Skip to main content

A Python web wrapper and text mining package for the scientific journal, including Nature, Science, ScienceDirect, Wiley, ACS publication, RSC publication and so on.

Project description

###pyliterature A Python web wrapper and text mining package for the scientific journal, including Nature, Science, ScienceDirect, Wiley, ACS publication, RSC publication and so on.

###Author * Xing Wang <xingwang1991@gmail.com>

###Dependencies

  • Python >=27

  • spynner

  • beautifulsoup4

  • nltk

####Examples

`python >>> from pyliterature import Pyliterature >>> url = 'http://www.nature.com/nature/journal/v541/n7635/full/nature20782.html' >>> keyword = 'DFT' >>> liter = Pyliterature(url, keyword) >>> liter.parser() >>> liter.text >>> for sent in liter.keysents: ... print(sent) ... `

If you want to add features/improvement or report issues, feel free to send a pull request!

###TODO * read bib list * determine sentence from which article

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyliterature-0.1.4.tar.gz (3.1 kB view details)

Uploaded Source

File details

Details for the file pyliterature-0.1.4.tar.gz.

File metadata

  • Download URL: pyliterature-0.1.4.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pyliterature-0.1.4.tar.gz
Algorithm Hash digest
SHA256 3b342245bf5a2648b6c6f53e350ef5f91cf63bd801af6b01b5e5371927c3e84e
MD5 d05f726b3d2f5a79c26fc5c7ff18cba4
BLAKE2b-256 9b3828c0f666f4b58c92bd9fab0304f4b481ead29c1724a1c03602cac9cc2c7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page