Skip to main content

A tool for parsing academic papers

Project description

PaperParser

Parses academic paper information from URLs.

This is a project by Winder Research, a Cloud-Native Data Science consultancy.

Installation

pip install paperparser

Usage

CLI

$ paperparser --help                                
Usage: paperparser [OPTIONS] URL STRATEGY

  Parse the bibtex from a URL using the STRATEGY.

Options:
  --version  Show the version and exit.
  --help     Show this message and exit.

$ paperparser https://arxiv.org/abs/1812.02900 arxiv
{'title': 'Off-Policy Deep Reinforcement Learning without Exploration', 'journal': 'CoRR', 'volume': 'abs/1812.02900', 'year': '2018', 'url': 
'http://arxiv.org/abs/1812.02900', 'archivePrefix': 'arXiv', 'eprint': '1812.02900', 'timestamp': 'Tue, 01 Jan 2019 15:01:25 +0100', 'biburl': 'https://dblp.org/rec/journals/corr/abs-1812-02900.bib', 'bibsource': 'dblp computer science bibliography, https://dblp.org'}

Python

from paperparser import page

p = page.BibTeXPage(url=url, strategy=strategy)
print(p.as_dict())
print(p.abstract())

Strategies

"arxiv"

Parses bibtex from dblp and abstracts directly.

"nips"

Parses bibtex and abstracts directly.

"acm"

Parses bibtex via the doi from https://dx.doi.org/ and abstracts directly.

"ieee"

Parses bibtex via the doi from https://dx.doi.org/ and abstracts directly.

"sciencedirect"

Parses bibtex via the doi from https://dx.doi.org/ and abstracts directly.

"wiley"

Parses bibtex via the doi from https://dx.doi.org/ and abstracts directly.

"semanticscholar"

Prefer other parsers where possible. Semantic scholar tends to index other websites and therefore results are sketchy. Parses bibtex via the DOI if possible, manual creation if not. Abstracts are direct.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PaperParser-0.2.2.tar.gz (5.6 kB view details)

Uploaded Source

Built Distribution

PaperParser-0.2.2-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file PaperParser-0.2.2.tar.gz.

File metadata

  • Download URL: PaperParser-0.2.2.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.3 CPython/3.7.8 Linux/4.19.78-coreos

File hashes

Hashes for PaperParser-0.2.2.tar.gz
Algorithm Hash digest
SHA256 b30f6412d689dd9b9c174c163ea1542cc59fb7d1a5f6befb550088ba5f4d1e76
MD5 891b49ffaea3627866742c461c1286d0
BLAKE2b-256 38c498ccce7c0c665412e800c1da4664242ce35c43b98c8825a962f03a2f4dc8

See more details on using hashes here.

File details

Details for the file PaperParser-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: PaperParser-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.3 CPython/3.7.8 Linux/4.19.78-coreos

File hashes

Hashes for PaperParser-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2745590bf4991a8e4cc27413627e816c3d5206ba7964a682ccd2bb6524ce68c2
MD5 417dae7220555a3b3d2ba48195383020
BLAKE2b-256 358d4da6eff811f3d44abee0c9aba32f299e0cd81d6fd8ed3c7654d56c20866c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page