Skip to main content

Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).

Project description

https://img.shields.io/travis/inspirehep/hepcrawl.svg https://img.shields.io/github/tag/inspirehep/hepcrawl.svg https://img.shields.io/pypi/dm/hepcrawl.svg https://img.shields.io/github/license/inspirehep/hepcrawl.svg

HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.

The project is currently in early stage of development.

See full documentation at http://pythonhosted.org/hepcrawl

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hepcrawl-13.0.83.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hepcrawl-13.0.83-py2.py3-none-any.whl (99.8 kB view details)

Uploaded Python 2Python 3

File details

Details for the file hepcrawl-13.0.83.tar.gz.

File metadata

  • Download URL: hepcrawl-13.0.83.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for hepcrawl-13.0.83.tar.gz
Algorithm Hash digest
SHA256 f5b5c028a663a02170109977e859e6249284714a77d8ce3317b1f630df39bb33
MD5 3b330b10fbb593b5a162d088b22e0e8c
BLAKE2b-256 d1bcc8a9f74c5acff8b667c7f9390141c89d1194b02a2a0f66cc9a9db4302f5f

See more details on using hashes here.

File details

Details for the file hepcrawl-13.0.83-py2.py3-none-any.whl.

File metadata

  • Download URL: hepcrawl-13.0.83-py2.py3-none-any.whl
  • Upload date:
  • Size: 99.8 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for hepcrawl-13.0.83-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f5cc693f806ae8ccd72bf0ec9b7a35d6e77af7076f37a4801fe1c2422de31a52
MD5 7e1909fb0bdee00dcce4bba9a9e7495f
BLAKE2b-256 cfb2dc98c2b0650033d9354152f7cebde6a14a94c04662bcfcc90c047bf2a84a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page