Skip to main content

Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).

Project description

https://img.shields.io/travis/inspirehep/hepcrawl.svg https://img.shields.io/coveralls/inspirehep/hepcrawl.svg https://img.shields.io/github/tag/inspirehep/hepcrawl.svg https://img.shields.io/pypi/dm/hepcrawl.svg https://img.shields.io/github/license/inspirehep/hepcrawl.svg

HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.

The project is currently in early stage of development.

See full documentation at http://pythonhosted.org/hepcrawl

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hepcrawl-13.0.67.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hepcrawl-13.0.67-py2.py3-none-any.whl (98.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file hepcrawl-13.0.67.tar.gz.

File metadata

  • Download URL: hepcrawl-13.0.67.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.18

File hashes

Hashes for hepcrawl-13.0.67.tar.gz
Algorithm Hash digest
SHA256 03a35dfbfa06967bb9bb7cb72a0be1fde0b6a6b2526c5eef2f2575baee6e6b02
MD5 8505a01499068e932347216dc3390b8f
BLAKE2b-256 13734b6ed2a7908567313817c7a6a0b83e51e57e96efdc982455fa4741e0d01c

See more details on using hashes here.

File details

Details for the file hepcrawl-13.0.67-py2.py3-none-any.whl.

File metadata

  • Download URL: hepcrawl-13.0.67-py2.py3-none-any.whl
  • Upload date:
  • Size: 98.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.18

File hashes

Hashes for hepcrawl-13.0.67-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 147ae6b29be7224c07d09036c89daaf847b56cb7dcf3fe0c9a25c8173d4463a6
MD5 5111fb66fd0bed46b749b84bc085c5a9
BLAKE2b-256 bbc6de2c110216edd079354b2fa7d17ef14059206393c572bcdf3b03ba2550eb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page