Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).
Project description
HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.
The project is currently in early stage of development.
See full documentation at http://pythonhosted.org/hepcrawl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
hepcrawl-13.0.21.tar.gz
(701.5 kB
view details)
File details
Details for the file hepcrawl-13.0.21.tar.gz.
File metadata
- Download URL: hepcrawl-13.0.21.tar.gz
- Upload date:
- Size: 701.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.24.0 setuptools/44.1.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/2.7.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
21fcb70b584a0d70ef1119a50d7b580cf0cb43c2621993f1789ae655daa49ff8
|
|
| MD5 |
3acde8902702d7c555b130a89deb82b5
|
|
| BLAKE2b-256 |
b732f132b4d78c7df89078c73a533003cf6bdb7b0a415706c3bcebf2d42fb382
|