Python library to work with ARC and WARC files
WARC (Web ARChive) is a file format for storing web crawls.
This warc library makes it very easy to work with WARC files.:
import warc f = warc.open("test.warc") for record in f: print record['WARC-Target-URI'], record['Content-Length']
The documentation of the warc library is available at http://warc.readthedocs.org/.
This software is licensed under GPL v2. See LICENSE file for details.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size warc-0.2.1.tar.gz (18.4 kB)||File type Source||Python version None||Upload date||Hashes View|