A library to interact with the archive.org Wayback Machine CDX server.
Project description
# archive-cdx
A library to interact with the archive.org Wayback Machine CDX server.
## How to
`archivecdx.Listing` is the basic list class.
Pass [CDX URL arguments](https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server#intro-and-usage-1) as keyword args.
Special args:
- `url` - the `url` param
- `delay` - the delay between automatic page count detection and grabbing the first page
- `page_count` - max number of pages, pass as keyword argument
```python
import archivecdx
listing = archivecdx.Listing("archive.org",page_count=40,fl=["original","timestamp","digest"])
print(listing[0]) # "Row(original='http://www.archive.org:80/',timestamp='19970126045828',digest='Q4YULN754FHV2U6Q5JUT6Q2P57WEWNNY')"
```
A library to interact with the archive.org Wayback Machine CDX server.
## How to
`archivecdx.Listing` is the basic list class.
Pass [CDX URL arguments](https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server#intro-and-usage-1) as keyword args.
Special args:
- `url` - the `url` param
- `delay` - the delay between automatic page count detection and grabbing the first page
- `page_count` - max number of pages, pass as keyword argument
```python
import archivecdx
listing = archivecdx.Listing("archive.org",page_count=40,fl=["original","timestamp","digest"])
print(listing[0]) # "Row(original='http://www.archive.org:80/',timestamp='19970126045828',digest='Q4YULN754FHV2U6Q5JUT6Q2P57WEWNNY')"
```
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
archive-cdx-0.1.1.tar.gz
(4.5 kB
view hashes)