Wayback Machine news archive search api client
Project description
Wayback Machine News Archive Client
🚧 under construction 🚧
A simple client library to access the Wayback Machine news archive search.
Installation
pip install wayback-news-search
Basic Usage
Counting matching stories:
from waybacknews.searchapi import SearchApiClient
import datetime as dt
api = SearchApiClient("mediacloud")
api.count("coronavirus", dt.datetime(2022, 3, 1), dt.datetime(2022, 4, 1))
Paging over all matching results:
from waybacknews.searchapi import SearchApiClient
import datetime as dt
api = SearchApiClient("mediacloud")
for page in api.all_articles("coronavirus", dt.datetime(2022, 3, 1), dt.datetime(2022, 4, 1)):
do_something(page)
Dev Installation
Install the dependencies for dev: pip install -e .[dev]
Distribution
- Run
pytest
to make sure all the test pass - Update the version number in
waybacknews/__init__.py
- Make a brief note in the version history section below about the changes
- Commit the changes
- Tag the commit with a semantic version number - 'v*..'
- Push to repo to GitHub
- Run
python setup.py sdist
to create an installation package - Run
twine upload --repository-url https://test.pypi.org/legacy/ dist/*
to upload it to PyPI's test platform - Run
twine upload dist/*
to upload it to PyPI
Version History
- v1.2.1 - fix paging bug triggered by no results
- v1.2.0 - add support for new
expanded
results, and more integration testing - v1.1.0 - add new
paged_articles
method to allow paging over all results - v1.0.3 - add 30 sec timeout, remove extra params mcproviders library might be adding
- v1.0.2 - fix to article endpoint
- v1.0.1 - automatically escape '/' in query strings, test case for
url
field search - v1.0.0 - update to public API endpoint
- v0.1.5 - simpler return for top terms
- v0.1.4 - better error handling
- v0.1.3 - allow overriding base api URL
- v0.1.2 - fix
article
endpoint, test case for fetching content (snippet
) viaarticle_url
property - v0.1.1 - more consistent method names
- v0.1.0 - initial test-only release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for wayback-news-search-1.2.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6c29d53d7ee79cbbbe5da43bc9d6b1b9838ac92d536cc649f7c43b2e725e4817 |
|
MD5 | 20323f89f0d6171b5907195f3b3072f3 |
|
BLAKE2b-256 | 3aa79e0fde439099b9bc4ebca633667ef42e056d92d8c0580e71414e2d62c61a |