Search Providers package for Mediacloud
Project description
Media Cloud Providers Library
A package of search providers for Media Cloud, wrapping up interfaces for different social media platform.
UNDER CONSTRUCTION- Probably won't get a huge amount of attention for a little bit, but I'm putting this up now since I've done this extraction already.
Install with pip (pip install .
) and the install.sh
script.
Requires environment variables set for various interfaces to work correctly.
Build
Make sure pip install flit twine
so you can build and deploy to PyPI.
- Bump the version number in
pyproject.toml
- Add a note about changes to the version history below
- Commit the changes and tag it with a semantic version number
- A github action will build and push the repository on committing a tagged version
Version History
- v2.2.0 - Added an optional argument to providers to toggle caching behavior, added more specific error on 504
- v2.1.1 - Bugfix
- v2.1.0 - Mediacloud news client code incorperated into this package
- v2.0.5 - Build-system in pyproject.toml
- v2.0.4 - reintroduce stopwords
- v2.0.3 - version bump for automatic releases
- v2.0.2 - respect domain filters on Media Cloud searches
- v2.0.1 - more work on caching strategies
- v2.0.0 - change CachingManager interface to support online news providers better
- v1.0.1 - fix default timeout option that applies across all providers
- v1.0.0 - Remove legacy Media Cloud, add timeout option to
provider_for
- v0.5.3 - Temporary fix to onlinenews-mediacloud search handling
- v0.5.3 - Tweaks to onlinenews-mediacloud for compatibility with new database pattern
- v0.5.2 - Fix to allow override of chunk'ing in MC client
- v0.5.1 - Fix use of media cloud to respect domains clause on story list paging
- v0.5.0 - Integrate new mediacloud-news-client into onlinenews-mediacloud
- v0.4.0 - Specify custom base URLs via new string param to
provider_by_name
andprovider_for
- v0.3.0 - Add support for paging through stories directly, and including text in paged results for speed
- v0.2.6 - Fixed querying by domain on new mediacloud system
- v0.2.5 - Alignment with new mediacloud system. Old onlinenews provider is now "onlinenews-mclegacy", "onlinenews-mediacloud" now queries the new index.
- v0.2.4 - Added support for api keys via "provider_by_name"
- v0.2.3 - removed support for API keys in environment variables- now expected as an argument in
providers.provider_for
- v0.2.2 - transition to use the dedicated mediacloud-api-legacy package to avoid version conflictsgit
- v0.2.1 - add in a date hack to resolve a lower-level bug in the Media Cloud legacy count-over-time results
- v0.2.0 - add in support for Media Cloud legacy database
- v0.1.7 - corrected support for a "filters" kwarg in online_news
- v0.1.6 - Added support for a "filters" kwarg in online_news
- v0.1.5 - Added politeness wait to all chunked queries in twitter provider
- v0.1.4 - Added Query Chunking for large collections in the Twitter provider
- v0.1.3 - Added Query Chunking for large queries in the onlinenews provider
- v0.1.2 - Test Completeness
- v0.1.1 - Parity with web-search module, and language model
- v0.1.0 - Initial pypi upload
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mc_providers-2.2.0.tar.gz
(88.6 kB
view details)
Built Distribution
File details
Details for the file mc_providers-2.2.0.tar.gz
.
File metadata
- Download URL: mc_providers-2.2.0.tar.gz
- Upload date:
- Size: 88.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 76e787cf68b6f835f9f337553379cc4e64af7703f4e0a01c52ed8f239540de7d |
|
MD5 | 8c39ad94c4d4374c62cbbd8c3b7d383a |
|
BLAKE2b-256 | 75c816f61b9a14552296e11671b7f2efb89dc4b34552336bbfa3a3a1c3162699 |
File details
Details for the file mc_providers-2.2.0-py3-none-any.whl
.
File metadata
- Download URL: mc_providers-2.2.0-py3-none-any.whl
- Upload date:
- Size: 97.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5eaf9b1e7ef30969def2720c4d6dcadfdd2646398d549f2786dbf26a4062b5cd |
|
MD5 | ae7e3f5f7d95f232e047ff7dffffc791 |
|
BLAKE2b-256 | 99a7fd4afe3abf9978a083036a3439e696421cb5e30df2c3463bfc3034c62ac3 |