Media Cloud API Client Library

Project description

MediaCloud Python API Client

This is a python client for accessing the MediaCloud API v2. We support Python versions 2.7 and 3.6.

Usage

pip install mediacloud

Check CHANGELOG.md for a detailed history of changes.

Examples

Find out how many stories in the top US online news sites mentioned "Zimbabwe" in the last year:

import mediacloud.api
mc = mediacloud.api.MediaCloud('MY_API_KEY')
res = mc.storyCount('zimbabwe AND president AND tags_id_media:58722749', 'publish_date:[NOW-1YEAR TO NOW]')
print(res['count']) # prints the number of stories found

Get 2000 stories from the NYT about a topic in 2018 and dump the output to json:

import mediacloud.api, json, datetime
mc = mediacloud.api.MediaCloud('MY_API_KEY')

fetch_size = 500
stories = []
last_processed_stories_id = 0
while len(stories) < 2000:
    fetched_stories = mc.storyList('trump AND "north korea" AND media_id:1', 
                                   solr_filter=mc.publish_date_query(datetime.date(2018,1,1), datetime.date(2019,1,1)),
                                   last_processed_stories_id=last_processed_stories_id, rows= fetch_size)
    stories.extend(fetched_stories)
    if len( fetched_stories) < fetch_size:
        break
    last_processed_stories_id = stories[-1]['processed_stories_id']
print(json.dumps(stories))

Find the most commonly used words in stories from the US top online news sites that mentioned "Zimbabwe" and "president" in 2013:

import mediacloud.api, datetime
mc = mediacloud.api.MediaCloud('MY_API_KEY')
words = mc.wordCount('zimbabwe AND president AND tags_id_media:58722749',
                     mc.publish_date_query( datetime.date( 2013, 1, 1), datetime.date( 2014, 1, 1)))
print(words[0])  # prints the most common word

To find out all the details about one particular story by id:

import mediacloud.api
mc = mediacloud.api.MediaCloud('MY_API_KEY')
story = mc.story(169440976)
print(story['url'])  # prints the url the story came from

To save the first 100 stories from one day to a database:

import mediacloud.api, datetime
mc = mediacloud.api.MediaCloud('MY_API_KEY')
db = mediacloud.storage.MongoStoryDatabase('one_day')
stories = mc.storyList('*', mc.publish_date_query( datetime.date (2014, 01, 01), datetime.date(2014,01,02) ),
                       last_processed_stories_id=0,rows=100)
[db.addStory(s) for s in stories]
print(db.storyCount())

Take a look at the test in the mediacloud/test/ module for more detailed examples.

Development

If you are interested in adding code to this module, first clone the GitHub repository.

Testing

You need to create an MC_API_KEY envvar and set it to your API key (we use python-dotenv). Then run make test. We run continuous integration (via Travis), so every push runs the whole test suite (we also do this nightly and on PRs).

Distributing a New Version

If you want to, setup twin's keyring integration to avoid typing your PyPI password over and over.

Run make test to make sure all the test pass
Update the version number in mediacloud/__init__.py
Make a brief note in the CHANGELOG.md about what changes
Run make build-release to create an install package
Run make release-test to upload it to PyPI's test platform
Run make release to upload it to PyPI

Project details

Release history Release notifications | RSS feed

4.1.3

Feb 6, 2024

4.1.2

Dec 23, 2023

4.1.1

Dec 13, 2023

4.1.0

Dec 12, 2023

4.0.1

Mar 29, 2023

4.0.0

Dec 15, 2022

3.13.0

Mar 30, 2022

3.12.5

Oct 19, 2021

3.12.4

Sep 24, 2021

3.12.3

Apr 8, 2021

3.12.2

Apr 7, 2021

3.12.1

Oct 8, 2020

3.12.0

Jun 3, 2020

3.11.3

May 19, 2020

3.11.2

May 4, 2020

3.11.1

May 1, 2020

3.11.0

Apr 24, 2020

3.10.0

Apr 3, 2020

3.9.3

Mar 19, 2020

3.9.2

Mar 18, 2020

3.9.1

Mar 17, 2020

3.9.0

Mar 17, 2020

3.8.0

Mar 6, 2020

This version

3.7.6

Feb 11, 2020

3.7.5

Dec 19, 2019

3.7.4

Nov 21, 2019

3.7.3

Nov 20, 2019

3.7.2

Nov 6, 2019

3.7.1

Nov 5, 2019

3.7.0

Sep 16, 2019

3.6.5

Jul 8, 2019

3.6.4

Jul 5, 2019

3.6.3

May 21, 2019

3.6.2

May 17, 2019

3.6.1

May 14, 2019

3.6.0

May 14, 2019

3.5.1

Apr 10, 2019

3.5.0

Mar 6, 2019

3.4.4

Jan 29, 2019

3.4.3

Jan 4, 2019

3.4.2

Jan 4, 2019

3.4.1

Dec 28, 2018

3.4.0

Dec 4, 2018

3.3.1

Nov 21, 2018

3.3.0

Nov 15, 2018

3.2.3

Nov 1, 2018

3.2.2

Oct 1, 2018

3.2.1

Aug 20, 2018

3.2.0

Aug 20, 2018

3.1.1

Jul 19, 2018

3.1.0

Jun 21, 2018

3.0.4

Jun 6, 2018

3.0.3

May 18, 2018

3.0.2

May 18, 2018

3.0.1

May 18, 2018

3.0.0

May 18, 2018

3.0.0-b2 pre-release

May 11, 2018

3.0.0-b1 pre-release

May 10, 2018

2.53.0

May 1, 2018

2.52.0

Apr 11, 2018

2.51.0

Mar 20, 2018

2.50.0

Mar 7, 2018

2.49.0

Mar 6, 2018

2.48.0

Mar 5, 2018

2.47.0

Feb 23, 2018

2.46.0

Feb 22, 2018

2.45.0

Jan 3, 2018

2.44.0

Dec 29, 2017

2.43.3

Dec 11, 2017

2.43.2

Oct 19, 2017

2.43.1

Oct 17, 2017

2.43.0

Oct 13, 2017

2.42.0

Sep 20, 2017

2.41.0

Aug 25, 2017

2.40.1

Jun 9, 2017

2.40.0

Apr 27, 2017

2.39.2

Apr 25, 2017

2.39.1

Mar 31, 2017

2.39.0

Mar 30, 2017

2.38.2

Mar 19, 2017

2.38.1

Mar 17, 2017

2.38.0

Feb 17, 2017

2.37.0

Feb 12, 2017

2.36.2

Jan 12, 2017

2.36.1

Jan 11, 2017

2.36.0

Jan 6, 2017

2.35.6

Dec 28, 2016

2.35.5

Dec 22, 2016

2.35.3

Dec 21, 2016

2.35.2

Dec 21, 2016

2.35.1

Dec 21, 2016

2.35.0

Dec 20, 2016

2.34.0

Oct 11, 2016

2.33.1

Aug 22, 2016

2.33.0

Aug 16, 2016

2.32.0

Jul 20, 2016

2.31.0

Jul 19, 2016

2.30.0

Jul 18, 2016

2.29.1

Jun 24, 2016

2.29.0

Jun 14, 2016

2.28.0

May 17, 2016

2.27.0

May 12, 2016

2.26.1

Apr 15, 2016

2.26.0

Mar 31, 2016

2.25.0

Mar 17, 2016

2.24.1

Jul 15, 2015

2.24.0

Jun 25, 2015

2.23.0

Jun 24, 2015

2.22.2

Jun 18, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mediacloud-3.7.6.tar.gz (17.3 kB view hashes)

Uploaded Feb 11, 2020 Source

Hashes for mediacloud-3.7.6.tar.gz

Hashes for mediacloud-3.7.6.tar.gz
Algorithm	Hash digest
SHA256	`12f1c8f729ff102091431fd61b29483fb3d258f6ff97c4b40d3dac719d8a926b`
MD5	`e303cbbb11fb16bc1ea6bf7fb431fc36`
BLAKE2b-256	`a05a81e12f2e8afd78e963ff2a0f9327be172d254c449cd0da9e7a530d9f53a7`