A free database of news resources for information extraction
Project description
# Freebie
_A free database of news resources for information extraction_
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
### Resources
- `websources`: a list of news outlet web sites
- source_id
- name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- `twitter_handles`: a list of twitter handles of news outlets
- handle_id
- name
- handle
- cca (country code)
- language
- source_id
- source_url
- category
- `rss_feeds`: a list of RSS feeds of news outlets
- feed_id
- source_id
- name
- feed_name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- rss_url
### Usage
Import each of the three resources as a list of dictionaries.
```python
from freebie import websources
for resource in websources[:1]:
print(resource)
# {'source_id': '1', 'name': '3BL', 'language': 'en', 'cca': 'US', 'coverage': 'global', 'covered_area': 'world', 'url': 'https://www.3blmedia.com/'}
```
### Contribution
All contributions are wellcome. I'd be happy to review PRs or to receive any data that you would want added via [email](mailto:me@sasho.io).
_A free database of news resources for information extraction_
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
### Resources
- `websources`: a list of news outlet web sites
- source_id
- name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- `twitter_handles`: a list of twitter handles of news outlets
- handle_id
- name
- handle
- cca (country code)
- language
- source_id
- source_url
- category
- `rss_feeds`: a list of RSS feeds of news outlets
- feed_id
- source_id
- name
- feed_name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- rss_url
### Usage
Import each of the three resources as a list of dictionaries.
```python
from freebie import websources
for resource in websources[:1]:
print(resource)
# {'source_id': '1', 'name': '3BL', 'language': 'en', 'cca': 'US', 'coverage': 'global', 'covered_area': 'world', 'url': 'https://www.3blmedia.com/'}
```
### Contribution
All contributions are wellcome. I'd be happy to review PRs or to receive any data that you would want added via [email](mailto:me@sasho.io).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
freebie-0.1.2.tar.gz
(70.5 kB
view hashes)
Built Distribution
Close
Hashes for freebie-0.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bb9e7f199039f018fd52f3405e388ef85fd5c56de40d09806e87404e2a395406 |
|
MD5 | 21f0484839780fc69f1f0a7c69cdaa49 |
|
BLAKE2b-256 | 391edaf66cea2e5f2d1daa48d7fec555ea6e4341d05d180bd4c10862442878a0 |