A free database of news resources for information extraction
Project description
# Freebie
_A free database of news resources for information extraction_
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
### Resources
- `websources`: a list of news outlet web sites
- source_id
- name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- `twitter_handles`: a list of twitter handles of news outlets
- handle_id
- name
- handle
- cca (country code)
- language
- source_id
- source_url
- category
- `rss_feeds`: a list of RSS feeds of news outlets
- feed_id
- source_id
- name
- feed_name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- rss_url
### Usage
Import each of the three resources as a list of dictionaries.
```python
from freebie import websources
for resource in websources[:1]:
print(resource)
# {'source_id': '1', 'name': '3BL', 'language': 'en', 'cca': 'US', 'coverage': 'global', 'covered_area': 'world', 'url': 'https://www.3blmedia.com/'}
```
### Contribution
All contributions are wellcome. I'd be happy to review PRs or to receive any data that you would want added via [email](mailto:me@sasho.io).
_A free database of news resources for information extraction_
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
### Resources
- `websources`: a list of news outlet web sites
- source_id
- name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- `twitter_handles`: a list of twitter handles of news outlets
- handle_id
- name
- handle
- cca (country code)
- language
- source_id
- source_url
- category
- `rss_feeds`: a list of RSS feeds of news outlets
- feed_id
- source_id
- name
- feed_name
- language
- cca (country code)
- coverage (global, national, local)
- covered_area
- url
- rss_url
### Usage
Import each of the three resources as a list of dictionaries.
```python
from freebie import websources
for resource in websources[:1]:
print(resource)
# {'source_id': '1', 'name': '3BL', 'language': 'en', 'cca': 'US', 'coverage': 'global', 'covered_area': 'world', 'url': 'https://www.3blmedia.com/'}
```
### Contribution
All contributions are wellcome. I'd be happy to review PRs or to receive any data that you would want added via [email](mailto:me@sasho.io).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
freebie-0.1.3.tar.gz
(70.5 kB
view hashes)
Built Distribution
Close
Hashes for freebie-0.1.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | efd126cec9486bcd73b7a994bb35e84dd48aa6ac5687b09553cd83d85c158a52 |
|
MD5 | ac2d539b5d088dce2de8ea282b58927e |
|
BLAKE2b-256 | ce5d28faa5fc47e0020539fddeaa207471e23c989dda149c8b1ccfe6ff09cec3 |