Simple RSS feed fetching framework
Project description
Simple RSS Feed Fetching Framework.
Decorator based feed fetcher.
Each function are given the RSS Feed’s info, Feed Entry ‘s info and html body.
Simple plugin system and useful standard plugins.
Installation
Supported python version is Python3.4 later.
pip install feedy
Tutorials
1. Getting feed entry’s title and link URL
Creating main.py like:
from feedy import Feedy
app = Feedy('feedy.dat')
@app.add('https://www.djangopackages.com/feeds/packages/latest/rss/')
def djangopackages(info, body):
print("- [{pkgname}]({link})".format(pkgname=info['article_title'], link=info['article_url']))
if __name__ == '__main__':
app.run()
And running:
$ python main.py - [django-dynamic-views](http://www.djangopackages.com/packages/p/django-dynamic-views/) - [django-simple-address](http://www.djangopackages.com/packages/p/django-simple-address/) - [django-db-sanitizer](http://www.djangopackages.com/packages/p/django-db-sanitizer/) :
After that please execute again.
$ python main.py # No output
It displays only newest feed entries.
If you want to check more detail messages:
$ feedy example.py feedy -t djangopackages --no-ignore-fetched -vvv [INFO] No updates in djangopackages. [DEBUG] A last_fetched_at is just updated.
2. Add feed patterns
Add CNN news feed for collecting images on each articles.
from feedy import Feedy
from bs4 import BeautifulSoup # You can select your favorite html parser.
app = Feedy(store='feedy.dat', ignore_fetched=True)
@app.add('http://rss.cnn.com/rss/edition.rss')
def cnn(info, body):
soup = BeautifulSoup(body, "html.parser")
for x in soup.find_all('img'):
print(x['src'])
@app.add('https://www.djangopackages.com/feeds/packages/latest/rss/')
def djangopackages(info, body):
"""Get the latest django library information."""
print("- [{pkgname}]({link})".format(pkgname=info['article_title'],
link=info['article_link']))
if __name__ == '__main__':
app.run()
And running:
$ python main.py - [django-dynamic-views](http://www.djangopackages.com/packages/p/django-dynamic-views/) - [django-simple-address](http://www.djangopackages.com/packages/p/django-simple-address/) - [django-db-sanitizer](http://www.djangopackages.com/packages/p/django-db-sanitizer/) : : http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_money.png http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_style.png http://edition.i.cdn.cnn.com/.a/1.269.4/assets/logo_cnn_nav_bottom.png : :
3. Command line interface
Feedy offers command line interface. It’s useful for debugging
help messages
$ feedy --help Usage: feedy [OPTIONS] SRC OBJ Run your feedy's project flexibly. Options: -v, --verbose Set log level -t, --targets TEXT The target function names. -s, --store TEXT A filename for store the last fetched time each RSS feed. -m, --max-entries INTEGER The maximum length for fetching entries every RSS feed --ignore-fetched / --no-ignore-fetched The maximum length for fetching entries every RSS feed --help Show this message and exit.
If you want to get specified entry for debugging, please execute following command:
$ feedy main.py app --max-entries 2 --no-ignore-fetched - [django-dynamic-views](http://www.djangopackages.com/packages/p/django-dynamic-views/) - [django-simple-address](http://www.djangopackages.com/packages/p/django-simple-address/) http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_money.png http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_style.png
And if you want to run only a cnn function, please execute:
$ feedy main.py app --max-entries 2 --no-ignore-fetched --target cnn http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_money.png http://i.cdn.turner.com/cnn/.e1mo/img/4.0/logos/menu_style.png
After that, please execute a following command:
4. Use plugins
You can easy developing by using plugins. For example, you can get shared count in social sns like facebook and pocket. There are two ways for applying the plugin.
Apply specified function using decorator
from feedy_plugins import social_share_plugin
@app.add('http://rss.cnn.com/rss/edition.rss')
@social_share_plugin
def cnn_shared(info, body, social_count):
article = {
'title': info['article_title'],
'pocket': social_count['pocket_count'],
'facebook': social_count['facebook_count'],
}
print(article)
And running:
$ feedy main.py app -t cnn_shared -m 2 {'title': 'Searchers locate Flight 804, EgyptAir vice chairman says', 'pocket': 4, 'facebook': 25} {'title': 'Security fears over French airports', 'pocket': 2, 'facebook': 9}
Apply all functions with ``.install()``
from feedy import Feedy
from feedy_plugins import social_share_plugin
from bs4 import BeautifulSoup
app = Feedy(store='feedy.dat', ignore_fetched=True)
app.install(social_shared_plugin) # apply each patterns.
@app.add('http://rss.cnn.com/rss/edition.rss')
def cnn_shared(info, body, social_count):
article = {
'title': info['article_title'],
'pocket': social_count['pocket_count'],
'facebook': social_count['facebook_count'],
}
print(article)
@app.add('https://www.djangopackages.com/feeds/packages/latest/rss/')
def djangopackages(info, body, social_count):
print("- [{pkgname}]({link})".format(pkgname=info['article_title'],
link=info['article_link']))
print(social_count['pocket_count'])
if __name__ == '__main__':
app.run()
5. Using feedy_utils
Download images using feedy_utils.
from feedy import Feedy
from feedy_utils import download_image
IMG_DIR = os.path.join(os.path.dirname('.'), 'img')
app = Feedy(store='feedy.dat', ignore_fetched=True)
@app.add('http://rss.cnn.com/rss/edition.rss')
def cnn(info, body):
download_image(body, info['site_url'],
filename="{title}-{i}".format(info['article_title']),
directory=IMG_DIR)
if __name__ == '__main__':
app.run()
Create Plugins
To write a new plugin, simply create decorator like:
def add_information_plugin(callback):
@wraps(callback)
def wrapper(*args, **kwargs):
additional_info = "This is custom plugin."
kwargs['additional_info'] = additional_info
callback(*args, **kwargs)
return wrapper
Happy hacking :)
Resources
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file feedy-0.0.4.tar.gz
.
File metadata
- Download URL: feedy-0.0.4.tar.gz
- Upload date:
- Size: 10.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 95be018c2ab3669a37205080f39e0bf99824ebecf2ae423f48297fd17b34684f |
|
MD5 | cf421bc4b50de87ec8c1c753e72f17ae |
|
BLAKE2b-256 | c43ddb004d086d4465e08d6a3796e693ec2e30cf071c939d616c2d5b58cfd698 |
File details
Details for the file feedy-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: feedy-0.0.4-py3-none-any.whl
- Upload date:
- Size: 12.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 896bbc5bb07f52619ad693d32d480a33741e6326b2683542eaf63639a0bd5b3b |
|
MD5 | 06d1535561fe45488a5b2a3e2a213c45 |
|
BLAKE2b-256 | f24ea41dc6912c280980360d5af69c707d82a2e0eaa737515ff716c7d08dc1e7 |