resyndicator

Aggregates data from many sources into merged and filtered Atom feeds.

These details have not been verified by PyPI

Project links

Homepage

Project description

Purpose

The Resyndicator aggregates data from various sources into Atom feeds. If you have a list of a couple hundred data sources – such as feeds, sitemaps, and Twitter users – and want to share the aggregate of those entries or updates between your various devices (computers, phones, etc.), your colleagues, or even the visitors of your website, then that’s just what the Resyndicator is for.

It allows for queries as sophisticated as SQLAlchemy allows to filter your aggregate feed.
It allows you to subclass the fetchers, so you can write fetchers for endpoints as obscure as Adobe’s AMF.
It keeps all entries in Postgres, so you have a backup.

Setup

When you’ve installed it though Buildout or pip, you should get an endpoint like bin/resyndicator. If not and you know why, then please tell me, because I have the same problem. Otherwise just copy the entry_points parameter from setup.py to your setup.py to create a new one.

In your own package, you’ll need to create at least a settings.py and a resources.py. In settings.py, you can specify your database credential with something like DATABASE = 'postgresql://foo:bar@localhost/impactfeeder' (you may need to create the database and grant access rights to the user). For more options, see the settings.py included in the Resyndicator.

In resources.py, you list the feeds and (eponymous) resyndicators like so for example:

from datetime import timedelta
from sqlalchemy.sql import or_
from resyndicator import settings
from resyndicator.models import Entry
from resyndicator.fetchers import (
    FeedFetcher, SitemapIndexFetcher, SitemapFetcher,
    TwitterStreamer, ContentFetcher)
from resyndicator.resyndicators import Resyndicator

PAST = timedelta(days=7)

CONTENT_FETCHER = ContentFetcher(past=PAST, timeout=10)

RESYNDICATORS = [
    Resyndicator(
        title='Effective Altruism',
        past=PAST,
        query=or_(
            Entry.source_link.in_([
                'http://feeds.feedburner.com/TheGivewellBlog',
                'http://www.openphilanthropy.org/sitemap.xml',
            ])
        )
    )
]

FETCHERS = [
    FeedFetcher('http://feeds.feedburner.com/TheGivewellBlog',
                interval=10*60),
    SitemapFetcher('http://www.openphilanthropy.org/sitemap.xml',
                   defaults={'title': 'Open Phil Sitemap',
                             'author': 'Open Philanthropy Project'},
                   interval=30*60),
]

STREAMS = [
    TwitterStreamer(
        oauth_token=settings.OAUTH_TOKEN,
        oauth_secret=settings.OAUTH_SECRET,
        timeout=30*60),
]

For each resyndicator, you define a query and a title which will determine its ID and thus its identity. If you change the title you create a different feed. The query determine the entries of the feed and are written are SQLAlchemy where statements.

You can then start the scheduler of the fetchers with bin/resyndicator -s mypackage.settings fetchers, the first stream with bin/resyndicator -s mypackage.settings stream 0 (other streams analogously), and the content fetcher with bin/resyndicator -s mypackage.settings content unless your Buildout is configured some weird way.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.8.2

Jun 7, 2017

0.8.1

Jun 3, 2017

0.8.0

Jun 3, 2017

0.7.1

Jun 3, 2017

0.7.0

Jun 2, 2017

0.6.0

Jun 2, 2017

This version

0.5.4

May 20, 2017

0.5.3

May 20, 2017

0.5.2

May 20, 2017

0.5.1

May 16, 2017

0.5.0

May 16, 2017

0.2.1

Jan 6, 2017

0.1.4

May 24, 2016

0.1.3

May 23, 2016

0.1.2

May 20, 2016

0.1.1

May 20, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

resyndicator-0.5.4.tar.gz (15.4 kB view hashes)

Uploaded May 20, 2017 Source

Hashes for resyndicator-0.5.4.tar.gz

Hashes for resyndicator-0.5.4.tar.gz
Algorithm	Hash digest
SHA256	`c0026392b61045d79d4bfbccc206cbdb3cf6922a3e3bafa0a8198540f6f8bca8`
MD5	`6f748ff835261b55aa56e531e57840b0`
BLAKE2b-256	`874d41cf853c85439aa080bf4305a97b5872417570423c94f6c8771926b5db7b`