Skip to main content

Celery-based rss/atom/jsonfeed processing extension for InvenioRDM.

Project description

PyPI version docs License DOI

invenio-feeds

Celery-based feed processing extension for InvenioRDM, powering the Rogue Scholar science blog archive.

invenio-feeds replaces the former Flask/REST API with Celery workers that periodically fetch blog feeds, normalise posts, and upsert them as InvenioRDM records and communities. It follows the standard invenio-base extension pattern and integrates into any InvenioRDM instance.

Installation

Requires Python 3.14. Uses uv for dependency management.

uv sync

Environment variables

# InvenioRDM instance to read/write
FLASK_INVENIORDM_API=https://rogue-scholar.org   # default
FLASK_INVENIORDM_TOKEN=<bearer-token>

Running

Celery worker

celery -A invenio_feeds.tasks worker --loglevel=info

Celery beat scheduler (periodic tasks)

celery -A invenio_feeds.tasks beat --loglevel=info

Trigger a feed import manually

celery -A invenio_feeds.tasks call invenio_feeds.process_blog_feed --args='["<blog-slug>"]'

Scheduled tasks

Task Schedule Description
invenio_feeds.process_all_feeds every 30 min Fan-out: one process_blog_feed per active blog
invenio_feeds.process_blog_feed on-demand Fetch and upsert all new posts for one blog
invenio_feeds.classify_all_blogs daily 03:00 UTC Fan-out: classify untagged posts per blog

Development

We use pytest for testing:

uv run pytest

Follow along via Codeberg Issues. Please open an issue if you encounter a bug or have a feature request.

Note on Patches/Pull Requests

  • Fork the project
  • Write tests for your new feature or a test that reproduces a bug
  • Implement your feature or make a bug fix
  • Do not mess with Rakefile, version or history
  • Commit, push and make a pull request. Bonus points for topical branches.

Documentation

Documentation (work in progress) for using Rogue Scholar is available at the Rogue Scholar Documentation website.

Meta

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

License: MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

invenio_feeds-0.1.6.tar.gz (121.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

invenio_feeds-0.1.6-py3-none-any.whl (129.6 kB view details)

Uploaded Python 3

File details

Details for the file invenio_feeds-0.1.6.tar.gz.

File metadata

  • Download URL: invenio_feeds-0.1.6.tar.gz
  • Upload date:
  • Size: 121.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.5

File hashes

Hashes for invenio_feeds-0.1.6.tar.gz
Algorithm Hash digest
SHA256 052a6dd5778d45a98adc7d894c33a7bcc1b9c2337e20193fb8792aa7b12f0ceb
MD5 97f30d6f5683c6f283c58318910d79e7
BLAKE2b-256 beeba9cc7ae1e9028419af060c9aa40f37af76df7b1fda2d0db6fa7293dbe81f

See more details on using hashes here.

File details

Details for the file invenio_feeds-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for invenio_feeds-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 8f10391c4716214d64179b6d9e89deb864670cd50ad5c877b8757bd494feaa39
MD5 d187e75cc85b320c7fd95668d9552a22
BLAKE2b-256 3b36dff24532d2afd6367c45804b2f91182f5bf49146ce5b01e6e1c4db0dd5b8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page