Skip to main content

SoferAI's Torah Downloader

Project description

torah-dl - tools for downloading media from Torah websites.

uv PyPI - Version image image Actions status Coverage Status

Why torah-dl?

Most of our generation's Torah content is locked up in websites that are not easily accessible. Sofer.Ai is building a platform to make Torah accessible to everyone, and we decided to build key parts of that platform in the open. We intend to support every website with Torah audio on our platform, and realized quickly that even finding all the sites with audio would be a full-time job. So we open-sourced torah-dl to make it easier for others to download Torah audio from any website, and make Torah more accessible!

How does it work?

torah-dl is a library and a set of command-line tools for downloading media from Torah websites. You can use it as a command-line tool with uv (preferred), pipx, pip, poetry, venv, or any Python tool installer of your choice, simply by running uv tool install "torah-dl[cli]", and then running torah-dl.

For those who want to integrate torah-dl into their Python application, you can simply install it via uv add torah-dl or pip install torah-dl. You can then use the library in your code as you would any other Python library:

from torah_dl import extract

extraction = extract("https://www.yutorah.org/lectures/details?shiurid=1117416")

print(extraction.download_url) # https://download.yutorah.org/2024/34263/1117416/ketuvot-57a-b---preparation-for-nisuin.mp3

print(extraction.title) # Ketuvot 57a-b - Preparation for Nisuin

print(extraction.file_format) # audio/mp3

print(extraction.file_name) # ketuvot-57a-b---preparation-for-nisuin.mp3

What sites does it support?

Here is the list of sites that torah-dl supports already, and what's coming soon:

Note: Virtual Beit Midrash (Etzion) is currently temporarily degraded for automated clients in CI due to upstream Cloudflare/bot protection. Browser access may still work normally.

Contributing

We'd love your help! Please see our CONTRIBUTING.md for more information on how to get involved.

Frequently Asked Questions

Am I allowed to download Torah audio from these websites?

You are responsible for ensuring that you follow all Terms of Service agreements, Copyright agreements, and other legal agreements with these websites.

TODO: get a lawyer to review this.

How do I download audio from a site that is not on the list?

We'd love your help! Please see our CONTRIBUTING.md for more information on how to get involved.

What are the usecases for `torah-dl`?

Allowing transcription services to make Torah more accessible 😉

Other uses include downloading Torah audio for offline listening, or for use in Torah study tools, or for training AI models to understand Torah, or for other purposes (please see question above about permissions).

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torah_dl-0.6.0.tar.gz (169.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

torah_dl-0.6.0-py3-none-any.whl (35.3 kB view details)

Uploaded Python 3

File details

Details for the file torah_dl-0.6.0.tar.gz.

File metadata

  • Download URL: torah_dl-0.6.0.tar.gz
  • Upload date:
  • Size: 169.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for torah_dl-0.6.0.tar.gz
Algorithm Hash digest
SHA256 fb9ff769196389f2a074a9701885ef43176ae6cc0d47bf9426a1a56bda8ea929
MD5 4fc70ef60d675a1afa8905cb8ddd1b5d
BLAKE2b-256 04c920138f854e58ebf71e9e7f232eb3c9ca9aa0d83cea79116cc53bbb073479

See more details on using hashes here.

Provenance

The following attestation bundles were made for torah_dl-0.6.0.tar.gz:

Publisher: release.yml on SoferAi/torah-dl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file torah_dl-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: torah_dl-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 35.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for torah_dl-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6901c53dfaa832253c2b10d7fc2962deb7658382c2b6313d94af107384f58b85
MD5 48bdeb4cecdd3dffd619695526a6b4fe
BLAKE2b-256 e04a52360317c78cca4c62dd97548204fef5ca65353167a370432fe008bb32fb

See more details on using hashes here.

Provenance

The following attestation bundles were made for torah_dl-0.6.0-py3-none-any.whl:

Publisher: release.yml on SoferAi/torah-dl

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page