Skip to main content

Add your description here

Project description

scrape-forvo

Download pronunciation MP3s from Forvo search pages.

Installation

uv run python -m pip install -e .

Usage

Only this command is confirmed to work reliably:

scrape-forvo egg --use-playwright --headed

Scriptable Usage

You can also import scrape_forvo and use it from Python:

from scrape_forvo import scrape

result = scrape(
    "egg",
    outdir="forvo_mp3",
    use_playwright=True,
    headed=True,
)

print(result.downloaded_count)
for candidate in result.candidates:
    print(candidate.url, "->", candidate.out_path)

The scrape() arguments map directly to CLI flags, so both interfaces share the same behavior without duplicated logic. Internally, the search URL is built as https://forvo.com/search/<word>/no/.

Development

Set up the project virtual environment with uv:

uv sync

Then run commands from the environment:

source .venv/bin/activate

Install dev dependencies:

python -m pip install -e .[dev]

Run tests:

pytest

Optional live test

Set FORVO_LIVE_TEST=1 to enable the live integration test.

TODO

edge cases

  • when multiple pronunciation files come out. which one to pick?
  • when there's no pronunciation available.

integration

  • integration with the vocab repo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrape_forvo-1.0.0.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scrape_forvo-1.0.0-py3-none-any.whl (9.5 kB view details)

Uploaded Python 3

File details

Details for the file scrape_forvo-1.0.0.tar.gz.

File metadata

  • Download URL: scrape_forvo-1.0.0.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for scrape_forvo-1.0.0.tar.gz
Algorithm Hash digest
SHA256 7e69f3d30bdf84821b44242e038816ab0baa115336a871760b7430f0aec223cd
MD5 282c4a3f7246b5c3f1a9e826f06626db
BLAKE2b-256 0b2f1ecdbad938faf122290886fc3363068d3f3cec8362e13a27bdd70faeae3a

See more details on using hashes here.

File details

Details for the file scrape_forvo-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: scrape_forvo-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 9.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for scrape_forvo-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1903946ad12882c61077aefa2d714fc47b9444040fcf09c1d3b4d8634e9c764b
MD5 e75c831df330a4747c47c3dae72b2189
BLAKE2b-256 c20ff9ddcabad8fc4933b1dfad973bde3fd3b9347e895221231a078d4839f4ac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page