Skip to main content

Add your description here

Project description

scrape-forvo

Download pronunciation MP3s from Forvo search pages.

Installation

python -m pip install -e .

Usage

Only this command is confirmed to work reliably:

scrape-forvo https://forvo.com/search/egg/no/ --use-playwright --headed

Scriptable Usage

You can also import scrape_forvo and use it from Python:

from scrape_forvo import scrape

result = scrape(
    "https://forvo.com/search/egg/no/",
    outdir="forvo_mp3",
    use_playwright=True,
    headed=True,
)

print(result.downloaded_count)
for candidate in result.candidates:
    print(candidate.url, "->", candidate.out_path)

The scrape() arguments map directly to CLI flags, so both interfaces share the same behavior without duplicated logic.

Development

Install dev dependencies:

python -m pip install -e .[dev]

Run tests:

pytest

Optional live test

Set FORVO_LIVE_TEST=1 to enable the live integration test.

TODO

edge cases

  • when multiple pronunciation files come out. which one to pick?
  • when there's no pronunciation available.

integration

  • integration with the vocab repo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrape_forvo-0.1.0.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scrape_forvo-0.1.0-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file scrape_forvo-0.1.0.tar.gz.

File metadata

  • Download URL: scrape_forvo-0.1.0.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for scrape_forvo-0.1.0.tar.gz
Algorithm Hash digest
SHA256 08683782c6a878313506e2b7e6e21942246ce8ece0b7488c5cdfccad06b24f4e
MD5 89ad5be5bc0e6e45b6055d3ce5327d79
BLAKE2b-256 7f00e3bcd47dad7e3614b9ac7bd4589662e4ccf7941cdd00be50007850fd5a22

See more details on using hashes here.

File details

Details for the file scrape_forvo-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: scrape_forvo-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for scrape_forvo-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 96cc561e89d1718f105362c96b56f1e95d2347dbcf79bc5ddae8e4795a9d505b
MD5 8692f3f01d666a0a16dc986ee1dbf721
BLAKE2b-256 6bd69788bfdba642bd7f1458f383fac4fb7162b2a827622319adb9341a75f222

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page