Skip to main content

A library for downloading pronunciation MP3 files from Google's dictionary service

Project description

Google Pronouncer

A Python library for downloading pronunciation MP3 files from Google's dictionary service.

Installation

pip install google-pronouncer

Command Line Usage

The package provides a simple command-line interface:

Download Pronunciations

# Download US pronunciation (default) - saves in current directory
google-pronouncer -d hello world

# Download British pronunciation
google-pronouncer -d hello -a gb

# Download both accents (automatically uses subdirectories)
google-pronouncer -d hello -a all

# Save files in a specific directory
google-pronouncer -d hello -o ./pronunciations

# Force using subdirectories even for single accent
google-pronouncer -d hello --use-subdirs

# Download from a file (one word per line)
google-pronouncer -f words.txt

# Force fresh download (ignore cache)
google-pronouncer -d hello --force-download

# Disable cache usage
google-pronouncer -d hello --no-cache

# Download with verbose logging
google-pronouncer -v -d hello world

File Organization

By default:

  • Files are saved in the current directory
  • Single accent downloads: Saved as word_accent.mp3
  • Multiple accent downloads: Organized in subdirectories as word/word_accent.mp3
  • Use -o to specify a different output directory
  • Use --use-subdirs to force subdirectory organization even for single accent downloads

Cache Management

# Show cache information for all words
google-pronouncer cache-info

# Show cache info for specific words
google-pronouncer cache-info hello world

# Clear all cache
google-pronouncer clear-cache

# Clear cache for specific words
google-pronouncer clear-cache hello world

Global Options

  -d, --download WORD   Download pronunciations for one or more words
  -f, --file FILE      File containing words to download (one word per line)
  -a, --accent {gb,us,all}  Accent to download (us=American, gb=British, all=both) (default: us)
  -j, --jobs N         Number of parallel downloads (default: 4)
  -o, --output-dir PATH  Directory to save pronunciations (default: ./pronunciations)
  -t, --timeout SECONDS  Request timeout in seconds (default: 10)
  -v, --verbose        Enable verbose logging
  --no-cache          Disable cache usage
  --force-download    Force download even if cached
  --use-subdirs       Use subdirectories for each word (default: only when downloading multiple accents)

Python Library Usage

Basic Usage

from google_pronouncer import GooglePronunciationDownloader, DownloadConfig, AccentType

# Create configuration (files will be saved in current directory)
config = DownloadConfig(
    output_dir=".",  # Current directory
    use_cache=True,  # Enable caching (default: True)
    force_download=False  # Force fresh download (default: False)
)

# Or specify a different output directory
config = DownloadConfig(
    output_dir="./pronunciations",
    use_cache=True,
    force_download=False
)

# Initialize downloader
downloader = GooglePronunciationDownloader(config)

# Download US pronunciation (default)
path = downloader.download_pronunciation("hello", AccentType.AMERICAN)
print(f"Downloaded to: {path}")

# Download British pronunciation
path = downloader.download_pronunciation("hello", AccentType.BRITISH)
print(f"Downloaded to: {path}")

# Download both accents for a word
paths = downloader.download_all_accents("world")
print(f"Downloaded files: {paths}")

# Download multiple words (US accent by default)
words = ["hello", "world", "python"]
paths = downloader.download_words(words)
print(f"Downloaded files: {paths}")

Cache Management

# Get cache information
info = downloader.get_cache_info()  # All words
info = downloader.get_cache_info("hello")  # Specific word

# Clear cache
downloader.clear_cache()  # All words
downloader.clear_cache("hello")  # Specific word

Configuration Options

config = DownloadConfig(
    output_dir="pronunciations",    # Directory to save files
    timeout=10,                     # Request timeout in seconds
    user_agent="Custom User Agent", # Optional custom user agent
    use_cache=True,                # Whether to use cached files
    min_file_size=1024,           # Minimum valid file size in bytes
    force_download=False          # Force fresh download
)

Available Accents

  • AccentType.BRITISH - British English pronunciation (gb)
  • AccentType.AMERICAN - American English pronunciation (us)

Error Handling

from google_pronouncer import DownloadError, CacheError

try:
    path = downloader.download_pronunciation("word", AccentType.BRITISH)
except DownloadError as e:
    print(f"Download failed: {e}")
except CacheError as e:
    print(f"Cache error: {e}")

Features

  • Download pronunciations in British and American English
  • Configurable output directory and request settings
  • Smart caching system with validation
  • Cache management tools (info, clear)
  • Command-line interface
  • Proper error handling and logging
  • Type hints for better IDE support
  • Support for downloading multiple words and accents
  • Clean and simple API

Cache System

The library includes a smart caching system that:

  • Automatically caches downloaded files
  • Validates cached files before use
  • Provides cache information and management
  • Supports forced fresh downloads
  • Includes file size validation
  • Maintains cache metadata

Requirements

  • Python 3.7+
  • requests library (>=2.31.0)

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google_pronouncer-0.3.0.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

google_pronouncer-0.3.0-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file google_pronouncer-0.3.0.tar.gz.

File metadata

  • Download URL: google_pronouncer-0.3.0.tar.gz
  • Upload date:
  • Size: 11.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for google_pronouncer-0.3.0.tar.gz
Algorithm Hash digest
SHA256 0a0eff24d49cba2d8434f84d487f38a99227a7cc2b66dea2e6a240e1e69c9273
MD5 880fb2b49fb7e109c2963d5cbb0bd5bb
BLAKE2b-256 c4ee2c9e115b941bba0c12957e01622b94ad4045f39cd4367bfa3603d917d79f

See more details on using hashes here.

Provenance

The following attestation bundles were made for google_pronouncer-0.3.0.tar.gz:

Publisher: publish-to-pypi.yml on HachiroSan/google-pronouncer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file google_pronouncer-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for google_pronouncer-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2299362c41818461227a0f905e6f121faae7cfe4794d755eda492913a7555b2d
MD5 e48f9fefb23bd201510e5b78aa7a37a2
BLAKE2b-256 15eeb97a2b8fbc726b08f7d8df588165d07fbb73a3a9afe20b9a4b16aa0d261a

See more details on using hashes here.

Provenance

The following attestation bundles were made for google_pronouncer-0.3.0-py3-none-any.whl:

Publisher: publish-to-pypi.yml on HachiroSan/google-pronouncer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page