Skip to main content

Addok plugin for French language support with phonetic normalization and French-specific synonyms.

Project description

addok-fr

Addok plugin for improved French geocoding with phonetic normalization and French-specific synonym handling.

Features

  • Phonetic normalization: French-optimized phonetic processor that handles typos and spelling variations in address searches
  • French synonyms: Pre-configured synonym database for common French address terms (rue, avenue, boulevard, etc.)

Installation

pip install addok-fr

Configuration

Enable the phonemicize processor

Add phonemicize into PROCESSORS_PYPATHS:

PROCESSORS_PYPATHS = [
    ,
    'addok_fr.phonemicize'
]

Cache configuration (optional)

The phonemicize processor uses an LRU cache to improve performance. By default, the cache can hold up to 500,000 entries (~86 MB of memory), which is suitable for approximately 500,000 unique words.

You can adjust the cache size in your Addok configuration file:

PHONEMICIZE_CACHE_SIZE = 500_000  # Default value

Recommendations:

  • 500K entries (~86 MB): Default, suitable for most French address datasets
  • 1M entries (~172 MB): For larger datasets with more unique words
  • 250K entries (~43 MB): For memory-constrained environments

The cache uses an LRU (Least Recently Used) eviction strategy, meaning the most frequently used words will remain cached even if the maximum size is reached

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

addok_fr-1.1.0.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

addok_fr-1.1.0-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file addok_fr-1.1.0.tar.gz.

File metadata

  • Download URL: addok_fr-1.1.0.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for addok_fr-1.1.0.tar.gz
Algorithm Hash digest
SHA256 030aacbbaa777e9876f1cf06f1033b685e179164adf6e6fcc40d6c5c54286178
MD5 2f1ec378043aee70b0d379ea325e8880
BLAKE2b-256 cd80565d23fbb6c113b72a4343d38d5435c6bd3def9efd9dd6166201d66e9669

See more details on using hashes here.

File details

Details for the file addok_fr-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: addok_fr-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for addok_fr-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c71e5cece4001c5e8c9247cb4d9cb72109a1d51be6efb9ba32e33188bfe62d53
MD5 fa5431f62efe08629fcf19dcf114852e
BLAKE2b-256 85c94c6e1d01a45665ee6132d34b56473dcac86153be5329ddd201786dabdf6a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page