Addok plugin for French language support with phonetic normalization and French-specific synonyms.
Project description
addok-fr
Addok plugin for improved French geocoding with phonetic normalization and French-specific synonym handling.
Features
- Phonetic normalization: French-optimized phonetic processor that handles typos and spelling variations in address searches
- French synonyms: Pre-configured synonym database for common French address terms (rue, avenue, boulevard, etc.)
Installation
pip install addok-fr
Configuration
Enable the phonemicize processor
Add phonemicize into PROCESSORS_PYPATHS:
PROCESSORS_PYPATHS = [
…,
'addok_fr.phonemicize'
]
Cache configuration (optional)
The phonemicize processor uses an LRU cache to improve performance. By default, the cache can hold up to 500,000 entries (~86 MB of memory), which is suitable for approximately 500,000 unique words.
You can adjust the cache size in your Addok configuration file:
PHONEMICIZE_CACHE_SIZE = 500_000 # Default value
Recommendations:
- 500K entries (~86 MB): Default, suitable for most French address datasets
- 1M entries (~172 MB): For larger datasets with more unique words
- 250K entries (~43 MB): For memory-constrained environments
The cache uses an LRU (Least Recently Used) eviction strategy, meaning the most frequently used words will remain cached even if the maximum size is reached
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file addok_fr-1.1.0.tar.gz.
File metadata
- Download URL: addok_fr-1.1.0.tar.gz
- Upload date:
- Size: 8.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
030aacbbaa777e9876f1cf06f1033b685e179164adf6e6fcc40d6c5c54286178
|
|
| MD5 |
2f1ec378043aee70b0d379ea325e8880
|
|
| BLAKE2b-256 |
cd80565d23fbb6c113b72a4343d38d5435c6bd3def9efd9dd6166201d66e9669
|
File details
Details for the file addok_fr-1.1.0-py3-none-any.whl.
File metadata
- Download URL: addok_fr-1.1.0-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c71e5cece4001c5e8c9247cb4d9cb72109a1d51be6efb9ba32e33188bfe62d53
|
|
| MD5 |
fa5431f62efe08629fcf19dcf114852e
|
|
| BLAKE2b-256 |
85c94c6e1d01a45665ee6132d34b56473dcac86153be5329ddd201786dabdf6a
|