Phonetics algorithms (Soundex and Metaphone) for russian, english and finnish languages
Project description
# Fonetika
Russian, English and Finnish Phonetic algorithm based on Soundex/Metaphone.
Package has both implemented phoneme transformation into letter-number sequence and distance engine for comparison of phonetic sequences (based on Levenstein and Hamming distances).
### Quick start
1. Install this package via ```pip```
```python
pip install fonetika
```
2. Import Soundex algorithm.
Package supports a lot of opportunities, it's possible to cut a result sequence (like in the original Soundex version) or also code vowels.
```python
from fonetika.soundex import RussianSoundex
soundex = RussianSoundex(delete_first_letter=True)
soundex.transform('ёлочка')
...
J070530
soundex = Soundex(delete_first_letter=True, code_vowels=True)
soundex.transform('ёлочка')
...
JA7A53A
```
> A structure of the library is scalable, `RussianSoundex` class inherits basic class `Soundex` (original for English language). In order to extend our algorithm, you need just inherit own class from `Soundex` and override methods.
3. Import Soundex distance for usage of string comparision
```python
from fonetika.distance import PhoneticsDistance
soundex = RussianSoundex(delete_first_letter=True)
phon_distance = PhoneticsDistance(soundex)
phon_distance.distance('ёлочка', 'йолочка')
...
0
```
Russian, English and Finnish Phonetic algorithm based on Soundex/Metaphone.
Package has both implemented phoneme transformation into letter-number sequence and distance engine for comparison of phonetic sequences (based on Levenstein and Hamming distances).
### Quick start
1. Install this package via ```pip```
```python
pip install fonetika
```
2. Import Soundex algorithm.
Package supports a lot of opportunities, it's possible to cut a result sequence (like in the original Soundex version) or also code vowels.
```python
from fonetika.soundex import RussianSoundex
soundex = RussianSoundex(delete_first_letter=True)
soundex.transform('ёлочка')
...
J070530
soundex = Soundex(delete_first_letter=True, code_vowels=True)
soundex.transform('ёлочка')
...
JA7A53A
```
> A structure of the library is scalable, `RussianSoundex` class inherits basic class `Soundex` (original for English language). In order to extend our algorithm, you need just inherit own class from `Soundex` and override methods.
3. Import Soundex distance for usage of string comparision
```python
from fonetika.distance import PhoneticsDistance
soundex = RussianSoundex(delete_first_letter=True)
phon_distance = PhoneticsDistance(soundex)
phon_distance.distance('ёлочка', 'йолочка')
...
0
```
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fonetika-1.1.4.tar.gz
(5.0 kB
view details)
File details
Details for the file fonetika-1.1.4.tar.gz
.
File metadata
- Download URL: fonetika-1.1.4.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.18.4 setuptools/37.0.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
6817b1e411dae238611c4ce6d0cc8cb8bd1bfddf873bf4633d3e38918761658c
|
|
MD5 |
2f4705ec835dece8ead7eb6a1e150f70
|
|
BLAKE2b-256 |
3e465369a7924ea1345d40a4bfc380cfa26d396a04b4f9071b6acbbd5aa42ca5
|