Skip to main content

Implementation of phonetic Polyphon algorithm (PyO3 core wrapper)

Project description

About

It's the effective implementation of phonetic Polyphon algorithm.

Original paper: «Polyphon: An Algorithm for Phonetic String Matching in Russian Language».

Authors: Viacheslav V. Paramonov, Alexey O. Shigarov, Gennagy M. Ruzhnikov, Polina V. Belykh.

We propose a new phonetic algorithm to string matching in Russian language without transliteration from Cyrillic to Latin characters. It is based on the rules of sounds formation in Russian language.

Usage

Add the dependency:

$ pip install polyphon

And then use:

from polyphon import encode;

print(encode("литие")) # лата

Note: encode works on a single word and removes any non-Russian characters (including spaces). If you want to encode multiple words, split them first and encode each separately.

This is a wrapper of Rust implementation. The repo is here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

polyphon-1.0.0.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

polyphon-1.0.0-cp37-abi3-win_amd64.whl (181.8 kB view details)

Uploaded CPython 3.7+Windows x86-64

File details

Details for the file polyphon-1.0.0.tar.gz.

File metadata

  • Download URL: polyphon-1.0.0.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.5

File hashes

Hashes for polyphon-1.0.0.tar.gz
Algorithm Hash digest
SHA256 26f2d9cae7771984c106027d8ac27b30c68b849070e34d84313e204ef9770014
MD5 8012f6a9d6f2dd923e2f8799e1c5ac72
BLAKE2b-256 c1770283b87a303b42224c36f16912b1d1ea0c00c124ba8b5f2963aa9199dedd

See more details on using hashes here.

File details

Details for the file polyphon-1.0.0-cp37-abi3-win_amd64.whl.

File metadata

  • Download URL: polyphon-1.0.0-cp37-abi3-win_amd64.whl
  • Upload date:
  • Size: 181.8 kB
  • Tags: CPython 3.7+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.5

File hashes

Hashes for polyphon-1.0.0-cp37-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 2a0e900bc2fcf1aebb1c3df90a8811fd8e30cf273c69bbdcdfe4cf9bbe7dc717
MD5 11b95b5f724f398d3d3876c29eaf50d7
BLAKE2b-256 3212c1eacaf8c91427003092fe116061e19a55d4bb7f40348987a794c866f206

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page