Implementation of the ARCoder Transliterated Arabic Name Matching Encoding Algorithm
Project description
ARCoder
arcoder.py implements an abstract base class called Encoder
along with two concrete
implementations: ARCoder
and Holmes
. Each has an encode
method that takes a
string and encodes it into a series of symbols designed to be used for similarity
measurements.
>>> from arcoder import ARCoder, Holmes
>>> a = ARCoder()
>>> a.encode("Sohaib")
['suhaeb', 'suhib']
>>> h = Holmes()
>>> h.encode("Sohaib")
['sohayb']
The ARCoder algorithm is described more fully in Moore, J., Hamid, S., and Bromberger, S.: "An Evaluation of Transliterated Arabic Name Matching Methods".
The Holmes implementation is derived from Holmes, D., Kashfi, S., Aqeel, S. U.: "Transliterated arabic name search", Communications, Internet, and Information Technology, pp. 267-273. (2004).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file arcoder-0.1.0.tar.gz
.
File metadata
- Download URL: arcoder-0.1.0.tar.gz
- Upload date:
- Size: 4.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
f454aa5cd9bc61b2bdc5af84aaad130401ce0bb778b3da543a67179c3a376a55
|
|
MD5 |
a63c77099a5e8e7d4c5c17e95cb4e9ff
|
|
BLAKE2b-256 |
2d268e96ec7e8b4489857b31766822884420af91fd432623684793b0ba5865c8
|
File details
Details for the file arcoder-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: arcoder-0.1.0-py3-none-any.whl
- Upload date:
- Size: 5.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
774a3d918e8bb9b4f40f1768c2fed211e667cefb6f07ae18d515b1aeac319454
|
|
MD5 |
e43897a9ad0a4fcbea7a31ebb21b2972
|
|
BLAKE2b-256 |
07eacbd3f54f4a0f9b5f043777dac748bc092d1e92052e6c01c31891eb52dc0e
|