Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup.
Project description
Installation
With pip
pip install fasttextsearch
For developers
pip install numpy
git clone https://github.com/danpovey/text_search
cd text_search
mkdir build
cd build
cmake ..
make -j
make test
# set PYTHONPATH so that you can use "import textsearch"
export PYTHONPATH=$PWD/../textsearch/python:$PWD/lib:$PYTHONPATH
Now you can use
python3 -c "import textsearch; print(textsearch.__file__)"
Caution: We did not use either python3 setup.py install
or pip install
.
We only set the environment variable PYTHONPATH
.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file fasttextsearch-0.10.tar.gz
.
File metadata
- Download URL: fasttextsearch-0.10.tar.gz
- Upload date:
- Size: 58.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e52a0531aa577527c57299a1df3828c11f0b0ff89ebe7d0e2600b3dbb51ca38a |
|
MD5 | 57b3f65f4fd2859a64608639e3ef4082 |
|
BLAKE2b-256 | a4c6447ac9a4fe17432fbc72048d919cbfe6a3a9dfbccd3dc9440bc66992dc3e |
File details
Details for the file fasttextsearch-0.10-cp311-cp311-win_amd64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp311-cp311-win_amd64.whl
- Upload date:
- Size: 136.6 kB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8eebbbe634c3c701768fadbffe4b33e0f707263d07ce6cb5c6f39368c4731c13 |
|
MD5 | ce43166ed082851d425ee13b21ad70d5 |
|
BLAKE2b-256 | e64749b55e87adca967756345f235e2bdc6a7c94a18c71acbbc0aff745f31bf5 |
File details
Details for the file fasttextsearch-0.10-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 131.3 kB
- Tags: CPython 3.11, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af500a1d42a65847d1c74c01c7243b7d43b0550d1f2ad052140d0d77d93f28de |
|
MD5 | 30f157dcc5cabb4353615291c7e41ff3 |
|
BLAKE2b-256 | 635fb04aa92a29af17393fd9293d124d0b75ae4a081124cfc028c82da1688254 |
File details
Details for the file fasttextsearch-0.10-cp311-cp311-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp311-cp311-macosx_10_9_x86_64.whl
- Upload date:
- Size: 102.9 kB
- Tags: CPython 3.11, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9d594388597a9cc0fa66e0f2f5a121b85b14ebfb995cd2677d64a9126c96c6e0 |
|
MD5 | 2cf0f528c2855953002124fb7554b35e |
|
BLAKE2b-256 | ff3778518421b6e7c3fb7ca6de67c37321aa4153b2383040afae031265f3f0d2 |
File details
Details for the file fasttextsearch-0.10-cp310-cp310-win_amd64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp310-cp310-win_amd64.whl
- Upload date:
- Size: 136.8 kB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b55b6820b0d53842739e2f7622089eff2efa7b3838a8890df456851e9cc5426f |
|
MD5 | 50f3a616a147ffbb45e13398ef32a0bc |
|
BLAKE2b-256 | f9066ef417745e1b2c137b4cbe74aeb60e8fa11b3adfcb298e3d76458b830694 |
File details
Details for the file fasttextsearch-0.10-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 131.4 kB
- Tags: CPython 3.10, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c8a488a64cd111fff696be2d6e4bae87d6b616fc9223959c2c00956b67262579 |
|
MD5 | 86af1f908366ca3a0b3c92bf4f3343e6 |
|
BLAKE2b-256 | ab813cbc95df64141bb40d22dc79ddf1c37d9eaeb9c33184e5a72a1b1fd57912 |
File details
Details for the file fasttextsearch-0.10-cp310-cp310-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp310-cp310-macosx_10_9_x86_64.whl
- Upload date:
- Size: 102.9 kB
- Tags: CPython 3.10, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 083ed27211a8d58dbdf9bd76f64d2764ccdd6d15052439c5b6b120e69277fe40 |
|
MD5 | c1e5280d469e45933e73a794c9175b13 |
|
BLAKE2b-256 | 64310ea97ebbd3c842d0e050a0ef746421811831ee6ae33dd4bbc46a02f4d19a |
File details
Details for the file fasttextsearch-0.10-cp39-cp39-win_amd64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp39-cp39-win_amd64.whl
- Upload date:
- Size: 136.9 kB
- Tags: CPython 3.9, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5ca91fe69753819053e91367e5dc1af9a071878a18efce13d7963ad9b08040fb |
|
MD5 | fd29c9145e10749498ff7f00e80be879 |
|
BLAKE2b-256 | 76a4985b9ee9314d3a348378aebbb03859031ec6b87e32686b41c676b14d1c73 |
File details
Details for the file fasttextsearch-0.10-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 131.5 kB
- Tags: CPython 3.9, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c4cce11c13e9e1719eeeb315936a68e9e967b48a672989a452dc9421d726f34e |
|
MD5 | 8391d514d785949a71dd2cff786935d9 |
|
BLAKE2b-256 | 5e2ce0356d6c1b65ddb27b2dc9495933038e3a67a5176ec95b64213857906dd5 |
File details
Details for the file fasttextsearch-0.10-cp39-cp39-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp39-cp39-macosx_10_9_x86_64.whl
- Upload date:
- Size: 103.0 kB
- Tags: CPython 3.9, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aec6d16974a2480bb2a0c5be18f3621048ecdf1a50c509cb3269599dddaaa792 |
|
MD5 | b949ad69e363ff94f43c71f5dee78f43 |
|
BLAKE2b-256 | cce27dccacaf38d7bfee672bb1d16c54365182d99e49549351c038cd58fb2ab7 |
File details
Details for the file fasttextsearch-0.10-cp38-cp38-win_amd64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp38-cp38-win_amd64.whl
- Upload date:
- Size: 136.7 kB
- Tags: CPython 3.8, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 959ffbee72e7100b620229f7d07f9cbe5e26ce9fe7843af621b3675c98f33d46 |
|
MD5 | 70803998e25ba578a74e330e264c4224 |
|
BLAKE2b-256 | 59c8223fb0ac25258ae614c7bf5cd06805a1286b0636f44c5295a671a283667f |
File details
Details for the file fasttextsearch-0.10-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 131.3 kB
- Tags: CPython 3.8, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b343a258a18afd189d93719eefdf23191167be788f2922e7804c5f1aa0c611f9 |
|
MD5 | c09678724cf7e60c89f06d6d6d94cc51 |
|
BLAKE2b-256 | 5afbe85e04fcce80ff0744c6637dd3cbe92cdfd83f77d6c153f9fcfe1ef6dc1f |
File details
Details for the file fasttextsearch-0.10-cp38-cp38-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp38-cp38-macosx_10_9_x86_64.whl
- Upload date:
- Size: 103.0 kB
- Tags: CPython 3.8, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3fdb72b603103c88815fa750d19d07cbdcdb1f55414ad49a1711435c59447fb1 |
|
MD5 | 0ec9bd343c74725e7e27c4cd03d90c5d |
|
BLAKE2b-256 | b614453f9f2ba8ef20202c74a743819f354996991301b4ae266540f96c386b7f |
File details
Details for the file fasttextsearch-0.10-cp37-cp37m-win_amd64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp37-cp37m-win_amd64.whl
- Upload date:
- Size: 137.1 kB
- Tags: CPython 3.7m, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1cd1bc47c59d34e1ac1666daec527294843c1cac2a7cf05cb014caca41dd9b68 |
|
MD5 | 9a0ec4de03f1954eab4ef39b2551e890 |
|
BLAKE2b-256 | a156069bbb979c85c001693c47048c68b08e8be6f258e5f35ab012dd95361653 |
File details
Details for the file fasttextsearch-0.10-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 131.7 kB
- Tags: CPython 3.7m, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9020e3d7c7dc675d7efee78269458a97d9c74cb02613bd70c555bdb4319a7cd8 |
|
MD5 | 3678c9c6a0d3b97887ed91008e20ca9b |
|
BLAKE2b-256 | 1b3ef27c5fa38d142c25813854dc4ce8294fb9d2d59d97efbc300c715842ef63 |
File details
Details for the file fasttextsearch-0.10-cp37-cp37m-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: fasttextsearch-0.10-cp37-cp37m-macosx_10_9_x86_64.whl
- Upload date:
- Size: 102.5 kB
- Tags: CPython 3.7m, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b4cec2bb8108c96137adbfeeca62309c857e3f72a673ff81f36cd69f21b60dd3 |
|
MD5 | d550735fe0ae9160055d82e72039748d |
|
BLAKE2b-256 | 4c158b3007b663696caf377cac0693a3abeb25d38f198852b74fe3939666fed3 |