Skip to main content

Python library for performing string similarity joins.

Project description

py_stringsimjoin

This project seeks to build a Python software package that provides scalable implementation of string similarity joins over two tables, for commonly used similarity measures such as Jaccard, Dice, cosine, overlap, overlap coefficient and edit distance. The package is free, open-source, and BSD-licensed.

Dependencies

py_stringsimjoin has been tested on each Python version between 3.7 and 3.11, inclusive.

The required dependencies to build the package are pandas 0.16.0 or higher, py_stringmatching 0.2.1 or higher, joblib, pyprind, six and a C++ compiler. For the development version, you will also need Cython.

Platforms

py_stringsimjoin has been tested on Linux, OS X and Windows.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_stringsimjoin_temp-0.3.3.tar.gz (1.4 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

py_stringsimjoin_temp-0.3.3-cp311-cp311-macosx_11_0_arm64.whl (2.0 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

py_stringsimjoin_temp-0.3.3-cp39-cp39-macosx_10_9_universal2.whl (2.6 MB view details)

Uploaded CPython 3.9macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file py_stringsimjoin_temp-0.3.3.tar.gz.

File metadata

  • Download URL: py_stringsimjoin_temp-0.3.3.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for py_stringsimjoin_temp-0.3.3.tar.gz
Algorithm Hash digest
SHA256 6a5060d3ad5b875f1e7253d132ac063ee61a9085950cea5cedb96d884ed6660e
MD5 55c43d0e671e1728e4be4e2050594204
BLAKE2b-256 b0e2f8ab533af8a238dd7cc5319d11c022e33a517d318cfa4e62ea052f261da4

See more details on using hashes here.

File details

Details for the file py_stringsimjoin_temp-0.3.3-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for py_stringsimjoin_temp-0.3.3-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 37144717c02e3fb597d1af8bd3f8a5fcb58370b638f8eecbac9e94de0ca36bfa
MD5 2d5f7fef4f983737a2459a5e89cd7d9f
BLAKE2b-256 7f6c230066ba68e41144dbde57d152c862dc428d11e846c9a4c8bc470807ecfa

See more details on using hashes here.

File details

Details for the file py_stringsimjoin_temp-0.3.3-cp39-cp39-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for py_stringsimjoin_temp-0.3.3-cp39-cp39-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 66c016dc3d811c54c0a9fc29c197e711d55e84299de897cb659f4602d89836a5
MD5 dfd207f1e48e41ec3d3b49c4a0861c50
BLAKE2b-256 fb1d2a2e5de1dbf56cd814e2ba66694f1a60dc0b48ac3cdc6b02369c4eac9eb7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page