Utilities for dataset similarity and joins
Project description
Jellyjoin Python Package
"Jellyjoin: the softest of joins."
Join dataframes or lists based on semantic similarity.
Author: Oran Looney License: MIT Year: 2025
Installation
pip install jellyjoin
Usage
import jellyjoin
jelly_df = jellyjoin.jellyjoin(
left_df,
right_df,
left_column="Column Name",
right_column="Other Column Name",
threshold=0.7,
similarity_strategy=jellyjoin.PairwiseSimilarity(),
)
print(jelly_df)
Development
To set up a development environment:
git clone https://github.com/<your-username>/jellyjoin.git
cd jellyjoin
pip install -e .[dev]
Run tests:
pytest
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jellyjoin-0.1.1.tar.gz
(7.1 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file jellyjoin-0.1.1.tar.gz.
File metadata
- Download URL: jellyjoin-0.1.1.tar.gz
- Upload date:
- Size: 7.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
35f73589da9c80f66cef7c28121214af949d5c501233f4f874aea8fc90496c7b
|
|
| MD5 |
eea545a994df961e846d4f2deee3ccac
|
|
| BLAKE2b-256 |
39f32853f18005078218c8174071d1974ded612180714f1f4af165aa9fdfbfc3
|
File details
Details for the file jellyjoin-0.1.1-py3-none-any.whl.
File metadata
- Download URL: jellyjoin-0.1.1-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6863e306dbf6f0525f8315c512b833295bc178bf4009b10f28de8a15121e2ce7
|
|
| MD5 |
1a0595236f0be283ea7d321ddc3d63d3
|
|
| BLAKE2b-256 |
e835c3fb98426d1f6fb497a03b36f5a7aff2702753826e06df9c59ffc62e76b2
|