Utilities for dataset similarity and joins
Project description
Jellyjoin Python Package
"Jellyjoin: the softest of joins."
Join dataframes or lists based on semantic similarity.
Author: Oran Looney License: MIT Year: 2025
Installation
pip install jellyjoin
Usage
import jellyjoin
jelly_df = jellyjoin.jellyjoin(
left_df,
right_df,
left_column="Column Name",
right_column="Other Column Name",
threshold=0.7,
similarity_strategy=jellyjoin.PairwiseSimilarity(),
)
print(jelly_df)
Development
To set up a development environment:
git clone https://github.com/<your-username>/jellyjoin.git
cd jellyjoin
pip install -e .[dev]
Run tests:
pytest
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jellyjoin-0.1.2.tar.gz
(9.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file jellyjoin-0.1.2.tar.gz.
File metadata
- Download URL: jellyjoin-0.1.2.tar.gz
- Upload date:
- Size: 9.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5354708f0e48c6f51bb1881a6a6ee29a09fafbb8bca3dc1556a5851d200084df
|
|
| MD5 |
96317bf7c747003599292bdbe43b0f23
|
|
| BLAKE2b-256 |
aff3c5446f014b9dc5d5c250564301110baa4b7f7b8d486400288b133215dc12
|
File details
Details for the file jellyjoin-0.1.2-py3-none-any.whl.
File metadata
- Download URL: jellyjoin-0.1.2-py3-none-any.whl
- Upload date:
- Size: 7.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
eabfd071e6862d747f89c0383a54d86abbe9b666e52a58f6120b5a7327bfde82
|
|
| MD5 |
13ca74f59de6f84c3777609b611333fe
|
|
| BLAKE2b-256 |
7f88e9ce97f0d721cea59bf31c47a71d589eb8586a17030a107ac87f897f5834
|