No project description provided
Project description
retsim-pytorch
Welcome to retsim-pytorch
, the PyTorch adaptation of Google's RETSim (Resilient and Efficient Text Similarity) model, which is part of the UniSim (Universal Similarity) framework.
This model is designed for efficient and accurate multilingual fuzzy string matching, near-duplicate detection, and assessing string similarity. For more information, please refer to the UniSim documentation.
Installation
You can easily install retsim-pytorch
via pip:
pip install retsim-pytorch
Usage
You can configure the model using the RETSimConfig
class. By default, it utilizes the same configuration as the original UniSim model. If you wish to use the same weights as the original Google model, you can download a SafeTensors port of the weights here.
Here's how to use the model in your code:
import torch
from retsim_pytorch import RETSim, RETSimConfig
from retsim_pytorch.preprocessing import binarize
# Configure the model
config = RETSimConfig()
model = RETSim(config)
# Prepare and run inference
binarized_inputs, chunk_ids = binarize(["hello world"])
embedded, unpooled = model(torch.tensor(binarized_inputs))
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file retsim_pytorch-0.1.1.tar.gz
.
File metadata
- Download URL: retsim_pytorch-0.1.1.tar.gz
- Upload date:
- Size: 15.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 440cb3e1f4ef8ac98cd03561d635540f9033b13b0065573d79d0ee733a54208e |
|
MD5 | 1227ae95e01028eaaa54ab13612b7e48 |
|
BLAKE2b-256 | 584f90ad9f97aeb9f3618e48ddd2b984fa5981c28d99e4f0a702ae8ce1ac8836 |
File details
Details for the file retsim_pytorch-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: retsim_pytorch-0.1.1-py3-none-any.whl
- Upload date:
- Size: 13.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 880095ecd2c59806985edecca2a96234ea3d3c9711bed06c3e3c684917d58b52 |
|
MD5 | 2fc01140d962abe6fc1f3c01a9aab1e3 |
|
BLAKE2b-256 | 567848db09ceeb243f195b46aa105cafb474f9862713f34b698517b7dd415fc2 |