Skip to main content

A Python library for near deduplication and record linkage.

Project description

PyPI Version PyPI - Python Version PyPI Downloads Tests Coverage License

Introduction

Liken is a library providing enhanced deduplication tooling for DataFrames.

The key features are:

  • Near deduplication
  • Ready-to-use deduplication methods
  • Record linkage and canonicalization
  • Rules-based deduplication
  • Pandas, Polars and PySpark support
  • Customizable in pure Python

A flexible API

Checkout the API Documentation

Installation

pip install liken

Example

import liken as lk

df = lk.dedupe(df).apply(lk.fuzzy()).drop_duplicates("address").collect()

License

This project is licensed under the Apache-2.0 License. See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liken-0.6.1.tar.gz (30.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

liken-0.6.1-py3-none-any.whl (36.4 kB view details)

Uploaded Python 3

File details

Details for the file liken-0.6.1.tar.gz.

File metadata

  • Download URL: liken-0.6.1.tar.gz
  • Upload date:
  • Size: 30.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.4 {"installer":{"name":"uv","version":"0.11.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for liken-0.6.1.tar.gz
Algorithm Hash digest
SHA256 523515bc318e13151ca29825caaef1a85b07297ed02ec18dbeab9dd7ff4ceab6
MD5 ae4eae8cb077af923d4a78e5b1f4ac03
BLAKE2b-256 5bfc4de8f678c242990f6ff5ae5e15adf82caa91f0125655575f7aa80ea3282e

See more details on using hashes here.

File details

Details for the file liken-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: liken-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 36.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.4 {"installer":{"name":"uv","version":"0.11.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for liken-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 229a78ee6fc91ab967c83af06e8dc642a99d8c57c0e47c37e6063b67a0d0f57d
MD5 f9fbaf49067047d4ebfd5dd0604bb893
BLAKE2b-256 441a1b8346b8e9aaa39d6518aedd80cc6127a520a72af4435534858d4857e3c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page