Skip to main content

A Python library for near deduplication and record linkage.

Project description

PyPI Version PyPI - Python Version PyPI Downloads Tests Coverage License

Introduction

Liken is a library providing enhanced deduplication tooling for DataFrames.

The key features are:

  • Near deduplication
  • Ready-to-use deduplication methods
  • Record linkage and canonicalization
  • Rules-based deduplication
  • Pandas, Polars and PySpark support
  • Customizable in pure Python

A flexible API

Checkout the API Documentation

Installation

pip install liken

Example

import liken as lk

df = lk.dedupe(df).apply(lk.fuzzy()).drop_duplicates("address").collect()

License

This project is licensed under the Apache-2.0 License. See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liken-0.7.3.tar.gz (31.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

liken-0.7.3-py3-none-any.whl (37.5 kB view details)

Uploaded Python 3

File details

Details for the file liken-0.7.3.tar.gz.

File metadata

  • Download URL: liken-0.7.3.tar.gz
  • Upload date:
  • Size: 31.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for liken-0.7.3.tar.gz
Algorithm Hash digest
SHA256 984f9876d32d9c0651d1dc09799320bb82a56393b0c3595f95f6c002c5df6f3e
MD5 ee26b6d35063e4ccd0ec8caf4cc96b72
BLAKE2b-256 a6290d443f5dbeb253943accaa4190bfaeea0cf53265d01984b3fbc8b5330e4f

See more details on using hashes here.

File details

Details for the file liken-0.7.3-py3-none-any.whl.

File metadata

  • Download URL: liken-0.7.3-py3-none-any.whl
  • Upload date:
  • Size: 37.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for liken-0.7.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d815d83a36ed07e11cb1e5f23b818c67f466ef99630f6c5939bc142fabbcf1bd
MD5 3291d7b271bfee95cb4c5832701dd55e
BLAKE2b-256 8bc4f7d13eab5409feb25b0748afe7eb781839130a5a487df542e288764c9ec9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page