Skip to main content

Identity resolution library built on Splink and DuckDB.

Project description

ID Grapher

span_data is an identity resolution library built on Splink and DuckDB. It links records that refer to the same entity and produces a unified ID graph.

Core Features

  • Ingest source data from pandas DataFrames, CSV files, and Snowflake tables or views.
  • Clean and normalize source fields using FieldDefinitionMap.
  • Resolve entities into profile graphs with Splink-based matching.
  • Export graph results to DataFrames, CSV, Parquet, DuckDB relations, and Snowflake.
  • Apply manual override workflows for merge, split, and assign operations.

Python Support

This package currently supports Python >=3.11,<3.12.

Usage

See SPAN Product Documentation for usage instructions.

License

This project is licensed under the BSD 3-Clause License. See LICENSE.

Third-party dependency license information is documented in THIRD_PARTY_NOTICES.md.

Release history is documented in CHANGELOG.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

span_data-0.1.3-cp311-cp311-win_amd64.whl (287.2 kB view details)

Uploaded CPython 3.11Windows x86-64

span_data-0.1.3-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (365.3 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

span_data-0.1.3-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (349.0 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

span_data-0.1.3-cp311-cp311-macosx_11_0_arm64.whl (270.6 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

span_data-0.1.3-cp311-cp311-macosx_10_9_x86_64.whl (284.5 kB view details)

Uploaded CPython 3.11macOS 10.9+ x86-64

File details

Details for the file span_data-0.1.3-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: span_data-0.1.3-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 287.2 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for span_data-0.1.3-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 62cecb13bc6a8e47258bbe780567f710c101a48f8afeebfc2d5376ed719e1a5f
MD5 ae6b14cafe2531f42b483865111f80ac
BLAKE2b-256 e85870b4fc595d60676fc823b6ce041d764109d5ae96b7948c89546832660409

See more details on using hashes here.

File details

Details for the file span_data-0.1.3-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.3-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b5e75b98a94385922b51d8c36cdcad08308764070f3af9f62ab0dabfb418c81f
MD5 14ab7d120e918a9579680f99d2b29fed
BLAKE2b-256 c4095ae1589ec5915ffc767a2062454ae155945c136735da775198ea081d928e

See more details on using hashes here.

File details

Details for the file span_data-0.1.3-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for span_data-0.1.3-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a9c691b337186a359bcd87e5418f9ea6919fdc9bf13688f3b3bb98a06d63aacd
MD5 a52dd87d622066840b8f8528eaf97ff9
BLAKE2b-256 46e5aa8c1b6d2225e90b83c78f3ff5b73c36e1ffddaf04b9f8237607cffcb80c

See more details on using hashes here.

File details

Details for the file span_data-0.1.3-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for span_data-0.1.3-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a6d49608bdca676643bc13798c06224bf5617a36903a24de44a67227a96312fb
MD5 b1325b1720f98c1cdb83f2a6738c72fd
BLAKE2b-256 b1f6dfdf008c1867cf32725e2f1ecf676114601cd979ee2b2d84ef098e8e1e82

See more details on using hashes here.

File details

Details for the file span_data-0.1.3-cp311-cp311-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.3-cp311-cp311-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 57bad906076f0f5d1ddaeca9566560e6990fb0277668b30cb17c671a9e2f86bf
MD5 d488afc43fb697534ae07f03891f5333
BLAKE2b-256 69c3fed92312beffb18244dcf6feb8b1827a53f82dd75281feceb348c96749e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page