Skip to main content

Identity resolution library built on Splink and DuckDB.

Project description

SPAN

span_data is an identity resolution library built on Splink and DuckDB. It links records that refer to the same entity and produces a unified ID graph.

Early release. This package is under active development, and the public API and import paths may change between releases. We expect to follow semantic versioning more strictly after a 1.0 stable baseline; until then, treat upgrades as potentially breaking. Pin a specific version in production and review span_data/packaging_info/CHANGELOG.md before upgrading.

Core Features

  • Ingest source data from pandas DataFrames, CSV files, and Snowflake tables or views.
  • Clean and normalize source fields.
  • Resolve entities into profile graphs with Splink-based matching.
  • Export graph results to DataFrames, CSV, Parquet, DuckDB relations, and Snowflake.
  • Apply manual override workflows for merge, split, and assign operations.

Python Support

This package currently supports Python >=3.11,<3.12.

Usage

See SPAN Product Documentation for usage instructions.

License

This project is licensed under the BSD 3-Clause License. See span_data/packaging_info/LICENSE.

Third-party dependency license information is documented in span_data/packaging_info/THIRD_PARTY_NOTICES.md.

Release history is documented in span_data/packaging_info/CHANGELOG.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

span_data-0.1.4-cp311-cp311-win_amd64.whl (287.6 kB view details)

Uploaded CPython 3.11Windows x86-64

span_data-0.1.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (366.0 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

span_data-0.1.4-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (349.7 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

span_data-0.1.4-cp311-cp311-macosx_11_0_arm64.whl (271.3 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

span_data-0.1.4-cp311-cp311-macosx_10_9_x86_64.whl (285.2 kB view details)

Uploaded CPython 3.11macOS 10.9+ x86-64

File details

Details for the file span_data-0.1.4-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: span_data-0.1.4-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 287.6 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for span_data-0.1.4-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 32f8520d11f6caa63a4c05540691adcabb74d2abfd8b71c862b6992eb3e2734e
MD5 1321529539807e129ec2796c36c54247
BLAKE2b-256 18b3f5a804bda2e680cd9a1b3cdd1be6db919a3a71e388e0dd3f26c4538a8035

See more details on using hashes here.

File details

Details for the file span_data-0.1.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.4-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 44b76825a7d2cb50088a5d7b6afc875877d563055deab0b8d53477bc1ce55e47
MD5 9f70a7a11739b5315747416c6dc95d4b
BLAKE2b-256 609e9d74b297cc8d73fe64ab8a320d49340511dc6248e2b55506a16e512a18ce

See more details on using hashes here.

File details

Details for the file span_data-0.1.4-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for span_data-0.1.4-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 953df99d6bcadd2a2fdeaceb652a45e7a98c8d9f81dfb262c0c9b7401a47551c
MD5 23d339e38ebb38cd001ae386381ae7a5
BLAKE2b-256 5fcb4ccfea64d73580c8fb99e20378932b956ab561bf0c9a3df3af0298bfc03f

See more details on using hashes here.

File details

Details for the file span_data-0.1.4-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for span_data-0.1.4-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 1a17547ac9ec820807888a824b1bc46604bae37f4b463661cff65510440415de
MD5 1e52a42524a9c0e645b80513753b1726
BLAKE2b-256 15977dca9ddc868210daf55130651bbc305550c50f6205f7a61c98fe9d0b2a63

See more details on using hashes here.

File details

Details for the file span_data-0.1.4-cp311-cp311-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.4-cp311-cp311-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 7b83d6b6af0ac00fd921ff1ab9a6b87457eb9d0e99c7fd7ee1d0fb83f9ef2353
MD5 dda3538c695495d594c38e838abbba0a
BLAKE2b-256 bab965bd351e114f5be735c98a227b012ba3ac55feeac7e71a94825257ae2450

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page