Skip to main content

Identity resolution library built on Splink and DuckDB.

Project description

SPAN

span_data is an identity resolution library built on Splink and DuckDB. It links records that refer to the same entity and produces a unified ID graph.

Early release. This package is under active development, and the public API and import paths may change between releases. We expect to follow semantic versioning more strictly after a 1.0 stable baseline; until then, treat upgrades as potentially breaking. Pin a specific version in production and review span_data/packaging_info/CHANGELOG.md before upgrading.

Core Features

  • Ingest source data from pandas DataFrames, CSV files, and Snowflake tables or views.
  • Clean and normalize source fields.
  • Resolve entities into profile graphs with Splink-based matching.
  • Export graph results to DataFrames, CSV, Parquet, DuckDB relations, and Snowflake.
  • Apply manual override workflows for merge, split, and assign operations.

Python Support

This package currently supports Python >=3.11,<3.12.

Usage

See SPAN Product Documentation for usage instructions.

License

This project is licensed under the BSD 3-Clause License. See span_data/packaging_info/LICENSE.

Third-party dependency license information is documented in span_data/packaging_info/THIRD_PARTY_NOTICES.md.

Release history is documented in span_data/packaging_info/CHANGELOG.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

span_data-0.1.5-cp311-cp311-win_amd64.whl (287.7 kB view details)

Uploaded CPython 3.11Windows x86-64

span_data-0.1.5-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (366.0 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

span_data-0.1.5-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (349.8 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

span_data-0.1.5-cp311-cp311-macosx_11_0_arm64.whl (271.3 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

span_data-0.1.5-cp311-cp311-macosx_10_9_x86_64.whl (285.3 kB view details)

Uploaded CPython 3.11macOS 10.9+ x86-64

File details

Details for the file span_data-0.1.5-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: span_data-0.1.5-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 287.7 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for span_data-0.1.5-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 212745b94984a8ecf9cf0237bb97aab9e407a39132cbe3893ed1241cbb830f71
MD5 1a5fd7ab0f4b816df2a183ba95710716
BLAKE2b-256 2212c8ec2d02354893a50a23ad367a4ac053b79ecb674b65b3d01312dead6138

See more details on using hashes here.

Provenance

The following attestation bundles were made for span_data-0.1.5-cp311-cp311-win_amd64.whl:

Publisher: build-wheels.yml on thedataloft/SPAN

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file span_data-0.1.5-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.5-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 31658907595551a81d90f55f3509da0a189d4f6add4e4b5ea10a12f62cdef654
MD5 9cc4a298a424e1aaf90d02190f6fe46b
BLAKE2b-256 6f73e11a4e34644aae11ad066ccf117b70d059446405465c3804643f21557a1f

See more details on using hashes here.

Provenance

The following attestation bundles were made for span_data-0.1.5-cp311-cp311-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl:

Publisher: build-wheels.yml on thedataloft/SPAN

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file span_data-0.1.5-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for span_data-0.1.5-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a7c68dac505186bda37c3f5ce8c9c1297be1932f131845abd33cdef1778c1325
MD5 12aa3522b5ff2f8deb6aabf9bccade1e
BLAKE2b-256 87c51973b10293e82f18b80e8ad915a5152e73cc38b9d23d19c54ad7c48fabe8

See more details on using hashes here.

Provenance

The following attestation bundles were made for span_data-0.1.5-cp311-cp311-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl:

Publisher: build-wheels.yml on thedataloft/SPAN

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file span_data-0.1.5-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for span_data-0.1.5-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 8c787b6f01c2af1bb2e1bbf3b41b41b826c818f0cb8d7399758c5ba71dee1ddd
MD5 a76354aec4ca2f19ffe40c5472991ce3
BLAKE2b-256 91b1393d053932f4ae7d066fdd12972abb9492a57f946397ba3c9ffde9f45d76

See more details on using hashes here.

Provenance

The following attestation bundles were made for span_data-0.1.5-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: build-wheels.yml on thedataloft/SPAN

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file span_data-0.1.5-cp311-cp311-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for span_data-0.1.5-cp311-cp311-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 1831a4b3baec35a8eb93bfe53d72060031f3e379d68732f03919faa3c9e33383
MD5 fb386aa57d482eb3dfacbda2afa8d771
BLAKE2b-256 eb43eef5f0fd92e5ed2cf391a7310998d68d4d72803460f962c0404f81c98f13

See more details on using hashes here.

Provenance

The following attestation bundles were made for span_data-0.1.5-cp311-cp311-macosx_10_9_x86_64.whl:

Publisher: build-wheels.yml on thedataloft/SPAN

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page