Skip to main content

A library for blocking in record linkage

Project description

codecov Documentation Status Typechecking Testing Downloads

Blocklib

Python implementations of record linkage blocking techniques. Blocking is a technique that makes record linkage scalable. It is achieved by partitioning datasets into groups, called blocks and only comparing records in corresponding blocks. This can reduce the number of comparisons that need to be conducted to find which pairs of records should be linked.

blocklib is part of the Anonlink project for privacy preserving record linkage.

Installation

Install with pip:

pip install blocklib

Documents

You can find comprehensive documentation and tutorials in readthedocs

Tests

Run unit tests with pytest::

$ pytest

Discussion

If you run into bugs, you can file them in our issue tracker on GitHub.

There is also an anonlink mailing list for development discussion and release announcements.

Wherever we interact, we strive to follow the Python Community Code of Conduct

License and Copyright

blocklib is copyright (c) Commonwealth Scientific and Industrial Research Organisation (CSIRO).

Licensed under the Apache License, Version 2.0 (the "License"). You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blocklib-0.1.11.tar.gz (21.0 kB view details)

Uploaded Source

Built Distribution

blocklib-0.1.11-py3-none-any.whl (26.9 kB view details)

Uploaded Python 3

File details

Details for the file blocklib-0.1.11.tar.gz.

File metadata

  • Download URL: blocklib-0.1.11.tar.gz
  • Upload date:
  • Size: 21.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for blocklib-0.1.11.tar.gz
Algorithm Hash digest
SHA256 8df6e827ba3a5649d326a53932073699bf2b653ea0e45a504d3ed0c10ae6d753
MD5 d8a4bf9ebc157339a0404c9373245112
BLAKE2b-256 db81a0094ee08af2820e0dcd9d935c282fba291532b524e4f8a6f1bbe06e076c

See more details on using hashes here.

File details

Details for the file blocklib-0.1.11-py3-none-any.whl.

File metadata

  • Download URL: blocklib-0.1.11-py3-none-any.whl
  • Upload date:
  • Size: 26.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for blocklib-0.1.11-py3-none-any.whl
Algorithm Hash digest
SHA256 5f77bc5c39d02b605488f52bed79e873767b48652c5c2838c5dd93235a1f72cd
MD5 0c97889398ca0145bd085259ddc871d8
BLAKE2b-256 4313d5119666c8d68967bc11d2215f5e33d23ff61faf37cd0cf2caded1077a9d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page