A library for blocking in record linkage
Project description
Blocklib
Python implementations of record linkage blocking techniques. Blocking is a technique that makes record linkage scalable. It is achieved by partitioning datasets into groups, called blocks and only comparing records in corresponding blocks. This can reduce the number of comparisons that need to be conducted to find which pairs of records should be linked.
blocklib
is part of the Anonlink project for privacy preserving record linkage.
Installation
Install with pip:
pip install blocklib
Documents
You can find comprehensive documentation and tutorials in readthedocs
Tests
Run unit tests with pytest
::
$ pytest
Discussion
If you run into bugs, you can file them in our issue tracker on GitHub.
There is also an anonlink mailing list for development discussion and release announcements.
Wherever we interact, we strive to follow the Python Community Code of Conduct
License and Copyright
blocklib
is copyright (c) Commonwealth Scientific and Industrial Research Organisation (CSIRO).
Licensed under the Apache License, Version 2.0 (the "License"). You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file blocklib-0.1.11.tar.gz
.
File metadata
- Download URL: blocklib-0.1.11.tar.gz
- Upload date:
- Size: 21.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8df6e827ba3a5649d326a53932073699bf2b653ea0e45a504d3ed0c10ae6d753 |
|
MD5 | d8a4bf9ebc157339a0404c9373245112 |
|
BLAKE2b-256 | db81a0094ee08af2820e0dcd9d935c282fba291532b524e4f8a6f1bbe06e076c |
File details
Details for the file blocklib-0.1.11-py3-none-any.whl
.
File metadata
- Download URL: blocklib-0.1.11-py3-none-any.whl
- Upload date:
- Size: 26.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5f77bc5c39d02b605488f52bed79e873767b48652c5c2838c5dd93235a1f72cd |
|
MD5 | 0c97889398ca0145bd085259ddc871d8 |
|
BLAKE2b-256 | 4313d5119666c8d68967bc11d2215f5e33d23ff61faf37cd0cf2caded1077a9d |