Geo distance scoring plugin for csv-reconcile
Project description
Table of Contents
CSV Reconcile Geo distance scoring plugin
A scoring plugin for csv-reconcile using geodesic distance. See csv-reconcile for details.
Reconciliation
This plugin is used to reconcile values representing points on the globe. It expects those
values to be in well-known text format for a point. That is, like so: POINT( longitude latitude )
.
The pre-processor automatically strips off literal datatypes when present as well as double quotes.
The CSV column to be reconciled needs to be in the same format. In addition, there must be at most one instance of any id column. For instance, if reconciling against coordinate location for a wikidata item, there must be at most one location per item.
Scoring
The scoring used is more or less arbitrary but has the following properties:
- The highest score is 100 and occurs when the distance to the reconciliation candidate is zero
- The lower the score the greater the distance to the reconciliation candidate
- The score is scaled so that a distance of 10km yields a score of 50
Future enhancements
Some of the current implementation was driven by the current design of csv-reconcile. Both may be updated to accommodate the following:
- Allow for separate latitude and longitude column in the CSV file
- Do some preliminary checks on coordinates before calculating distance for performance
- Add some scoring options such as the following:
- Change the scale of the scoring to place the distance for a score of 50 somewhere else for better granularity
- Allow for overriding the scaling function
- etc.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for csv_reconcile_geo-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ea72d27db62d7a4877569691d5282c5e5c0dfb3870503812a29d12c88f282fd |
|
MD5 | e0cb0d575f7ac444baf186077f2d5983 |
|
BLAKE2b-256 | f591249a1032faaf844c0cab5e6ca2650ebe3000c28d8e8de793e8d845950c81 |