Skip to main content

Typology-based semantic labelling of numeric columns

Project description

TTLA

DOI

This application is meant to be an automated experiment and not an application by it self to annotated numeric columns. Nonetheless, we are planning to create an application based on this approach details will be mentioned here once we start.

Install via pip

pip install ttla

Run the experiments

To download the data of T2Dv2 automatically

python data/preprocessing.py

Detection

python -m experiments.web_commons_v2 detect

Labeling

  1. Label (may take up to an hour, it needs to be connected to the internet)
python experiments.web_commons_v2 label
  1. Get the kinds (offline, quick)
python experiments.web_commons_v2 addkinds
 
  1. Show scores (offline, quick)
python experiments.web_commons_v2 scores
 

Tests

Quick tests (test the algorithms, but does not include the t2d experiment)

sh run_tests.sh

run tests with the T2Dv2 experiment (may take up to an hour)

sh run_t2dv2_tests.sh

not that some tests may fail overtime as they depend on dbpedia

Coverage:

Coverage of the quick tests

sh run_cov.sh

Coverage of T2Dv2 tests

sh run_t2dv2_cov.sh

To publish

python setup.py sdist bdist_wheel
twine upload dist/*

Contribution

To contribute, please read the below to follow the same convention

Code structure

  • The source code related to detection of data types (e.g. categorical, continuous, ...) is located under detect.
  • while the files related to the annotation of the semantic types (e.g. height of a person) are located under label.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttla-1.0.4.tar.gz (19.1 kB view hashes)

Uploaded Source

Built Distribution

ttla-1.0.4-py3-none-any.whl (36.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page