Skip to main content

Typology-based semantic labelling of numeric columns

Project description

TTLA

DOI

This application is meant to be an automated experiment and not an application by it self to annotated numeric columns. Nonetheless, we are planning to create an application based on this approach details will be mentioned here once we start.

Install via pip

pip install ttla

Run the experiments

To download the data of T2Dv2 automatically

python data/preprocessing.py

Detection

python -m experiments.web_commons_v2 detect

Labeling

  1. Label (may take up to an hour, it needs to be connected to the internet)
python experiments.web_commons_v2 label
  1. Get the kinds (offline, quick)
python experiments.web_commons_v2 addkinds
 
  1. Show scores (offline, quick)
python experiments.web_commons_v2 scores
 

Tests

Quick tests (test the algorithms, but does not include the t2d experiment)

sh run_tests.sh

run tests with the T2Dv2 experiment (may take up to an hour)

sh run_t2dv2_tests.sh

not that some tests may fail overtime as they depend on dbpedia

Coverage:

Coverage of the quick tests

sh run_cov.sh

Coverage of T2Dv2 tests

sh run_t2dv2_cov.sh

To publish

python setup.py sdist bdist_wheel
twine upload dist/*

Contribution

To contribute, please read the below to follow the same convention

Code structure

  • The source code related to detection of data types (e.g. categorical, continuous, ...) is located under detect.
  • while the files related to the annotation of the semantic types (e.g. height of a person) are located under label.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttla-1.0.4.tar.gz (19.1 kB view details)

Uploaded Source

Built Distribution

ttla-1.0.4-py3-none-any.whl (36.7 kB view details)

Uploaded Python 3

File details

Details for the file ttla-1.0.4.tar.gz.

File metadata

  • Download URL: ttla-1.0.4.tar.gz
  • Upload date:
  • Size: 19.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ttla-1.0.4.tar.gz
Algorithm Hash digest
SHA256 0df30e1a4f6a7257f993b012f84f5963e8726c372780a391d46b38a6493c49bc
MD5 728b66d4e84480eb4dee454ab4bcb8ba
BLAKE2b-256 13358708fb861441c5eaa32684014422a5fe6b21915693af72cda1f281e9c265

See more details on using hashes here.

File details

Details for the file ttla-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: ttla-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 36.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ttla-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 0aceca82c76d7f0a05b045385dba80f3d68bc7847a2d2905b1ae830e8a5f727f
MD5 b19cfbb8ee19c7e6d0e2118b720229f0
BLAKE2b-256 4f89ac857cb655ba57fe7274acaee77fb7d6584dbf8a24bfa9854b13832a4623

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page