Skip to main content

Typology-based semantic labelling of numeric columns

Project description

TTLA

DOI

This application is meant to be an automated experiment and not an application by it self to annotated numeric columns. Nonetheless, we are planning to create an application based on this approach details will be mentioned here once we start.

Install via pip

pip install ttla

Run the experiments

To download the data of T2Dv2 automatically

python data/preprocessing.py

Detection

python experiments/web_commons_v2.py detect

Labeling

  1. Label (may take up to an hour, it needs to be connected to the internet)
python experiments/web_commons_v2.py label
  1. Get the kinds (offline, quick)
python experiments/web_commons_v2.py addkinds
 
  1. Show scores (offline, quick)
python experiments/web_commons_v2.py scores
 

Tests

Quick tests (test the algorithms, but does not include the t2d experiment)

sh run_tests.sh

run tests with the T2Dv2 experiment (may take up to an hour)

sh run_t2dv2_tests.sh

not that some tests may fail overtime as they depend on dbpedia

Coverage:

Coverage of the quick tests

sh run_cov.sh

Coverage of T2Dv2 tests

sh run_t2dv2_cov.sh

To publish

python setup.py sdist bdist_wheel
twine upload dist/*

Contribution

To contribute, please read the below to follow the same convention

Code structure

  • The source code related to detection of data types (e.g. categorical, continuous, ...) is located under detect.
  • while the files related to the annotation of the semantic types (e.g. height of a person) are located under label.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttla-1.0.3.tar.gz (19.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ttla-1.0.3-py3-none-any.whl (21.8 kB view details)

Uploaded Python 3

File details

Details for the file ttla-1.0.3.tar.gz.

File metadata

  • Download URL: ttla-1.0.3.tar.gz
  • Upload date:
  • Size: 19.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ttla-1.0.3.tar.gz
Algorithm Hash digest
SHA256 8ac8759285909754dbc4ad2a3b5d9bfa6ac261acde893bc07d1aadd5de435c4e
MD5 613101c6d3fa93dc9ad3244807034d2f
BLAKE2b-256 6c5e9d474636a9691ef5112e9e6c88555d67d955070685615d352ae98d95a6da

See more details on using hashes here.

File details

Details for the file ttla-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: ttla-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 21.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ttla-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 3a0d300e1c288531514ef98dc5c6cdb49a1f1ed47868d3eb9f2d271e46c3ed3a
MD5 81512d6c543c5b494cf8d850643c32fb
BLAKE2b-256 613bac4f2566b4ecc813754f044b4e297b08ca40a5b7ebf9876ed5b8f69b706c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page