Skip to main content

No project description provided

Project description

Costra

This is a tool for automatic evaluation of Czech sentence embeddings using Costra 1.1 dataset.

More information can be found in the following paper:

The presentation of the paper with the accompanying video can be found here.

Installation

$ pip install costra

Usage

  1. You can get sentences from Costra using the following command:
from costra import costra
sentences = costra.get_sentences()
  1. Use the sentences to generate your embeddings. The embeddings are evaluating the following way:
costra.evaluate(YOUR_EMBEDDINGS)

Citation

If you use the tool for academic purporses, please consider citing the following paper:

@inproceedings{Costra,
  author    = {Petra Baran{\v{\c}}{\'{\i}}kov{\'{a}} and Ond{\v{\r}}ej Bojar},
  editor    = {Petr Sojka and Ivan Kope{\v{\c}}ek and Karel Pala and Ales Hor{\'{a}}k},
  title     = {Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces},
  booktitle = {Text, Speech, and Dialogue - 23rd International Conference, {TSD}
               2020, Brno, Czech Republic, September 8-11, 2020, Proceedings},
  series    = {Lecture Notes in Computer Science},
  volume    = {12284},
  pages     = {135--143},
  publisher = {Springer},
  year      = {2020},
  url       = {https://doi.org/10.1007/978-3-030-58323-1\_14},
  doi       = {10.1007/978-3-030-58323-1\_14},
}

License

The data is distributed under the Creative Commons 4.0 BY.

Project details


Release history Release notifications | RSS feed

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

costra-1.0.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

costra-1.0-py3-none-any.whl (229.6 kB view details)

Uploaded Python 3

File details

Details for the file costra-1.0.tar.gz.

File metadata

  • Download URL: costra-1.0.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.5

File hashes

Hashes for costra-1.0.tar.gz
Algorithm Hash digest
SHA256 13546ab00f5be0ceffa7726da6d23db85e8494499d7654ff8e6071c155c9577e
MD5 62875ab1fadb1c90117ad6e33028a0c6
BLAKE2b-256 0d5c2340c797de9e5e7b7757bb29643346e7a6509ce4485c799b451b46f631d4

See more details on using hashes here.

File details

Details for the file costra-1.0-py3-none-any.whl.

File metadata

  • Download URL: costra-1.0-py3-none-any.whl
  • Upload date:
  • Size: 229.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.8.5

File hashes

Hashes for costra-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f516fceb3f07f6c7fce47143a3ec12a34e8ef6451b92f699e6b1d700dc2152f9
MD5 49a68fc763ea6232eb41d642046cb680
BLAKE2b-256 eed502a1aad011f6682fa181405229eeef5938fae5aa13dd9bbf5cbba3b210dc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page