Skip to main content

No project description provided

Project description

Costra

This is a tool for automatic evaluation of Czech sentence embeddings using Costra 1.1 dataset.

More information can be found in the following paper:

The presentation of the paper with the accompanying video can be found here.

Installation

$ pip install costra

Usage

  1. You can get sentences from Costra using the following command:
from costra import costra
sentences = costra.get_sentences()
  1. Use the sentences to generate your embeddings. The embeddings are evaluating the following way:
costra.evaluate(YOUR_EMBEDDINGS)

Citation

If you use the tool for academic purporses, please consider citing the following paper:

@inproceedings{Costra,
  author    = {Petra Baran{\v{\c}}{\'{\i}}kov{\'{a}} and Ond{\v{\r}}ej Bojar},
  editor    = {Petr Sojka and Ivan Kope{\v{\c}}ek and Karel Pala and Ales Hor{\'{a}}k},
  title     = {Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces},
  booktitle = {Text, Speech, and Dialogue - 23rd International Conference, {TSD}
               2020, Brno, Czech Republic, September 8-11, 2020, Proceedings},
  series    = {Lecture Notes in Computer Science},
  volume    = {12284},
  pages     = {135--143},
  publisher = {Springer},
  year      = {2020},
  url       = {https://doi.org/10.1007/978-3-030-58323-1\_14},
  doi       = {10.1007/978-3-030-58323-1\_14},
}

License

The data is distributed under the Creative Commons 4.0 BY.

Project details


Release history Release notifications | RSS feed

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

costra-1.0.tar.gz (4.5 kB view hashes)

Uploaded Source

Built Distribution

costra-1.0-py3-none-any.whl (229.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page