Skip to main content

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models.

Project description

Colibri Core

Maarten van Gompel proycon@anaproy.nl Radboud University Nijmegen

Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html)

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

Please consult the documentation at http://proycon.github.io/colibri-core

This software is developed in the scope of the Ph.D. research project Constructions as Linguistic Bridges. This research examines the identification and extraction of aligned constructions or patterns across natural languages, and the usage of such constructions in Machine Translation. The aligned constructions are not identified on the basis of an extensive and explicitly defined grammar or expert database of linguistic knowledge, but rather are implicitly distilled from large amounts of example data. Our notion of constructions is broad and transcends the idea of words or variable-length phrases.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colibricore-0.5.7.2.tar.gz (593.3 kB view details)

Uploaded Source

File details

Details for the file colibricore-0.5.7.2.tar.gz.

File metadata

File hashes

Hashes for colibricore-0.5.7.2.tar.gz
Algorithm Hash digest
SHA256 2d02453b298e813cd411aee8bfc6de779251f219967ed30787bdf119af10b414
MD5 2e83cad7a050cf28303e0cd98ac508bb
BLAKE2b-256 725ce68e4cf987066d984e6ae1294b07d94bb5c8b2e477d622dbcecd5e90a209

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page