Skip to main content

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models.

Project description

https://travis-ci.org/proycon/colibri-core.svg?branch=master

by Maarten van Gompel, proycon@anaproy.nl, Radboud University Nijmegen

Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html)

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

Please consult the documentation at http://proycon.github.io/colibri-core

This software is developed in the scope of the Ph.D. research project Constructions as Linguistic Bridges. This research examines the identification and extraction of aligned constructions or patterns across natural languages, and the usage of such constructions in Machine Translation. The aligned constructions are not identified on the basis of an extensive and explicitly defined grammar or expert database of linguistic knowledge, but rather are implicitly distilled from large amounts of example data. Our notion of constructions is broad and transcends the idea of words or variable-length phrases.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colibricore-0.5.7.4.tar.gz (593.9 kB view details)

Uploaded Source

File details

Details for the file colibricore-0.5.7.4.tar.gz.

File metadata

File hashes

Hashes for colibricore-0.5.7.4.tar.gz
Algorithm Hash digest
SHA256 7980906488a055382b4d539225756560a2db98566d4156d9ea74b1e68d6d3afa
MD5 f7e73dfdbb55fa1ff170b1e0bf5f9075
BLAKE2b-256 ba193a143a43dd20a37ca6d6bb9bac65df5ddbf3ddf0142940d631ad057d9c8a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page