Skip to main content

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models.

Project description

Colibri Core

Maarten van Gompel proycon@anaproy.nl Radboud University Nijmegen

Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html)

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

Please consult the documentation at http://proycon.github.io/colibri-core

This software is developed in the scope of the Ph.D. research project Constructions as Linguistic Bridges. This research examines the identification and extraction of aligned constructions or patterns across natural languages, and the usage of such constructions in Machine Translation. The aligned constructions are not identified on the basis of an extensive and explicitly defined grammar or expert database of linguistic knowledge, but rather are implicitly distilled from large amounts of example data. Our notion of constructions is broad and transcends the idea of words or variable-length phrases.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colibricore-0.5.7.3.tar.gz (593.3 kB view details)

Uploaded Source

File details

Details for the file colibricore-0.5.7.3.tar.gz.

File metadata

File hashes

Hashes for colibricore-0.5.7.3.tar.gz
Algorithm Hash digest
SHA256 fdfad2e9024eacbdf624a93aca37024b6275f464242e6bd2fa2a91b4cf357f75
MD5 e7222285467ef359a88f173b1de873b1
BLAKE2b-256 e7113ac6455d0e555026395eaa517fe45eea3c26c1cacd17b4c1a31bd15b6ac9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page