Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models.
Project description
- Colibri Core
Maarten van Gompel proycon@anaproy.nl Radboud University Nijmegen
Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html)
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.
Please consult the documentation at http://proycon.github.io/colibri-core
This software is developed in the scope of the Ph.D. research project Constructions as Linguistic Bridges. This research examines the identification and extraction of aligned constructions or patterns across natural languages, and the usage of such constructions in Machine Translation. The aligned constructions are not identified on the basis of an extensive and explicitly defined grammar or expert database of linguistic knowledge, but rather are implicitly distilled from large amounts of example data. Our notion of constructions is broad and transcends the idea of words or variable-length phrases.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file colibricore-0.5.7.3.tar.gz
.
File metadata
- Download URL: colibricore-0.5.7.3.tar.gz
- Upload date:
- Size: 593.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fdfad2e9024eacbdf624a93aca37024b6275f464242e6bd2fa2a91b4cf357f75 |
|
MD5 | e7222285467ef359a88f173b1de873b1 |
|
BLAKE2b-256 | e7113ac6455d0e555026395eaa517fe45eea3c26c1cacd17b4c1a31bd15b6ac9 |