Multilingual POS-tagger and Dependency-parser
Project description
MultiCOMBO
Multilingual POS-Tagger and Dependency-Parser with COMBO-pytorch and spaCy.
Basic usage
>>> import multicombo
>>> nlp=multicombo.load()
>>> doc=nlp('Who plays "La vie en rose"?')
>>> print(multicombo.to_conllu(doc))
# text = Who plays "La vie en rose"?
1 Who who PRON PronType=Int 2 nsubj _ Translit=who
2 plays play VERB Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin 0 root _ _
3 " " PUNCT _ 5 punct _ SpaceAfter=No
4 La le DET Definite=Def|Gender=Fem|Number=Sing|PronType=Art 5 det _ Translit=la
5 vie vie NOUN Gender=Fem|Number=Sing 2 obj _ _
6 en en ADP _ 7 case _ _
7 rose rose NOUN Number=Sing 5 nmod _ SpaceAfter=No
8 " "' PUNCT _ 5 punct _ SpaceAfter=No
9 ? ? PUNCT _ 2 punct _ SpaceAfter=No
>>> import deplacy
>>> deplacy.render(doc)
Who PRON <════════════╗ nsubj
plays VERB ═══════════╗═╝═╗ ROOT
" PUNCT <══════╗ ║ ║ punct
La DET <════╗ ║ ║ ║ det
vie NOUN ═══╗═╝═╝═╗<╝ ║ obj
en ADP <╗ ║ ║ ║ case
rose NOUN ═╝<╝ ║ ║ nmod
" PUNCT <════════╝ ║ punct
? PUNCT <══════════════╝ punct
>>> deplacy.serve(doc)
http://127.0.0.1:5000
multicombo.load(lang="xx")
loads spaCy Language pipeline with bert-base-multilingual-cased and spacy.lang.xx.MultiLanguage
tokenizer. Other language specific tokenizers can be loaded with the option lang
, while several languages require additional packages:
lang="ja"
Japanese requires SudachiPy and SudachiDict-core.lang="th"
Thai requires PyThaiNLP.lang="vi"
Vietnamese requires pyvi.
Installation for Linux
pip3 install multicombo --user
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file multicombo-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: multicombo-0.5.0-py3-none-any.whl
- Upload date:
- Size: 16.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.25.1 setuptools/41.0.0 requests-toolbelt/0.8.0 tqdm/4.44.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 013e7ecc6749e92cbb86046ce8d5987ac246b430c18e911bed96b963268e13c0 |
|
MD5 | 31b7094bcffa4393577f6272dd89708f |
|
BLAKE2b-256 | b968ec7d587d656399d7e6d0713b3b7166bbc446fcc5534d7f0dfd8acf2b2848 |