Custom French lemmatizer based on Leff for spacy
Project description
spacy-lefff : Custom French lemmatizer based on Lefff for spacy
spacy v2.0 extension and pipeline component for adding a French lemmatizer based on Lefff.
Installation
spacy-lefff
requires spacy
<= v2.0.9.
pip install spacy-lefff
Usage
Import and initialize your nlp
spacy object and add the custom component after it parsed the document so you can benefit the POS tags.
Be aware to work with UTF-8
.
import spacy
from spacy_lefff import LefffLemmatizer
nlp = spacy.load('fr')
french_lemmatizer = LefffLemmatizer()
nlp.add_pipe(french_lemmatizer, name='lefff', after='parser')
doc = nlp(u"Paris est une ville très chère.")
for d in doc:
print(d.text, d.pos_, d._.lefff_lemma, d.tag_)
Credits
Sagot, B. (2010). The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French. In 7th international conference on Language Resources and Evaluation (LREC 2010).
Benoît Sagot Webpage about LEFFF
http://alpage.inria.fr/~sagot/lefff-en.html
First work of Claude Coulombe to support Lefff with Python : https://github.com/ClaudeCoulombe
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.