French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)
Project description
Installation
pip install dict-fr-DELA
French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)
DESCRIPTION
This package contains several dictionaries processed from one of those made available by the former Laboratoire d'Automatique Documentaire et Linguistique (LADL), now integrated into Institut Gaspard Monge (IGM) of the Université Gustave Eiffel.
The selected dictionary is the inflected form DELA French dictionary in UTF-16 LE encoding, from March 16, 2006, with 683.824 simple entries for 102.073 different lemmas and 108.436 compounded entries for 83.604 different lemmas.
FILES
All files are installed in Python's /usr/local equivalent, under share/dict.
Original files
Filename | Description |
---|---|
dict-fr-DELA | 792.120 entries inflected form DELA French dictionary |
dict-fr-DELA-License | Lesser General Public License For Linguistic Resources |
The dict-fr-DELA file has undergone the following transformations:
- conversion from UTF-16 LE to UTF-8
- removal of MS-DOS end of lines
- sort & removal of duplicates
Generated files
Filename | Description |
---|---|
dict-fr-DELA.ascii | French words and compound words list (unaccented) |
dict-fr-DELA.unicode | 742.889 entries French words and compound words list (accented) |
dict-fr-DELA.combined | French words and compound words list (with both accented and unaccented words) |
dict-fr-DELA-proper_nouns.ascii | French proper nouns list (unaccented, sometimes compounded) |
dict-fr-DELA-proper_nouns.unicode | 823 entries French proper nouns list (accented, sometimes compounded) |
dict-fr-DELA-proper_nouns.combined | French proper nouns list (with both accented and unaccented words, sometimes compounded) |
dict-fr-DELA-common-words.ascii | French common words list (unaccented) |
dict-fr-DELA-common-words.unicode | 641.759 entries French common words list (accented) |
dict-fr-DELA-common-words.combined | French common words list (with both accented and unaccented words) |
dict-fr-DELA-common-compound-words.ascii | French common compound words list (unaccented) |
dict-fr-DELA-common-compound-words.unicode | 100.320 entries French common compound words list (accented) |
dict-fr-DELA-common-compound-words.combined | French common compound words list (with both accented and unaccented words) |
These generated files went through the following transformations:
- removal of escape backslashes
- removal of lemma and grammatical info from dict-fr-DELA
- lossless conversion of accents for the *-ascii versions
- combination of the *-ascii and *-unicode versions into the *-combined ones (without duplicates)
SEE ALSO
spell(1) like tools, anagram(6), conjuguer(1)
HISTORY
DELA means "Dictionnaire Electronique du LADL" (LADL's electronic dictionaries). These dictionaries were initiated by the lab's founder, Maurice Gross.
This package of data files was originally intended to be used with the PNU project's conjuguer command, as well as many other text processing tools.
I wrote an history of Unix & French dictionaries (in French only), which covers this dictionary and many others.
LICENSE
The original contents, as well as this package, are licensed under the Lesser General Public License For Linguistic Resources.
AUTHORS
Laboratoire d'Automatique Documentaire et Linguistique (LADL) for the original contents.
Hubert Tournier for the package.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dict-fr-DELA-2021.8.27.tar.gz
.
File metadata
- Download URL: dict-fr-DELA-2021.8.27.tar.gz
- Upload date:
- Size: 19.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.7.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7ab1905a34a20af4447ddf242a8d76dd788b2c07c9d7b1da2e1e3b7d21735eab |
|
MD5 | 8dd5d5367b2a6d4a593416d89bcfa408 |
|
BLAKE2b-256 | f8c8d297bdf331d8fa3906f0f6f82714ac02a1541ca026917e8300741e7502f0 |
File details
Details for the file dict_fr_DELA-2021.8.27-py3-none-any.whl
.
File metadata
- Download URL: dict_fr_DELA-2021.8.27-py3-none-any.whl
- Upload date:
- Size: 19.0 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.7.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 105775cf9f3235964c28eff6602cc82652468eadbbb0a94ec6bcbc7c501d4828 |
|
MD5 | 15ecb691ff0077a0e22dee88602e7145 |
|
BLAKE2b-256 | c0bfad322a9433bb7cd5e87638d88079bf261a79dfc72bc51da9e3b99ae47fa7 |