Skip to main content

French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)

Project description

Installation

pip install dict-fr-DELA

French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)

DESCRIPTION

This package contains several dictionaries processed from one of those made available by the former Laboratoire d'Automatique Documentaire et Linguistique (LADL), now integrated into Institut Gaspard Monge (IGM) of the Université Gustave Eiffel.

The selected dictionary is the inflected form DELA French dictionary in UTF-16 LE encoding, from March 16, 2006, with 683.824 simple entries for 102.073 different lemmas and 108.436 compounded entries for 83.604 different lemmas.

FILES

All files are installed in Python's /usr/local equivalent, under share/dict.

Original files

Filename Description
dict-fr-DELA 792.120 entries inflected form DELA French dictionary
dict-fr-DELA-License Lesser General Public License For Linguistic Resources

The dict-fr-DELA file has undergone the following transformations:

  • conversion from UTF-16 LE to UTF-8
  • removal of MS-DOS end of lines
  • sort & removal of duplicates

Generated files

Filename Description
dict-fr-DELA.ascii French words and compound words list (unaccented)
dict-fr-DELA.unicode 742.889 entries French words and compound words list (accented)
dict-fr-DELA.combined French words and compound words list (with both accented and unaccented words)
dict-fr-DELA-proper_nouns.ascii French proper nouns list (unaccented, sometimes compounded)
dict-fr-DELA-proper_nouns.unicode 823 entries French proper nouns list (accented, sometimes compounded)
dict-fr-DELA-proper_nouns.combined French proper nouns list (with both accented and unaccented words, sometimes compounded)
dict-fr-DELA-common-words.ascii French common words list (unaccented)
dict-fr-DELA-common-words.unicode 641.759 entries French common words list (accented)
dict-fr-DELA-common-words.combined French common words list (with both accented and unaccented words)
dict-fr-DELA-common-compound-words.ascii French common compound words list (unaccented)
dict-fr-DELA-common-compound-words.unicode 100.320 entries French common compound words list (accented)
dict-fr-DELA-common-compound-words.combined French common compound words list (with both accented and unaccented words)

These generated files went through the following transformations:

  • removal of escape backslashes
  • removal of lemma and grammatical info from dict-fr-DELA
  • lossless conversion of accents for the *-ascii versions
  • combination of the *-ascii and *-unicode versions into the *-combined ones (without duplicates)

SEE ALSO

spell(1) like tools, anagram(6), conjuguer(1)

HISTORY

DELA means "Dictionnaire Electronique du LADL" (LADL's electronic dictionaries). These dictionaries were initiated by the lab's founder, Maurice Gross.

This package of data files was originally intended to be used with the PNU project's conjuguer command, as well as many other text processing tools.

I wrote an history of Unix & French dictionaries (in French only), which covers this dictionary and many others.

LICENSE

The original contents, as well as this package, are licensed under the Lesser General Public License For Linguistic Resources.

AUTHORS

Laboratoire d'Automatique Documentaire et Linguistique (LADL) for the original contents.

Hubert Tournier for the package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dict-fr-DELA-2021.8.27.tar.gz (19.1 MB view details)

Uploaded Source

Built Distribution

dict_fr_DELA-2021.8.27-py3-none-any.whl (19.0 MB view details)

Uploaded Python 3

File details

Details for the file dict-fr-DELA-2021.8.27.tar.gz.

File metadata

  • Download URL: dict-fr-DELA-2021.8.27.tar.gz
  • Upload date:
  • Size: 19.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.7.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for dict-fr-DELA-2021.8.27.tar.gz
Algorithm Hash digest
SHA256 7ab1905a34a20af4447ddf242a8d76dd788b2c07c9d7b1da2e1e3b7d21735eab
MD5 8dd5d5367b2a6d4a593416d89bcfa408
BLAKE2b-256 f8c8d297bdf331d8fa3906f0f6f82714ac02a1541ca026917e8300741e7502f0

See more details on using hashes here.

File details

Details for the file dict_fr_DELA-2021.8.27-py3-none-any.whl.

File metadata

  • Download URL: dict_fr_DELA-2021.8.27-py3-none-any.whl
  • Upload date:
  • Size: 19.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.7.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for dict_fr_DELA-2021.8.27-py3-none-any.whl
Algorithm Hash digest
SHA256 105775cf9f3235964c28eff6602cc82652468eadbbb0a94ec6bcbc7c501d4828
MD5 15ecb691ff0077a0e22dee88602e7145
BLAKE2b-256 c0bfad322a9433bb7cd5e87638d88079bf261a79dfc72bc51da9e3b99ae47fa7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page