Skip to main content

French dictionaries from Association des Bibliophiles Universels (ABU)

Project description

Installation

pip install dict-fr-ABU

French dictionaries from Association des Bibliophiles Universels (ABU)

DESCRIPTION

This package contains several dictionaries processed from those made available by the Association des Bibliophiles Universels (ABU) organization before 2003.

FILES

All files are installed in Python's /usr/local equivalent, under share/dict.

Original files

Filename Description
dict-fr-ABU-cites 39.076 French cities list (accented, with compound words), along with postal zip code
dict-fr-ABU-Header-cites French cities list (mandatory header)
dict-fr-ABU-dicorth 1.500 French orthographical difficulties by decreasing frequency (with compound words)
dict-fr-ABU-Header-dicorth French orthographical difficulties (mandatory header)
dict-fr-ABU-mots_communs 255.282 French common words (including female and plural forms, as well as conjugated verbs), along with singular / unconjugated form, and type
dict-fr-ABU-pays 170 countries and regions (with compound words)
dict-fr-ABU-Header-pays Countries and regions (mandatory header)
dict-fr-ABU-prenoms 12.437 firstnames (unaccented)
dict-fr-ABU-Header-prenoms Firstnames (mandatory header)
dict-fr-ABU-License ABU 1.1 License

Generated files

Filename Description
dict-fr-ABU-cites.ascii French cities list (unaccented)
dict-fr-ABU-cites.unicode French cities list (accented)
dict-fr-ABU-cites.combined French cities list (with both accented and unaccented words)
dict-fr-ABU-mots_communs.ascii French common words (unaccented)
dict-fr-ABU-mots_communs.combined French common words (accented)
dict-fr-ABU-mots_communs.unicode French common words (with both accented and unaccented words)
dict-fr-ABU-pays.ascii Countries and regions (unaccented)
dict-fr-ABU-pays.combined Countries and regions (accented)
dict-fr-ABU-pays.unicode Countries and regions (with both accented and unaccented words)
dict-fr-ABU-prenoms.ascii Firstnames (unaccented)

These generated files went through the following transformations:

  • extraction of the headers in the dict-fr-header-* files above
  • conversion from ISO-Latin-1 to UTF-8
  • sort
  • removal of duplicates
  • removal of lemma and grammatical info from dict-fr-ABU-mots_communs
  • removal of the zip codes from dict-fr-ABU-cites
  • lossless conversion of accents for the *-ascii versions
  • combination of the *-ascii and *-unicode versions into the *-combined ones (without duplicates)

SEE ALSO

spell(1) like tools, anagram(6)

HISTORY

These data files were originally intended to be used with the PNU project's anagram command, as well as many other text processing tools.

I wrote an history of Unix & French dictionaries (in French only), which covers this dictionary and many others.

LICENSE

The original contents, as well as this package, are licensed under the ABU 1.1 license.

Some source files had mandatory headers that were kept under data/dict-fr-ABU-Header-* rather than in the files themselves, in order to ease direct processing with other tools.

AUTHORS

Association des Bibliophiles Universels (ABU) for the original contents.

Hubert Tournier for the package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dict-fr-ABU-2021.8.27.tar.gz (4.6 MB view details)

Uploaded Source

Built Distribution

dict_fr_ABU-2021.8.27-py3-none-any.whl (4.6 MB view details)

Uploaded Python 3

File details

Details for the file dict-fr-ABU-2021.8.27.tar.gz.

File metadata

  • Download URL: dict-fr-ABU-2021.8.27.tar.gz
  • Upload date:
  • Size: 4.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.7.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for dict-fr-ABU-2021.8.27.tar.gz
Algorithm Hash digest
SHA256 a7159242c3b4b709e8fe97d960d2aec20a37a1e616a4cd1b50fc734661b45cf6
MD5 89ab681fae8b360a744d7ec8389c7adc
BLAKE2b-256 f8d3e6cbe2fd2df8b4ccb4f3fb91260f8884914fd041e05f0d7e1e0e33ffba21

See more details on using hashes here.

File details

Details for the file dict_fr_ABU-2021.8.27-py3-none-any.whl.

File metadata

  • Download URL: dict_fr_ABU-2021.8.27-py3-none-any.whl
  • Upload date:
  • Size: 4.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.7.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for dict_fr_ABU-2021.8.27-py3-none-any.whl
Algorithm Hash digest
SHA256 17fec7ba6b0d311489ad3014ba50d177c6cee754d738c57856b9819d12a98372
MD5 20e8935dd485d08c7846ec58af9085a4
BLAKE2b-256 780e3852516007a78171eeb37f614d5391207811c51a8480f466695032c18b61

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page