French dictionaries from Association des Bibliophiles Universels (ABU)
Project description
Installation
pip install dict-fr-ABU
French dictionaries from Association des Bibliophiles Universels (ABU)
DESCRIPTION
This package contains several dictionaries processed from those made available by the Association des Bibliophiles Universels (ABU) organization before 2003.
FILES
All files are installed in Python's /usr/local equivalent, under share/dict.
Original files
Filename | Description |
---|---|
dict-fr-ABU-cites | 39.076 French cities list (accented, with compound words), along with postal zip code |
dict-fr-ABU-Header-cites | French cities list (mandatory header) |
dict-fr-ABU-dicorth | 1.500 French orthographical difficulties by decreasing frequency (with compound words) |
dict-fr-ABU-Header-dicorth | French orthographical difficulties (mandatory header) |
dict-fr-ABU-mots_communs | 255.282 French common words (including female and plural forms, as well as conjugated verbs), along with singular / unconjugated form, and type |
dict-fr-ABU-pays | 170 countries and regions (with compound words) |
dict-fr-ABU-Header-pays | Countries and regions (mandatory header) |
dict-fr-ABU-prenoms | 12.437 firstnames (unaccented) |
dict-fr-ABU-Header-prenoms | Firstnames (mandatory header) |
dict-fr-ABU-License | ABU 1.1 License |
Generated files
Filename | Description |
---|---|
dict-fr-ABU-cites.ascii | French cities list (unaccented) |
dict-fr-ABU-cites.unicode | French cities list (accented) |
dict-fr-ABU-cites.combined | French cities list (with both accented and unaccented words) |
dict-fr-ABU-mots_communs.ascii | French common words (unaccented) |
dict-fr-ABU-mots_communs.combined | French common words (accented) |
dict-fr-ABU-mots_communs.unicode | French common words (with both accented and unaccented words) |
dict-fr-ABU-pays.ascii | Countries and regions (unaccented) |
dict-fr-ABU-pays.combined | Countries and regions (accented) |
dict-fr-ABU-pays.unicode | Countries and regions (with both accented and unaccented words) |
dict-fr-ABU-prenoms.ascii | Firstnames (unaccented) |
These generated files went through the following transformations:
- extraction of the headers in the dict-fr-header-* files above
- conversion from ISO-Latin-1 to UTF-8
- sort
- removal of duplicates
- removal of lemma and grammatical info from dict-fr-ABU-mots_communs
- removal of the zip codes from dict-fr-ABU-cites
- lossless conversion of accents for the *-ascii versions
- combination of the *-ascii and *-unicode versions into the *-combined ones (without duplicates)
SEE ALSO
spell(1) like tools, anagram(6)
HISTORY
These data files were originally intended to be used with the PNU project's anagram command, as well as many other text processing tools.
I wrote an history of Unix & French dictionaries (in French only), which covers this dictionary and many others.
LICENSE
The original contents, as well as this package, are licensed under the ABU 1.1 license.
Some source files had mandatory headers that were kept under data/dict-fr-ABU-Header-* rather than in the files themselves, in order to ease direct processing with other tools.
AUTHORS
Association des Bibliophiles Universels (ABU) for the original contents.
Hubert Tournier for the package.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dict-fr-ABU-2021.8.27.tar.gz
.
File metadata
- Download URL: dict-fr-ABU-2021.8.27.tar.gz
- Upload date:
- Size: 4.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.7.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7159242c3b4b709e8fe97d960d2aec20a37a1e616a4cd1b50fc734661b45cf6 |
|
MD5 | 89ab681fae8b360a744d7ec8389c7adc |
|
BLAKE2b-256 | f8d3e6cbe2fd2df8b4ccb4f3fb91260f8884914fd041e05f0d7e1e0e33ffba21 |
File details
Details for the file dict_fr_ABU-2021.8.27-py3-none-any.whl
.
File metadata
- Download URL: dict_fr_ABU-2021.8.27-py3-none-any.whl
- Upload date:
- Size: 4.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.7.0 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 17fec7ba6b0d311489ad3014ba50d177c6cee754d738c57856b9819d12a98372 |
|
MD5 | 20e8935dd485d08c7846ec58af9085a4 |
|
BLAKE2b-256 | 780e3852516007a78171eeb37f614d5391207811c51a8480f466695032c18b61 |