A package for curating dictionaries (esp in babylon and stardict formats).
Project description
dict curation
A package for curating doc file collections. Prominent features:
Scrape texts off various sites, such as Wikisource. See example here. (PS: Consider contributing to raw_etexts repo. )
OCR some pdf with google drive. Automatically splits into 25 page bits and ocrs them individually. See usage example here, function here.
For users
Manually and periodically generated docs here
For detailed examples and help, please see individual module files in this package.
Installation or upgrade:
sudo pip install dict_curation -U
sudo pip install git+https://github.com/sanskrit-coders/dict_curation/@master -U
Web.
For contributors
Contact
Have a problem or question? Please head to github.
Packaging
~/.pypirc should have your pypi login credentials.
python setup.py bdist_wheel twine upload dist/* --skip-existing
Build documentation
sphinx html docs can be generated with cd docs; make html
Testing
Run pytest in the root directory.
Auxiliary tools
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for dict_curation-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b4d0ab6c32775c9de8884e03cce557f0a8fbfc2e807734ee5c644ef47fea367 |
|
MD5 | 27de6118e8786cade1bee1588a9456c1 |
|
BLAKE2b-256 | e5690d21a0adb7097ebb6f147761f7e964dd961b1bd7eb59c960b02a7f9ce369 |