Skip to main content

A package for curating dictionaries (esp in babylon and stardict formats).

Project description

^Build status Build Status Documentation Status PyPI version

dict curation

A package for curating doc file collections. Prominent features:

  • Scrape texts off various sites, such as Wikisource. See example here. (PS: Consider contributing to raw_etexts repo. )

  • OCR some pdf with google drive. Automatically splits into 25 page bits and ocrs them individually. See usage example here, function here.

For users

Installation or upgrade:

  • sudo pip install dict_curation -U

  • sudo pip install git+https://github.com/sanskrit-coders/dict_curation/@master -U

  • Web.

For contributors

Contact

Have a problem or question? Please head to github.

Packaging

  • ~/.pypirc should have your pypi login credentials.

python setup.py bdist_wheel
twine upload dist/* --skip-existing

Build documentation

  • sphinx html docs can be generated with cd docs; make html

Testing

Run pytest in the root directory.

Auxiliary tools

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dict_curation-0.0.3.tar.gz (40.7 kB view details)

Uploaded Source

Built Distribution

dict_curation-0.0.3-py3-none-any.whl (45.1 kB view details)

Uploaded Python 3

File details

Details for the file dict_curation-0.0.3.tar.gz.

File metadata

  • Download URL: dict_curation-0.0.3.tar.gz
  • Upload date:
  • Size: 40.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for dict_curation-0.0.3.tar.gz
Algorithm Hash digest
SHA256 94e98e93891d64a3c60f6811d31322b61f0f0f27490450e42e50db131de80cc4
MD5 5cf662f33fe3f2c55ca3429970627c3a
BLAKE2b-256 747e6861f5810aa3478fb91d0b99fcb9fc5f5a1602457c76326c392d3c54c8bf

See more details on using hashes here.

File details

Details for the file dict_curation-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: dict_curation-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 45.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.2 requests-toolbelt/0.9.1 tqdm/4.55.2 CPython/3.9.1

File hashes

Hashes for dict_curation-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ff1063e756837cc1852c712f988c376971e82bd37eaa0fd0cbd93836c60795d6
MD5 abc6ce4942c723cbb652fba97a36f15d
BLAKE2b-256 d6ad252f5e0d2f8f80da982869a2f21655d9ae685e68140ffe5f956cd54673f0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page