Skip to main content

Natural Language Processing (NLP) library for Urdu language.

Project description

Urduhack: NLP library for ( 🇵🇰 ) Urdu language

License: MIT image image wheel Build Status codecov Last commit image Downloads Join Slack Say Thanks!

Feature Support

  • Normalization
    • Arabic and Urdu Unicode Redundancy Problem
    • Character Normalization
    • Combined Characters Normalization
    • Diacritics Removal
    • Spaces Before & After Digits
    • Spaces After Punctuations
    • Joined Words Fix
  • Tokenization
    • Sentence Tokenization
    • Words Tokenization

Roadmap

  • Classification
    • Sentimental Analysis
    • Sentence Classification
    • Documents Classification
  • Name Entity Recognition
  • Image to Text
  • Speak to Text

Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

To install Requests, simply use pip

$ pip install urduhack

Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Backers Backers on Open Collective

Thank you to all our backers! 🙏 [Become a backer]

Sponsors Sponsors on Open Collective

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

Copyright and license

Code released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urduhack-0.1.2.tar.gz (57.2 kB view details)

Uploaded Source

Built Distribution

urduhack-0.1.2-py3-none-any.whl (60.7 kB view details)

Uploaded Python 3

File details

Details for the file urduhack-0.1.2.tar.gz.

File metadata

  • Download URL: urduhack-0.1.2.tar.gz
  • Upload date:
  • Size: 57.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for urduhack-0.1.2.tar.gz
Algorithm Hash digest
SHA256 9203d2ef84e5951988858f7883b1b2ff0a3bf950d058ea9075088419dfd672ff
MD5 b17955ecae55d7d90445a6082aaaecaf
BLAKE2b-256 1c8aadf238a5f558d9b6486106df51ce15be65e6446d7f71ef790b416e6edca7

See more details on using hashes here.

File details

Details for the file urduhack-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: urduhack-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 60.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for urduhack-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 99a19ea05b298734d3ae9cbf615804fb1dde781caa20a2cea73b7b8c5788fb04
MD5 aba153609d026a42ea543714567282f8
BLAKE2b-256 4a71ea43a1ee52dbf7742c176b3446944f34d1d0e4550bc1947c68ccb2783f7f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page