Skip to main content

Natural Language Processing (NLP) library for Urdu language.

Project description

Urduhack: NLP library for ( 🇵🇰 ) Urdu language

License: MIT image image wheel Build Status codecov Last commit image Downloads Join Slack Say Thanks!

Feature Support

  • Normalization
    • Arabic and Urdu Unicode Redundancy Problem
    • Character Normalization
    • Combined Characters Normalization
    • Diacritics Removal
    • Spaces Before & After Digits
    • Spaces After Punctuations
    • Joined Words Fix
  • Tokenization
    • Sentence Tokenization
    • Words Tokenization

Roadmap

  • Classification
    • Sentimental Analysis
    • Sentence Classification
    • Documents Classification
  • Name Entity Recognition
  • Image to Text
  • Speak to Text

Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

To install Requests, simply use pip

$ pip install urduhack

Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Backers Backers on Open Collective

Thank you to all our backers! 🙏 [Become a backer]

Sponsors Sponsors on Open Collective

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

Copyright and license

Code released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urduhack-0.1.4.tar.gz (59.3 kB view details)

Uploaded Source

Built Distribution

urduhack-0.1.4-py3-none-any.whl (62.9 kB view details)

Uploaded Python 3

File details

Details for the file urduhack-0.1.4.tar.gz.

File metadata

  • Download URL: urduhack-0.1.4.tar.gz
  • Upload date:
  • Size: 59.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.1

File hashes

Hashes for urduhack-0.1.4.tar.gz
Algorithm Hash digest
SHA256 7a69d303ab709114e3059507f5ac6f168fe9eea9148c5407f8d923bccabcf5e1
MD5 c88b6d95f51dbb237925aa40156e1472
BLAKE2b-256 56f29060ba82d4b682b50063d372e7f13d29e3e44ef273599965fc2cf7c78f11

See more details on using hashes here.

File details

Details for the file urduhack-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: urduhack-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 62.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.1

File hashes

Hashes for urduhack-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 152406d25b0545f41de6bc5ab2b3dcc100802ad6b3476180a622c619a6d5975d
MD5 301316659319b9dde5733d9e974f53a1
BLAKE2b-256 a959ea10cac4c2753451e7f6b738f8cf1b7a9a4f5175523ff901574fa511f084

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page