Skip to main content

Natural Language Processing (NLP) library for Urdu language.

Project description

Urduhack: NLP library for ( 🇵🇰 ) Urdu language

License: MIT image image wheel Build Status codecov Last commit image Downloads Join Slack Say Thanks!

Feature Support

  • Normalization
    • Arabic and Urdu Unicode Redundancy Problem
    • Character Normalization
    • Combined Characters Normalization
    • Diacritics Removal
    • Spaces Before & After Digits
    • Spaces After Punctuations
    • Joined Words Fix
  • Tokenization
    • Sentence Tokenization
    • Words Tokenization

Roadmap

  • Classification
    • Sentimental Analysis
    • Sentence Classification
    • Documents Classification
  • Name Entity Recognition
  • Image to Text
  • Speak to Text

Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

To install Requests, simply use pip

$ pip install urduhack

Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

Community

Get updates on UrduHack nlp development and chat with the project maintainers and community members.

Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Sponsors

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

Copyright and license

Code released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urduhack-0.1.1.tar.gz (56.8 kB view details)

Uploaded Source

Built Distribution

urduhack-0.1.1-py3-none-any.whl (60.2 kB view details)

Uploaded Python 3

File details

Details for the file urduhack-0.1.1.tar.gz.

File metadata

  • Download URL: urduhack-0.1.1.tar.gz
  • Upload date:
  • Size: 56.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for urduhack-0.1.1.tar.gz
Algorithm Hash digest
SHA256 754564f3238d4e4303787b6647ee35a0c55e8a0431702539283c18488093f9be
MD5 04f2622ae06ef55fca326c1904049fe3
BLAKE2b-256 139ecd8683817f632d0d0b7797eb3b7142d593257a511d87834cfbddd15d724b

See more details on using hashes here.

File details

Details for the file urduhack-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: urduhack-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 60.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.3

File hashes

Hashes for urduhack-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c57df5f38f68586c6934ba99afd61d9c7530d222bc03350e912f1b2572e2fef2
MD5 6194076715d9c9f799b572b373e7fc26
BLAKE2b-256 cda458373605034ae8e3e7b655c14ff7cab08e39fcbc8f6a85883f6a9941ad05

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page