Skip to main content

Natural Language Processing (NLP) library for Urdu language.

Project description

Urduhack: A Python NLP library for Urdu language

image image Azure DevOps builds Azure DevOps tests Build Status CodeFactor codecov image Downloads Gitter License: MIT

Urduhack is a NLP library for urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

🔥 Features Support

  • Normalization
    • Arabic and Urdu Unicode Redundancy Problem
    • Character Normalization
    • Combined Characters Normalization
    • Diacritics Removal
    • Spaces Before & After Digits
    • Spaces After Punctuations
    • Joined Words Fix
  • Tokenization
    • Sentence Tokenization
    • Words Tokenization
  • Data Pre-processing
    • Handles all kind of numbers, emails, currencies and urls etc.
  • Tasks
    • Sentimental analysis
    • Sentence classification
    • Documents classification
    • Name entity recognition
    • Image to text
    • Speech to text
  • Datasets
    • IMDB Urdu movies review dataset
    • Hand written digits datasets

🛠 Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

Installing with tensorflow cpu version.

$ pip install urduhack[tf]

Installing with tensorflow gpu version.

$ pip install urduhack[tf-gpu]

🔗 Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

Documentation
Installation How to install Urduhack and download models
Quickstart New to Urduhack? Here's everything you need to know!
API Reference The detailed reference for Urduhack's API.

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

👍 Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Backers Backers on Open Collective

Thank you to all our backers! 🙏 [Become a backer]

Sponsors Sponsors on Open Collective

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

📝 Copyright and license

Code released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urduhack-0.3.4.tar.gz (71.0 kB view details)

Uploaded Source

Built Distribution

urduhack-0.3.4-py3-none-any.whl (81.9 kB view details)

Uploaded Python 3

File details

Details for the file urduhack-0.3.4.tar.gz.

File metadata

  • Download URL: urduhack-0.3.4.tar.gz
  • Upload date:
  • Size: 71.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.7.1

File hashes

Hashes for urduhack-0.3.4.tar.gz
Algorithm Hash digest
SHA256 af6defddae2abd5b2186decaed4c354772f41975fa7887aa8b47e90b47501315
MD5 52ab770e97d1200f160a7f6fb0abe1d2
BLAKE2b-256 3e1255f2e483bfea0f646b7d5baf5359347c584a0589351d72407f5cecdf322e

See more details on using hashes here.

File details

Details for the file urduhack-0.3.4-py3-none-any.whl.

File metadata

  • Download URL: urduhack-0.3.4-py3-none-any.whl
  • Upload date:
  • Size: 81.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.7.1

File hashes

Hashes for urduhack-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 eb910540b31f925c0632b6aee97a9b755a96bada768e038201f5217a360aa91f
MD5 a91f58266dcf65b2f8061fec5b4e66db
BLAKE2b-256 03677c10f834f5456e3c8502122bb43d84d27723bc0192566a6789df2fe2abd0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page