Skip to main content

Natural Language Processing (NLP) library for Urdu language.

Project description

Urduhack: A Python NLP library for Urdu language

image image Azure DevOps builds Azure DevOps tests Build Status CodeFactor codecov image Downloads Gitter License: MIT

Urduhack is a NLP library for urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

Our Goal

  • Academic users Easier experimentation to prove their hypothesis without coding from scratch.
  • NLP beginners Learn how to build an NLP project with production level code quality.
  • NLP developers Build a production level application within minutes.

🔥 Features Support

  • Normalization
  • Preprocessing
  • Tokenization
  • Pipeline Module
  • Models
    • Pos tagger
    • Sentimental analysis
    • Sentence classification
    • Documents classification
    • Name entity recognition
    • Image to text
    • Speech to text
  • Datasets loader

🛠 Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

Installing with tensorflow cpu version.

$ pip install urduhack[tf]

Installing with tensorflow gpu version.

$ pip install urduhack[tf-gpu]

Usage

import urduhack

# Downloading models
urduhack.download()

nlp = urduhack.Pipeline()
text = ""
doc = nlp(text)

for sentence in doc.sentences:
    print(sentence.text)
    for word in sentence.words:
        print(f"{word.text}\t{word.pos}")

    for token in sentence.tokens:
        print(f"{token.text}\t{token.ner}")

🔗 Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

Documentation
Installation How to install Urduhack and download models
Quickstart New to Urduhack? Here's everything you need to know!
API Reference The detailed reference for Urduhack's API.

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

👍 Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Backers Backers on Open Collective

Thank you to all our backers! 🙏 [Become a backer]

Sponsors Sponsors on Open Collective

Support this project by becoming a sponsor. [Become a sponsor]

📝 Copyright and license

Code released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urduhack-1.1.1.tar.gz (83.9 kB view details)

Uploaded Source

Built Distribution

urduhack-1.1.1-py3-none-any.whl (105.5 kB view details)

Uploaded Python 3

File details

Details for the file urduhack-1.1.1.tar.gz.

File metadata

  • Download URL: urduhack-1.1.1.tar.gz
  • Upload date:
  • Size: 83.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.1

File hashes

Hashes for urduhack-1.1.1.tar.gz
Algorithm Hash digest
SHA256 9170ec2e20a6fd2f0620a6d24d61c4233b2b4afe5c1b7e4d4326ef8f04cc6cb4
MD5 7e6fd2834e75555b03c33196cd71ce3c
BLAKE2b-256 68bf850b07a0f6e49bb7c9f3d355f02b757f8e281b6a8f3352cec820854f0a4b

See more details on using hashes here.

File details

Details for the file urduhack-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: urduhack-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 105.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.1

File hashes

Hashes for urduhack-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 df7cee63c95220e794635529f8a20083f44563885895d9b2f247497ee16c7495
MD5 7c422ad1b20b450757f56957e100b684
BLAKE2b-256 59043393a9626b766cfee3187e9ccfa27e73061c24646d60be22a0652de95b4f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page