hazm · PyPI

Python library for digesting Persian text.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Natural Language
- Persian
Programming Language
- Python :: 2.7
- Python :: 3.3
Topic
- Text Processing

Project description

Python library for digesting Persian text.

Text cleaning
Sentence and word tokenizer
Word lemmatizer
POS tagger
Dependency parser
Corpus readers for Hamshahri and Bijankhan
NLTK compatible
Python 3.3 and 2.7 support
|Build Status|

Usage

>>> from hazm import Normalizer
>>> normalizer = Normalizer()
>>> normalizer.normalize('اصلاح نويسه ها و استفاده از نیم‌فاصله پردازش را آسان مي كند')
'اصلاح نویسه‌ها و استفاده از نیم‌فاصله پردازش را آسان می‌کند'

>>> from hazm import sent_tokenize, word_tokenize
>>> sent_tokenize('ما هم برای وصل کردن آمدیم! ولی برای پردازش، جدا بهتر نیست؟')
['ما هم برای وصل کردن آمدیم!', 'ولی برای پردازش، جدا بهتر نیست؟']
>>> word_tokenize('ولی برای پردازش، جدا بهتر نیست؟')
['ولی', 'برای', 'پردازش', '،', 'جدا', 'بهتر', 'نیست', '؟']

>>> from hazm import Stemmer, Lemmatizer
>>> stemmer = Stemmer()
>>> stemmer.stem('کتاب‌ها')
'کتاب'
>>> lemmatizer = Lemmatizer()
>>> lemmatizer.lemmatize('می‌روم')
'رفت#رو'

>>> from hazm import POSTagger
>>> tagger = POSTagger()
>>> tagger.tag(word_tokenize('ما بسیار کتاب می‌خوانیم'))
[('ما', 'PR'), ('بسیار', 'ADV'), ('کتاب', 'N'), ('می‌خوانیم', 'V')]

>>> from hazm import DependencyParser
>>> parser = DependencyParser(tagger=POSTagger())
>>> parser.parse(word_tokenize('زنگ‌ها برای که به صدا درمی‌آید ؟'))
<DependencyGraph with 8 nodes>

Installation

pip install hazm

We also trained tagger and parser models which you may put them in resources folder of your project.

Thanks

from constributors: Mojtaba Khallash and Mohsen Imany.
from Virastyar for persian word list.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Natural Language
- Persian
Programming Language
- Python :: 2.7
- Python :: 3.3
Topic
- Text Processing

Release history Release notifications | RSS feed

0.12.1

Apr 1, 2026

0.12.0

Apr 1, 2026

0.11.0

Dec 20, 2025

0.10.0

Jan 16, 2024

0.9.4

Oct 1, 2023

0.9.3

Jul 19, 2023

0.9.2

Jul 8, 2023

0.9.1

Jun 30, 2023

0.7.0

Oct 12, 2018

0.6.0.1

Oct 12, 2018

0.5.2

Oct 7, 2015

0.5.1

Jun 29, 2015

0.5

Mar 20, 2015

0.4

Dec 16, 2014

0.3

Aug 29, 2014

0.2

Jul 11, 2014

This version

0.1

Dec 14, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hazm-0.1.tar.gz (134.9 kB view details)

Uploaded Dec 14, 2013 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hazm-0.1.linux-x86_64.exe (198.6 kB view details)

Uploaded Dec 14, 2013 Source

File details

Details for the file hazm-0.1.tar.gz.

File metadata

Download URL: hazm-0.1.tar.gz
Upload date: Dec 14, 2013
Size: 134.9 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for hazm-0.1.tar.gz
Algorithm	Hash digest
SHA256	`1101cf8b66884e9f64c8fa88c129f66150a466551578812d5da7b876213c862e`
MD5	`29bd9c844d18547b3163271b6d61c4dc`
BLAKE2b-256	`da4d7065524c9cede2f2a0e57f0a1d6663af036495f084fc60834da31bce33b3`

See more details on using hashes here.

File details

Details for the file hazm-0.1.linux-x86_64.exe.

File metadata

Download URL: hazm-0.1.linux-x86_64.exe
Upload date: Dec 14, 2013
Size: 198.6 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for hazm-0.1.linux-x86_64.exe
Algorithm	Hash digest
SHA256	`53548952bd94091256e3bb1cd9e7adc6b615bfeae2e83d83cb9ec4305828adf5`
MD5	`2b6ea27cdd0b84153007884132efe995`
BLAKE2b-256	`942e5aa15812f9ca87c20210c1a94645770dd3e2fdb0b4a91279c75ab22381ca`

See more details on using hashes here.

hazm 0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Usage

Installation

Thanks

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes