Ultra-fast, comprehensive NLP preprocessing library with advanced tokenization
Project description
UltraNLP - Ultra-Fast NLP Preprocessing Library
🚀 Ultra-fast, comprehensive NLP preprocessing with advanced tokenization
Features
- ⚡ Ultra-fast tokenization - Handles $20, 20Rs, emails, hashtags, emojis
- 🧹 Comprehensive text cleaning - HTML, URLs, emojis, normalization
- 🔤 Smart spell correction - With caching and performance optimization
- 📦 Batch processing - Parallel processing for large datasets
- 🎯 Production ready - Memory efficient, thread-safe
Quick Start
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ultranlp-1.0.0.tar.gz
(8.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ultranlp-1.0.0.tar.gz.
File metadata
- Download URL: ultranlp-1.0.0.tar.gz
- Upload date:
- Size: 8.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fab85f4bf6050f13ac394428d6101651fe174253fe32bce8bcac5fc8fe11c57c
|
|
| MD5 |
dfa66d8d9ba61d0dbe2c0310ff254ef4
|
|
| BLAKE2b-256 |
4e845754190ccd540256c4a7abee199c8d6ed9595873d36f66778d8a205e5863
|
File details
Details for the file ultranlp-1.0.0-py3-none-any.whl.
File metadata
- Download URL: ultranlp-1.0.0-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0044f45127076f731a09fd6f9891f3c2b528ed111de5c29604d43b55c50a8093
|
|
| MD5 |
ea309fb4a0264479aa5239a36d56eb88
|
|
| BLAKE2b-256 |
2136b6d31360ade75d1f857006b99b49d92b66a37a6d445fe311bdffc8ba9ac0
|