Text preprocessing utilities for ACACE
Project description
ACACE Text Preprocessor
A text preprocessing module for the Adaptive Context-Aware Content Engine (ACACE).
Features
- HTML tag removal and entity decoding
- Whitespace normalization
- Simple and efficient text cleaning
Installation
pip install acace_text_preprocessor
Usage
from acace_text_preprocessor import preprocess_text
# Clean and normalize text
cleaned_text = preprocess_text("Your <b>HTML</b> text with extra spaces")
print(cleaned_text) # Output: "Your HTML text with extra spaces"
License
MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file acace_text_preprocessor-0.1.1.tar.gz.
File metadata
- Download URL: acace_text_preprocessor-0.1.1.tar.gz
- Upload date:
- Size: 1.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.9.6 Darwin/24.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
89afbb849dcb1232127909381bc9919b4bba990a87f4bf1df6afd89b8d8527de
|
|
| MD5 |
7c3ab4fb499228dd3bb8ea607b496b63
|
|
| BLAKE2b-256 |
73a8685b5f418d76d1ea607f53e33c138ff32b89c60f36f3adf2db39611f8450
|
File details
Details for the file acace_text_preprocessor-0.1.1-py3-none-any.whl.
File metadata
- Download URL: acace_text_preprocessor-0.1.1-py3-none-any.whl
- Upload date:
- Size: 2.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.2 CPython/3.9.6 Darwin/24.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a0e78ac7c86eaecbcb2bdefb49e64c4d6455b8c7dc5c721c4851936c6ac7610b
|
|
| MD5 |
1211feb3dcf70a85fb0f0c09db09e26f
|
|
| BLAKE2b-256 |
f20d33d99624674e2c182d3b445a04c221f7038436de5f5bf5defa7831041741
|