CLI to modify text files.
Project description
txt-utils
CLI to modify text files.
Features
merge
: merge multiple text files into oneextract-vocabulary
: extract unit vocabularytranscribe
: transcribe unitsreplace
: replace textreplace-line
: replace text in a linetrim-units
: trim unitsremove-units
: remove unitscreate-unit-occurrence-stats
: create unit occurrence statistics
Roadmap
- add tests
- create n-grams
- map units
- merge units right/left
- calculate units TF-IDF
Installation
pip install txt-utils --user
Usage
txt-utils-cli
Dependencies
- pandas
- tqdm
- ordered-set >=4.1.0
- pronunciation-dictionary >=0.0.4
License
MIT License
Acknowledgments
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410
Citation
If you want to cite this repo, you can use this BibTeX-entry:
@misc{tstu22,
author = {Taubert, Stefan},
title = {txt-utils},
year = {2022},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/stefantaubert/txt-utils}}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
txt-utils-0.0.1.tar.gz
(13.6 kB
view hashes)
Built Distribution
txt_utils-0.0.1-py3-none-any.whl
(20.6 kB
view hashes)
Close
Hashes for txt_utils-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba7838e1356bb7adf3bc1ea48212285807154b57375e94e0ee82963e9d229342 |
|
MD5 | bb23b818f0c6091eb7fb92cf930bf010 |
|
BLAKE2b-256 | 185cd308094b53d72e065fe8ce8fbbd01f7de865259505f28b82c5844db71c61 |