CLI to modify text files.
Project description
txt-utils
CLI to modify text files.
Features
merge
: merge multiple text files into oneextract-vocabulary
: extract unit vocabularytranscribe
: transcribe unitsreplace
: replace textreplace-line
: replace text in a linetrim-units
: trim unitsremove-units
: remove unitscreate-unit-occurrence-stats
: create unit occurrence statistics
Roadmap
- add tests
- create n-grams
- map units
- merge units right/left
- calculate units TF-IDF
Installation
pip install txt-utils --user
Usage
txt-utils-cli
Contributing
If you notice an error, please don't hesitate to open an issue.
Dependencies
- pandas
- tqdm
- ordered-set >=4.1.0
- pronunciation-dictionary >=0.0.4
License
MIT License
Acknowledgments
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410
Citation
If you want to cite this repo, you can use this BibTeX-entry generated by GitHub (see About => Cite this repository).
Changelog
- 0.0.2 (2023-05-30)
- Bugfix: Merge multiple files
- Added:
- Support for Python 3.11
- 0.0.1 (2022-05-30)
- Initial release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
txt-utils-0.0.2.tar.gz
(13.9 kB
view hashes)
Built Distribution
txt_utils-0.0.2-py3-none-any.whl
(20.9 kB
view hashes)
Close
Hashes for txt_utils-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f2ebb8bdf82e972daa2cdbba8e3f33b73bfacd48119770951dc88f605017360a |
|
MD5 | 4e352e5c53b351ecd53f721ded404619 |
|
BLAKE2b-256 | 1e3980a19fc1bdc5972aa11740616ef0111b436ad55efc4269018634d7d8161a |