A script to count the frequencies of the following in text:
Can handle non ascii files well. 1gb of data takes approximately 5-7 mins.
- pip or easy_install 'textfreq'
In Shell/Command-prompt: ``textfreq <INPUT.txt> <OUTPUT.txt> <Commands: -w (words), -p (pairs), -l (letters), -dc (Deva Conjuncts), -bc (Bangla Conjuncts)>``
- Significantly improved speed
- Added Devanagari and Bengali Conjuncts finder
- Added words, pairs and letter counts
TODO: Brief introduction on what you do with files - including link to relevant help section.