A script to count the frequencies of the following in text:
Can handle non ascii files well. 1gb of data takes approximately 5-7 mins.
- pip or easy_install 'textfreq'
In Shell/Command-prompt: ``textfreq <INPUT.txt> <OUTPUT.txt> <Commands: -w (words), -p (pairs), -l (letters), -dc (Deva Conjuncts), -bc (Bangla Conjuncts)>``
- Significantly improved speed
- Added Devanagari and Bengali Conjuncts finder
- Added words, pairs and letter counts
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.