corrects English spelling mistakes and normalize. (e.g., "cooooooooooooooollllllllllllll" to "cool")
pytypo corrects English spelling mistakes. That feature is based on TYPO CORPUS (http://luululu.com/tweet/)
And this module normalizes also lengthened English expression having repeating letters. (e.g., this module converts “cooooooooooooooollllllllllllll” to “cool”)
That feature is based on the following paper: Samuel Brody and Nicholas Diakopoulos. Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! using word lengthening to detect sentiment in microblogs. In EMNLP2011, pp. 562-570, 2011. http://aclweb.org/anthology//D/D11/D11-1052.pdf
Contributions are welcome!
$ pip install pytypo
>>> import pytypo
>>> pytypo.correct_sentence('you are coooolll!!!') you are cool!
>>> pytypo.correct('okayyyyy') okay
>>> pytypo.cut_repeat('mamisaaaaaan', 1) mamisan >>> pytypo.cut_repeat('okayyyyy', 2) okayy
Add many cases