a text mining framework in Python
tmpy is text minging framework in Python, modeled after the tm R package. Yet it makes good use of the existing scipy, numpy, pandas, nltk, and xmltodict packages, provides some pythonic ways to import, export, manage documents and compute the term document matrics.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.