Python framework for fast Vector Space Modelling
Tools for indexing gzip files to support random-like access.
Fast & simple summary for large CSV files
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Counter for large datasets
Persistent dict in Python, backed up by sqlite3 and pickle, multithread-safe.
Geographical queries made easy.
Uploads videos to liveleak.com
Performs ElasticSearch bulk and scroll tasks