7 projects
smart-open
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
gensim
Python framework for fast Vector Space Modelling
sqlitedict
Persistent dict in Python, backed up by sqlite3 and pickle, multithread-safe.
bounter
Counter for large datasets
sparsetools
UNKNOWN
sparsesvd
Python module that wraps SVDLIBC, a library for sparse Singular Value Decomposition.
simserver
Document similarity server