UNKNOWN
Project description
fowler.corpora is software to create vector space models for Distributional Semantics experiments.
It is possible to instantiate a vector space from
British National Corpus
Google Books N-gram corpus
The weighting schemes include:
TF-IDF
NMF
PMI
The implemented experiments are:
Word similarity (wordsim353)
Dialog act tagging, using the Switchboard corpus http://www.eecs.qmul.ac.uk/~dm303/cvsc14.html
Number of categorical composition experiments
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fowler.corpora-0.1.tar.gz
(24.1 kB
view hashes)