Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (
Help us improve Python packaging - Donate today!

Phrase: generates phrases given a corpus

Project Description


A library that builds on nltk and gensim to automatically generate phrases.


Add the package to your python path using pip:

`bash pip install phrase `


To create a phrase dictionary and print out the top 25 phrases:

`bash create_phrase_dictionary <corpus_folder> <phrase_dictionary_output_filename> `

This is not a light process, it can take a lot of memory and time, so be warned.


To run all the tests, you need to run py.test to pick up the unit tests. Lettuce is currently being used for BDD tests and needs to be run from the tests folder or with tests/ added to the PYTHONPATH (the tests utilize the units.helpers modules) `bash py.test PYTHONPATH=tests lettuce tests/features cd tests lettuce features/ `

Release History

Release History

This version
History Node


History Node


Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
phrase-0.0.10.macosx-10.5-x86_64.tar.gz (14.5 kB) Copy SHA256 Checksum SHA256 Source Oct 2, 2014

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting