Skip to main content

Phrase: generates phrases given a corpus

Project description

Phrase

A library that builds on nltk and gensim to automatically generate phrases.

Installation

Add the package to your python path using pip:

`bash pip install phrase `

Usage

To create a phrase dictionary and print out the top 25 phrases:

`bash create_phrase_dictionary <corpus_folder> <phrase_dictionary_output_filename> `

This is not a light process, it can take a lot of memory and time, so be warned.

Tests

To run all the tests, you need to run py.test to pick up the unit tests. Lettuce is currently being used for BDD tests and needs to be run from the tests folder or with tests/ added to the PYTHONPATH (the tests utilize the units.helpers modules) `bash py.test PYTHONPATH=tests lettuce tests/features cd tests lettuce features/ `

Project details


Release history Release notifications

This version
History Node

0.0.10

History Node

0.0.9

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
phrase-0.0.10.macosx-10.5-x86_64.tar.gz (14.5 kB) Copy SHA256 hash SHA256 Source None Oct 2, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page