Skip to main content

UNKNOWN

Project description

These n-grams are based on the largest publicly-available, genre-balanced corpus of English – the 520 million word Corpus of Contemporary American English (COCA). With this n-grams data (2, 3, 4, 5-word sequences, with their frequency), you can carry out powerful queries offline – without needing to access the corpus via the web interface.

Install

pip install ngrams

Example

from ngrams.generate import Ngrams

number = 1

ngrams = Ngrams(params)

print ngrams.result()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngrams-0.0.1.tar.gz (1.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page