Skip to main content

Static Hash-Based Lookup for Google Ngram Frequencies

Project description

gngram-lookup

PyPI version Downloads Downloads/Month Tests Python 3.11+

Word frequency from 500 years of books. O(1) lookup. 5 million words.

Install

pip install gngram-lookup
python -m gngram_lookup.download_data

Python

import gngram_lookup as ng

ng.exists('computer')       # True
ng.exists('xyznotaword')    # False

ng.frequency('computer')
# {'peak_tf': 2000, 'peak_df': 2000, 'sum_tf': 892451, 'sum_df': 312876}

ng.batch_frequency(['the', 'algorithm', 'xyznotaword'])
# {'the': {...}, 'algorithm': {...}, 'xyznotaword': None}

CLI

gngram-exists computer    # True, exit 0
gngram-exists xyznotaword # False, exit 1

gngram-freq computer
# peak_tf_decade: 2000
# peak_df_decade: 2000
# sum_tf: 892451
# sum_df: 312876

Docs

See Also

Attribution

Data derived from the Google Books Ngram dataset.

License

Proprietary. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gngram_lookup-0.2.2.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gngram_lookup-0.2.2-py3-none-any.whl (9.6 kB view details)

Uploaded Python 3

File details

Details for the file gngram_lookup-0.2.2.tar.gz.

File metadata

  • Download URL: gngram_lookup-0.2.2.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0

File hashes

Hashes for gngram_lookup-0.2.2.tar.gz
Algorithm Hash digest
SHA256 984f4a0cb6dc33bf8bab605edcd551981f0f5119d1a6edc2def60c54eaf21de5
MD5 443ac1316cd224b35efd21612da4cdad
BLAKE2b-256 7855935ce0e359314e88802cb6c357852bdcafe2712b19bb1339486c3491f6bf

See more details on using hashes here.

File details

Details for the file gngram_lookup-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: gngram_lookup-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 9.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0

File hashes

Hashes for gngram_lookup-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a70f7d7d40e1fdde62c0505c11b7398e1b97aec3457c11e8c4f31a3110f0bb0d
MD5 9ff2b3dafa93516999745f1cc90e689a
BLAKE2b-256 8c400a0884258adacbf00245f1e77a7d11a1e9c19c8ce2a41fb1fe4ea2ffb8d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page