Skip to main content

Static Hash-Based Lookup for Google Ngram Frequencies

Project description

gngram-lookup

PyPI version Downloads Downloads/Month Tests Python 3.11+

Word frequency from 500 years of books. O(1) lookup. 5 million words.

Install

pip install gngram-lookup
python -m gngram_lookup.download_data

Python

import gngram_lookup as ng

ng.exists('computer')       # True
ng.exists('xyznotaword')    # False

ng.frequency('computer')
# {'peak_tf': 2000, 'peak_df': 2000, 'sum_tf': 892451, 'sum_df': 312876}

ng.batch_frequency(['the', 'algorithm', 'xyznotaword'])
# {'the': {...}, 'algorithm': {...}, 'xyznotaword': None}

CLI

gngram-exists computer    # True, exit 0
gngram-exists xyznotaword # False, exit 1

gngram-freq computer
# peak_tf_decade: 2000
# peak_df_decade: 2000
# sum_tf: 892451
# sum_df: 312876

Docs

See Also

Attribution

Data derived from the Google Books Ngram dataset.

License

Proprietary. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gngram_lookup-0.2.1.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gngram_lookup-0.2.1-py3-none-any.whl (7.5 kB view details)

Uploaded Python 3

File details

Details for the file gngram_lookup-0.2.1.tar.gz.

File metadata

  • Download URL: gngram_lookup-0.2.1.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0

File hashes

Hashes for gngram_lookup-0.2.1.tar.gz
Algorithm Hash digest
SHA256 d14edba161f4e293d6aeb046247de535e9ecf0dd40b794a21a12880685455701
MD5 5d54b73a4a6aeb4e51d089ff1b864210
BLAKE2b-256 9a51504910b535f3578c70ce830fc76864e1e175e972010fbce75db893c1a88f

See more details on using hashes here.

File details

Details for the file gngram_lookup-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: gngram_lookup-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 7.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0

File hashes

Hashes for gngram_lookup-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c8427bd2de6884f521b65a4bd0855614e1b6c8a02ce5b05a6a282654c5c6d097
MD5 68257afb5e6a9c415c8561494bc3eb53
BLAKE2b-256 2d6be1e55ead4e465b8a60c21aaf14f32c1a07860d21742a461e52f47a3ddfb1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page