Skip to main content

A set of functions that uses sklearn to conduct a TF-IDF analysis to generate keywords from event-based / grouped textual corpus.

Project description

evekeys: Isolate keywords from an event-based and custom-grouped textual corpus

By Chris Lindgren chris.a.lindgren@gmail.com

Distributed under the BSD 3-clause license. See LICENSE.txt or http://opensource.org/licenses/BSD-3-Clause for details.

Documentation: https://evekeys.readthedocs.io/en/latest/

Overview

A set of functions that uses scikit-learn to conduct a TF-IDF analysis to isolate keywords from event-based documents. It answers the following questions:

  1. What keywords represent a particular period of content?
  2. What keywords represent a particular group of content from a particular period?

It assumes you have:

  • imported your corpus as a pandas DataFrame,
  • included metadata information, such as a list of dates and list of groups to reorganize your corpus, and
  • pre-processed your documents.

It functions only with Python 3.x and is not backwards-compatible.

Warning: evekeys performs little to no custom error-handling, so make sure your inputs are formatted properly. If you have questions, please let me know via email.

System requirements

  • pandas
  • sklearn
  • tqdm

Installation

pip install evekeys

Known Issues or Limitations

  • Please contact me if you discover any issues.

Example notebooks

  • Coming soon.

Distribution update terminal commands

# Create new distribution of code for archiving
sudo python setup.py sdist bdist_wheel

# Distribute to Python Package Index
python -m twine upload --repository-url https://upload.pypi.org/legacy/ dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

evekeys-0.0.2.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

evekeys-0.0.2-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file evekeys-0.0.2.tar.gz.

File metadata

  • Download URL: evekeys-0.0.2.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for evekeys-0.0.2.tar.gz
Algorithm Hash digest
SHA256 29ba57a6de1f235b1c9142c60b3468451ea6408da4eb43b53568e832db890b82
MD5 36b4a68e9bd470a05a4e66064bde238a
BLAKE2b-256 81589b8db48b1bde8c4b2645dc2fce71009c36ec2ead6e57a417fa40f464508c

See more details on using hashes here.

File details

Details for the file evekeys-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: evekeys-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3

File hashes

Hashes for evekeys-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 fd3244c88b5d1ee01c694c611a4ae0357d14d5760e96bbbfb9a7301e451dacb3
MD5 db665e2e363cb3433b17e73c226f3093
BLAKE2b-256 4c83fb74e688efd15b7bfcc2f2fb8d3e054b954d852ff257d0a428c3d05ecf4d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page