A set of functions that process and create topic models from a sample of community-detected Twitter networks' tweets. It can process and visualize network data across periods and communities.
Project description
NTTC (Name That Twitter Community!): Process and analyze community-detected data
by Chris Lindgren chris.a.lindgren@gmail.com Distributed under the BSD 3-clause license. See LICENSE.txt or http://opensource.org/licenses/BSD-3-Clause for details.
Documentation: https://nttc.readthedocs.io/en/latest/
Overview
A set of functions that process and create topic models from a sample of community-detected Twitter networks' tweets. It also analyzes if there are potential persistent community hubs (either/and by top mentioned or top RTers).
It assumes you seek an answer to the following questions:
- What communities persist or are ephemeral across periods in the corpora, and when?
- What can these communities be named, based on their top RTs and users, top mentioned users, as well as generated topic models?
- Of these communities, what are their topics over time?
Accordingly, it assumes you have a desire to investigate communities across periods and the tweets from each detected community across already defined periodic episodes with the goal of naming each community AND examining their respective topics over time in the corpus.
It functions only with Python 3.x and is not backwards-compatible (although one could probably branch off a 2.x port with minimal effort).
Warning: nttc
performs no custom error-handling, so make sure your inputs are formatted properly! If you have questions, please let me know via email.
System requirements
- arrow
- tsm
- nltk
- networkx
- matplot
- pandas
- numpy
- emoji
- pprint
- gensim
- spacy
- tqdm
- sklearn
- joblib
- MulticoreTSNE
- hdbscan
- seaborn
- stop_words
Installation
pip install nttc
Known Issues or Limitations
- Please contact me if you discover any issuess.
Example notebooks
- See the
assets/examples
folder for example uses.
Distribution update terminal commands
# Create new distribution of code for archiving sudo python setup.py sdist bdist_wheel # Distribute to Python Package Index python -m twine upload --repository-url https://upload.pypi.org/legacy/ dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nttc-0.6.1.tar.gz
.
File metadata
- Download URL: nttc-0.6.1.tar.gz
- Upload date:
- Size: 29.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 51261625d8c16d56aa159a70727c41a425a184349db817fe703a4a3e481655c9 |
|
MD5 | e4645f7b45d784207b557be9e4bf5db2 |
|
BLAKE2b-256 | b654ac88e328296cc75e3c5c91c84fd4d358260104575d10a2e04d5a65aab2cf |
File details
Details for the file nttc-0.6.1-py3-none-any.whl
.
File metadata
- Download URL: nttc-0.6.1-py3-none-any.whl
- Upload date:
- Size: 29.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 712c8e5733bf43748f3a4806757081f2f56b9b9e1501b3804a175c8c7ec32acb |
|
MD5 | d5cfbfacd4d03ff2742062bf7a0c6981 |
|
BLAKE2b-256 | e80825211da3e3742a54ae96104fcad946c9738734627c1c72ae63fd69f82565 |