A toolkit for calculating process entropy quickly. With specific applications to tweets.
Project description
ProcessEntropy
A toolkit for calculating process entropy quickly. With specific applications to tweets.
Example Usage
# Load in tweets between 2018/11/16 to 2019/01/01
import pandas as pd
with open("example_data/BBCWorld_Tweets_small.csv", 'r') as f:
BBC = pd.read_csv(f)
with open("example_data/BuzzFeedNews_Tweets_small.csv", 'r') as f:
BuzzFeed = pd.read_csv(f)
# Find process entropy of BuzzFeed tweets
from ProcessEntropy.CrossEntropy import tweet_self_entropy
print(tweet_self_entropy(BuzzFeed['tweet']))
# Find cross entropy between BuzzFeed and BBC World
from ProcessEntropy.CrossEntropy import timeseries_cross_entropy
target = list(zip(BuzzFeed['created_at'], BuzzFeed['tweet']))
source = list(zip(BBC['created_at'], BBC['tweet']))
print(timeseries_cross_entropy(target, source))
Requirements
- Python 3.x with packages:
- Numba
- NTLK
- Numpy
Installation
pip install ProcessEntropy
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for ProcessEntropy-0.3.dev0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0c629a90f0ff6faaaf121b6035e3d31015dab5b328ff4e459f0d5ca27a005d90 |
|
MD5 | dc99029916b186ae59423c5da28e9289 |
|
BLAKE2b-256 | e4dc85461c97be83d8419b2c530a5f3bda9937e33568d7f1d0a5a571a5dc6226 |