Skip to main content

Real Time Tweets Analysis.

Project description

Real-Time Tweets Sentiment Analysis Package

Overview

Retrieving real-time tweets using twitter API, Apache Kafka, and Apache Spark Streaming; then, using tensorflow deep learning model to classify the tweets wether they positive, negative, or neutral; all in a pypi package.

TweetsAnalysis

The streamer and model package, available on pypi TweetsAnalysis

Package Requirements

  • gensim
  • pandas
  • pyspark
  • kafka-python
  • streamlit
  • scikit-learn
  • seaborn
  • tensorflow
  • tweepy==3.9.0
  • pydantic
  • strictyaml
  • joblib


Model

The model architecture:

The model results in about 85.5% in the train set and 84.4% accuracy on the test set, which has 160000 tweets; therefore, there is no over-fitting here.


Run

First we need to install the requirements with:

 pip install TweetsAnalysis

To train the model run, but first we need to specifiy the model and data directories in the config file:

python train_model.py

Straming

Start kafka with:

bin/zookeeper-server-start.sh config/zookeeper.properties
bin/kafka-server-start.sh config/server.properties

then create a kafka topic (tweets_stream) with:

bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic tweets_stream

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TweetsAnalysis-1.1.8.tar.gz (17.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

TweetsAnalysis-1.1.8-py3-none-any.whl (12.3 kB view details)

Uploaded Python 3

File details

Details for the file TweetsAnalysis-1.1.8.tar.gz.

File metadata

  • Download URL: TweetsAnalysis-1.1.8.tar.gz
  • Upload date:
  • Size: 17.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for TweetsAnalysis-1.1.8.tar.gz
Algorithm Hash digest
SHA256 20a1c7b990a13e665c06353f61847369f5409a07ebcbec681e2045961253248d
MD5 2fa44f9e1e588361fa9db550f01d309b
BLAKE2b-256 9d0ff6a5368294edc7dec5c3db38a8c044367d1773cf90062f20fe67619179e9

See more details on using hashes here.

File details

Details for the file TweetsAnalysis-1.1.8-py3-none-any.whl.

File metadata

  • Download URL: TweetsAnalysis-1.1.8-py3-none-any.whl
  • Upload date:
  • Size: 12.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for TweetsAnalysis-1.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 62f1fbc34d64967055fad4d9cd11ed3ad4b68e3b86952f9720d42188752f736e
MD5 40b79784bb1f382870500caedd62aec5
BLAKE2b-256 bf3a916e9f6a58a574ab9f8d061518571ba259011037aead55fd69d20d5e6269

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page