Skip to main content

Twitter streaming and graph-based analysis framework

Project description

Twista

Twista is a Twitter streaming and analysis command line tool suite implemented in Python 3. It provides the following core features:

  • to record Tweets (statuses, replies, retweets, replies) from the public Twitter streaming API in a standardized way,
  • to import collected chunks of Tweets into a Neo4j graph database for analysis.
  • The graph database can be used for analysis. We recommand to make use of tools like Jupyter. Twista provides integrated support for Jupyter. Try the twista lab command to start Jupyter with the current config file.

Installation

Twista is hosted on PyPi. Therefore, it can be easily installed:

pip3 install twista

Type

$ twista

to get an overview of existing Twista commands.

Usage: twista [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  import  Imports Twitter records into a Neo4j graph analytical database ...
  init    Initializes a directory to be used with Twista ...
  lab     Starts Jupyter lab for analysis
  record  Records a Twitter stream
  stop    Stops the Neo4j analytical database

We recommend to study the Wiki on how to record and analyze public Twitter streams using Twista and graph databases.

Twista passed its acid-tests

Twista (0.3.0) is been used to record a sample of the complete German Twitter stream since April 2019. This dataset is open access, updated monthly, and available here: DOI

Twista (0.2.0) has been evaluated recording tweets during the German Federal Election Campaigns of 2017. Over four months Twista recorded 10 GB of data without any operator interaction! This dataset is open access and available here: DOI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for Twista, version 0.3.0rc0
Filename, size File type Python version Upload date Hashes
Filename, size Twista-0.3.0rc0-py3-none-any.whl (10.7 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size Twista-0.3.0rc0.tar.gz (8.4 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page