An advanced Twitter scraping & OSINT tool.

These details have not been verified by PyPI

Project links

Homepage

Project description

TWINT - Twitter Intelligence Tool

No authentication. No API. No limits.

Twint is an advanced Twitter scraping tool written in Python that allows for scraping Tweets from Twitter profiles without using Twitter's API.

Twint utilizes Twitter's search operators to let you scrape Tweets from specific users, scrape Tweets relating to certain topics, hashtags & trends, or sort out sensitive information from Tweets like e-mail and phone numbers. I find this very useful, and you can get really creative with it too.

Twint also makes special queries to Twitter allowing you to also scrape a Twitter user's followers, Tweets a user has liked, and who they follow without any authentication, API, Selenium, or browser emulation.

tl;dr Benefits

Some of the benefits of using Twint vs Twitter API:

Can fetch almost all Tweets (Twitter API limits to last 3200 Tweets only);
Fast initial setup;
Can be used anonymously and without Twitter sign up;
No rate limitations.

Limits imposed by Twitter

Twitter limits scrolls while browsing the user timeline. This means that with .Profile or with .Favorites you will be able to get ~3200 tweets.

Requirements

Python 3.6;
aiohttp;
aiodns;
beautifulsoup4;
cchardet;
elasticsearch;
pysocks;
pandas (>=0.23.0);
aiohttp_socks;
schedule;
geopy;
fake-useragent;
py-googletransx.

Installing

Git:

git clone https://github.com/twintproject/twint.git
cd twint
pip3 install . -r requirements.txt

Pip:

pip3 install twint

pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint

Pipenv:

pipenv install -e git+https://github.com/twintproject/twint.git#egg=twint

CLI Basic Examples and Combos

A few simple examples to help you understand the basics:

twint -u username - Scrape all the Tweets from user's timeline.
twint -u username -s pineapple - Scrape all Tweets from the user's timeline containing pineapple.
twint -s pineapple - Collect every Tweet containing pineapple from everyone's Tweets.
twint -u username --year 2014 - Collect Tweets that were tweeted before 2014.
twint -u username --since "2015-12-20 20:30:15" - Collect Tweets that were tweeted since 2015-12-20 20:30:15.
twint -u username --since 2015-12-20 - Collect Tweets that were tweeted since 2015-12-20 00:00:00.
twint -u username -o file.txt - Scrape Tweets and save to file.txt.
twint -u username -o file.csv --csv - Scrape Tweets and save as a csv file.
twint -u username --email --phone - Show Tweets that might have phone numbers or email addresses.
twint -s "Donald Trump" --verified - Display Tweets by verified users that Tweeted about Donald Trump.
twint -g="48.880048,2.385939,1km" -o file.csv --csv - Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file.
twint -u username -es localhost:9200 - Output Tweets to Elasticsearch
twint -u username -o file.json --json - Scrape Tweets and save as a json file.
twint -u username --database tweets.db - Save Tweets to a SQLite database.
twint -u username --followers - Scrape a Twitter user's followers.
twint -u username --following - Scrape who a Twitter user follows.
twint -u username --favorites - Collect all the Tweets a user has favorited (gathers ~3200 tweet).
twint -u username --following --user-full - Collect full user information a person follows
twint -u username --profile-full - Use a slow, but effective method to gather Tweets from a user's profile (Gathers ~3200 Tweets, Including Retweets).
twint -u username --retweets - Use a quick method to gather the last 900 Tweets (that includes retweets) from a user's profile.
twint -u username --resume resume_file.txt - Resume a search starting from the last saved scroll-id.

More detail about the commands and options are located in the wiki

Module Example

Twint can now be used as a module and supports custom formatting. More details are located in the wiki

import twint

# Configure
c = twint.Config()
c.Username = "now"
c.Search = "fruit"

# Run
twint.run.Search(c)

Output

955511208597184512 2018-01-22 18:43:19 GMT <now> pineapples are the best fruit

import twint

c = twint.Config()

c.Username = "noneprivacy"
c.Custom["tweet"] = ["id"]
c.Custom["user"] = ["bio"]
c.Limit = 10
c.Store_csv = True
c.Output = "none"

twint.run.Search(c)

Storing Options

Write to file;
CSV;
JSON;
SQLite;
Elasticsearch.

Elasticsearch Setup

Details on setting up Elasticsearch with Twint is located in the wiki.

Graph Visualization

graph

Graph details are also located in the wiki.

We are developing a Twint Desktop App.

FAQ

I tried scraping tweets from a user, I know that they exist but I'm not getting them

Twitter can shadow-ban accounts, which means that their tweets will not be available via search. To solve this, pass --profile-full if you are using Twint via CLI or, if are using Twint as module, add config.Profile_full = True. Please note that this process will be quite slow.

More Examples

Followers/Following

To get only follower usernames/following usernames

twint -u username --followers

twint -u username --following

To get user info of followers/following users

twint -u username --followers --user-full

twint -u username --following --user-full

userlist

To get only user info of user

twint -u username --user-full

To get user info of users from a userlist

twint --userlist inputlist --user-full

tweet translation (experimental)

To get 100 english tweets and translate them to italian

twint -u noneprivacy --csv --output none.csv --lang en --translate --translate-dest it --limit 100

import twint

c = twint.Config()
c.Username = "noneprivacy"
c.Limit = 100
c.Store_csv = True
c.Output = "none.csv"
c.Lang = "en"
c.Translate = True
c.TranslateDest = "it"
twint.run.Search(c)

Notes:

Google translate has some quotas

Featured Blog Posts:

Contact

If you have any question, want to join in discussions, or need extra help, you are welcome to join our Twint focused channel at OSINT team

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

2.1.20

Apr 29, 2020

2.1.19

Apr 10, 2020

2.1.18

Apr 4, 2020

2.1.17

Apr 2, 2020

2.1.16

Mar 23, 2020

2.1.15

Feb 22, 2020

2.1.14

Feb 21, 2020

2.1.13

Feb 7, 2020

2.1.12

Jan 27, 2020

2.1.11

Dec 19, 2019

2.1.10

Dec 16, 2019

2.1.9

Dec 13, 2019

2.1.8

Dec 12, 2019

2.1.7

Nov 4, 2019

2.1.6

Oct 21, 2019

2.1.5

Oct 19, 2019

2.1.4

Oct 19, 2019

2.1.3

Oct 18, 2019

2.1.2

Sep 11, 2019

2.1.1

Aug 14, 2019

2.1.0

Aug 12, 2019

2.0.1

Aug 10, 2019

2.0.0

Aug 10, 2019

1.2.7

Aug 9, 2019

1.2.6

Aug 9, 2019

1.2.5

Aug 2, 2019

1.2.0.0

Nov 1, 2018

1.1.4.3

Jun 21, 2018

1.1.4.2

Jun 21, 2018

1.1.4.1

Jun 21, 2018

1.1.4

Jun 20, 2018

1.1.3.3

Jun 1, 2018

1.1.3.2

May 30, 2018

1.1.3.1

May 30, 2018

1.1.3

May 28, 2018

1.1.2.6

May 24, 2018

1.1.2.5

May 24, 2018

1.1.2.4

May 22, 2018

1.1.2.3

May 3, 2018

1.1.2.2

May 2, 2018

1.1.2.1

May 2, 2018

1.1.1

Apr 30, 2018

1.1.0

Apr 30, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twint-2.1.20.tar.gz (31.3 kB view details)

Uploaded Apr 29, 2020 Source

File details

Details for the file twint-2.1.20.tar.gz.

File metadata

Download URL: twint-2.1.20.tar.gz
Upload date: Apr 29, 2020
Size: 31.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for twint-2.1.20.tar.gz
Algorithm	Hash digest
SHA256	`b3b7671997e31ea5dff9f4cca0f83add07c3163fd7ab1dc8e44f9110eeb1965d`
MD5	`6750dbf97206a88e924ed627fbd8b22a`
BLAKE2b-256	`69e14daa62fbae8a34558015c227a8274bb2598e0fc6e330bdeb8484ed154ce7`

See more details on using hashes here.

twint 2.1.20

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TWINT - Twitter Intelligence Tool

tl;dr Benefits

Limits imposed by Twitter

Requirements

Installing

CLI Basic Examples and Combos

Module Example

Storing Options

Elasticsearch Setup

Graph Visualization

FAQ

More Examples

Followers/Following

userlist

tweet translation (experimental)

Featured Blog Posts:

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes