Twitter GraphQL and Search API implementation with SNScrape data models

These details have not been verified by PyPI

Project links

repository

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

twscrape

Twitter GraphQL and Search API implementation with SNScrape data models.

Install

pip install twscrape

Or development version:

pip install git+https://github.com/vladkens/twscrape.git

Features

Support both Search & GraphQL Twitter API
Async/Await functions (can run multiple scrapers in parallel at the same time)
Login flow (with receiving verification code from email)
Saving/restoring account sessions
Raw Twitter API responses & SNScrape models
Automatic account switching to smooth Twitter API rate limits

Usage

import asyncio
from twscrape import AccountsPool, API, gather
from twscrape.logger import set_log_level

async def main():
    pool = AccountsPool()  # or AccountsPool("path-to.db") - default is `accounts.db` 
    await pool.add_account("user1", "pass1", "user1@example.com", "email_pass1")
    await pool.add_account("user2", "pass2", "user2@example.com", "email_pass2")

    # log in to all new accounts
    await pool.login_all()

    api = API(pool)

    # search api (latest tab)
    await gather(api.search("elon musk", limit=20))  # list[Tweet]

    # graphql api
    tweet_id, user_id, user_login = 20, 2244994945, "twitterdev"

    await api.tweet_details(tweet_id)  # Tweet
    await gather(api.retweeters(tweet_id, limit=20))  # list[User]
    await gather(api.favoriters(tweet_id, limit=20))  # list[User]

    await api.user_by_id(user_id)  # User
    await api.user_by_login(user_login)  # User
    await gather(api.followers(user_id, limit=20))  # list[User]
    await gather(api.following(user_id, limit=20))  # list[User]
    await gather(api.user_tweets(user_id, limit=20))  # list[Tweet]
    await gather(api.user_tweets_and_replies(user_id, limit=20))  # list[Tweet]

    # note 1: limit is optional, default is -1 (no limit)
    # note 2: all methods have `raw` version e.g.:

    async for tweet in api.search("elon musk"):
        print(tweet.id, tweet.user.username, tweet.rawContent)  # tweet is `Tweet` object

    async for rep in api.search_raw("elon musk"):
        print(rep.status_code, rep.json())  # rep is `httpx.Response` object

    # change log level, default info
    set_log_level("DEBUG")

    # Tweet & User model can be converted to regular dict or json, e.g.:
    doc = await api.user_by_id(user_id)  # User
    doc.dict()  # -> python dict
    doc.json()  # -> json string

if __name__ == "__main__":
    asyncio.run(main())

Note on rate limits:

Search API – 250 requests per account / 15 minites
GraphQL API – 500 requests per account per operation / 15 minutes

Models

Tweet
User

SNScrape – is a scraper for social networking services (SNS)

Project details

These details have not been verified by PyPI

Project links

repository

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.12

Apr 18, 2024

0.11.1

Feb 12, 2024

0.11.0

Feb 10, 2024

0.10.1

Jan 8, 2024

0.10.0

Jan 5, 2024

0.9.0

Nov 1, 2023

0.8.0

Sep 8, 2023

0.7.0

Jul 30, 2023

0.6.0

Jul 15, 2023

0.5.0

Jul 7, 2023

0.4.2

Jul 6, 2023

0.4.1

Jul 5, 2023

0.4.0

Jul 5, 2023

0.3.0

Jun 23, 2023

0.2.2

Jun 5, 2023

0.2.1

May 28, 2023

0.2.0

May 28, 2023

This version

0.1.1

May 9, 2023

0.1.0

May 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twscrape-0.1.1.tar.gz (129.6 kB view hashes)

Uploaded May 9, 2023 Source

Built Distribution

twscrape-0.1.1-py3-none-any.whl (18.0 kB view hashes)

Uploaded May 9, 2023 Python 3

Hashes for twscrape-0.1.1.tar.gz

Hashes for twscrape-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`4edd1877c5e03cf98f1c628b3b3950ba782c1a1889b3b1847b577e9299076fbc`
MD5	`ef7dc17fb8d17d2642841bba6c0fcfb7`
BLAKE2b-256	`da654dcc12c4ad03a7f80bff970f5300fe9e6296d3ce3b3c5d77b30a2c00c765`

Hashes for twscrape-0.1.1-py3-none-any.whl

Hashes for twscrape-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7055f948bace380adcd5ba20313e34f714cefb7e6dc523da2044ec75b10fbc51`
MD5	`104ca27aedf5d9b88cf36ee2e52f3c06`
BLAKE2b-256	`4482bf10d939a3e360486488a51c8bfebbf5fc32f641e24713927c943cbd7e7c`