Skip to main content

recursively parsing tg channels

Project description

:spider_web: Telegram scrapper

Simple package (+ CLI) for scrapping Telegram channel

Features

:heavy_check_mark: auto explore new channels
:heavy_check_mark: search text
:heavy_check_mark: search mentions

Usage

$ git clone https://github.com/andrewsapw/telespider.git
$ poetry install
$ tspider search -w "stonks" -n 100 # search word `stonks`
$ tspider search -u "andrewsap" -n 100 # search mentions of user `andrewsap`

Configuration

App uses this environment variables:

  • API_HASH and API_ID - required by Pyrogram (more about that here)
  • ENTRYPOINT_CHANNELS - comma separated list of channels to begin search in
  • MAX_PER_CHANNEL - max number of messages to parse from one channel (can be set with -n option from CLI)
  • AUTO_EXPLORE_CHANNELS - automatically explore new channels and add them to queue for parsing (can be set with --explore\--no-explore options in CLI)
  • SILENT - suppress all output

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

telespider-0.1.0.tar.gz (5.7 kB view details)

Uploaded Source

Built Distribution

telespider-0.1.0-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file telespider-0.1.0.tar.gz.

File metadata

  • Download URL: telespider-0.1.0.tar.gz
  • Upload date:
  • Size: 5.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2

File hashes

Hashes for telespider-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7f7a6f7eb49f54042fbe58565801720ddc0c97adae19ad0a0bcd72e0adaaf70a
MD5 6e19499a5e3ff9ffa7cdea14f3716d80
BLAKE2b-256 6c0dfba7f0eeef34f6e144568e0c17519e6f562b14d7f87c720328912d3e4c25

See more details on using hashes here.

File details

Details for the file telespider-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: telespider-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2

File hashes

Hashes for telespider-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e8623a3d58634ca433f9209a85c3a0c21af43643a046a01d661699e39670770e
MD5 f6c2028496f1390c77ebaa2b6b361694
BLAKE2b-256 173d7b25862c9a9262b52c182a699817377f09844ee236827ef75b01ea1ee6fc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page