Telegram Channel Backup Crawler

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Project description

TelegramBackup

This is the Telegram adapter for tg-blog, a front-end for displaying telegram (or any compatible) channel data as an interactive web page.

Motivation

Telegram has become increasingly fragile, recently revoking usernames for inactive channels, and often randomly banning regular users misclassified as spam. During the username revoking, many channels of deceased individuals have been remove from public space, no longer accessible using t.me links. This tool helps to maintain Telegram channel data in case of an accidental event, and also to publicly display inactive channels that have been revoked.

Demos / Examples

You can add this to your blog so that it syncs with your telegram channel (e.g. Azalea's Blog).
You can also use this to back up and display another person's channel (e.g. One Among Us (TODO)).

Usage

Installation

First, install Python >= 3.11. Then, run pip install tgc

Then, to support video/animation/sticker conversion, you have to install the following non-python dependencies:

Install Node 19.2 and yarn 1.22
yarn global add puppeteer-lottie-cli
Install ffmpeg using your system package manager

Mode 1: Convert Telegram Export

If you only need a one-time export, you can use mode 1. To do this, you first need to export a channel using tdesktop.

To convert an export file into a format supported by tg-blog, you can run tgce <export path>

Mode 2: Crawl Channel using MTProto API

If you have the permission to add a bot account to a channel, or invite a self-bot account, you can use the MTProto crawler for automatic incremental export updates. (Please, do not log into your own Telegram account for crawling, there's a very high chance of being mis-classified as spam and get banned)

Using this method, it can automatically update the channel backup incrementally, and the information will be more complete. However, it is more difficult to set up than mode 1.

Setup API Keys

Obtain api_id and api_hash by creating your Telegram application (Official Guide)
1. Log into https://my.telegram.org/apps
2. Fill out the form to create an application
Choose which type of account to log in:
1. Bot account: Create a bot using the @BotFather bot.
2. Self-bot account: Leave bot_token blank, it will prompt you to login. You should only use a self-bot when you're not the admin of the channel (because inviting a bot requires admin access).
Fill in the tokens in ~/.config/tgc/config.toml as shown below

# Telegram API id
api_id = 10000000

# Telegram API hash
api_hash = "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa"

# Telegram bot token (can be blank)
bot_token = "0000000000:aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa"

After setting up the keys, you can proceed to setting up the channel:

Setup Channels to be Crawled

Either invite your bot to the channel or join the channel on your self-bot account
Forward a channel message to @RawDataBot to obtain the channel ID. (You'll see a JSON response, and you can find the ID from the forward_from_chat field)
Fill in the channel info in ~/.config/tgc/config.toml as shown below

# One export entry in a list of exports
[[exports]]
# Telegram chat id
chat_id = -1001191767119
# Output Path
path = "exports/hykilp"

After all setup is complete, you can proceed to running the crawler.

Running the Crawler

Simply run the tgc command.

Automatic Updates using GitHub Actions

TODO

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Release history Release notifications | RSS feed

1.1.0

Jul 15, 2023

1.0.9

Mar 6, 2023

1.0.8

Mar 6, 2023

1.0.7

Jan 14, 2023

1.0.6

Jan 13, 2023

1.0.5

Dec 24, 2022

1.0.4

Dec 24, 2022

1.0.3

Dec 22, 2022

1.0.2

Dec 20, 2022

1.0.2rc2 pre-release

Dec 20, 2022

1.0.2rc1 pre-release

Dec 20, 2022

This version

1.0.1

Dec 20, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tgc-1.0.1.tar.gz (32.0 kB view hashes)

Uploaded Dec 20, 2022 Source

Built Distribution

tgc-1.0.1-py3-none-any.whl (16.0 kB view hashes)

Uploaded Dec 20, 2022 Python 3

Hashes for tgc-1.0.1.tar.gz

Hashes for tgc-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`d28c868f85a6a8af950b523b90735b7000b7dc2b866f2f7c51f20cd26815abeb`
MD5	`32c3b69eabc7ec52e66db6a059b48bf3`
BLAKE2b-256	`735dba5ec09b2c65ba2a8ba716150f1323d540911ae24878eefdd07d8ad8ef1f`

Hashes for tgc-1.0.1-py3-none-any.whl

Hashes for tgc-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4b8605ea78331039def418c8fc1726aa218f8c69b15423d04fb4e687f0a12ce1`
MD5	`e943d9062cc8f497a69c1e9ccbd8d9ae`
BLAKE2b-256	`96f53884d3c11498dc79818df3c470633c957e92dcc122f27dba727896a3d664`