Skip to main content

Asynchronous scraper to download youtube community posts

Project description

YoutubeCommunityScraper | yp-dl

yp-dl is an asynchronous scraper for downloading Youtube Community posts in json format.

Motivation

Youtube stops retrieving old community posts after 200 posts on a channel. There's no way to access/view older posts if you do not have the link to them or their ID.

Installation

pip install yp-dl

Features

  • Asynchronous support
  • For every post it retrieves:
    • post_link
    • time_since
    • utc_timestamp at download
    • video_link
    • image_link
    • text_content
    • poll_content
  • Update support for the json files when new posts are made
  • Progress visualization during download

Usage

usage: yp-dl [-h] [-f FOLDER_PATH] [-r] [-u] [-v] link [link ...]

An asynchronous scraper that downloads youtube posts from youtube channels in json format.

positional arguments:
  link                  Provide any number of links. Link example: https://www.youtube.com/@3blue1brown

options:
  -h, --help            show this help message and exit
  -f FOLDER_PATH, --folder-path FOLDER_PATH
                        Provide the path of the folder you wish to store/update your json files. If it's in the
                        current working directory (CWD), just type the folder name. If none is provided, everything
                        will be stored/updated in the CWD.
  -r, --reverse         Reverses the order of the posts from oldest first to newest first. Be wary though, if you
                        use this option with --update, your post order will be messed up.
  -u, --update          Appends the existing json file(s) with the new posts.
  -v, --verbose         Gives more details about what's going on when the program runs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yp_dl-0.9.6.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yp_dl-0.9.6-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file yp_dl-0.9.6.tar.gz.

File metadata

  • Download URL: yp_dl-0.9.6.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Linux/6.2.0-1015-azure

File hashes

Hashes for yp_dl-0.9.6.tar.gz
Algorithm Hash digest
SHA256 f1a705070608da4c1a91b2403f482b4fa7119e23c65d3ea5a1d6dae5aa9c1556
MD5 385bcb55729d876497d6506cfbee1d28
BLAKE2b-256 ab4cf58246d09f2d6a63333d499ed0af2e14f8953c4b91c9d69aa70b6106ad9d

See more details on using hashes here.

File details

Details for the file yp_dl-0.9.6-py3-none-any.whl.

File metadata

  • Download URL: yp_dl-0.9.6-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Linux/6.2.0-1015-azure

File hashes

Hashes for yp_dl-0.9.6-py3-none-any.whl
Algorithm Hash digest
SHA256 00c1d9423ea217b5dca6bc868c3689e92c77a825f039bbb71a87f15a95bef118
MD5 6475ab78d9731df803a066339f4bc20a
BLAKE2b-256 0300ec7bd4cc1f8c5e6eba5dae3135d007bc0ba67d327283f9df76713ccd2ca0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page