Skip to main content

A tool for extracting youtube video details

Project description

YouTube Scrapper

The YouTube scrapper is a data pipeline that allows you to extract data from videos posted on YouTube. The scrapper will extract information such as; video titles, descriptions, the number of views the video has and all the users or viewers' comments per each video. Information extracted is dumped into a csv file which can then be uploaded to an Amazon RDS database.

Getting Started

  1. Make sure pip and Chromedriver is installed.
  2. Download package by using pip install Youtube-scrapper in console
  3. Run 'python -m youtube_scrapper' on your terminal
  4. A prompt to enter the following information will pop up;
    1. channel - this takes the youtube channel you want to scrape data from
    2. scroll_range - this indicates how many video you want the scrapper to look and grab information from.
  5. Once the information is passed in, the scrapper will start grabbing all information requested.
  6. Once finished, the data is returned in a csv file which can be found at the directory level the package was run.

Note

  • When the scroll_range value is 0, 30 videos (The least amount of videos that will be scrapped) are returned from the channel specified. Each time the value goes up by 1, 30 more videos are returned.
  • Run time takes a while (approximately 30mins for 30 videos)

Dependencies

  • Python 3.7.9
  • selenium 3.141.0
  • pandas 1.3.2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Youtube_scrapper-0.0.7.tar.gz (5.4 kB view details)

Uploaded Source

File details

Details for the file Youtube_scrapper-0.0.7.tar.gz.

File metadata

  • Download URL: Youtube_scrapper-0.0.7.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5

File hashes

Hashes for Youtube_scrapper-0.0.7.tar.gz
Algorithm Hash digest
SHA256 3282883b2c4ed885a261b83fe528d7b007b9619e4b3e627461947b01c9ab29a1
MD5 9096df6ae38560aa44cbeadac3734cdf
BLAKE2b-256 3af6e149f71b226420233f49ea90ccd9c6fa07d43db922956e9ebcccb5d60351

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page