A tool for extracting youtube video details
Project description
YouTube Scrapper
The YouTube scrapper is a data pipeline that allows you to extract data from videos posted on YouTube. The scrapper will extract information such as; video titles, descriptions, the number of views the video has and all the users or viewers' comments per each video. Information extracted is dumped into a csv file which can then be uploaded to an Amazon RDS database.
Getting Started
- Make sure pip and Chromedriver is installed.
- Download package by using pip install Youtube-scrapper in console
- Run 'python -m youtube_scrapper' on your terminal
- A prompt to enter the following information will pop up;
- channel - this takes the youtube channel you want to scrape data from
- scroll_range - this indicates how many video you want the scrapper to look and grab information from.
- Once the information is passed in, the scrapper will start grabbing all information requested.
- Once finished, the data is returned in a csv file which can be found at the directory level the package was run.
Note
- When the scroll_range value is 0, 30 videos (The least amount of videos that will be scrapped) are returned from the channel specified. Each time the value goes up by 1, 30 more videos are returned.
- Run time takes a while (approximately 30mins for 30 videos)
Dependencies
- Python 3.7.9
- selenium 3.141.0
- pandas 1.3.2
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file Youtube_scrapper-0.0.7.tar.gz
.
File metadata
- Download URL: Youtube_scrapper-0.0.7.tar.gz
- Upload date:
- Size: 5.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3282883b2c4ed885a261b83fe528d7b007b9619e4b3e627461947b01c9ab29a1 |
|
MD5 | 9096df6ae38560aa44cbeadac3734cdf |
|
BLAKE2b-256 | 3af6e149f71b226420233f49ea90ccd9c6fa07d43db922956e9ebcccb5d60351 |