Clark University, Package for YouTube crawler and cleaning data
Project description
clarku-youtube-crawler
Version 0.0.1->0.0.3
This is beta without testing since python packaging is a pain. Please don't install these versions.
Version 0.0.5
Finally figured out testing. It works okay. More documentation to come. To install:
pip install clarku-youtube-crawler
Example usage
import clarku_youtube_crawler.RawCrawler as cr
After running import, go to config.ini
to configure file paths. Make sure DEVELOPER_KEY.txt (or if the filename differs, configure also in config.ini
) is in the same folder. Then run:
test = cr.RawCrawler()
test.__build__()
test.crawl("asmr",start_date=1, start_month=1, start_year=2020, day_count=1)
test.crawl_videos_in_list(comment_page_count=1)
test.merge_all()
If missing requirements (I already include all dependencies so it shouldn't happen), download requirements.txt
here on this repo
and run
$ pip install -r requirements.txt
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for clarku_youtube_crawler-0.0.6.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 640035108a14b9bcf724082364ceb0d02dbed128f08bb78215fa1dc6d26ef4ed |
|
MD5 | cda4d6b1530e2d4d17b543f9150694ff |
|
BLAKE2b-256 | e91aae5a46902a84749f4d16fb1ab460a2c8c3699175802a7d26a60f6966328d |
Hashes for clarku_youtube_crawler-0.0.6-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1597bcd096e1699fa847fc7258b6307ba46b9d254c77feeadff974367c847950 |
|
MD5 | 3080c2fcd7ce7eb0fdbd8da721fb88d8 |
|
BLAKE2b-256 | bd1e47658832b6c85f9a2354238e08157c525ebecc51723a09a348d540a7515c |