Clark University, Package for YouTube crawler and cleaning data
Project description
clarku-youtube-crawler
Version 0.0.1->0.0.3
This is beta without testing since python packaging is a pain. Please don't install these versions.
Version 0.0.5
Finally figured out testing. It works okay. More documentation to come. To install:
pip install clarku-youtube-crawler
Example usage
import clarku_youtube_crawler.RawCrawler as cr
After running import, go to config.ini
to configure file paths. Make sure DEVELOPER_KEY.txt (or if the filename differs, configure also in config.ini
) is in the same folder. Then run:
test = cr.RawCrawler()
test.__build__()
test.crawl("asmr",start_date=1, start_month=1, start_year=2020, day_count=1)
test.crawl_videos_in_list(comment_page_count=1)
test.merge_all()
If missing requirements (I already include all dependencies so it shouldn't happen), download requirements.txt
here on this repo
and run
$ pip install -r requirements.txt
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for clarku_youtube_crawler-0.0.6.dev0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | fc46c012d9ae30bb0b99cb92c296795534ae37fc82246d0a2e5725521c9fd288 |
|
MD5 | 7427b6507dbbcda976e8c2fec15f5d63 |
|
BLAKE2b-256 | c1928d8e6f4c1279327371e0ab9dabb04ccc7bc6e2a46f19f159c040a67cffb0 |
Hashes for clarku_youtube_crawler-0.0.6.dev0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb088c58dc8cac53b35d7313ca4a1e09f8c41649f42f4da0a842cddb450b45eb |
|
MD5 | ad1700ca380d7a3be4a68bf0059bc4b4 |
|
BLAKE2b-256 | ec12121dcc6c918f343c876846746cdcc87f574fff229b0c1702681fc4d6588b |