Skip to main content

A powerful, asynchronous web crawler designed specifically for YouTube.

Project description

Youcreep

A powerful, asynchronous web crawler designed specifically for YouTube. Built to efficiently interact with web pages in a headless mode, this tool facilitates the seamless extraction of YouTube video comments and video information directly from the platform.

Features

  • Asynchronous Crawling: Uses Python's asynchronous features for fast and efficient data extraction.
  • Headless Interaction: Operates seamlessly without opening a web browser.
  • Modular Design: Built with modularity in mind, allowing for easy extension and maintenance.

Modules

1. YoutubeCommentCrawler

Extract comments from any given YouTube video.

Usage:

from youcreep import YoutubeCommentCrawler

with YoutubeCommentCrawler() as crawler:
   comments = await crawler.crawl('YOUR_YOUTUBE_VIDEO_URL', n_target=NUMBER_OF_COMMENTS_TO_FETCH)

2. YoutubeVideoInfoCrawler

Search for videos based on a keyword and extract their information.

Usage:

from youcreep import YoutubeVideoInfoCrawler

with YoutubeVideoInfoCrawler() as crawler:
   videos_info = await crawler.crawl('YOUR_SEARCH_TERM', n_target=NUMBER_OF_VIDEOS_TO_FETCH,
                                     filter_options=YOUR_FILTER_OPTIONS)

Getting Started

  1. Clone the repository

    git clone https://github.com/stevieflyer/youtube_crawler.git
    
  2. Navigate to the project directory

    cd youcreep
    
  3. Install the dependencies

    pip install -r requirements.txt
    
  4. Start crawling

    Use the examples provided in the modules section above.

Contributing

Contributions are welcome! Please raise an issue or submit a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Happy crawling! :)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

youcreep-0.1.42.tar.gz (16.8 kB view details)

Uploaded Source

Built Distribution

youcreep-0.1.42-py3-none-any.whl (25.4 kB view details)

Uploaded Python 3

File details

Details for the file youcreep-0.1.42.tar.gz.

File metadata

  • Download URL: youcreep-0.1.42.tar.gz
  • Upload date:
  • Size: 16.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for youcreep-0.1.42.tar.gz
Algorithm Hash digest
SHA256 3d0a3fdf0c184c4564c98b2126a6025299c3d94798f495d2f56bf1e90a7a868e
MD5 af121488eefb73189415c5c6b2c98d0e
BLAKE2b-256 94b6299d89e2cfde8c82a2d396bcccfdb67aa9360365f56059cfaa9e44421f05

See more details on using hashes here.

File details

Details for the file youcreep-0.1.42-py3-none-any.whl.

File metadata

  • Download URL: youcreep-0.1.42-py3-none-any.whl
  • Upload date:
  • Size: 25.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for youcreep-0.1.42-py3-none-any.whl
Algorithm Hash digest
SHA256 26aefb72c29a273dc35ffee7c5fb06e0d66996ca26a44b935854af12bb6ec930
MD5 c6894d31259f2f288a948e76a67e5ec3
BLAKE2b-256 0844f31300420f9255145aff45fba7e1f31cbc4bf28bfa6e71c92dbbf49fd7b5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page