Skip to main content

Video Crawler

Project description

vidcrawler

Crawls major videos sites like YouTube/Rumble/Bitchute/Brighteon for video content and reports what it finds.

Platform Unit Tests

Actions Status Actions Status Actions Status

API

Command line

vidcrawler --input_crawl_json "fetch_list.json" --output_json "out_list.json"

Python

from vidcrawler import crawl_video_sites

crawl_list = [
    ["channel name", "source", "channel_id"]
]
output = crawl_video_sites(crawl_list)

"source" and "channel_id" are used to generate the video-platform-specific urls to fetch data. The "channel name" is echo'd back in the generated json feeds, but doesn't not affect the fetching process in any way.

Testing

Install vidcrawler and then the command vidcralwer_test will become available.

$ pip install vidcrawler
$ vidcrawler_test

Example input fetch_list.json

[
    [
        "Health Ranger Report",
        "brighteon",
        "hrreport"
    ],
    [
        "Sydney Watson",
        "youtube",
        "UCSFy-1JrpZf0tFlRZfo-Rvw"
    ],
    [
        "Computing Forever",
        "bitchute",
        "hybm74uihjkf"
    ],
    [
        "ThePeteSantilliShow",
        "rumble",
        "ThePeteSantilliShow"
    ],
    [
        "Macroaggressions",
        "odysee",
        "Macroaggressions"
    ]
]

Releases

  • 1.0.2: Updates testing
  • 1.0.1: improves command line
  • 1.0.0: Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vidcrawler-1.0.2.tar.gz (26.5 kB view details)

Uploaded Source

Built Distribution

vidcrawler-1.0.2-py2.py3-none-any.whl (37.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file vidcrawler-1.0.2.tar.gz.

File metadata

  • Download URL: vidcrawler-1.0.2.tar.gz
  • Upload date:
  • Size: 26.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0

File hashes

Hashes for vidcrawler-1.0.2.tar.gz
Algorithm Hash digest
SHA256 eb4f99a37021f7de9806bc859bfc149f7b41a3a3027498361cce14844bd00e50
MD5 715055994cf0b135b1d1686637820c7e
BLAKE2b-256 1d65bf62ed7e12d26c90819885d81c1e1c6d452780052b4efe1e3fe9fb4fa9de

See more details on using hashes here.

File details

Details for the file vidcrawler-1.0.2-py2.py3-none-any.whl.

File metadata

  • Download URL: vidcrawler-1.0.2-py2.py3-none-any.whl
  • Upload date:
  • Size: 37.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0

File hashes

Hashes for vidcrawler-1.0.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 ad93dbde5c92c74b393bb9db7ccf048b70a3f4e6ce368327c9ea7fb780383ea8
MD5 5db1487695e9328644571957e16a7199
BLAKE2b-256 f01f2d46ffb6b2fb92872d2c0ca8c6277445afec9fc8d8124bc2e420aea8d3e2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page