Skip to main content

https://vods.co/ vod scraper

Project description

vodscrepe

pypi version downloads build status python versions format license

Description

This PyPI package is best described as a tool for scraping the vods.co website. Currently, the package only supports Super Smash Bros. Melee vods.

Installation

With Python installed, simply run the following command to add the package to your project.

python -m pip install vodscrepe

Usage

The following is an example usage of the package, which is also included in the repo as example.py:

from tqdm import tqdm

from vodscrepe import Scraper

s = Scraper('melee')

try:
    for vod in s.scrape(show_progress=True):
        tqdm.write(str(vod))
except KeyboardInterrupt:
    tqdm.write("Scraping terminated.")

This example lists information about the vods from the most recent to the last page in the following fashion:

python example.py > sets.txt

Then, the sets.txt file becomes populated with vod information...

['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Hungrybox (Jigglypuff) - Grand Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs n0ne (Captain Falcon) - Losers Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Gahtzu (Captain Falcon) - Losers Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Hungrybox (Jigglypuff) vs n0ne (Captain Falcon) - Winners Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Panda (FL) (Fox) vs Gahtzu (Captain Falcon) - Losers Quarters - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Chef Rach (Captain Falcon) - Losers Quarters - Bo5
['2020-03-15'] CEO Dreamland 2020 - n0ne (Captain Falcon) vs Panda (FL) (Fox) - Winners Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Hungrybox (Jigglypuff) - Winners Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Krudo (Sheik) vs Gahtzu (Captain Falcon) - Losers Top 8 - Bo5
['2020-03-15'] CEO Dreamland 2020 - blankshooter744 (Fox) vs Chef Rach (Captain Falcon) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Leighton (Jigglypuff) vs Prof (Marth) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Sinbad (Sheik) vs Krudo (Sheik) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Wevans (Samus) vs Gahtzu (Captain Falcon) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Sinbad (Sheik) vs Dom (FL) (Marth) - Losers Round 4 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Gahtzu (Captain Falcon) - Winners Quarters - Bo5
Scraping terminated.

...while the terminal details the progress:

All vods:   0%|                                              | 0/331 [00:07<?, ?pages/s]
Page 0:  25%|██████████████                                  | 15/60 [00:07<00:12,  3.07vods/s]

Feel free to check out the docs for more information.

License

This software is released under the terms of MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vodscrepe-2.1.0.tar.gz (8.8 kB view details)

Uploaded Source

Built Distribution

vodscrepe-2.1.0-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file vodscrepe-2.1.0.tar.gz.

File metadata

  • Download URL: vodscrepe-2.1.0.tar.gz
  • Upload date:
  • Size: 8.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.7.10 Linux/5.4.0-1041-azure

File hashes

Hashes for vodscrepe-2.1.0.tar.gz
Algorithm Hash digest
SHA256 36c323974f698713b2841632cec4ff49154e567d2cae3ead157da90a1601c506
MD5 c7df67ada937d645a291a4e907e639b5
BLAKE2b-256 cf95cfbe8be2450337589397fc94c1b7eaddd8f0dc6fd134fd67349919c1f863

See more details on using hashes here.

File details

Details for the file vodscrepe-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: vodscrepe-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.7.10 Linux/5.4.0-1041-azure

File hashes

Hashes for vodscrepe-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f4122b9865d393029732bc17cb0088c59555f1d752dcee74d5b9044d1db67f47
MD5 125b085bcbcf66490e7a80896bc8b3b6
BLAKE2b-256 cc166b642f250b464c7db3e16cc50a301bb9e272fac6ce977a8fae8490803bb3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page