https://vods.co/ vod scraper
Project description
vodscrepe
Description
This PyPI package is best described as a tool for scraping the vods.co website. Currently, the package only supports Super Smash Bros. Melee vods.
Installation
With Python installed, simply run the following command to add the package to your project.
python -m pip install vodscrepe
Usage
The following is an example usage of the package, which is also included in the repo as example.py
:
from tqdm import tqdm
from vodscrepe import Scraper
s = Scraper('melee')
try:
for vod in s.scrape(show_progress=True):
tqdm.write(str(vod))
except KeyboardInterrupt:
tqdm.write("Scraping terminated.")
This example lists information about the vods from the most recent to the last page in the following fashion:
python example.py > sets.txt
Then, the sets.txt
file becomes populated with vod information...
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Hungrybox (Jigglypuff) - Grand Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs n0ne (Captain Falcon) - Losers Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Gahtzu (Captain Falcon) - Losers Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Hungrybox (Jigglypuff) vs n0ne (Captain Falcon) - Winners Finals - Bo5
['2020-03-15'] CEO Dreamland 2020 - Panda (FL) (Fox) vs Gahtzu (Captain Falcon) - Losers Quarters - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Chef Rach (Captain Falcon) - Losers Quarters - Bo5
['2020-03-15'] CEO Dreamland 2020 - n0ne (Captain Falcon) vs Panda (FL) (Fox) - Winners Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Hungrybox (Jigglypuff) - Winners Semis - Bo5
['2020-03-15'] CEO Dreamland 2020 - Krudo (Sheik) vs Gahtzu (Captain Falcon) - Losers Top 8 - Bo5
['2020-03-15'] CEO Dreamland 2020 - blankshooter744 (Fox) vs Chef Rach (Captain Falcon) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Leighton (Jigglypuff) vs Prof (Marth) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Sinbad (Sheik) vs Krudo (Sheik) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Wevans (Samus) vs Gahtzu (Captain Falcon) - Losers Round 5 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Sinbad (Sheik) vs Dom (FL) (Marth) - Losers Round 4 - Bo5
['2020-03-15'] CEO Dreamland 2020 - Colbol (Fox) vs Gahtzu (Captain Falcon) - Winners Quarters - Bo5
Scraping terminated.
...while the terminal details the progress:
All vods: 0%| | 0/331 [00:07<?, ?pages/s]
Page 0: 25%|██████████████ | 15/60 [00:07<00:12, 3.07vods/s]
Feel free to check out the docs for more information.
License
This software is released under the terms of MIT license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vodscrepe-2.1.0.tar.gz
.
File metadata
- Download URL: vodscrepe-2.1.0.tar.gz
- Upload date:
- Size: 8.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.4 CPython/3.7.10 Linux/5.4.0-1041-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 36c323974f698713b2841632cec4ff49154e567d2cae3ead157da90a1601c506 |
|
MD5 | c7df67ada937d645a291a4e907e639b5 |
|
BLAKE2b-256 | cf95cfbe8be2450337589397fc94c1b7eaddd8f0dc6fd134fd67349919c1f863 |
File details
Details for the file vodscrepe-2.1.0-py3-none-any.whl
.
File metadata
- Download URL: vodscrepe-2.1.0-py3-none-any.whl
- Upload date:
- Size: 9.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.4 CPython/3.7.10 Linux/5.4.0-1041-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f4122b9865d393029732bc17cb0088c59555f1d752dcee74d5b9044d1db67f47 |
|
MD5 | 125b085bcbcf66490e7a80896bc8b3b6 |
|
BLAKE2b-256 | cc166b642f250b464c7db3e16cc50a301bb9e272fac6ce977a8fae8490803bb3 |