https://vods.co/ vod scraper
Project description
vodscrepe
Description
This PyPI package is best described as a tool for scraping the vods.co website. Currently, the package only supports Super Smash Bros. Melee vods.
Installation
With Python installed, simply run the following command to add the package to your project.
pip install vodscrepe
Usage
The following is an example usage of the package, which is also included in the repo as example.py
:
from vodscrepe import Scraper
from tqdm import tqdm
s = Scraper('melee', debug=True)
pages = range(300)
try:
for vod in s.scrape(pages, show_progress=True):
if vod is not None:
tqdm.write(str(vod))
except KeyboardInterrupt:
tqdm.write("Scraping terminated.")
This example lists information about the vods from the most recent to page 300 in the following fashion:
python example.py > sets.txt
Then, the sets.txt
file becomes populated with vod information...
"['2020-01-26'] Genesis 7 - Zain (Marth) vs Hungrybox (Jigglypuff) - Grand Finals - Bo5"
"['2020-01-26'] Genesis 7 - Mango (Fox) vs Hungrybox (Jigglypuff) - Losers Finals - Bo5"
"['2020-01-26'] Genesis 7 - Hax (Fox) vs Hungrybox (Jigglypuff) - Losers Semis - Bo5"
"['2020-01-26'] Genesis 7 - Mango (Falco) vs Zain (Marth) - Winners Finals - Bo5"
"['2020-01-26'] Genesis 7 - Fiction (Fox) vs Hungrybox (Jigglypuff) - Losers Quarters - Bo5"
"['2020-01-26'] Genesis 7 - Hax (Fox) vs Leffen (Fox) - Losers Quarters - Bo5"
"['2020-01-26'] Genesis 7 - Hungrybox (Jigglypuff) vs Zain (Marth) - Winners Semis - Bo5"
"['2020-01-26'] Genesis 7 - Leffen (Fox) vs Mango (Falco) - Winners Semis - Bo5"
"['2020-01-26'] Genesis 7 - Fiction (Fox) vs n0ne (Captain Falcon) - Losers Top 8 - Bo5"
"['2020-01-26'] Genesis 7 - Hax (Fox) vs Shroomed (Sheik) - Losers Top 8 - Bo5"
"['2020-01-26'] Genesis 7 - n0ne (Captain Falcon) vs aMSa (Sheik) - Losers Round 6 - Bo5"
"['2020-01-26'] Genesis 7 - Fiction (Fox) vs Captain Faceroll (Sheik) - Losers Round 6 - Bo5"
"['2020-01-26'] Genesis 7 - Hax (Fox) vs PewPewU (Marth) - Losers Round 6 - Bo5"
"['2020-01-26'] Genesis 7 - Shroomed (Sheik) vs Swedish Delight (Sheik) - Losers Round 6 - Bo5"
"['2020-01-26'] Genesis 7 - iBDW (Fox) vs Captain Faceroll (Sheik) - Losers Round 5 - Bo5"
"['2020-01-26'] Genesis 7 - Ryobeat (Peach) vs S2J (Captain Falcon) - Losers Round 4 - Bo5"
"['2020-01-26'] Genesis 7 - Mew2King (Marth) vs ARMY (Ice Climbers) - Losers Round 4 - Bo5"
"['2020-01-26'] Genesis 7 - Trif (Peach) vs Swedish Delight (Sheik) - Losers Round 4 - Bo5"
"['2020-01-26'] Genesis 7 - SFAT (Fox) vs Panda (FL) (Fox) - Losers Round 3 - Bo5"
"['2020-01-26'] Genesis 7 - Shroomed (Sheik) vs Zain (Marth) - Winners Quarters - Bo5"
"['2020-01-26'] Genesis 7 - Mango (Falco) vs aMSa (Sheik) - Winners Quarters - Bo5"
"['2020-01-26'] Genesis 7 - Fiction (Fox) vs Leffen (Fox) - Winners Quarters - Bo3"
Scraping terminated.
...while the terminal details the progress:
All vods: 0%| | 0/300 [00:07<?, ?pages/s]
Page 0: 37%|████████████████████ | 22/60 [00:07<00:12, 3.07vods/s]
License
This software is released under the terms of MIT license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for vodscrepe-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3895c2b879016c74ae5c952f5db83871bd69533b4e60ac31dd665be56d3a33f |
|
MD5 | f30146c6dd86429ad2ee94ef051ccc6b |
|
BLAKE2b-256 | 92fc13f548277e70a3ad5c5ad4af4ea372651e7654ea3abfc46518d6ecd52446 |