Utilities for scraping Yahoo pages
Project description
Python utilities for scraping Yahoo pages
The yahooscraper package is organized into modules and submodules. Each leaf module (i.e., module without submodules) contains functions that take a single argument – HTML text of the page represented by the module and its namespacing – and return some data parsed from the page.
If the data is not found, None is returned. Or, in cases where an iterable should be returned, an empty iterable may be returned.
Each leaf module also includes a url() function, which returns the URL of the page represented by the module. In cases where the module represents a set of URLs, this function takes parameters.
To get data from pages that require authentication, first obtain an authenticated session using the login module’s authenticated_session() method function.
See more detailed API documentation using pydoc:
$ pydoc yahooscraper $ pydoc yahooscraper.login $ pydoc yahooscraper.fantasy.team
Examples
Output Fantasy NBA team name:
import yahooscraper as ys
LEAGUE_ID = 237834
TEAM_ID = 8
session = ys.login.authenticated_session()
response = session.get(ys.fantasy.team.url('nba', LEAGUE_ID, TEAM_ID))
team = ys.fantasy.team.team(response.text)
print(team)
Install
$ pip install yahooscraper
Development
Test
$ python tests.py
Deploy
Bump version in setup.py
$ python setup.py register
$ python setup.py sdist upload
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.