A pure-python headless browser
Project description
A simple library for interacting with the web from python
Description
activesoup combines familiar python web capabilities for convenient headless “browsing” functionality:
Modern HTTP support with requests - connection pooling, sessions, …
Convenient access to the web page with beautifulsoup - convenient HTML navigation.
Robust HTML parsing with html5lib - parse the web like browsers do.
Use cases
Consider using activesoup when:
Usage examples
Log into a website, and download a CSV file that’s access-protected:
from activesoup import driver
d = driver.Driver()
login_page = d.get('https://my-site.com/login')
login_form = login_page.form
member_portal = login_form.submit({'username': secret_store['username'],
'password': secret_store['password']})
if member_portal.response.status_code not in range(200, 300):
raise RuntimeError("Couldn't log in")
# Logged in now
csv_report = d.get('/members_area/file.csv')
csv_report.save_to('~/interesting_resport.csv')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
activesoup-0.0.2.tar.gz
(4.2 kB
view hashes)
Built Distribution
Close
Hashes for activesoup-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 16c4944de7933562e3511c59b5e0761caf9c8ff9f052041769b69da09240a136 |
|
MD5 | 9cac28890bf6045043f130e40269eebb |
|
BLAKE2b-256 | 0082e6b9174f50aa04a99810bdb272f67abee1a802654ffef54eee8b5fe2e0d0 |