A pure-python headless browser
Project description
A simple library for interacting with the web from python
Description
activesoup combines familiar python web capabilities for convenient headless “browsing” functionality:
Modern HTTP support with requests - connection pooling, sessions, …
Convenient access to the web page with an interface inspired by beautifulsoup - convenient HTML navigation.
Robust HTML parsing with html5lib - parse the web like browsers do.
Use cases
Consider using activesoup when:
Usage examples
Log into a website, and download a CSV file that’s access-protected:
from activesoup import driver
d = driver.Driver()
login_page = d.get('https://my-site.com/login')
login_form = login_page.form
member_portal = login_form.submit({'username': secret_store['username'],
'password': secret_store['password']})
if member_portal.response.status_code not in range(200, 300):
raise RuntimeError("Couldn't log in")
# Logged in now
csv_report = d.get('/members_area/file.csv')
csv_report.save_to('~/interesting_resport.csv')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
activesoup-0.0.4.tar.gz
(4.5 kB
view hashes)
Built Distribution
Close
Hashes for activesoup-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 35c7d3d59303008a7df22b5359b45a2fef1a61c1e736dbd14113d2e7f71d6a92 |
|
MD5 | 816e29871b11618efd3370e1914e0a7c |
|
BLAKE2b-256 | 08b4b4dac426b78c27b8ba3e35f990acd0d727350184d7362f5dfbcc0280f8b2 |