Intuned Browser SDK
Project description
Intuned Browser SDK (Python)
Intuned's Python SDK for browser automation and web data extraction, designed to work seamlessly with the Intuned platform.
Installation
Using Poetry (Recommended)
poetry add intuned-browser
Using pip
pip install intuned-browser
Features
The Intuned Browser SDK provides a comprehensive set of tools for browser automation and data extraction:
🤖 AI-Powered Extraction
- Structured Data Extraction - Extract structured data from web pages using AI
- Schema Validation - Validate extracted data against JSON schemas
- Smart Page Loading Detection - Determine when pages have fully loaded
🌐 Web Automation Helpers
- Navigation - Advanced URL navigation with
go_to_url() - Content Loading - Scroll to load dynamic content with
scroll_to_load_content() - Network Monitoring - Wait for network activity with
wait_for_network_settled() - DOM Monitoring - Wait for DOM changes with
wait_for_dom_settled() - Click Automation - Click elements until exhausted with
click_until_exhausted()
📄 Content Processing
- HTML Sanitization - Clean and sanitize HTML with
sanitize_html() - Markdown Extraction - Convert HTML to markdown with
extract_markdown() - URL Resolution - Resolve relative URLs with
resolve_url() - Date Processing - Parse and process dates with
process_date()
📁 File Operations
- File Downloads - Download files with
download_file() - S3 Integration - Upload and save files to S3 with
upload_file_to_s3()andsave_file_to_s3()
✅ Data Validation
- Schema Validation - Validate data structures with
validate_data_using_schema() - Empty Value Filtering - Filter empty values with
filter_empty_values()
Quick Start
from intuned_browser import (
extract_markdown,
sanitize_html,
go_to_url,
wait_for_network_settled,
validate_data_using_schema
)
# Example: Extract and process web content
async def extract_content(page):
# Navigate to URL
await go_to_url(page, "https://example.com")
# Wait for network to settle
await wait_for_network_settled(page)
# Get and sanitize HTML
html = await page.content()
clean_html = sanitize_html(html)
# Extract markdown
markdown = extract_markdown(clean_html)
return markdown
AI-Powered Data Extraction
from intuned_browser.ai import extract_structured_data
from intuned_browser.ai.types import JsonSchema
# Define your data schema
schema: JsonSchema = {
"type": "object",
"properties": {
"title": {"type": "string"},
"price": {"type": "number"},
"description": {"type": "string"}
},
"required": ["title", "price"]
}
# Extract structured data using AI
async def extract_product_data(page):
result = await extract_structured_data(
page=page,
schema=schema,
prompt="Extract product information from this page"
)
return result
Documentation
For detailed documentation on all functions and types, see the documentation.
Support
For support, questions, or contributions, please contact the Intuned team at engineering@intunedhq.com.
About Intuned
Intuned provides powerful tools for browser automation, web scraping, and data extraction. Visit intunedhq.com to learn more.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file intuned_browser-0.1.9.tar.gz.
File metadata
- Download URL: intuned_browser-0.1.9.tar.gz
- Upload date:
- Size: 110.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd3909612092af0bb89d4a6d05553d54e63d58534d698b7840032af16d8e36a8
|
|
| MD5 |
50e3b788d930a8faf2ca108ded35daf0
|
|
| BLAKE2b-256 |
6da1c3cc14c2c978b86af411d36c7dee0fcaee5ff5b9cb84da12425531fc1a87
|
File details
Details for the file intuned_browser-0.1.9-py3-none-any.whl.
File metadata
- Download URL: intuned_browser-0.1.9-py3-none-any.whl
- Upload date:
- Size: 142.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ed4dd4b5fd33d938629928519e474cce2ee7b7232df0491781675a8c9d42fafc
|
|
| MD5 |
c541b715dc25817c34b4d474dd5cee81
|
|
| BLAKE2b-256 |
36da806f5dd080d677a288a13a4c370b3b79ff710947fb1286e4e2cafd35261f
|