Browser automation service with AI-powered element identification and locator generation
Project description
Browser Service ๐ค
AI-powered browser automation made simple. Automatically identify web elements and generate reliable locators for your automation scripts.
โจ What is Browser Service?
Browser Service is an intelligent browser automation library that uses AI to automatically identify and generate reliable selectors for web elements. No more fragile XPath or CSS selectors that break when websites change!
Perfect for:
- ๐ค Automated testing
- ๐ Web scraping
- ๐ Data extraction
- ๐ฏ UI automation
- ๐ Robotic Process Automation (RPA)
๐ Quick Start
1. Install
pip install browser-service
2. Basic Usage
import asyncio
from browser_service.config import config
from browser_service.tasks import process_workflow_task
async def automate_website():
# Configure your API key (get from Google AI Studio)
config.GEMINI_API_KEY = "your-gemini-api-key-here"
# Define what you want to automate
workflow = {
"url": "https://example.com",
"steps": [
{"action": "click", "element": "Login button"},
{"action": "type", "element": "Email field", "text": "user@example.com"},
{"action": "click", "element": "Submit button"}
]
}
# Run the automation
result = await process_workflow_task(workflow)
print(f"โ
Automation completed: {result}")
# Run it
asyncio.run(automate_website())
That's it! The AI automatically finds the right elements on the page.
๐ฏ Key Features
๐ง AI-Powered Element Detection
- Natural Language: Describe elements in plain English ("Login button", "Search box")
- Smart Locators: Automatically generates reliable XPath, CSS, and Playwright selectors
- Self-Healing: Adapts when websites change their structure
๐ง Easy Configuration
from browser_service.config import config
# Configure once, use everywhere
config.GEMINI_API_KEY = "your-api-key"
config.HEADLESS = True # Run without browser window
config.TIMEOUT = 30 # Seconds to wait for elements
๐ Multiple Browser Support
- Chrome/Chromium
- Firefox
- Safari (macOS)
- Edge
๐ Built-in Monitoring
from browser_service.utils import record_workflow_metrics
# Track your automation performance
metrics = record_workflow_metrics(workflow_result)
print(f"โฑ๏ธ Execution time: {metrics['duration']}s")
print(f"๐ฏ Success rate: {metrics['success_rate']}%")
๐ Usage Examples
Form Filling Automation
workflow = {
"url": "https://myapp.com/contact",
"steps": [
{"action": "type", "element": "Name field", "text": "John Doe"},
{"action": "type", "element": "Email field", "text": "john@example.com"},
{"action": "select", "element": "Country dropdown", "value": "United States"},
{"action": "click", "element": "Submit button"}
]
}
Data Extraction
workflow = {
"url": "https://news-site.com",
"steps": [
{"action": "extract", "element": "Article titles", "multiple": True},
{"action": "extract", "element": "Publication dates", "multiple": True}
]
}
E-commerce Automation
workflow = {
"url": "https://store.com",
"steps": [
{"action": "type", "element": "Search box", "text": "wireless headphones"},
{"action": "click", "element": "Search button"},
{"action": "click", "element": "First product"},
{"action": "click", "element": "Add to cart button"}
]
}
๐ง Configuration
Required Setup
from browser_service.config import config
# Required: Your Google Gemini API key
config.GEMINI_API_KEY = "your-gemini-api-key-here"
# Optional: Customize behavior
config.HEADLESS = True # Run without browser UI
config.TIMEOUT = 30 # Element wait timeout (seconds)
config.BROWSER_TYPE = "chrome" # chrome, firefox, safari, edge
Advanced Configuration
from browser_service.config import BrowserServiceConfig
# Create custom config
custom_config = BrowserServiceConfig(
gemini_api_key="your-key",
headless=False,
timeout=60,
browser_type="firefox"
)
๐ฎ Action Types
| Action | Description | Example |
|---|---|---|
click |
Click an element | {"action": "click", "element": "Submit button"} |
type |
Type text into field | {"action": "type", "element": "Email field", "text": "user@email.com"} |
select |
Select dropdown option | {"action": "select", "element": "Country", "value": "USA"} |
extract |
Get element text | {"action": "extract", "element": "Price"} |
wait |
Wait for element | {"action": "wait", "element": "Loading spinner"} |
scroll |
Scroll to element | {"action": "scroll", "element": "Bottom of page"} |
๐๏ธ API Reference
Core Functions
from browser_service.tasks import process_workflow_task
from browser_service.locators import generate_locators_from_attributes
from browser_service.browser import BrowserSessionManager
# Process a complete workflow
result = await process_workflow_task(workflow)
# Generate locators for an element
locators = generate_locators_from_attributes(element_attributes)
# Manage browser sessions
browser = BrowserSessionManager()
await browser.start_session()
# ... use browser ...
await browser.cleanup()
Configuration Classes
from browser_service.config import (
config, # Global config instance
BrowserServiceConfig, # Main config class
BatchConfig, # Batch processing config
LocatorConfig, # Locator generation config
LLMConfig # AI model config
)
๐ Troubleshooting
Common Issues
โ "Element not found" errors
# Solution: Make element descriptions more specific
{"action": "click", "element": "Blue Login button on the right"} # Better
{"action": "click", "element": "Login button"} # Worse
โ "API key invalid" error
# Make sure you set your Gemini API key
config.GEMINI_API_KEY = "your-actual-api-key-from-google-ai-studio"
โ "Timeout" errors
# Increase timeout for slow pages
config.TIMEOUT = 60 # 60 seconds instead of default 30
Debug Mode
import logging
from browser_service.utils import setup_logging
# Enable detailed logging
setup_logging(level=logging.DEBUG)
# Now you'll see detailed logs about element detection
๐ Requirements
- Python: 3.8 or higher
- API Key: Google Gemini API key (free tier available)
- Browser: Chrome/Chromium recommended (others supported)
๐ค Contributing
Found a bug or want to improve Browser Service?
- ๐ Report Issues: GitHub Issues
- ๐ก Feature Requests: Open an issue with "Feature Request" label
- ๐ง Code Contributions: Fork โ Branch โ PR
๐ License
MIT License - see LICENSE file for details.
๐โโ๏ธ Support
- ๐ Documentation: More examples in
/examplesfolder - ๐ฌ Questions: GitHub Discussions
- ๐ Bug Reports: GitHub Issues
Ready to automate? ๐ Install now and start building intelligent browser automation!
pip install browser-service
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file browser_service-1.0.9.tar.gz.
File metadata
- Download URL: browser_service-1.0.9.tar.gz
- Upload date:
- Size: 85.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e91fc42f7977f4bf610c3fbc1979953f49eb6845d2b20160e71c3b2af0ff3f35
|
|
| MD5 |
ceac955359f4492fb3088122e308e1fd
|
|
| BLAKE2b-256 |
8ce0a2cfc298a38cf3d0214215194aabea5c6adae8f9cb2cf47c250848cde3e3
|
File details
Details for the file browser_service-1.0.9-py3-none-any.whl.
File metadata
- Download URL: browser_service-1.0.9-py3-none-any.whl
- Upload date:
- Size: 94.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae2792c9cb93b438df4ee55c0704a9299ad01e0e16883ffe4eba2320f81c7edd
|
|
| MD5 |
10b4a92d8e321860f3056b9a21099921
|
|
| BLAKE2b-256 |
e372e8bb98c6a9fceaab30efee5c1062c1b808f727babc41f92366068e31b4f6
|