A browser for your agent, built on Chrome and Pyppeteer.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Development Status
- 2 - Pre-Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
Programming Language
- Python :: 3

Project description

agentbrowser

A browser for your agent, built on Chrome and Pyppeteer.

Installation

pip install agentbrowser

Usage

Importing into your project

from agentbrowser import (
    get_browser,
    init_browser,
    navigate_to,
    get_body_html,
    get_body_text,
    get_body_text_raw,
    get_document_html,
    create_page,
    close_page,
    evaluate_javascript,
)

Quickstart

from agentbrowser import (
    navigate_to,
    get_body_text,
)

# Navigate to a URL
page = navigate_to("https://google.com")

# Get the text from the page
text = get_body_text(page)

print(text)

Basic:

Create a new page

Equivalent of ctrl+t in Chrome. Makes a new blank page.

page = create_page()

Close a page

Equivalent of ctrl+w in Chrome. Closes the current page.

close_page(page)

Navigate to a URL

Equivalent of typing a URL into the address bar and hitting enter. If you haven't created a page yet, it will create one for you.

page = navigate_to("https://google.com")

Get the HTML of the page

Get the entire document HTML

html = get_document_html(page)

Get the HTML of the body

Get just the HTML of the body and inner. Useful for parsing out the content of the page.

html = get_body_html(page)

Get the text of the body

Get just the text of the body. Unlike the raw function, tries to remove some useless tags and divs and things. Not perfect, though.

text = get_body_text(page)

Get the raw text of the body

Get the raw text of the body. This will include all the tags and divs and things.

text = get_body_text_raw(page)

Advanced Usage

Get browser

This will give you a reference to the browser object, which you can use for advanced stuff. The browser object comes from Pyppeteer, so anything you can do with Pyppeteer, you can do with this.

browser = get_browser()

Evaluate Javascript

Call some Javascript on the page. Equivalent of opening the console and typing in some Javascript.

result = evaluate_javascript(page, "document.title")

Initialize browser

This will initialize the browser object. You can pass headless and executable_path. Headless will control whether the actual window appears on screen. Executable path will control which browser is used. By default, it will try to find Chrome first, then fall back to Chromium if it can't find Chrome.

The browser will be auto-initialized by default so you don't need to call this. The only reason you would is because you want to use headful or swap the browser.

init_browser(headless=True, executable_path="/path/to/chrome")

bash publish.sh --version=<version> --username=<pypi_username> --password=<pypi_password>

Contributions Welcome

If you like this library and want to contribute in any way, please feel free to submit a PR and I will review it. Please note that the goal here is simplicity and accesibility, using common language and few dependencies.

Questions, Comments, Concerns

If you have any questions, please feel free to reach out to me on Twitter or Discord.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Development Status
- 2 - Pre-Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.2

Aug 2, 2023

0.2.1

Aug 1, 2023

0.2.0

Aug 1, 2023

0.1.4

Jul 18, 2023

0.1.1

Jul 10, 2023

0.1.0

Jul 10, 2023

This version

0.0.0

Jul 18, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentbrowser-0.0.0.tar.gz (5.5 kB view hashes)

Uploaded Jul 18, 2023 Source

Built Distribution

agentbrowser-0.0.0-py3-none-any.whl (6.7 kB view hashes)

Uploaded Jul 18, 2023 Python 3

Hashes for agentbrowser-0.0.0.tar.gz

Hashes for agentbrowser-0.0.0.tar.gz
Algorithm	Hash digest
SHA256	`9a5d36ae3168dc20df721bf5da00c070f23d8bef76a8cc4d78284775a4ff5d8c`
MD5	`083421b2e8d59281e4a62df348e086a2`
BLAKE2b-256	`f7fd7cf1950fe06d8650e5abe76f610eab6da5e2e8fa3a0afbe5fa01de660326`

Hashes for agentbrowser-0.0.0-py3-none-any.whl

Hashes for agentbrowser-0.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3392015d71b393c9222c863144ec080de141bd05f424c93e15cf45fb66fac878`
MD5	`c243dba892bcb98e9fdcf5627683838a`
BLAKE2b-256	`12b7ef077adb8e8e9519f870a00b82e44e8ff72daf80962e733926d556a91625`