Get HTML elements locators using natural language.

These details have not been verified by PyPI

Project description

Locatr

Test

Locatr package helps you to find HTML locators on a webpage using prompts and llms.

Overview

LLM based HTML locator finder.
Re-rank support for improved accuracy.
Supports playwright, selenium, cdp.
Uses cache to reduce calls to llm apis.
Results/Statistics generation of api calls.

Example:

starButtonLocator, err := locatr.GetLocatr("Star button on the page")
starButtonLocator.click()

Install Locatr with

Golang

go get github.com/vertexcover-io/locatr

Python

pip install locatr

Quick Example
LLM Client
Re-ranking Client
Locatr Settings
Locatrs
Cache Schema & Management
Logging
Generate Statistics
Contributing

Quick Example

With python

Python example

# example assumes that there is already a page opened in the selenium session.
import os

from locatr import (
    LlmProvider,
    LlmSettings,
    Locatr,
    LocatrCdpSettings,
    LocatrSeleniumSettings,
    PluginType,
)

llm_settings = LlmSettings(
    llm_provider=LlmProvider.OPENAI,
    llm_api_key=os.environ.get("LLM_API_KEY"),
    model_name=os.environ.get("LLM_MODEL_NAME"),
    reranker_api_key=os.environ.get("RERANKER_API_KEY"),
)

locatr_settings_selenium = LocatrSeleniumSettings(
    plugin_type=PluginType.SELENIUM,
    llm_settings=llm_settings,
    selenium_url=os.environ.get("SELENIUM_URL"),
    selenium_session_id="e4c543363b9000a66073db7a39152719",
)

l = Locatr(locatr_settings_selenium, debug=True)

print(lib.get_locatr("H1 element with text Example Domain"))

Here's a quick example on how to use the project:

package main

import (
	"fmt"
	"log"
	"os"
	"time"

	"github.com/playwright-community/playwright-go"
	"github.com/vertexcover-io/locatr"
)

func main() {
	pw, err := playwright.Run()
	if err != nil {
		log.Fatalf("could not start playwright: %v", err)
	}
	defer pw.Stop()

	browser, err := pw.Chromium.Launch(
		playwright.BrowserTypeLaunchOptions{
			Headless: playwright.Bool(false),
		},
	)
	if err != nil {
		log.Fatalf("could not launch browser: %v", err)
	}
	defer browser.Close()

	page, err := browser.NewPage()
	if err != nil {
		log.Fatalf("could not create page: %v", err)
	}
	if _, err := page.Goto("https://hub.docker.com/"); err != nil {
		log.Fatalf("could not navigate to docker hub: %v", err)
	}
	time.Sleep(5 * time.Second) // wait for page to load

	llmClient, err := locatr.NewLlmClient(
		locatr.OpenAI, // (openai | anthropic),
		os.Getenv("LLM_MODEL_NAME"),
		os.Getenv("LLM_API_KEY"),
	)
	if err != nil {
		log.Fatalf("could not create llm client: %v", err)
	}
    options := locatr.BaseLocatrOptions{UseCache: true, LogConfig: locatr.LogConfig{Level: locatr.Silent}, LlmClient: llmClient}

	playWrightlocatr := locatr.NewPlaywrightLocatr(page, options)

	searchBarLocator, err := playWrightlocatr.GetLocatr("Search Docker Hub input field")
	if err != nil {
		log.Fatalf("could not get locator: %v", err)
	}
	fmt.Println(searchBarLocator.InnerHTML())
}

Please check the examples directory for more examples.

LLM Client

The LlmClient is a wrapper around the llm provider you want to use. Supported providers are locatr.OpenAI, locatr.Anthropic. It is optional; if not provided in the options, Locatr will automatically create an LlmClient using environment variables.

The following environment variables will be read to create a default LlmClient:
- LLM_PROVIDER: Defines which provider's LLM should be utilized (openai, anthropic).
- LLM_MODEL: Specifies the model to use
- LLM_API_KEY: The API key required to authenticate with the LLM provider.

To create a new llm client call the locatr.NewLlmClient function.

import (
	"github.com/vertexcover-io/locatr.
	"os"
)

llmClient, err := locatr.NewLlmClient(
	locatr.OpenAI, // Supported providers: "openai" | "anthropic"
	os.Getenv("LLM_MODEL_NAME"),
	os.Getenv("LLM_API_KEY"),
)
options := locatr.BaseLocatrOptions{
	LlmClient: llmClient,
}

Run without creating the llm client..

import (
	"github.com/vertexcover-io/locatr.
	"os"
)

options := locatr.BaseLocatrOptions{
	UseCache: true,
}

Re-ranking Client

ReRankClient is a wrapper around the ranking provider you want to use. Currently, we only support the cohere re-ranker. To create a cohere re-ranker, use the following code:

note: There is no support to create a re-ranking client by default if not provided in BaseLocatrOptions

Only re-ranked HTML chunks with a score greater than 0.9 are sent to the LLM.
The default cohere re-ranking model is rerank-english-v3.0.

import (
	"github.com/vertexcover-io/locatr"
	"os"
)

reRankClient, err := locatr.NewCohereClient(
	os.Getenv("COHERE_API_KEY"),
)
options := locatr.BaseLocatrOptions{
	ReRankClient: reRankClient,
}

Advantages of using re-ranking in Locatr

Using re-ranking reduces the input context sent to the LLM.
Re-ranked chunks will contain only the most relevant HTML chunks, improving the accuracy.
Sending less input context to the LLM reduces response time and lowers the cost per LLM call.

Locatr Options

locatr.BaseLocatrOptions is a struct with multiple fields used to configure caching, logging, and output file paths in locatr.

Fields

CachePath (string):
- Path where the cache will be saved.
- Example: "/path/to/cache/file"
UseCache (bool):
- Default is false. Set to true to enable caching.
LogConfig (LogConfig):
- Configuration for logging behavior.
- Level (LogLevel):
  - Sets the log level. Controls the verbosity of logging.
  - Example: locatr.Info to log errors, warnings, and info messages.
- Writer (Writer):
  - Destination for log output. Implement the Printf function for custom log handling.
ResultsFilePath (string):
- Path to the file where locatr results will be saved.
- If not provided, results will be saved to DEFAULT_LOCATR_RESULTS_FILE.
LlmClient (LlmClientInterface):
- Optional value; if not provided will be created by default (read more about llm client)
ReRankClient (ReRankInterface)
- The ReRankClient you want to use. When this is passed locatr will use the re-ranking client to re-rank the html chunks. (More about re-ranking).

Locatrs

Locatrs are a wrapper around the main plugin (playwright, selenium, cdp).

PlaywrightLocatr

Create an instance of PlayWrightLocatr using :

playWrightLocatr := locatr.NewPlaywrightLocatr(page, llmClient, options)

CdpLocatr

To use Locatr through CDP, we first need to start the browser with a CDP server. This can be achieved by running:

google-chrome --remote-debugging-port=9222

We can pass the same arguments when using Selenium or Playwright:

Selenium:

chrome_options = Options()
chrome_options.add_argument("--remote-debugging-port=9222")

Playwright:

browser = playwright.chromium.launch(headless=False, args=["--remote-debugging-port=9222"])

After starting the browser with CDP, we need the page ID. The page ID is essential to run Locatr scripts on the correct page. This can be achieved in two ways:

Directly getting it from the CDP server

Send a GET request to http://localhost:9222/json.
You will receive the following response:

[ {
"description": "",
"devtoolsFrontendUrl": "/devtools/inspector.html?ws=localhost:9222/devtools/page/215947B924E9C4D232ADE7331FDBEBA6",
"faviconUrl": "https://www.youtube.com/s/desktop/e718aa11/img/logos/favicon_32x32.png",
"id": "215947B924E9C4D232ADE7331FDBEBA6",
"title": "YouTube",
"type": "page",
"url": "https://www.youtube.com/",
"webSocketDebuggerUrl": "ws://localhost:9222/devtools/page/215947B924E9C4D232ADE7331FDBEBA6"
}]

The id field contains the page id.

Get it through playwright:

  const browser = await chromium.launch({ headless: false });
  const context = await browser.newContext();
  const page = await context.newPage();
  const cdpSession = await context.newCDPSession(page);
  const response = await cdpSession.send('Page.getFrameTree');
  const pageId = response.frameTree.frame.id;

Once we have the page ID, we can establish a connection with CDP:

	connectionOpts := locatr.CdpConnectionOptions{
		Port:   9222,
		PageId: "177AE4272FC8BBE48190C697A27942DA",
	}
	connection, err := locatr.CreateCdpConnection(connectionOpts)
	defer connection.Close()

Now we can create the CDP Locatr with:

	cdpLocatr, err := locatr.NewCdpLocatr(connection, options)

Selenium Locatr

Selenium Locatr can be created through two ways:

Through selenium server url:

	seleniumLocatr, err := locatr.NewRemoteConnSeleniumLocatr("http://localhost:4444/wd/hub", driver.SessionID(), options)

note: the path must always be /wd/hub

Directly passing the selenium driver:

	seleniumLocatr, err := locatr.NewSeleniumLocatr(driver, options)

Methods

GetLocatr: Locates an element using a descriptive string and returns a Locator object.

searchBarLocator, err := playWrightLocatr.GetLocatr("Search Docker Hub input field")

Cache

Cache Schema

The cache is stored in JSON format. The schema is as follows:

{
	"Page Full Url" : [
		{
			"locatr_name": "The description of the element you gave",
			"locatrs": [
				"input#search"
			]
		}
	]
}

Cache Management

To remove the cache, delete the file at the path specified in BaseLocatrOptions's CachePath.

Logging

Logging is enabled by default in locatr and it's set to Error log level. Pass the LogConfig value in the BaseLocatrOptions struct.

	options := locatr.BaseLocatrOptions{UseCache: true, LogConfig: locatr.LogConfig{Level: locatr.Debug}}

Available Log Levels

The following log levels are available, in increasing order of priority:

Debug: Logs all messages, info, warn, error.
Info : Logs informational messages, warnings, and errors.
Warning: Logs warnings and errors only.
Error (Default): Logs only error messages.

Locatr Results

Locatr provides a feature to get all the information about each locatr request made (call to GetLocatr function). The result has the following schema.

LocatrDescription (string): Description of the locatr passed to the request.
Url (string): The URL associated with the locatr.
CacheHit (bool): Indicates if the result was retrieved from the cache (true) or freshly generated (false).
Locatr (string): The locatr generated by the operation.
InputTokens (int): Number of input tokens processed by the LLM call.
OutputTokens (int): Number of tokens generated in the output by the LLM call.
TotalTokens (int): Sum of input and output tokens.
LlmErrorMessage (string): The error message from the LLM, if any.
ChatCompletionTimeTaken (int): Time taken for the LLM to complete locatr generation in seconds.
AttemptNo (int): An integer field to indicate the attempt number with re rank.
LocatrRequestInitiatedAt (time.Time): The timestamp when the request was initiated.
LocatrRequestCompletedAt (time.Time): The timestamp when the request was completed.
AllLocatrs ([]string): All the locatrs of each located elements.

Saving Results

Results can be saved to a file specified by locatr.BaseLocatrOptions.ResultsFilePath (locatr_results.json). If no file path is specified, results are written to locatr.DEFAULT_LOCATR_RESULTS_PATH.

To write results to a file: Use the playwrightLocatr.WriteResultsToFile function.

Schema of the json file:

{
    "locatr_description": "",
    "url": "",
    "cache_hit": false,
    "locatr": "",
    "input_tokens": 8399,
    "output_tokens": 22,
    "total_tokens": 8421,
    "llm_error_message": "",
    "llm_locatr_generation_time_taken": 1,
    "attempt_no": 0,
    "request_initiated_at": "",
    "request_completed_at": "",
	"all_locatrs": []
}

To retrieve results as a slice: Use the playwrightLocatr.GetLocatrResults function.

Contributing

We welcome contributions! Please read our CONTRIBUTING.md guide to get started.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.56.2

Nov 11, 2025

0.56.1

Nov 11, 2025

0.56.0

Apr 2, 2025

0.55.0

Mar 28, 2025

0.54.0

Mar 24, 2025

0.53.0

Mar 24, 2025

0.52.0

Mar 24, 2025

0.51.0

Mar 24, 2025

0.50.0

Mar 20, 2025

0.49.0

Mar 5, 2025

0.48.0

Mar 5, 2025

0.47.0

Mar 5, 2025

0.46.0

Mar 5, 2025

0.45.0

Mar 3, 2025

0.44.0

Feb 21, 2025

0.43.0

Feb 21, 2025

0.42.0

Feb 3, 2025

0.41.0

Feb 3, 2025

0.40.0

Feb 3, 2025

0.39.0

Feb 3, 2025

0.38.0

Jan 28, 2025

0.37.0

Jan 28, 2025

0.36.0

Jan 28, 2025

0.35.0

Jan 27, 2025

0.34.0

Jan 27, 2025

0.33.0

Jan 27, 2025

0.32.0

Jan 27, 2025

0.31.0

Jan 8, 2025

0.30.0

Jan 8, 2025

0.29.0

Jan 8, 2025

0.28.0

Jan 8, 2025

0.27.0

Jan 7, 2025

0.26.0

Jan 7, 2025

0.24.0

Jan 6, 2025

0.23.0

Dec 26, 2024

0.22.0

Dec 26, 2024

0.21.0

Dec 26, 2024

0.20.0

Dec 26, 2024

0.19.0

Dec 26, 2024

This version

0.18.0

Dec 26, 2024

0.17.0

Dec 26, 2024

0.16.0

Dec 26, 2024

0.15.0

Dec 26, 2024

0.14.0

Dec 26, 2024

0.13.0

Dec 26, 2024

0.12.0

Dec 26, 2024

0.11.0

Dec 26, 2024

0.10.0

Dec 26, 2024

0.9.0

Dec 26, 2024

0.8.0

Dec 26, 2024

0.7.0

Dec 26, 2024

0.6.0

Dec 26, 2024

0.5.0

Dec 26, 2024

0.4.0

Dec 26, 2024

0.2.0

Dec 26, 2024

0.1.0

Dec 24, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

test_locatr-0.18.0.tar.gz (5.9 MB view details)

Uploaded Dec 26, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

test_locatr-0.18.0-py3-none-any.whl (5.9 MB view details)

Uploaded Dec 26, 2024 Python 3

File details

Details for the file test_locatr-0.18.0.tar.gz.

File metadata

Download URL: test_locatr-0.18.0.tar.gz
Upload date: Dec 26, 2024
Size: 5.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for test_locatr-0.18.0.tar.gz
Algorithm	Hash digest
SHA256	`8f14968b2bf7bcd5dc12788cbd974be51f7e3f5157b214cff80db72ab02acbc5`
MD5	`5edfdb5a0100ebd2593cd054e3c2bc03`
BLAKE2b-256	`d177b8c5cf44236bc00242a4083f5e55c2d14945af37b60b548cb2596e95187d`

See more details on using hashes here.

File details

Details for the file test_locatr-0.18.0-py3-none-any.whl.

File metadata

Download URL: test_locatr-0.18.0-py3-none-any.whl
Upload date: Dec 26, 2024
Size: 5.9 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for test_locatr-0.18.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b9e2c715b5f4113763612ccc0a18a9061b051c3c186904df769132bf338e9dfc`
MD5	`8e509ca9c70494abb1cd395b07656b81`
BLAKE2b-256	`fbcbcb894919b329bb783cc0aaa3a890a887ed7ea5176b15a8d83b3cd9b28bd7`

See more details on using hashes here.

test_locatr 0.18.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Locatr

Overview

Install Locatr with

Golang

Python

Table of Contents

Quick Example

With python

Python example

LLM Client

Re-ranking Client

Locatr Options

Locatrs

PlaywrightLocatr

CdpLocatr

Selenium Locatr

Methods

Cache

Cache Schema

Cache Management

Logging

Available Log Levels

Locatr Results

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes