Skip to main content

Get HTML/XML elements css/xpath selectors using natural language.

Project description

Locatr

Locatr package helps you to find HTML locators on a webpage using prompts and llms.

Overview

  • LLM based HTML element css path finder.
  • Re-rank support for improved accuracy.
  • Supports playwright, selenium, cdp.
  • Uses cache to reduce calls to llm apis.

Example:

print(locatr.get_locatr("Search input bar in the page"))
# output: 'html > div > input'

For more examples check the examples/python folder.

Install locatr with

pip install locatr

Table of Contents

Quick Example

# example assumes that there is already a page opened in the selenium session.
import os

from locatr import (
    LlmProvider,
    LlmSettings,
    Locatr,
    LocatrCdpSettings,
    LocatrSeleniumSettings,
    PluginType,
)

llm_settings = LlmSettings(
    llm_provider=LlmProvider.OPENAI,
    llm_api_key=os.environ.get("LLM_API_KEY"),
    model_name=os.environ.get("LLM_MODEL_NAME"),
    reranker_api_key=os.environ.get("COHERE_API_KEY"),
)

locatr_settings_selenium = LocatrSeleniumSettings(
    llm_settings=llm_settings,
    selenium_url=os.environ.get("SELENIUM_URL"), # url must end with `/wd/hub`
    selenium_session_id="e4c543363b9000a66073db7a39152719",
)

selenium_locatr = Locatr(locatr_settings_selenium, debug=True)

print(selenium_locatr.get_locatr("H1 element with text Example Domain"))

Locatr Settings

Locatr settings expects the following fields:

Cache Settings.

  • cache_path -> The path to the cache file. This file will be reused later to save llm api requests. If nothing is provided then it is stored in .locatr.cache.
  • use_cache -> Weather to use cache or not.

For cache schema see: link

Llm settings.

Contains all the settings required for llm and re-ranking.

from locatr import LlmSettings

It expects the following values:

  1. llm_provider -> The llm provider you want to use. Options are locatr.LlmProvider.OPENAI and locatr.LlmProvider.ANTHROPIC
  2. llm_api_key -> The provider's api key value.
  3. model_name -> Specify which llm model you want to use.
  4. reranker_api_key -> Api key for cohere reranker. It is optional if not provided reranking will not be used.

Example:

from locatr import LlmSettings

llm_settings = LlmSettings(
    llm_provider=LlmProvider.OPENAI,
    llm_api_key=os.environ.get("LLM_API_KEY"),
    model_name=os.environ.get("LLM_MODEL_NAME"),
    reranker_api_key=os.environ.get("COHERE_API_KEY"),
)

Note: If values are not provided in the settings then they will be read from the following env variables.

  • LLM_PROVIDER
  • LLM_MODEL
  • LLM_API_KEY
  • COHERE_API_KEY

Locatr settings is bound with the type of plugin you want to use (cdp/selenium).

To create settings for cdp use (use with playwright):

from locatr import LocatrCdpSettings

# .... create llm settings

locatr_setting_cdp = LocatrCdpSettings(
    llm_settings=llm_settings,
    cdp_url="http://localhost:9222", # You can get this port by passing the following argument to chromium based browsers: `--remote-debugging-port=9222`
)

To create settings for selenium use:

from locatr import LocatrSeleniumSettings

locatr_settings_selenium = LocatrSeleniumSettings(
    llm_settings=llm_settings,
    selenium_url=os.environ.get("SELENIUM_URL"), # url must end with `/wd/hub`
    selenium_session_id="e4c543363b9000a66073db7a39152719",
)

Get locatr

To get locatr string we need to import the Locatr class and pass the settings to it.

from locatr import Locatr

# ... create settings 

l = Locatr(locatr_settings_selenium)

By default, the Locatr class operates without logging. However, if you'd like to view the Locatr server logs for debugging purposes, you can enable the debug parameter by passing True as the second argument during initialization.

l.get_locatr("red 'yes button' in the form")

You can also get locatrs asynchronously. Just call get_locatr_async.

await l.get_locatr_async("red 'yes button' in the form")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

test_locatr-0.39.0.tar.gz (7.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

test_locatr-0.39.0-py3-none-any.whl (7.0 MB view details)

Uploaded Python 3

File details

Details for the file test_locatr-0.39.0.tar.gz.

File metadata

  • Download URL: test_locatr-0.39.0.tar.gz
  • Upload date:
  • Size: 7.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.24

File hashes

Hashes for test_locatr-0.39.0.tar.gz
Algorithm Hash digest
SHA256 ffed1259d9c403eef22325cc977ff27975d3030c64b667c9e5aaaadaa7be3d88
MD5 8d2e331a39330fd2e64f7df792065ed6
BLAKE2b-256 d1215ad19db64fd1a46dcd986c9f9896642cb2a8b52316c825e324fd0caceb38

See more details on using hashes here.

File details

Details for the file test_locatr-0.39.0-py3-none-any.whl.

File metadata

File hashes

Hashes for test_locatr-0.39.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bfb79aee795e8655ee287678a0d39d01ba0567bbc6a1610169b49a8ed6b47b87
MD5 fc8016efaaf532f0e0bac590b6b292d2
BLAKE2b-256 26a865b755a3de4daf28adf911a0f0f0d3033a700b2938edf2dfe32fe808ebba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page