Skip to main content

AI Vision library for Robot Framework

Project description

"Buy Me A Coffee"

Robot Framework AI Vision Library

AI VIsion Library for Robot Framework that verifies UI/screenshots (including template “look & feel”) by sending instructions plus one or more images to an OpenAI-compatible API (Ollama, OpenAI, Perplexity, Gemini, etc.).

The main keyword (Verify That) expects the model to return a strict RESULT: / EXPLANATION: format and will fail the test if the result is not pass.

Features

  • Visual assertions on one or more screenshots using natural-language instructions.
  • Template comparison keyword to validate “actual vs expected” look & feel (optionally creates a side-by-side image).
  • Image utilities built on Pillow: open/convert, watermark, combine images, auto-generate names, save into Robot Framework output directory.
  • Works with multiple providers via the openai Python client and OpenAI-compatible endpoints (base_url).

Installation

Install from PyPI (once published):

pip install -U robotframework-aivision

Runtime dependencies include Robot Framework, Pillow, and the openai Python client.

Configuration

Import the library in Robot Framework and choose a provider using platform plus optional overrides (base_url, api_key, model, image_detail).

Robot Framework import examples

Default (Ollama-like local setup):

*** Settings ***
Library  AIVision

OpenAI (API key required):

*** Settings ***
Library  AIVision
... platform=OpenAI
... api_key=%{OPENAI_API_KEY}
... model=gpt-5.2

Perplexity:

*** Settings ***
Library  AIVision
... platform=Perplexity
... api_key=%{PPLX_API_KEY}
... model=sonar-pro

Gemini (OpenAI-compatible endpoint):

*** Settings ***
Library  AIVision
... platform=Gemini
... api_key=%{GEMINI_API_KEY}
... model=gemini-2.5-flash

Supported platforms (defaults)

The library defines these platform presets (model and base_url) which you can override via import arguments.

Platform Default base_url Default model API key
Ollama http://localhost:11434/v1 qwen3-coder:480b-cloud Not required
DockerModel http://localhost:12434/engines/v1 ai/qwen3-vl:8B-Q8_K_XL Not required.
OpenAI https://api.openai.com/v1 gpt-5.2 Required.
Perplexity https://api.perplexity.ai sonar-pro Required.
Gemini https://generativelanguage.googleapis.com/v1beta/openai/ gemini-2.5-flash Required.
Manual None None Required.

Keywords

All keywords below are implemented in AIVision and are available after importing the library.

Keyword Purpose
Verify That Send one or more screenshots + instructions to the model, parse the RESULT and raise AssertionError on failure.
Verify Screenshot Matches Look And Feel Template Compare a screenshot against a reference template with a built-in instruction set; optional combined image creation.
Open Image Open an image (and optionally convert mode, default RGB).
Save Image Save a PIL image to a path (defaults to RF output directory) with optional watermark.
Generate Image Name Create a unique timestamp-based filename with prefix/extension.
Combine Images On Paths Side By Side Combine two image files side-by-side (optionally watermark) and optionally save.
Combine Images Side By Side Combine two in-memory PIL images side-by-side (optionally watermark).
Add Watermark To Image Add watermark text using the included font file.

Usage examples

Simple visual assertion

*** Settings ***
Library  AIVision  platform=Ollama

*** Test Cases ***
Login button is correct
   Verify That  ${CURDIR}/screens/login.png  Login button is visible and labeled as 'Sign In'

Compare screenshot to design template

*** Settings ***
Library  AIVision

*** Test Cases ***
Home page matches template
   Verify Screenshot Matches Look And Feel Template
   ...  ${CURDIR}/screens/home_actual.png
   ...  ${CURDIR}/templates/home_expected.png

Override template instructions

*** Settings ***
Library  AIVision

*** Test Cases ***
Home page matches template - custom rules
   Verify Screenshot Matches Look And Feel Template
   ...  ${CURDIR}/screens/home_actual.png
   ...  ${CURDIR}/templates/home_expected.png
   ...  override_instructions=Verify layout, spacing, typography, and brand colors match the template exactly.

Version history

0.2.0a1, 2026-02-02 -- Alpha1 version 0.2.0, 2026-01-29 -- AI System Prompt is configurable 0.1.0, 2025-12-19 -- Additional GenAI Providers added 0.0.1, 2024-05-11 -- Initial version

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robotframework_aivision-0.2.0a1.tar.gz (92.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robotframework_aivision-0.2.0a1-py3-none-any.whl (95.9 kB view details)

Uploaded Python 3

File details

Details for the file robotframework_aivision-0.2.0a1.tar.gz.

File metadata

File hashes

Hashes for robotframework_aivision-0.2.0a1.tar.gz
Algorithm Hash digest
SHA256 ab2aaf60cd3e684d66d624de034472897eaa891aa557bba1fa94ac887cc20ea8
MD5 dd6ceab19bc352c413e8a4f0710f417f
BLAKE2b-256 fedb380d2f951ebb0ae698887b25eeea60e8ee21c45e45e1ccf750635a69f18a

See more details on using hashes here.

Provenance

The following attestation bundles were made for robotframework_aivision-0.2.0a1.tar.gz:

Publisher: python-publish.yml on robco/robotframework-aivision

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file robotframework_aivision-0.2.0a1-py3-none-any.whl.

File metadata

File hashes

Hashes for robotframework_aivision-0.2.0a1-py3-none-any.whl
Algorithm Hash digest
SHA256 3d292e13b4d399b22567671aaa42d390a12dad5616b36747c0617ad1a6d34e07
MD5 6d39cab880b7ea5ad06803483c348549
BLAKE2b-256 32bd88ecc3967eda9eec18b1373f8875c7856bac850e5ca86985f37d704be39b

See more details on using hashes here.

Provenance

The following attestation bundles were made for robotframework_aivision-0.2.0a1-py3-none-any.whl:

Publisher: python-publish.yml on robco/robotframework-aivision

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page