A library for Visual Document Testing

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

robotframework-doctestlibrary

Robot Framework DocTest library.
Simple Automated Visual Document Testing.

See keyword documentation for

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Compare two Images and highlight differences
    Compare Images    Reference.jpg    Candidate.jpg

Installation instructions

pip install --upgrade robotframework-doctestlibrary

Only Python 3.X or newer is supported. Tested with Python 3.8/3.11/3.12

Install robotframework-doctestlibrary

Installation via `pip` from PyPI (recommended)

pip install --upgrade robotframework-doctestlibrary

Installation via `pip` from GitHub

pip install git+https://github.com/manykarim/robotframework-doctestlibrary.git

git clone https://github.com/manykarim/robotframework-doctestlibrary.git
cd robotframework-doctestlibrary
pip install -e .

Install dependencies

Install Tesseract, Ghostscript, GhostPCL, ImageMagick binaries and barcode libraries (libdmtx, zbar) on your system.
Hint: Since 0.2.0 Ghostscript, GhostPCL and ImageMagick are only needed for rendering .ps and .pclfiles.
Rendering and content parsing of .pdf is done via MuPDF
In the future there might be a separate pypi package for .pcl and .ps files to get rid of those dependencies.

Linux

apt-get install imagemagick tesseract-ocr ghostscript libdmtx0b libzbar0

Windows

Some special instructions for Windows

Rename executable for GhostPCL to pcl6.exe (only needed for `.pcl` support)

The executable for GhostPCL gpcl6win64.exe needs to be renamed to pcl6.exe

Otherwise it will not be possible to render .pcl files successfully for visual comparison.

Add tesseract, ghostscript and imagemagick to system path in windows (only needed for OCR, `.pcl` and `.ps` support)

C:\Program Files\ImageMagick-7.0.10-Q16-HDRI
C:\Program Files\Tesseract-OCR
C:\Program Files\gs\gs9.53.1\bin
C:\Program Files\gs\ghostpcl-9.53.1-win64

(The folder names and versions on your system might be different)

That means: When you open the CMD shell you can run the commands

magick.exe
tesseract.exe
gswin64.exe
pcl6.exe

successfully from any folder/location

Windows error message regarding pylibdmtx

How to solve ImportError for pylibdmtx

If you see an ugly ImportError when importing pylibdmtx on Windows you will most likely need the Visual C++ Redistributable Packages for Visual Studio 2013. Install vcredist_x64.exe if using 64-bit Python, vcredist_x86.exe if using 32-bit Python.

ImageMagick

The library might return the error File could not be converted by ImageMagick to OpenCV Image: <path to the file> when comparing PDF files. This is due to ImageMagick permissions. Verify this as follows with the sample.pdf in the testdata directory:

convert sample.pdf sample.jpg 
convert-im6.q16: attempt to perform an operation not allowed by the security policy

Solution is to copy the policy.xml from the repository to the ImageMagick installation directory.

Docker

You can also use the docker images or create your own Docker Image docker build -t robotframework-doctest . Afterwards you can, e.g., start the container and run the povided examples like this:

Windows
- docker run -t -v "%cd%":/opt/test -w /opt/test robotframework-doctest robot atest/Compare.robot
Linux
- docker run -t -v $PWD:/opt/test -w /opt/test robotframework-doctest robot atest/Compare.robot

Gitpod.io

Try out the library using Gitpod

Examples

Have a look at

for more examples.

Testing with Robot Framework

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Compare two Images and highlight differences
    Compare Images    Reference.jpg    Candidate.jpg

Use masks/placeholders to exclude parts from visual comparison

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Compare two Images and ignore parts by using masks
    Compare Images    Reference.jpg    Candidate.jpg    placeholder_file=masks.json

Compare two PDF Docments and ignore parts by using masks
    Compare Images    Reference.jpg    Candidate.jpg    placeholder_file=masks.json

Compare two Farm images with date pattern
    Compare Images    Reference.jpg    Candidate.jpg    placeholder_file=testdata/pattern_mask.json

Compare two Farm images with area mask as list
    ${top_mask}    Create Dictionary    page=1    type=area    location=top    percent=10
    ${bottom_mask}    Create Dictionary    page=all    type=area    location=bottom    percent=10
    ${masks}    Create List    ${top_mask}    ${bottom_mask}
    Compare Images    Reference.jpg    Candidate.jpg    mask=${masks}

Compare two Farm images with area mask as string
    Compare Images    Reference.jpg    Candidate.jpg    mask=top:10;bottom:10

Different Mask Types to Ignore Parts When Comparing

Areas, Coordinates, Text Patterns

[
    {
    "page": "all",
    "name": "Date Pattern",
    "type": "pattern",
    "pattern": ".*[0-9]{2}-[a-zA-Z]{3}-[0-9]{4}.*"
    },
    {
    "page": "1",
    "name": "Top Border",
    "type": "area",
    "location": "top",
    "percent":  5
    },
    {
    "page": "1",
    "name": "Left Border",
    "type": "area",
    "location": "left",
    "percent":  5
    },
    {
    "page": 1,
    "name": "Top Rectangle",
    "type": "coordinates",
    "x": 0,
    "y": 0,
    "height": 10,
    "width": 210,
    "unit": "mm"
    }
]

Accept visual different by checking move distance or text content

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Accept if parts are moved up to 20 pixels by pure visual check
    Compare Images    Reference.jpg    Candidate.jpg    move_tolerance=20

Accept if parts are moved up to 20 pixels by reading PDF Data
    Compare Images    Reference.pdf    Candidate.pdf    move_tolerance=20    get_pdf_content=${true}

Accept differences if text content is the same via OCR
    Compare Images    Reference.jpg    Candidate.jpg    check_text_content=${true}

Accept differences if text content is the same from PDF Data
    Compare Images    Reference.pdf    Candidate.pdf    check_text_content=${true}    get_pdf_content=${true}

Different options to detect moved parts/objects

*** Settings ***
Library    DocTest.VisualTest   movement_detection=orb

*** Test Cases ***
Accept if parts are moved up to 20 pixels by pure visual check
    Compare Images    Reference.jpg    Candidate.jpg    move_tolerance=20

*** Settings ***
Library    DocTest.VisualTest   movement_detection=template

*** Test Cases ***
Accept if parts are moved up to 20 pixels by pure visual check
    Compare Images    Reference.jpg    Candidate.jpg    move_tolerance=20

*** Settings ***
Library    DocTest.VisualTest   movement_detection=classic

*** Test Cases ***
Accept if parts are moved up to 20 pixels by pure visual check
    Compare Images    Reference.jpg    Candidate.jpg    move_tolerance=20

Options for taking additional screenshots, screenshot format and render resolution

Take additional screenshots or reference and candidate file.

*** Settings ***
Library    DocTest.VisualTest   take_screenshots=${true}    screenshot_format=png

Take diff screenshots to highlight differences

*** Settings ***
Library    DocTest.VisualTest   show_diff=${true}    DPI=300

Experimental usage of Open CV East Text Detection to improve OCR

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Compare two Farm images with date pattern and east detection
    Compare Images    Reference.jpg    Candidate.jpg    placeholder_file=masks.json    ocr_engine=east

Check content of PDF files

*** Settings ***
Library    DocTest.PdfTest

*** Test Cases ***
Check if list of strings exists in PDF File
    @{strings}=    Create List    First String    Second String
    PDF Should Contain Strings    ${strings}    Candidate.pdf
    
Compare two PDF Files and only check text content
    Compare Pdf Documents    Reference.pdf    Candidate.pdf    compare=text

Compare two  PDF Files and only check text content and metadata
    Compare Pdf Documents    Reference.pdf    Candidate.pdf    compare=text,metadata
    
Compare two  PDF Files and check all possible content
    Compare Pdf Documents    Reference.pdf    Candidate.pdf

Ignore Watermarks for Visual Comparisons

Store the watermark in a separate B/W image or PDF.
Watermark area needs to be filled with black color.
Watermark content will be subtracted from Visual Comparison result.

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Compare two Images and ignore jpg watermark
    Compare Images    Reference.jpg    Candidate.jpg    watermark_file=Watermark.jpg

Compare two Images and ignore pdf watermark
    Compare Images    Reference.pdf    Candidate.pdf    watermark_file=Watermark.pdf

Compare two Images and ignore watermark folder
    Compare Images    Reference.pdf    Candidate.pdf    watermark_file=${CURDIR}${/}watermarks

Watermarks can also be passed on Library import. This setting will apply to all Test Cases in Test Suite

*** Settings ***
Library    DocTest.VisualTest   watermark_file=${CURDIR}${/}watermarks

*** Test Cases ***
Compare two Images and ignore watermarks
    Compare Images    Reference.jpg    Candidate.jpg

Get Text From Documents or Images

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Get Text Content And Compare
    ${text}    Get Text From Document    Reference.pdf
    List Should Contain Value    ${text}    Test String

Get Barcodes From Documents or Images

*** Settings ***
Library    DocTest.VisualTest

*** Test Cases ***
Get Text Content And Compare
    ${text}    Get Barcodes From Document    reference.jpg
    List Should Contain Value    ${text}    123456789

Using pabot to run tests in parallel

Document Testing can be run in parallel using pabot.
However, you need to pass the additional arguments --artifacts and --artifactsinsubfolders to the pabot command, to move the screenshots to the correct subfolder.
Otherwise the screenshots will not be visible in the log.html

pabot --testlevelsplit --processes 8 --artifacts png,jpg,pdf,xml --artifactsinsubfolders /path/to/your/tests/

Visual Testing of Web Applications

I experimented a bit and tried to use this library for Visual Testing of Web Applications.
Please have a look at this pilot example here

Development

Feel free to create issues or pull requests.
I'm always happy for any feedback.

Core team

In order of appearance.

Many Kasiriha
April Wang

Contributors

This project is community driven and becomes a reality only through the work of all the people who contribute.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.22.0

Jan 26, 2024

0.21.0

Jan 25, 2024

0.20.0

Nov 22, 2023

0.19.0

Oct 2, 2023

0.18.1

Aug 24, 2023

0.18.0

Aug 2, 2023

0.16.0

May 25, 2023

0.15.0

May 24, 2023

0.14.0

May 24, 2023

0.14.0.dev0 pre-release

May 10, 2023

0.13.0

Apr 12, 2023

0.12.1

Apr 6, 2023

0.12.0

Apr 6, 2023

0.11.0

Apr 4, 2023

0.10.1

Mar 28, 2023

0.10.0

Mar 23, 2023

0.9.1

Mar 9, 2023

0.9.0

Feb 9, 2023

0.8.1

Jan 2, 2023

0.8.0

Oct 11, 2022

0.7.0

Oct 9, 2022

0.6.0

Oct 9, 2022

0.5.0

Oct 5, 2022

0.4.2

Sep 24, 2022

0.4.1

Sep 22, 2022

0.4.0 yanked

Sep 22, 2022

Reason this release was yanked:

Included eastModel and other files by mistake

0.3.1

Jul 26, 2022

0.3.1a0 pre-release

Jul 26, 2022

0.2.0.20220325161430

Mar 25, 2022

0.2.0.20220325161421

Mar 25, 2022

0.2.0.20211223210536

Dec 23, 2021

0.2.0.20211223210530

Dec 23, 2021

0.2.0.20211029113431

Oct 29, 2021

0.2.0.20211029113425

Oct 29, 2021

0.2.0.20211026095904

Oct 26, 2021

0.2.0.20211026095857

Oct 26, 2021

0.2.0.20211025234825

Oct 25, 2021

0.2.0.20211025234818

Oct 25, 2021

0.2.0.20211025185729

Oct 25, 2021

0.2.0.20211025185722

Oct 25, 2021

0.2.0.dev20211025125143 pre-release

Oct 25, 2021

0.2.0.dev20211025125135 pre-release

Oct 25, 2021

0.2.0.dev20211025120600 pre-release

Oct 25, 2021

0.2.0.dev20211025120554 pre-release

Oct 25, 2021

0.2.0.dev20211008192948 pre-release

Oct 8, 2021

0.2.0.dev20211008192942 pre-release

Oct 8, 2021

0.1.2

Mar 10, 2021

0.1.1

Mar 9, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robotframework_doctestlibrary-0.22.0.tar.gz (41.4 kB view hashes)

Uploaded Jan 26, 2024 Source

Built Distribution

robotframework_doctestlibrary-0.22.0-py3-none-any.whl (43.6 kB view hashes)

Uploaded Jan 26, 2024 Python 3

Hashes for robotframework_doctestlibrary-0.22.0.tar.gz

Hashes for robotframework_doctestlibrary-0.22.0.tar.gz
Algorithm	Hash digest
SHA256	`474893e8299316955da54f96eb33a1499bf1f428068e0edf65b1e7dc81809565`
MD5	`963917b92dab4d5017f32f9501270120`
BLAKE2b-256	`d56c89f1c87bb84310d0c27a941d12677bcaf4dfb32212a3f1acfc03011e45d7`

Hashes for robotframework_doctestlibrary-0.22.0-py3-none-any.whl

Hashes for robotframework_doctestlibrary-0.22.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1032a3aa6ec0ca8807edd6d53a1b28f2f9a1380aadc5d1491e42b68c5ed0c53b`
MD5	`3cc2b1a716dfb4d82820853525297ead`
BLAKE2b-256	`b41d6c44bc2a7901c29fe1e1297af0c6ab0642dd23ec2a314e94a438dc09f6f7`

robotframework-doctestlibrary 0.22.0

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

robotframework-doctestlibrary

Installation instructions

Install robotframework-doctestlibrary

Installation via pip from PyPI (recommended)

Installation via pip from GitHub

Install dependencies

Some special instructions for Windows

Rename executable for GhostPCL to pcl6.exe (only needed for .pcl support)

Add tesseract, ghostscript and imagemagick to system path in windows (only needed for OCR, .pcl and .ps support)

Windows error message regarding pylibdmtx

ImageMagick

Docker

Gitpod.io

Examples

Testing with Robot Framework

Use masks/placeholders to exclude parts from visual comparison

Different Mask Types to Ignore Parts When Comparing

Areas, Coordinates, Text Patterns

Accept visual different by checking move distance or text content

Different options to detect moved parts/objects

Options for taking additional screenshots, screenshot format and render resolution

Experimental usage of Open CV East Text Detection to improve OCR

Check content of PDF files

Ignore Watermarks for Visual Comparisons

Get Text From Documents or Images

Get Barcodes From Documents or Images

Using pabot to run tests in parallel

Visual Testing of Web Applications

Development

Core team

Contributors

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

Installation via `pip` from PyPI (recommended)

Installation via `pip` from GitHub

Rename executable for GhostPCL to pcl6.exe (only needed for `.pcl` support)

Add tesseract, ghostscript and imagemagick to system path in windows (only needed for OCR, `.pcl` and `.ps` support)