Skip to main content

Ad-hoc Test EXecutor

Project description

ATEX = Ad-hoc Test EXecutor

A collections of Python APIs to provision operating systems, collect and execute FMF-style tests, gather and organize their results and generate reports from those results.

The name comes from a (fairly unique to FMF/TMT ecosystem) approach that allows provisioning a pool of systems and scheduling tests on them as one would on an ad-hoc pool of thread/process workers - once a worker becomes free, it receives a test to run.
This is in contrast to splitting a large list of N tests onto M workers like N/M, which yields significant time penalties due to tests having very varies runtimes.

Above all, this project is meant to be a toolbox, not a silver-plate solution. Use its Python APIs to build a CLI tool for your specific use case.
The CLI tool provided here is just for demonstration / testing, not for serious use - we want to avoid huge modular CLIs for Every Possible Scenario. That's the job of the Python API. Any CLI should be simple by nature.


THIS PROJECT IS HEAVILY WIP, THINGS WILL MOVE AROUND, CHANGE AND OTHERWISE BREAK. DO NOT USE IT (for now).


License

Unless specified otherwise, any content within this repository is distributed under the GNU GPLv3 license, see the COPYING.txt file for more.

Testing this project

There are some limited sanity tests provided via pytest, although:

  • Some require additional variables (ie. Testing Farm) and will ERROR without them.
  • Some take a long time (ie. Testing Farm) due to system provisioning taking a long time, so install pytest-xdist and run with a large -n.

Currently, the recommended approach is to split the execution:

# synchronously, because podman CLI has concurrency issues
pytest tests/provision/test_podman.py

# in parallel, because provisioning takes a long time
export TESTING_FARM_API_TOKEN=...
export TESTING_FARM_COMPOSE=...
pytest -n 20 tests/provision/test_podman.py

# fast enough for synchronous execution
pytest tests/fmf

Parallelism and cleanup

There are effectively 3 methods of running things in parallel in Python:

  • threading.Thread (and related concurrent.futures classes)
  • multiprocessing.Process (and related concurrent.futures classes)
  • asyncio

and there is no clear winner (in terms of cleanup on SIGTERM or Ctrl-C):

  • Thread has signal handlers only in the main thread and is unable to interrupt any running threads without super ugly workarounds like sleep(1) in every thread, checking some "pls exit" variable
  • Process is too heavyweight and makes sharing native Python objects hard, but it does handle signals in each process individually
  • asyncio handles interrupting perfectly (every try/except/finally completes just fine, KeyboardInterrupt is raised in every async context), but async python is still (3.14) too weird and unsupported
    • asyncio effectively re-implements subprocess with a slightly different API, same with asyncio.Transport and derivatives reimplementing socket
    • 3rd party libraries like requests or urllib3 don't support it, one needs to resort to spawning these in separate threads anyway
    • same with os.* functions and syscalls
    • every thing exposed via API needs to have 2 copies - async and non-async, making it unbearable
    • other stdlib bugs, ie. "large" reads returning BlockingIOError sometimes

The approach chosen by this project was to use threading.Thread, and implement thread safety for classes and their functions that need it.
For example:

class MachineReserver:
    def __init__(self):
        self.lock = threading.RLock()
        self.job = None
        self.proc = None

    def reserve(self, ...):
        try:
            ...
            job = schedule_new_job_on_external_service()
            with self.lock:
                self.job = job
            ...
            while not reserved(self.job):
                time.sleep(60)
            ...
            with self.lock:
                self.proc = subprocess.Popen(["ssh", f"{user}@{host}", ...)
            ...
            return machine
        except Exception:
            self.abort()
            raise

    def abort(self):
        with self.lock:
            if self.job:
                cancel_external_service(self.job)
                self.job = None
            if self.proc:
                self.proc.kill()
                self.proc = None

Here, it is expected for .reserve() to be called in a long-running thread that provisions a new machine on some external service, waits for it to be installed and reserved, connects an ssh session to it and returns it back.

But equally, .abort() can be called from an external thread and clean up any non-pythonic resources (external jobs, processes, temporary files, etc.) at which point we don't care what happens to .reserve(), it will probably fail with some exception, but doesn't do any harm.

Here is where daemon=True threads come in handy - we can simply call .abort() from a KeyboardInterrupt (or SIGTERM) handle in the main thread, and just exit, automatically killing any leftover threads that are uselessly sleeping.
(Realistically, we might want to spawn new threads to run many .abort()s in parallel, but the main thread can wait for those just fine.)

It is not perfect, but it's probably the best Python can do.

Note that races can still occur between a resource being reserved and written to self.* for .abort() to free, so resource de-allocation is not 100% guaranteed, but single-threaded interrupting has the same issue.
Do have fallbacks (ie. max reserve times on the external service).

Also note that .reserve() and .abort() could be also called by a context manager as __enter__ and __exit__, ie. by a non-threaded caller (running everything in the main thread).

Unsorted notes

TODO: codestyle from contest

- this is not tmt, the goal is to make a python toolbox *for* making runcontest
  style tools easily, not to replace those tools with tmt-style CLI syntax

  - the whole point is to make usecase-targeted easy-to-use tools that don't
    intimidate users with 1 KB long command line, and runcontest is a nice example

  - TL;DR - use a modular pythonic approach, not a gluetool-style long CLI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atex-0.9.tar.gz (84.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

atex-0.9-py3-none-any.whl (72.9 kB view details)

Uploaded Python 3

File details

Details for the file atex-0.9.tar.gz.

File metadata

  • Download URL: atex-0.9.tar.gz
  • Upload date:
  • Size: 84.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for atex-0.9.tar.gz
Algorithm Hash digest
SHA256 1f7bad3d23b301e5c71022f1f1bfa5b82c49038b8b1a987e65b961723e330b11
MD5 b265e4f1a17657913db6f51c8eb74306
BLAKE2b-256 4ed6053fcf03dd67609d376efd97d4ceb0ed18e931ebc289fc3fb68953d393f1

See more details on using hashes here.

File details

Details for the file atex-0.9-py3-none-any.whl.

File metadata

  • Download URL: atex-0.9-py3-none-any.whl
  • Upload date:
  • Size: 72.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for atex-0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 1c48b7f3432833078dd8bb825f48b9f0a3471962efc6aadcd5351b6d04181c10
MD5 c089f10167ed768ff563aaa1e73481b5
BLAKE2b-256 7767778e0f4dc22396e2ed0c0a1fbba55d6fe6248de695c6cdcb775b4a9d0233

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page