Skip to main content

Cache assertion data to simplify regression testing of complex serializable data

Project description

pytest_cache_assert

Cache assertion data to simplify regression testing of complex serializable data

Installation

poetry add pytest_assert_check --dev

Quick Start

The primary use case of this package is regression testing of large dictionaries. You may have some parameterized test cases where you need to assert that a resulting dictionary is the same, but you don’t want to manually generate the expected fields and values and couple the test case to the source code. Instead you can cache or “record” the expected serialized data structure, check the expected dictionary into version control, then regenerate on changes

This package can minimize test case logic, while improving test thoroughness

This project was heavily inspired by the excellent pytest-recording

Basic Example

You've created a new project called package_a with one file package_a/source_file.py and test tests/test_file.py

"""package_a/source_file.py"""

import sys
from datetime import datetime
from typing import Any, Dict, List, Optional

from beartype import beartype
from pydantic import BaseModel


class User(BaseModel):  # noqa: H601
    """Example from pydantic documentation."""

    id: int  # noqa: A003,VNE003
    name = 'John Doe'
    signup_ts: Optional[datetime] = None
    friends: List[int] = []


@beartype
def create_data(name: str) -> Dict[str, Any]:
    """Arbitrary function that returns a dictionary.

    This demonstration uses pydantic, but any dictionary can be tested!

    """
    return User(id=sys.maxsize, name=name).dict()
"""tests/test_file.py"""

import pytest

from package_a.source_file import create_data


@pytest.mark.parametrize('name', ['Test Name 1', 'Test Name 2'])
def test_create_data(name, assert_against_cache):
    """Basic test of create_data()."""
    result = create_data(name=name)

    # One could manually create the expected dictionary
    cache = {'id': 9223372036854775807, 'signup_ts': None, 'friends': [], 'name': name}
    assert result == cache
    # ----------------------------------------------------------------------------------
    # Or utilize the pytest_cache_assert fixture to compare against the last cached version
    assert_against_cache(result)

pytest_cache_assert will automatically create: tests/cache-assert/source_file/test_file/test_create_data-[Test Name 1].json (and test_create_data[Test Name 2].json) for each of the parameters when first run by caching the result. Below is the example for test_create_data-[Test Name 1].json

{
  "_info": [
    {
      "func_args": {
        "name": "Test Name 1"
      },
      "test_file": "test_readme.py",
      "test_name": "test_create_data"
    }
  ],
  "_json": {
    "friends": [],
    "id": 9223372036854775807,
    "name": "Test Name 1",
    "signup_ts": null
  }
}

The cached JSON files must be checked into version control and if needed, can be manually edited or deleted so that they will be regenerated when the test suite is next run

More Examples

In your cached dictionary, you may have variable values with more complex logic to verify, such as dates, UUIDs, etc. These can be selectively ignored, matched-if-null, or some other user-specified check:

"""tests/test_main.py."""

from uuid import uuid4
from datetime import datetime, timedelta

from pytest_cache_assert import KeyRule, check_suppress, check_type, Wildcards


def test_assert_against_cache_key_rules(assert_against_cache):
    """Demonstrate use of `key_rules`."""
    # NOTE: all data passed to assert_against_cache must be JSON Serializable (i.e. from Marshmallow or Pydantic, etc.)
    now = datetime.now()
    cached_data = {'date': str(now), {'nested': {'uuid': str(uuid4())}}, {'ignored': {'a': 1, 'b': 2}}}
    test_data = {'date': str(now + timedelta(hours=3)), {'nested': {'uuid': str(uuid4())}}}
    #
    key_rules = [
      # Suppress keys 'ignored.a' and 'ignored.b' with the SINGLE wildcard
      # If there were a 'ignored.c.something', the Wildcard.RECURSIVE could be used
      KeyRule(pattern=['ignored', Wildcards.SINGLE], func=check_suppress),
      # Ensure that values are of the same type. This is useful if you expect datetime objects and want to raise
      #   if one value is Null (or you have a number stored as a string or UUID, etc.)
      KeyRule(pattern=['date'], func=check_type),
      KeyRule(pattern=['nested', 'uuid'], func=check_type),
      # The KeyRule function can be replaced with any function that accepts keyword arguments 'old' and 'new'
    ]
    #
    # Run twice to ensure that the initial cache file exists and that the key_rules suppress any errors
    assert_against_cache(cached_data, key_rules=key_rules)
    assert_against_cache(test_data, key_rules=key_rules)

    # Verify that without key rules, an AssertionError is raised
    with pytest.raises(AssertionError, match=DEF_ERROR_MESSAGE):
        assert_against_cache(test_data)  # act

Or you may want to write your own custom checks against the serialized data, such as with Cerberus or another library. This is possible with the validator callable. The default validator is a no-op and that may be replaced with any custom function that raises an Exception on error.

"""tests/test_main.py."""

import re

import pytest
from beartype import beartype
from cerberus import Validator
from cerberus.schema import SchemaError


@beartype
def cerberus_validator(test_data: TEST_DATA_TYPE) -> None:
    """Cerberus custom validator example."""
    validator = Validator({'result': {'type': 'int'}})
    assert validator.validate(test_data)


def test_assert_against_cache_validator(assert_against_cache):
    """Test the validator."""
    expected = re.escape("{'result': [{'type': ['Unsupported types: int']}]}")

    with pytest.raises(SchemaError, match=expected):
        assert_against_cache({'result': False}, validator=cerberus_validator)  # act

Even More Example

For more examples, see Scripts or Tests

Global Configuration Options

  • DEF_CACHE_DIR_KEY: set a custom relative path from the tests/ directory. Default is assert-cache/
import pytest

from pytest_cache_assert import DEF_CACHE_DIR_KEY


@pytest.fixture(scope='module')
def cache_assert_config():
    return {
        DEF_CACHE_DIR_KEY: 'custom/cache/dir',
    }

Planned Global Configuration Options

These are ideas for future options that are not currently implemented, but could be if there is enough interest:

  • PLANNED: Consider a record mode that will always-write to regenerate the cache while working on development
    • The other edge case where a mode might be helpful is when file names or test names are changed and the cache metadata has too many duplicates and needs to be refreshed
  • PLANNED: Provide CLI arguments like pytest-recording (request.config.getoption("--record-mode") or "none") for one-time changes to configuration
  • PLANNED: Consider filters to prevent secrets from being cached: filter_headers=[['authorization', 'id'], ['authorization', 'cookies']]

Roadmap

See the Open Issues and Milestones for current status and ./docs/CODE_TAG_SUMMARY.md for annotations in the source code.

For release history, see the ./docs/CHANGELOG.md

Contributing

See the Developer Guide, Contribution Guidelines, etc

License

LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytest_cache_assert-1.0.0rc0.tar.gz (16.3 kB view hashes)

Uploaded Source

Built Distribution

pytest_cache_assert-1.0.0rc0-py3-none-any.whl (15.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page