Skip to main content

Simplify the testing of SQL data models and queries by allowing users to mock input data and create tests for various scenarios. It provides a consistent and convenient way to test the execution of your query without the need to process a massive amount of data.

Project description

SQL Mock: Python Library for Mocking SQL Queries with Dictionary Inputs

License: MIT

The primary purpose of this library is to simplify the testing of SQL data models and queries by allowing users to mock input data and create tests for various scenarios. It provides a consistent and convenient way to test the execution of your query without the need to process a massive amount of data.

The library currently supports the following databases. Database specific documentations are provided in the links:

Installation

The library can be installed from [PyPI][pypi-project] using pip:

pip install --upgrade sql-mock

To install database specific versions, you can use the following:

# BigQuery
pip install --upgrade sql-mock[bigquery]

# Clickhouse
pip install --upgrade sql-mock[clickhouse]

If you need to modify this source code, install the dependencies using poetry:

poetry install --all-extras

Usage

How it works

Before diving into specific database scenarios, let's start with a simplified example of how SQL Mock works behind the scenes.

  1. You have an original SQL query, for instance:

    SELECT id FROM data.table1
    
  2. Using SQL Mock, you define mock tables. You can use the built-in column types provided by SQL Mock. Available column types include Int, String, Date, and more. Each database type has their own column types. Define your tables by subclassing a mock table class that fits your database (e.g. BigQueryMockTable) and specifying the column types along with default values. In our example we use the ClickhouseTableMock class

    from sql_mock.clickhouse import column_mocks as col
    from sql_mock.clickhouse.table_mocks import ClickHouseTableMock, table_meta
    
    @table_meta(table_ref='data.table1)
    class Table(ClickHouseTableMock):
        id = col.Int(default=1)
        name = col.String(default='Peter')
    
    @table_meta(table_ref='data.result_table')
    class ResultTable(ClickhouseTableMock):
        id = col.Int(default=1)
    
  3. Creating mock data: Define mock data for your tables using dictionaries. Each dictionary represents a row in the table, with keys corresponding to column names. Table column keys that don't get a value will use the default.

    user_data = [
        {}, # This will use the defaults for both id and name
        {'id': 2, 'name': 'Martin'},
        {'id': 3}, # This will use defaults for the name
    ]
    
    input_table_mock = Table.from_dicts(user_data)
    
  4. Getting results for a table mock: Use the from_inputs method of the table mock object to generate mock query results based on your mock data.

    res = ResultTable.from_mocks(query='SELECT id FROM data.table1', input_data=[input_table_mock])
    
  5. Behind the scene SQL Mock replaces table references (e.g. data.table1) in your query with Common Table Expressions (CTEs) filled with dummy data. It can roughly be compared to something like this:

    WITH data__table1 AS (
        -- Mocked inputs
        SELECT 
            cast('1' AS 'String') AS id, 
            cast('Peter' AS 'String') AS name
        UNION ALL 
        SELECT 
            cast('2' AS 'String') AS id, 
            cast('Martin' AS 'String') AS name
        UNION ALL 
        SELECT 
            cast('3' AS 'String') AS id, 
            cast('Peter' AS 'String') AS name
    )
    
    result AS (
        -- Original query with replaced references
        SELECT id FROM data__table1 
    )
    
    SELECT 
        cast(id AS 'String') AS id
    FROM result
    
  6. Finally, you can compare your results to some expected results using the assert_equal method.

    expected = [{'id': '1'},{'id': '2'},{'id': '3'}]
    res.assert_equal(expected)
    

Setup for Pytest

If you are using pytest, make sure to add a conftest.py file to the root of your project. In the file add the following lines:

import pytest
pytest.register_assert_rewrite('sql_mock')

This allows you to get a rich comparison when using the .assert_equal method on the table mock instances.

We also recommend using pytest-icdiff for better visibility on diffs of failed tests.

Examples

You can find some examples in the examples folder.

Contributing

We welcome contributions to improve and enhance this open-source project. Whether you want to report issues, suggest new features, or directly contribute to the codebase, your input is valuable. To ensure a smooth and collaborative experience for both contributors and maintainers, please follow these guidelines:

Reporting Issues

If you encounter a bug, have a feature request, or face any issues with the project, we encourage you to report them using the project's issue tracker. When creating an issue, please include the following information:

  • A clear and descriptive title.
  • A detailed description of the problem or suggestion.
  • Steps to reproduce the issue (if applicable).
  • Any error messages or screenshots that help clarify the problem.

Feature Requests

If you have ideas for new features or improvements, please use the project's issue tracker to submit a feature request. We appreciate well-documented feature requests that explain the motivation and potential use cases.

Contributing Code

If you're interested in contributing code, follow these steps:

  1. Fork the Repository: Fork the project's repository to your GitHub account.

  2. Create a Branch: Create a new branch for your contribution, preferably with a name that describes the feature or fix you're working on.

  3. Code and Test: Write your code, and make sure to test it thoroughly to ensure it functions as expected.

  4. Documentation: If your contribution involves code changes, update the relevant documentation to reflect those changes.

  5. Submit a Pull Request: Submit a pull request to the project's repository. Be sure to provide a clear and concise description of your changes. Include a reference to any related issues.

  6. Code Review: Your pull request will undergo code review by maintainers and contributors. Be prepared to address any feedback and make necessary changes.

  7. Merge: Once your contribution is approved and passes all checks, it will be merged into the project.

Coding Standards

When contributing code, adhere to the following coding standards:

  • Follow the project's coding style, including code formatting and naming conventions.
  • Ensure your code is well-documented and includes comments where necessary.
  • Write clear commit messages that describe the purpose of each commit.

Local Setup

To set up your local development environment for this project, follow these steps:

1. Clone the Repository

git clone https://github.com/your-project/repository.git
cd repository

2. Install Dependencies

We use Poetry for dependency management. If you don't have Poetry installed, you can get it from here.

Once you have Poetry, you can install the project's dependencies:

poetry install

3. Pre-Commit Hooks

This project uses pre-commit hooks to ensure code quality. To install the hooks, run:

poetry run pre-commit install

This will set up the necessary hooks to check code formatting, linting, and other code quality checks before each commit.

4. Running Tests

We use pytest for running tests. You can run all the tests with:

poetry run pytest tests/

5. Environment Variables

If you're working with database-specific sections (e.g., BigQuery or ClickHouse), make sure to set the required environment variables for your chosen database. Refer to the respective "Usage" sections for details on these variables.

6. Development Workflow

Before you start contributing, create a new branch for your work:

git checkout -b your-feature-branch

Make your code changes, commit them, and create a pull request to the project's repository following the Contributing Guidelines.

7. Code Formatting and Linting

As part of the pre-commit hooks, code formatting and linting will be automatically checked before each commit. Be sure to address any issues reported by the hooks.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sql_mock-0.1.2.tar.gz (12.7 kB view hashes)

Uploaded Source

Built Distribution

sql_mock-0.1.2-py3-none-any.whl (13.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page