Skip to main content

Simple, extendable configuration files, using Pydantic for validation.

Project description

Validating config

Simple, extendable configuration objects

Quick start

Installation

As a package:

  • pip install valconfig
    

Or as inlined source:

  • Copy the valconfig.py file into your project and install Pydantic.

Example usage

# config.py
from valconfig import ValConfig
from pydantic import HttpUrl
from scityping.numpy import Array

class Config(ValConfig):
    n_units: int=5
    connectivites: Array[float, 2]  # 2D array of floats
    url: HttpUrl="example.com"

config = Config()

For more detailed usage, see the documentation.

Motivation

Many types of projects make use of some system to set configuration options, and while the type of project may influence that system, its basic requirements are pretty much always the same:

  • It should be easy to read.
  • It should be easy to extend.

Validating config came about to fulfill these needs for scientific programming, which is a bit of an extreme case: On the one hand, scientists are not professional programmers, so the system needs to be so simple that it needs no documentation at all. On the other hand, they are also constantly developing new methods, and so also need to be able to add new configuration options and new variable types. Moreover, code is often shared (at least within a group) between users with different machines and different needs. A key use of config objects in a scientific context are specifying machine-local settings, like enabling the GPU, defining paths for data and figures, or defining MPI flags.

Thus we needed a way to

  • Set version-controlled defaults, that are packaged with the repository.
  • Make it easy to know what the configurable options are.
  • Make it easy to change those options, without touching the defaults.

And we want to that with nothing more complicated than a git push.

In addition, projects often depend on other projects, so we need a way to set their options as well. Ideally all within the same configuration file.

Finally, not all parameters are strings or simple numerical values. Some might need to be cast to NumPy arrays; others might need to be pre-processed, or checked for consistency with other parameters, or inferred from other parameters. These are all roles that conceptually belong in a configuration parser, not the core parts of our program. As scientist programmers, we already have the bad habit of mixing up our programming logic into a big ball of spaghetti – let’s give ourselves one less reason to do so.

This adds two additional desiderata:

  • Support for composition: have one configuration for multiple packages.
  • Validation of values based on provided type information.

v1.0 rewrite

This version is a complete rewrite of ValConfig v0.1, now built on top of Pydantic v2. The logic for resolving resolving relative paths in configuration files is now both simpler and more powerful. This version also switches to using TOML files for configuration, making use of the built-in support for TOML added in Python 3.11: this allows us to remove all the logic for dealing with the limitations of configparser, thus improving reliability and reducing code size by about 20%.

The old version is still available using from valconfig.v1 import ValConfig.

What a ValidatingConfig provides

  • A mechanism to define simple lists of configuration options with zero boilerplate: just list the parameter names, their types, and optionally their default values.
  • Built-in validation provided by Pydantic.
  • Additional validation for scientific types, when used in conjunction with Scitying.
  • The ability to compose configuration objects from multiple packages into a single central configuration object – even when those packages don’t use ValidatingConfig. This ensures all packages have access to a single source of truth.
  • The ability to extend the functionality of a config object with all the functionality Python has to offer: read environment variables with os.getenv, get command line parameters with sys.argv, etc.
    • Other packages typically need to add features like these because their config objects are more limited. We avoid this by defining config with a standard class. This keeps this package lean, and your code simple.

Other features

  • Zero onboarding:

    • Validating config uses only standard Python: No custom function calls, no boilerplate. If you know modern Python, you already know how to use this module.
    • Just define a Config class with the parameter names and their types.
    • Use the Config class as any other Python class.
  • Define your own custom types with either Pydantic or Scityping.[^new-types][^scityping-v1]

  • Pydantic provides validators which can be used to add per-field validation and pre-processing. These are defined with plain Python, not a restricted minilanguage or a sanitized eval.

  • Use standard properties to define computed (inferred) values.

  • Store configuration values in TOML files

    • This is a highly flexible format, supporting deep parameter structure while remaining easy for humans to read.
    • Support for YAML or JSON may be added in the future, but for now TOML is the way to go.
  • Relative paths in config files are correctly resolved.

    • Relative paths can be specified as relative either to the configuration file, the project root, or the current directory.
  • Automatic discovery of the project root and local configuration files.

    • To identify the project root, ValConfig will recurse up the directory tree until it finds a file recognized as indicating the root of a project.[^root-filenames]
    • While recursing up the tree, ValConfig will also recognize any file named local.toml as a local configuration file.
    • All local configuration files up to the project are loaded, with those closest to the current directory (i.e. those with a longer path) given higher precedence.
  • Hierarchical: Organize your parameters into categories by defining nested classes.

    from valconfig import ValConfig
    
    class Config(ValConfig):
    
      class figures:
        width: float
        format: str
        class curves:
          colormap: str | list[str]
        class heatmaps:
          colormap: str | list[str]
    
      class run:
        cache: bool
    
      class model:
        n_units: int
    
  • Composable

    • Want multiple config objects for each of your project’s subpackages ? Just import them.

    • Want to combine them into a single root config file ? Just import them.

      from valconfig import ValConfig
      from .subpkg1 import Config as Config1
      from .subpkg2 import Config as Config2
      
      class Config(ValConfig):
        pkg1: Config1
        pkg2: Config2
      
    • Does your project depend on another ? Define the parameters you need to modify and add a function which updates the config with those parameters.

      from valconfig import ValConfig, ValidationInfo
      from other_package import config as OtherConfig
      from pydantic import field_validator
      
      class Config(ValConfig):
        # List the parameters you want to modify
        other_package_option1: int
        other_package_option2: float
        # Include the 3rd party config
        other_package: OtherConfig
      
        @field_validator("other_package", mode="after")
        def update_other_package_options(cls, other_config, info: ValidationInfo):
          # The exact details here will depend on the 3rd party package
          other_config.opt1 = info.data["other_package_option1"]
          other_config.opt2 = info.data["other_package_option2"]
      

[^new-types]: Scityping was developed as an extension of Pydantic to allow the use of (abstract) base classes in type definitions, for example defining a field of type Model which accepts any subclass of Model. (In plain Pydantic values are always coerced to the target type.) Whether it is best to define new types with either Scityping or Pydantic largely depends on whether this use as abstract classes is needed. [^scityping-v1]: Scityping was originally developed against Pydantic v1, whereas ValConfig v1.0 now uses Pydantic v2. Eventually I hope to update Scityping, but in the mean time it should continue it work, as most of the old Pydantic hooks are deprecated but not removed in v2.

[^root-filenames]: By default, these are .git, .hg, .smt, pyproject.toml, setup.cfg, setup.py and poetry.lock. This can be changed by overriding the class attribute __projectroot_filenames__.

In relation to other projects

What’s wrong with the gazillion other config file projects out there ? Why not use one of those ?

In short, because we can now do in 200 lines of code what used to take thousands. Most existing packages for config files were developed many years ago, before the standardization of type annotations. The can all perform the basic task of converting a file to a Python dictionary of strings, but fulfilling all of our aforementioned desiderata was difficult without creating a bloated package. Understandably therefore, they focused on the features required by their own use cases, which means that I found them all unsatisfactory in some respect:

  • Some introduce a new API with custom functions or decorators to define parameters. This makes it more difficult to learn and extend. (vyper-config, hydra)
  • Many provide no validation (one-config, config2).
  • When validations functions are provided, the target type is often not specified by the config object, but in the calling code – if your configuration library allows to define the name but not the type of the parameters, that’s only 50% of the information and 20% of the work. (configparser, vyper-config)
  • For the examples I know which provides validation at the config level, the set of supported types is very limited and basically hard-coded. (OmegaConf, CFG API)
  • Some approaches even define their own file formats, substantially raising the barrier to adoption. (CFG API)
  • The package configobj is probably the most closely aligned with our goals: it provides a simple, declarative format for both parameter and types specification, as well as key features like hierarchical parameters. (But not, to my knowledge, the configuration of subpackages.) It is also mature, which unfortunately means that it pre-dates widespread use of validation libraries and therefore must package its own custom validation library.

A simple implementation

With the standardization of type annotations in Python 3.6–3.8 and the availability of classes like Pydantic’s BaseModel, defining classes with validation logic has become almost trivial, and converting a BaseModel into a full-featured config parser basically only needs three things:

  • functionality for reading values from files;
  • functionality for resolving relative paths from different source locations;
  • ability to compose configuration classes.

In effect therefore, ValConfig is just a subclass of BaseModel with some simple methods for reading config files. The biggest difference is that each subclass of ValConfig is made a singleton. We use this pattern for exactly the same reasons as one-config: it solves a host of corner cases, makes it trivial to support composing configs, and ensures that config objects are a single source of truth.

By relying on Pydantic, ValConfig can be extremely lean, while still providing functionality that is on-par with the much heavier packages listed above. Pydantic itself is highly mature and actively maintained.

The ValConfig class and all its utility functions clock in at less than 400 lines of code and are contained in a single, easily maintained module. If desired, the module can even be included as part of your project’s source code, thus removing valconfig as package dependency and giving you full control over the config object.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

valconfig-1.0.0rc2.tar.gz (61.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

valconfig-1.0.0rc2-py3-none-any.whl (43.5 kB view details)

Uploaded Python 3

File details

Details for the file valconfig-1.0.0rc2.tar.gz.

File metadata

  • Download URL: valconfig-1.0.0rc2.tar.gz
  • Upload date:
  • Size: 61.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for valconfig-1.0.0rc2.tar.gz
Algorithm Hash digest
SHA256 1f507ccbf8b26335d71b2824b57436c20cb20b652e1f7b11e3978575ee665264
MD5 24334cc3d2aabc3363a525357c132c88
BLAKE2b-256 6873af48772d8bd4272a8821231e52b07be071402c3c19533e1a56850f11c863

See more details on using hashes here.

Provenance

The following attestation bundles were made for valconfig-1.0.0rc2.tar.gz:

Publisher: publish-on-pypi.yml on alcrene/valconfig

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file valconfig-1.0.0rc2-py3-none-any.whl.

File metadata

  • Download URL: valconfig-1.0.0rc2-py3-none-any.whl
  • Upload date:
  • Size: 43.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for valconfig-1.0.0rc2-py3-none-any.whl
Algorithm Hash digest
SHA256 f71857a9149cb9f18549c70cb29a4869be98dd9a9782f53ed1c87edd8ff8d334
MD5 c32bc8d42a926e3492d42886268a2e47
BLAKE2b-256 bbdcadf24e64783ad2ece2b225ac29b2b3f306ab212887fc5420977a550842be

See more details on using hashes here.

Provenance

The following attestation bundles were made for valconfig-1.0.0rc2-py3-none-any.whl:

Publisher: publish-on-pypi.yml on alcrene/valconfig

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page