Simple, extendable configuration files, using Pydantic for validation.
Project description
Validating config
Simple, extendable configuration objects
Quick start
Installation
As a package:
-
pip install valconfig
Or as inlined source:
- Copy the
valconfig.pyfile into your project and install Pydantic.
Example usage
# config.py
from valconfig import ValConfig
from pydantic import HttpUrl
from scityping.numpy import Array
class Config(ValConfig):
n_units: int=5
connectivites: Array[float, 2] # 2D array of floats
url: HttpUrl="example.com"
config = Config()
For more detailed usage, see the documentation.
Motivation
Many types of projects make use of some system to set configuration options, and while the type of project may influence that system, its basic requirements are pretty much always the same:
- It should be easy to read.
- It should be easy to extend.
Validating config came about to fulfill these needs for scientific programming, which is a bit of an extreme case: On the one hand, scientists are not professional programmers, so the system needs to be so simple that it needs no documentation at all. On the other hand, they are also constantly developing new methods, and so also need to be able to add new configuration options and new variable types. Moreover, code is often shared (at least within a group) between users with different machines and different needs. A key use of config objects in a scientific context are specifying machine-local settings, like enabling the GPU, defining paths for data and figures, or defining MPI flags.
Thus we needed a way to
- Set version-controlled defaults, that are packaged with the repository.
- Make it easy to know what the configurable options are.
- Make it easy to change those options, without touching the defaults.
And we want to that with nothing more complicated than a git push.
In addition, projects often depend on other projects, so we need a way to set their options as well. Ideally all within the same configuration file.
Finally, not all parameters are strings or simple numerical values. Some might need to be cast to NumPy arrays; others might need to be pre-processed, or checked for consistency with other parameters, or inferred from other parameters. These are all roles that conceptually belong in a configuration parser, not the core parts of our program. As scientist programmers, we already have the bad habit of mixing up our programming logic into a big ball of spaghetti – let’s give ourselves one less reason to do so.
This adds two additional desiderata:
- Support for composition: have one configuration for multiple packages.
- Validation of values based on provided type information.
v1.0 rewrite
This version is a complete rewrite of ValConfig v0.1, now built on top of Pydantic v2.
The logic for resolving resolving relative paths in configuration files is now both simpler and more powerful.
This version also switches to using TOML files for configuration, making use of the built-in support for TOML added in Python 3.11: this allows us to remove all the logic for dealing with the limitations of configparser, thus improving reliability and reducing code size by about 20%.
The old version is still available using from valconfig.v1 import ValConfig.
What a ValidatingConfig provides
- A mechanism to define simple lists of configuration options with zero boilerplate: just list the parameter names, their types, and optionally their default values.
- Built-in validation provided by Pydantic.
- Additional validation for scientific types, when used in conjunction with Scitying.
- The ability to compose configuration objects from multiple packages into a single central configuration object – even when those packages don’t use ValidatingConfig. This ensures all packages have access to a single source of truth.
- The ability to extend the functionality of a config object with all the
functionality Python has to offer: read environment variables with
os.getenv, get command line parameters withsys.argv, etc.- Other packages typically need to add features like these because their config objects are
more limited. We avoid this by defining
configwith a standard class. This keeps this package lean, and your code simple.
- Other packages typically need to add features like these because their config objects are
more limited. We avoid this by defining
Other features
-
Zero onboarding:
- Validating config uses only standard Python: No custom function calls, no boilerplate. If you know modern Python, you already know how to use this module.
- Just define a
Configclass with the parameter names and their types. - Use the
Configclass as any other Python class.
-
Define your own custom types with either Pydantic or Scityping.[^new-types][^scityping-v1]
-
Pydantic provides validators which can be used to add per-field validation and pre-processing. These are defined with plain Python, not a restricted minilanguage or a sanitized
eval. -
Use standard properties to define computed (inferred) values.
-
Store configuration values in TOML files
- This is a highly flexible format, supporting deep parameter structure while remaining easy for humans to read.
- Support for YAML or JSON may be added in the future, but for now TOML is the way to go.
-
Relative paths in config files are correctly resolved.
- Relative paths can be specified as relative either to the configuration file, the project root, or the current directory.
-
Automatic discovery of the project root and local configuration files.
- To identify the project root, ValConfig will recurse up the directory tree until it finds a file recognized as indicating the root of a project.[^root-filenames]
- While recursing up the tree, ValConfig will also recognize any file named
local.tomlas a local configuration file. - All local configuration files up to the project are loaded, with those closest to the current directory (i.e. those with a longer path) given higher precedence.
-
Hierarchical: Organize your parameters into categories by defining nested classes.
from valconfig import ValConfig class Config(ValConfig): class figures: width: float format: str class curves: colormap: str | list[str] class heatmaps: colormap: str | list[str] class run: cache: bool class model: n_units: int
-
Composable
-
Want multiple config objects for each of your project’s subpackages ? Just import them.
-
Want to combine them into a single root config file ? Just import them.
from valconfig import ValConfig from .subpkg1 import Config as Config1 from .subpkg2 import Config as Config2 class Config(ValConfig): pkg1: Config1 pkg2: Config2
-
Does your project depend on another ? Define the parameters you need to modify and add a function which updates the config with those parameters.
from valconfig import ValConfig, ValidationInfo from other_package import config as OtherConfig from pydantic import field_validator class Config(ValConfig): # List the parameters you want to modify other_package_option1: int other_package_option2: float # Include the 3rd party config other_package: OtherConfig @field_validator("other_package", mode="after") def update_other_package_options(cls, other_config, info: ValidationInfo): # The exact details here will depend on the 3rd party package other_config.opt1 = info.data["other_package_option1"] other_config.opt2 = info.data["other_package_option2"]
-
[^new-types]: Scityping was developed as an extension of Pydantic to allow
the use of (abstract) base classes in type definitions, for example defining
a field of type Model which accepts any subclass of Model. (In plain
Pydantic values are always coerced to the target type.) Whether it is best
to define new types with either Scityping or Pydantic largely depends on
whether this use as abstract classes is needed.
[^scityping-v1]: Scityping was originally developed against Pydantic v1, whereas ValConfig v1.0 now uses Pydantic v2. Eventually I hope to update Scityping, but in the mean time it should continue it work, as most of the old Pydantic hooks are deprecated but not removed in v2.
[^root-filenames]: By default, these are .git, .hg, .smt, pyproject.toml, setup.cfg, setup.py and poetry.lock. This can be changed by overriding the class attribute __projectroot_filenames__.
In relation to other projects
What’s wrong with the gazillion other config file projects out there ? Why not use one of those ?
In short, because we can now do in 200 lines of code what used to take thousands. Most existing packages for config files were developed many years ago, before the standardization of type annotations. The can all perform the basic task of converting a file to a Python dictionary of strings, but fulfilling all of our aforementioned desiderata was difficult without creating a bloated package. Understandably therefore, they focused on the features required by their own use cases, which means that I found them all unsatisfactory in some respect:
- Some introduce a new API with custom functions or decorators to define parameters. This makes it more difficult to learn and extend. (vyper-config, hydra)
- Many provide no validation (one-config, config2).
- When validations functions are provided, the target type is often not specified by the config object, but in the calling code – if your configuration library allows to define the name but not the type of the parameters, that’s only 50% of the information and 20% of the work. (configparser, vyper-config)
- For the examples I know which provides validation at the config level, the set of supported types is very limited and basically hard-coded. (OmegaConf, CFG API)
- Some approaches even define their own file formats, substantially raising the barrier to adoption. (CFG API)
- The package configobj is probably the most closely aligned with our goals: it provides a simple, declarative format for both parameter and types specification, as well as key features like hierarchical parameters. (But not, to my knowledge, the configuration of subpackages.) It is also mature, which unfortunately means that it pre-dates widespread use of validation libraries and therefore must package its own custom validation library.
A simple implementation
With the standardization of type annotations in Python 3.6–3.8 and the availability of classes like Pydantic’s BaseModel, defining classes with validation
logic has become almost trivial, and converting a BaseModel into a full-featured
config parser basically only needs three things:
- functionality for reading values from files;
- functionality for resolving relative paths from different source locations;
- ability to compose configuration classes.
In effect therefore, ValConfig is just a subclass of BaseModel with some
simple methods for reading config files. The biggest difference is that each
subclass of ValConfig is made a singleton. We use this pattern for exactly
the same reasons as one-config: it
solves a host of corner cases, makes it trivial to support composing configs,
and ensures that config objects are a single source of truth.
By relying on Pydantic, ValConfig can be extremely lean, while still providing
functionality that is on-par with the much heavier packages listed above.
Pydantic itself is highly mature and actively maintained.
The
ValConfigclass and all its utility functions clock in at less than 400 lines of code and are contained in a single, easily maintained module. If desired, the module can even be included as part of your project’s source code, thus removing valconfig as package dependency and giving you full control over the config object.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file valconfig-1.0.0rc2.tar.gz.
File metadata
- Download URL: valconfig-1.0.0rc2.tar.gz
- Upload date:
- Size: 61.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1f507ccbf8b26335d71b2824b57436c20cb20b652e1f7b11e3978575ee665264
|
|
| MD5 |
24334cc3d2aabc3363a525357c132c88
|
|
| BLAKE2b-256 |
6873af48772d8bd4272a8821231e52b07be071402c3c19533e1a56850f11c863
|
Provenance
The following attestation bundles were made for valconfig-1.0.0rc2.tar.gz:
Publisher:
publish-on-pypi.yml on alcrene/valconfig
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
valconfig-1.0.0rc2.tar.gz -
Subject digest:
1f507ccbf8b26335d71b2824b57436c20cb20b652e1f7b11e3978575ee665264 - Sigstore transparency entry: 582857386
- Sigstore integration time:
-
Permalink:
alcrene/valconfig@07c44667516092de00a4f3a86fc2f765efc983f0 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/alcrene
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-on-pypi.yml@07c44667516092de00a4f3a86fc2f765efc983f0 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file valconfig-1.0.0rc2-py3-none-any.whl.
File metadata
- Download URL: valconfig-1.0.0rc2-py3-none-any.whl
- Upload date:
- Size: 43.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f71857a9149cb9f18549c70cb29a4869be98dd9a9782f53ed1c87edd8ff8d334
|
|
| MD5 |
c32bc8d42a926e3492d42886268a2e47
|
|
| BLAKE2b-256 |
bbdcadf24e64783ad2ece2b225ac29b2b3f306ab212887fc5420977a550842be
|
Provenance
The following attestation bundles were made for valconfig-1.0.0rc2-py3-none-any.whl:
Publisher:
publish-on-pypi.yml on alcrene/valconfig
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
valconfig-1.0.0rc2-py3-none-any.whl -
Subject digest:
f71857a9149cb9f18549c70cb29a4869be98dd9a9782f53ed1c87edd8ff8d334 - Sigstore transparency entry: 582857390
- Sigstore integration time:
-
Permalink:
alcrene/valconfig@07c44667516092de00a4f3a86fc2f765efc983f0 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/alcrene
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-on-pypi.yml@07c44667516092de00a4f3a86fc2f765efc983f0 -
Trigger Event:
workflow_dispatch
-
Statement type: