Pydargs allows configuring a dataclass through command line arguments.
Project description
pydargs
Pydargs allows configuring a (Pydantic) dataclass through command line arguments.
Installation
Pydargs can be installed with your favourite package manager. For example:
pip install pydargs
Usage
A minimal usage example would be:
from dataclasses import dataclass
from pydargs import parse
@dataclass
class Config:
number: int
some_string: str = "abc"
if __name__ == "__main__":
config = parse(Config)
After which this entrypoint can be called with
entrypoint --number 42
or
entrypoint --number 42 --some-string abcd
ArgumentParser arguments
It's possible to pass additional arguments to the underlying argparse.ArgumentParser
instance by providing them
as keyword arguments to the parse
function. For example:
config = parse(Config, prog="myprogram", allow_abbrev=False)
will disable abbreviations for long options and set the program name to myprogram
in help messages. For an extensive list of accepted arguments, see the argparse docs.
Supported Field Types
The dataclass can have fields of the base types: int
, float
, str
, bool
, as well as:
- Literals comprised of those types.
- Enums, although these are not recommended as they do not play nice in the help messages. Only the enum name is accepted as a valid input, not the value.
- Bytes, with an optional
encoding
metadata field:a_value: bytes = field(metadata=dict(encoding="ascii"))
, which defaults to utf-8. - Date and datetime, with an optional
date_format
metadata field:your_date: date = field(metadata=dict(date_format="%m-%d-%Y"))
. When not provided dates in ISO 8601 format are accepted. - Lists of those types, either denoted as e.g.
list[int]
orSequence[int]
. Multiple arguments to anumbers: list[int]
field can be provided as--numbers 1 2 3
. A list-field without a default will require at least a single value to be provided. If a default is provided, it will be completely replaced by any arguments, if provided. - Optional types, denoted as e.g.
typing.Optional[int]
orint | None
(for Python 3.10 and above). Any argument passed is assumed to be of the provided type and can never beNone
. - Unions of types, denoted as e.g.
typing.Union[int, str]
orint | str
. Each argument will be parsed into the first type that returns a valid result. Note that this means thatstr | int
will always result in a value of typestr
. - Any other type that can be instantiated from a string, such as
Path
. - Dataclasses that, in turn, contain fields of supported types. See Nested Dataclasses.
- A union of multiple dataclasses, that in turn contain fields of supported types, which will be parsed in Subparsers.
Metadata
Additional options can be provided to the dataclass field metadata.
The following metadata fields are supported:
positional
Set positional=True
to create a positional argument instead of an option.
from dataclasses import dataclass, field
@dataclass
class Config:
argument: str = field(metadata=dict(positional=True))
as_flags
Set as_flags=True
for a boolean field:
from dataclasses import dataclass, field
@dataclass
class Config:
verbose: bool = field(default=False, metadata=dict(as_flags=True))
which would create the arguments --verbose
and --no-verbose
to
set the value of verbose
to True
or False
respectively, instead
of a single option that requires a value like --verbose True
.
parser
Provide a custom type converter that parses the argument into the desired type. For example:
from dataclasses import dataclass, field
from json import loads
@dataclass
class Config:
list_of_numbers: list[int] = field(metadata=dict(parser=loads))
This would parse --list-of-numbers [1, 2, 3]
into the list [1, 2, 3]
. Note that the error message returned
when providing invalid input is lacking any details. Also, no validation is performed to verify that the returned
type matches the field type. In the above example, --list-of-numbers '{"a": "b"}'
would result in list_of_numbers
being the dictionary {"a": "b"}
without any kind of warning.
short_option
Provide a short option for a field, which can be used as an alternative to the long option. For example,
from dataclasses import dataclass, field
@dataclass
class Config:
a_field_with_a_long_name: int = field(metadata=dict(short_option="-a"))
would allow using -a 42
as an alternative to --a-field-with-a-long-name 42
.
Ignoring fields
Fields can be ignored by adding the ignore_arg
metadata field:
@dataclass
class Config:
number: int
ignored: str = field(metadata=dict(ignore_arg=True))
When indicated, this field is not added to the parser and cannot be overridden with an argument.
help
Provide a brief description of the field, used in the help messages generated by argparse.
For example, calling your_program -h
with the dataclass below,
from dataclasses import dataclass, field
@dataclass
class Config:
an_integer: int = field(metadata=dict(help="any integer you like"))
would result in a message like:
usage: your_program [-h] [--an-integer AN_INTEGER]
optional arguments:
-h, --help show this help message and exit
--an-integer AN_INTEGER any integer you like
metavar
Override the displayed name of an argument in the help messages generated by argparse, as documented here.
For example, with the following dataclass,
from dataclasses import dataclass, field
@dataclass
class Config:
an_integer: int = field(metadata=dict(metavar="INT"))
calling your_program -h
would result in a message like:
usage: your_program [-h] [--an-integer INT]
optional arguments:
-h, --help show this help message and exit
--an-integer INT
Nested Dataclasses
Dataclasses may be nested; the type of a dataclass field may be another dataclass type:
from dataclasses import dataclass
@dataclass
class Config:
field_a: int
field_b: str = "abc"
@dataclass
class Base:
config: Config
verbose: bool = False
Argument names of fields of the nested dataclass are prefixed with the field name of the nested dataclass in the base
dataclass. Calling pydargs.parse(Base, ["-h"])
will result in something like:
usage: your_program.py [-h] --config-field-a CONFIG_FIELD_A
[--config-field-b CONFIG_FIELD_B]
[--verbose VERBOSE]
options:
-h, --help show this help message and exit
--verbose VERBOSE (default: False)
config:
--config-field-a CONFIG_FIELD_A
--config-field-b CONFIG_FIELD_B
(default: abc)
Please be aware of the following:
- The default (factory) of fields with a dataclass type is ignored by pydargs, which may yield unexpected results.
E.g., in the example above,
config: Config = field(default_factory=lambda: Config(field_b="def"))
will not result in a default of "def" for field_b when parsed by pydargs. Instead, setfield_b: str = "def"
in the definition ofConfig
. If you must add a default, for example for instantiating your dataclass elsewhere, doconfig: Config = field(default_factory=Config)
, assuming that all fields inConfig
have a default. - Nested dataclasses can not be positional (although fields of the nested dataclass can be).
- Argument names must not collide. In the example above, the
Base
class should not contain a field namedconfig_field_a
.
Subparsers
Dataclasses can contain a field with a union-of-dataclasses type, e.g.:
from dataclasses import dataclass, field
from typing import Union
@dataclass
class Command1:
field_a: int
field_b: str = "abc"
@dataclass
class Command2:
field_c: str = field(metadata=dict(positional=True))
@dataclass
class Base:
command: Union[Command1, Command2]
verbose: bool = False
This will result in sub commands
which allow calling your entrypoint as entrypoint --verbose Command1 --field-a 12
.
Calling pydargs.parse(Base, ["-h"])
will result in something like:
usage: your_program.py [-h] [--verbose VERBOSE] {Command1,command1,Command2,command2} ...
options:
-h, --help show this help message and exit
--verbose VERBOSE (default: False)
action:
{Command1,command1,Command2,command2}
Note that:
- Also lower-case command names are accepted.
- Any dataclass can not contain more than one subcommand-field.
- Sub-commands can be nested and mixed with nested dataclasses.
- Any positional fields defined after a subcommand-field can not be parsed.
- Subparsers handle all arguments that come after the command; so all global arguments must come before the command.
In the above example this means that
entrypoint --verbose Command2 string
is valid butentrypoint Command2 string --verbose
is not.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pydargs-0.9.0.tar.gz
.
File metadata
- Download URL: pydargs-0.9.0.tar.gz
- Upload date:
- Size: 20.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e469cbed74b2ea3038cc52658bb77720795d33e2f26e27da6641223052f8ce32 |
|
MD5 | c915ba00a439500c5b214fb1d3e73d62 |
|
BLAKE2b-256 | 60940a448ef09febaec77b84ab1ee676c32b699a83ac3bb2865cee81d0e1cf44 |
Provenance
File details
Details for the file pydargs-0.9.0-py3-none-any.whl
.
File metadata
- Download URL: pydargs-0.9.0-py3-none-any.whl
- Upload date:
- Size: 9.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fae9e81a6f146733a63128fc90127ead9891b566efd210440d16f1e25e449d1a |
|
MD5 | a7128fdc8df0030b7c11523115daad4b |
|
BLAKE2b-256 | bcf7f2701e2e0fbe273ee93e66f205a6cbc945704214fd2d565285f4f3eb45a4 |