Easier Configuration

These details have not been verified by PyPI

Project links

Project description

CHANfiG

Introduction

CHANfiG aims to make your configuration easier.

There are tons of configurable parameters in training a Machine Learning model. To configure all these parameters, researchers usually need to write gigantic config files, sometimes even thousands of lines. Most of the configs are just replicates of the default arguments of certain functions, resulting in many unnecessary declarations. It is also very hard to alter the configurations. One needs to navigate and open the right configuration file, make changes, save and exit. These had wasted an uncountable[^uncountable] amount of precious time ~~and are no doubt a crime~~. Using argparse could relieve the burdens to some extent. However, it takes a lot of work to make it compatible with existing config files, and its lack of nesting limits its potential.

CHANfiG would like to make a change.

You just type the alternations in the command line, and leave everything else to CHANfiG.

CHANfiG is highly inspired by YACS. Different from the paradigm of YACS( your code + a YACS config for experiment E (+ external dependencies + hardware + other nuisance terms ...) = reproducible experiment E), The paradigm of CHANfiG is:

your code + command line arguments (+ optional CHANfiG config + external dependencies + hardware + other nuisance terms ...) = reproducible experiment E (+ optional CHANfiG config for experiment E)

Components

A Config is basically a nested dict structure.

However, the default Python dict is hard to manipulate.

The only way to access a dict member is through dict['name'], which is obviously extremely complex. Even worse, if the dict is nested like a config, member access could be something like dict['parent']['children']['name'].

Enough is enough, it is time to make a change.

We need attribute-style access, and we need it now. dict.name and dict.parent.children.name are all you need.

Although there have been some other works that achieve a similar functionality of attribute-style access to dict members. Their Config objects either use a separate dict to store information from attribute-style access (EasyDict), which may lead to inconsistency between attribute-style access and dict-style access; or re-use the existing __dict__ and redirect dict-style access (ml_collections), which may result in confliction between attributes and members of Config.

To overcome the aforementioned limitations, we inherit the Python built-in dict to create FlatDict, NestedDict, and Config objects.

FlatDict

FlatDict improves the default dict in 3 aspects.

FlatDict also accepts default_factory, and can be easily used as defaultdict.

Dict Operations

FlatDict extends the update method of the original dict, allows passing another Mapping, Iterable or a path.

Moreover, FlatDict comes with difference and intersection, which makes it very easy to compare a FlatDict with other Mapping, Iterable, or a path.

ML Operations

FlatDict supports the to method similar to PyTorch Tensors. You can simply convert all member values of FlatDict to a certain type or pass to a device in the same way.

FlatDict also integrates cpu, gpu (cuda), and tpu methods for easier access.

IO Operations

FlatDict provides json, jsons, yaml and yamls methods to dump FlatDict object to a file or string. It also provides from_json, from_jsons, from_yaml and from_yamls methods to build a FlatDict object from a string or file.

FlatDict also includes dump and load methods which determines the type by its extension and dump/load FlatDict object to/from a file.

NestedDict

Since most Configs are in a nested structure, we further propose a NestedDict.

Based on FlatDict, NestedDict provides all_keys, all_values, and all_items methods to allow iterating over the whole nested structure at once.

NestedDict also comes with apply method, which made it easier to manipulate nested structures.

Config

Config extends the functionality by supporting freeze and defrost the dict, and by adding a built-in ConfigParser to pare command line arguments.

Note that Config also has default_factory=Config() by default for convenience.

Variable

Have one value for multiple names at multiple places? We got you covered.

Just wrap the value with Variable, and one alteration will be reflected everywhere.

Usage

CHANfiG has great backward compatibility with previous configs.

No matter if your old config is json or yaml, you could directly read from them.

And if you are using yacs, just replace CfgNode with Config and enjoy all the additional benefits that CHANfiG provides.

from chanfig import Config, Variable


class Model:
    def __init__(self, encoder, dropout=0.1, activation='ReLU'):
        self.encoder = Encoder(**encoder)
        self.dropout = Dropout(dropout)
        self.activation = getattr(Activation, activation)

def main(config):
    model = Model(**config.model)
    optimizer = Optimizer(**config.optimizer)
    scheduler = Scheduler(**config.scheduler)
    dataset = Dataset(**config.dataset)
    dataloader = Dataloader(**config.dataloader)


class TestConfig(Config):
    def __init__(self):
        super().__init__()
        dropout = Variable(0.1)
        self.name = "CHANfiG"
        self.seed = 1013
        self.data.batch_size = 64
        self.model.encoder.num_layers = 6
        self.model.decoder.num_layers = 6
        self.model.dropout = dropout
        self.model.encoder.dropout = dropout
        self.model.decoder.dropout = dropout
        self.activation = "GELU"
        self.optim.lr = 1e-3

    def post(self):
        self.id = f"{self.name}_{self.seed}"


if __name__ == '__main__':
    # config = Config.load('config.yaml')  # in case you want to read from a yaml
    # config = Config.load('config.json')  # in case you want to read from a json
    # existing_configs = {'data.batch_size': 64, 'model.encoder.num_layers': 8}
    # config = Config(**existing_configs)  # in case you have some config in dict to load
    config = TestConfig()
    config = config.parse()
    # config.update('dataset.yaml')  # in case you want to merge a yaml
    # config.update('dataset.json')  # in case you want to merge a json
    # note that the value of merge will override current values
    config.model.decoder.num_layers = 8
    config.freeze()
    print(config)
    # main(config)
    # config.yaml('config.yaml')  # in case you want to save a yaml
    # config.json('config.json')  # in case you want to save a json

All you need to do is just run a line:

python main.py --model.encoder.num_layers 8 --model.dropout=0.2

You could also load a default configure file and make changes based on it:

Note, you must specify config.parse(default_config='config') to correctly load the default config.

python main.py --config meow.yaml --model.encoder.num_layers 8 --model.dropout=0.2

If you have made it dump current configurations, this should be in the written file:

activation: GELU
data:
  batch_size: 64
id: CHANfiG_1013
model:
  decoder:
    dropout: 0.1
    num_layers: 6
  dropout: 0.1
  encoder:
    dropout: 0.1
    num_layers: 6
name: CHANfiG
optim:
  lr: 0.001
seed: 1013

{
  "name": "CHANfiG",
  "seed": 1013,
  "data": {
    "batch_size": 64
  },
  "model": {
    "encoder": {
      "num_layers": 6,
      "dropout": 0.1
    },
    "decoder": {
      "num_layers": 6,
      "dropout": 0.1
    },
    "dropout": 0.1
  },
  "activation": "GELU",
  "optim": {
    "lr": 0.001
  },
  "id": "CHANfiG_1013"
}

Define the default arguments in function, put alterations in CLI, and leave the rest to CHANfiG.

Installation

Install the most recent stable version on pypi:

pip install chanfig

Install the latest version from source:

pip install git+https://github.com/ZhiyuanChen/CHANfiG

It works the way it should have worked.

License

CHANfiG is multi-licensed under the following licenses:

Unlicense
GNU GPL 2.0 (or any later version)
MIT
Apache 2.0
BSD 2-Clause
BSD 3-Clause
BSD 4-Clause

You can choose any (one or more) of these license if you use this work.

SPDX-License-Identifier: Unlicense OR GPL-2.0-or-later OR MIT OR Apache-2.0 OR BSD-2-Clause OR BSD-3-Clause OR BSD-4-Clause

[^uncountable]: fun fact: time is always uncountable.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.106

Sep 20, 2024

0.0.105

Aug 27, 2024

0.0.104

Aug 20, 2024

0.0.103

Aug 2, 2024

0.0.102

Aug 1, 2024

0.0.101

Jul 18, 2024

0.0.100

Jun 26, 2024

0.0.99

Apr 15, 2024

0.0.98

Mar 28, 2024

0.0.96

Feb 23, 2024

0.0.95

Nov 9, 2023

0.0.94

Nov 7, 2023

0.0.93

Oct 19, 2023

0.0.92

Sep 14, 2023

0.0.91

Sep 4, 2023

0.0.90

Aug 29, 2023

0.0.89

Aug 7, 2023

0.0.88

Aug 4, 2023

0.0.87

Jul 18, 2023

0.0.86

Jul 14, 2023

0.0.85

Jul 7, 2023

0.0.84.post1

Jun 30, 2023

0.0.84

Jun 29, 2023

0.0.83 yanked

Jun 26, 2023

Reason this release was yanked:

Merge error with property

0.0.82

Jun 6, 2023

0.0.81

May 20, 2023

0.0.80

May 12, 2023

0.0.79

May 11, 2023

0.0.78

May 4, 2023

0.0.77

Apr 25, 2023

0.0.76

Apr 25, 2023

0.0.75

Apr 25, 2023

0.0.74

Apr 24, 2023

0.0.73

Apr 23, 2023

0.0.72

Apr 20, 2023

0.0.71

Apr 14, 2023

0.0.70 yanked

Apr 8, 2023

Reason this release was yanked:

The __repr__ of NestedDict is broken and raises RecursionError when calling it.

This version

0.0.69

Mar 29, 2023

0.0.68

Mar 24, 2023

0.0.67

Mar 15, 2023

0.0.66

Mar 7, 2023

0.0.66a1 pre-release

Mar 4, 2023

0.0.65

Mar 3, 2023

0.0.64

Mar 1, 2023

0.0.63

Mar 1, 2023

0.0.62

Feb 28, 2023

0.0.61

Feb 26, 2023

0.0.60

Feb 23, 2023

0.0.59

Feb 21, 2023

0.0.58

Feb 10, 2023

0.0.57

Feb 10, 2023

0.0.56

Feb 9, 2023

0.0.55

Feb 8, 2023

0.0.54

Feb 3, 2023

0.0.53

Jan 18, 2023

0.0.52

Jan 17, 2023

0.0.51

Jan 16, 2023

0.0.50

Jan 16, 2023

0.0.49

Jan 13, 2023

0.0.48

Jan 13, 2023

0.0.47

Jan 13, 2023

0.0.46

Jan 13, 2023

0.0.45

Jan 4, 2023

0.0.44

Jan 2, 2023

0.0.43

Nov 28, 2022

0.0.42

Nov 28, 2022

0.0.41

Nov 17, 2022

0.0.40

Nov 7, 2022

0.0.39

Nov 7, 2022

0.0.37

Nov 4, 2022

0.0.36

Nov 4, 2022

0.0.35

Nov 4, 2022

0.0.34

Nov 2, 2022

0.0.33

Nov 1, 2022

0.0.32

Nov 1, 2022

0.0.31

Nov 1, 2022

0.0.26

Oct 26, 2022

0.0.24.post1

Oct 20, 2022

0.0.24

Oct 20, 2022

0.0.23

Oct 18, 2022

0.0.22

Oct 18, 2022

0.0.21

Oct 18, 2022

0.0.20

Oct 13, 2022

0.0.19.post1

Oct 13, 2022

0.0.19

Oct 13, 2022

0.0.18.post1

Oct 11, 2022

0.0.18

Oct 11, 2022

0.0.17

Oct 9, 2022

0.0.16

Sep 14, 2022

0.0.15

Sep 11, 2022

0.0.14

Sep 6, 2022

0.0.13

Sep 3, 2022

0.0.12 yanked

Sep 2, 2022

Reason this release was yanked:

major flaw in nested declearation in config

0.0.11

Jul 27, 2022

0.0.10

Jul 4, 2022

0.0.9.post7

May 17, 2022

0.0.9.post5

May 16, 2022

0.0.9.post4

May 13, 2022

0.0.9.post3

May 13, 2022

0.0.9

May 13, 2022

0.0.8

May 12, 2022

0.0.7

May 12, 2022

0.0.6

May 12, 2022

0.0.5.post1

May 7, 2022

0.0.4.post1

May 6, 2022

0.0.4

May 5, 2022

0.0.3.post3

May 5, 2022

0.0.3.post2

May 5, 2022

0.0.3.post1

May 5, 2022

0.0.3

May 5, 2022

0.0.2.post1

May 5, 2022

0.0.2

May 5, 2022

0.0.1

May 4, 2022

0.0

May 4, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chanfig-0.0.69.tar.gz (6.3 MB view details)

Uploaded Mar 29, 2023 Source

Built Distribution

chanfig-0.0.69-py3-none-any.whl (27.1 kB view details)

Uploaded Mar 29, 2023 Python 3

File details

Details for the file chanfig-0.0.69.tar.gz.

File metadata

Download URL: chanfig-0.0.69.tar.gz
Upload date: Mar 29, 2023
Size: 6.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.2

File hashes

Hashes for chanfig-0.0.69.tar.gz
Algorithm	Hash digest
SHA256	`d6dc872ecb6611500acc828e15771992cf397b52b06100eabc653fba8115cb9d`
MD5	`56f21bd3f5e7fc45d80b461cbfc20186`
BLAKE2b-256	`6494c58cb8aac4b2662e7888b4c73974c3d7141b209396c8fd2fe1908a6d4648`

See more details on using hashes here.

File details

Details for the file chanfig-0.0.69-py3-none-any.whl.

File metadata

Download URL: chanfig-0.0.69-py3-none-any.whl
Upload date: Mar 29, 2023
Size: 27.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.2

File hashes

Hashes for chanfig-0.0.69-py3-none-any.whl
Algorithm	Hash digest
SHA256	`013629b267bef04ae9c72814f62e5cc6cd22a7a275e62d8d2d4066233d475ea4`
MD5	`82b1e91e87a6b653e39b63d1682c689e`
BLAKE2b-256	`3c7525a493af0411957912dc67c1ef6861bb46cd1c84915134c6366ebaa5f381`