Skip to main content

Configuration layer for DataDesigner synthetic data generation

Project description

data-designer-config

Configuration layer for NeMo Data Designer synthetic data generation framework.

This package provides the configuration API for defining synthetic data generation pipelines. It's a lightweight dependency that can be used standalone for configuration management.

Installation

pip install data-designer-config

Usage

import data_designer.config as dd

# Initialize config builder with model config(s)
config_builder = dd.DataDesignerConfigBuilder(
    model_configs=[
        dd.ModelConfig(
            alias="my-model",
            model="nvidia/nemotron-3-nano-30b-a3b",
            provider="nvidia",
            inference_parameters=dd.ChatCompletionInferenceParams(temperature=0.7),
        ),
    ]
)

# Add columns
config_builder.add_column(
    dd.SamplerColumnConfig(
        name="user_id",
        sampler_type=dd.SamplerType.UUID,
        params=dd.UUIDSamplerParams(prefix="user-"),
    )
)
config_builder.add_column(
    dd.LLMTextColumnConfig(
        name="description",
        prompt="Write a product description",
        model_alias="my-model",
    )
)

# Build configuration
config = config_builder.build()

See main README.md for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_designer_config-0.6.0rc5.tar.gz (143.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_designer_config-0.6.0rc5-py3-none-any.whl (123.1 kB view details)

Uploaded Python 3

File details

Details for the file data_designer_config-0.6.0rc5.tar.gz.

File metadata

  • Download URL: data_designer_config-0.6.0rc5.tar.gz
  • Upload date:
  • Size: 143.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for data_designer_config-0.6.0rc5.tar.gz
Algorithm Hash digest
SHA256 e2c49329e8792c8026449a0f11e36501c429e117d3c719e063804fe48b94f11a
MD5 07126b9505d60f339331d4c3354e5778
BLAKE2b-256 561fe8a6331890a538fd54d4e82ae7d06c38860f5ce1b6ae2db69cc266e660ee

See more details on using hashes here.

File details

Details for the file data_designer_config-0.6.0rc5-py3-none-any.whl.

File metadata

File hashes

Hashes for data_designer_config-0.6.0rc5-py3-none-any.whl
Algorithm Hash digest
SHA256 29cb15abc4b73f7ef916c9c9d15e15a2b0f7165a1401986c48e713d2a412affe
MD5 4c4f51cbc8de781e64b6f93b67dbb0fd
BLAKE2b-256 882014d25de3820dd3db6785dd311ec946834f1953f5f19b39290ceb99be7daf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page