Skip to main content

An agentic framework for building Data transformations from natural language

Project description

Aiden

An agentic framework for building data transformations from natural language.

Overview

Aiden is a Python framework that enables you to build data transformations using natural language. It leverages AI agents to simplify data engineering tasks, making them more accessible and efficient.

Installation

You can install Aiden using pip:

pip install aiden-ai

Or using Poetry:

poetry add aiden-ai

For development installation:

git clone https://github.com/getaiden/aiden.git
cd aiden
poetry install

Example Usage

Aiden makes it easy to transform data using natural language instructions. Here's a simple example of cleaning email addresses:

from aiden import Transformation
from pandas import DataFrame

# Create a validation dataset
validation_df = DataFrame(
    {
        "email":
        ["test", "test2", "test@test.com", "test@test.com"]
    }
)

# Define your transformation with natural language intent
tr = Transformation(
    intent="""Clean the email column by:
    1. Removing any leading or trailing whitespace
    2. Converting all emails to lowercase
    3. Validating email format (must contain @ and a valid domain)
    4. Removing duplicate email addresses
    5. Filtering out emails from disposable domains
    Return only the valid, cleaned email addresses.""",
    input_schema={"email": str},
    output_schema={"email": str},
)

# Build the transformation with the validation dataset
tr.build(validation_dataset=validation_df)

# Print the transformation description
print(tr.describe().as_markdown())

This example demonstrates how to create a transformation that cleans email addresses, validates the transformation with sample data, and then applies it to your actual dataset.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aiden_ai-0.1.1.tar.gz (42.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aiden_ai-0.1.1-py3-none-any.whl (57.9 kB view details)

Uploaded Python 3

File details

Details for the file aiden_ai-0.1.1.tar.gz.

File metadata

  • Download URL: aiden_ai-0.1.1.tar.gz
  • Upload date:
  • Size: 42.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.10 Linux/6.11.0-1013-azure

File hashes

Hashes for aiden_ai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 897a42970f117b18f69c5fd943fe5008cce6bb3bb2187a4823651cbffbc96e93
MD5 36757305683511881c5912daa182c7cb
BLAKE2b-256 119c03bc5fe4b9d48a7d5e4e0349399a5d9791a566ab50ba62b13d596a90f90e

See more details on using hashes here.

File details

Details for the file aiden_ai-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: aiden_ai-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 57.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.10 Linux/6.11.0-1013-azure

File hashes

Hashes for aiden_ai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2152c1a94292db99000df65dcb0cdf9ec5a8b0d1c5c5f774344f273135054aa9
MD5 e8106f5c2f53d90cc0b0b5005c44519a
BLAKE2b-256 f25eb730eec44c7efd98d524883fc4258e9a1c0df5f9e4cf5c9da6ec0ba59076

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page