An agentic framework for building Data transformations from natural language
Project description
Aiden
An agentic framework for building data transformations from natural language.
Overview
Aiden is a Python framework that enables you to build data transformations using natural language. It leverages AI agents to simplify data engineering tasks, making them more accessible and efficient.
Installation
You can install Aiden using pip:
pip install aiden-ai
Or using Poetry:
poetry add aiden-ai
For development installation:
git clone https://github.com/getaiden/aiden.git
cd aiden
poetry install
Example Usage
Aiden makes it easy to transform data using natural language instructions. Here's a simple example of cleaning email addresses:
from aiden import Transformation
from pandas import DataFrame
# Create a validation dataset
validation_df = DataFrame(
{
"email":
["test", "test2", "test@test.com", "test@test.com"]
}
)
# Define your transformation with natural language intent
tr = Transformation(
intent="""Clean the email column by:
1. Removing any leading or trailing whitespace
2. Converting all emails to lowercase
3. Validating email format (must contain @ and a valid domain)
4. Removing duplicate email addresses
5. Filtering out emails from disposable domains
Return only the valid, cleaned email addresses.""",
input_schema={"email": str},
output_schema={"email": str},
)
# Build the transformation with the validation dataset
tr.build(validation_dataset=validation_df)
# Print the transformation description
print(tr.describe().as_markdown())
This example demonstrates how to create a transformation that cleans email addresses, validates the transformation with sample data, and then applies it to your actual dataset.
License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aiden_ai-0.1.1.tar.gz.
File metadata
- Download URL: aiden_ai-0.1.1.tar.gz
- Upload date:
- Size: 42.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.10 Linux/6.11.0-1013-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
897a42970f117b18f69c5fd943fe5008cce6bb3bb2187a4823651cbffbc96e93
|
|
| MD5 |
36757305683511881c5912daa182c7cb
|
|
| BLAKE2b-256 |
119c03bc5fe4b9d48a7d5e4e0349399a5d9791a566ab50ba62b13d596a90f90e
|
File details
Details for the file aiden_ai-0.1.1-py3-none-any.whl.
File metadata
- Download URL: aiden_ai-0.1.1-py3-none-any.whl
- Upload date:
- Size: 57.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.10 Linux/6.11.0-1013-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2152c1a94292db99000df65dcb0cdf9ec5a8b0d1c5c5f774344f273135054aa9
|
|
| MD5 |
e8106f5c2f53d90cc0b0b5005c44519a
|
|
| BLAKE2b-256 |
f25eb730eec44c7efd98d524883fc4258e9a1c0df5f9e4cf5c9da6ec0ba59076
|