Skip to main content

AI-powered target variable creation and synthesis agent

Project description

Target Synthesis Agent

An intelligent AI-powered agent for generating and synthesizing target variables for machine learning tasks. This tool analyzes data characteristics and business context to create optimal target variables for various ML applications.

🚀 Features

  • AI-Powered Analysis: Leverages advanced LLM models to analyze data and business context
  • Multiple Data Sources: Works with both SQL databases and pandas DataFrames
  • Customizable Workflows: Supports various ML approaches and synthesis strategies
  • Comprehensive Testing: Includes a complete test suite for reliability
  • Extensible Architecture: Easy to extend with custom components and integrations

📦 Installation

Prerequisites

  • Python 3.10+
  • Git
  • uv package manager (recommended)

Setup

  1. Clone the repository

    git clone https://github.com/stepfnAI/target_synthesis_agent.git
    cd target_synthesis_agent/
    git checkout review
    
  2. Set up the virtual environment and install dependencies

    uv venv --python=3.10 venv
    source venv/bin/activate
    uv pip install -e ".[dev]"
    
  3. Clone and install the blueprint dependency

    cd ..
    git clone https://github.com/stepfnAI/sfn_blueprint.git
    cd sfn_blueprint
    git switch dev
    uv pip install -e .
    cd ../target_synthesis_agent
    
  4. Set up environment variables

    # Optional: Configure LLM provider (default: openai)
    export LLM_PROVIDER="your_llm_provider"
    
    # Optional: Configure LLM model (default: gpt-4)
    export LLM_MODEL="your_llm_model"
    
    # Required: Your LLM API key
    export LLM_API_KEY="your_llm_api_key"
    

🛠️ Usage

Basic SQL Usage

python examples/sql_basic_usage.py

🧪 Testing

Run the complete test suite:

pytest tests/ -s

Or run individual test files:

pytest tests/conftest.py -s
pytest tests/test_agent.py -s
pytest tests/test_utils.py -s

🏗️ Architecture

The Target Synthesis Agent is built with a modular architecture:

  • Core Components:

    • agent.py: Main SQL-based implementation
    • models.py: Data models and schemas
    • utils.py: Utility functions and helpers
    • constants.py: Configuration and prompts
  • Dependencies:

    • sfn-blueprint: Core framework and utilities
    • pandas: Data manipulation
    • sqlalchemy: Database interactions
    • scikit-learn: ML utilities

🤝 Contributing

We welcome contributions! Please follow these steps:

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📧 Contact

For questions or support, please contact support@stepfunction.ai

🙏 Acknowledgments

  • Built with ❤️ by StepFunction AI
  • Uses sfn-blueprint for core functionality
  • Inspired by modern MLOps best practices

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

target_synthesis_agent-1.0.9.tar.gz (28.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

target_synthesis_agent-1.0.9-py3-none-any.whl (25.1 kB view details)

Uploaded Python 3

File details

Details for the file target_synthesis_agent-1.0.9.tar.gz.

File metadata

File hashes

Hashes for target_synthesis_agent-1.0.9.tar.gz
Algorithm Hash digest
SHA256 314d987e8f3baf66b2f036bf62c45c576d5ebad2ece68eadf4447d7bb5505c45
MD5 0d6f1be4328480057da317405d30d4be
BLAKE2b-256 4ce533cb9ab85f0fd9da6c4d98eb2429091b394e6bacebf49c61a75c34f20b83

See more details on using hashes here.

File details

Details for the file target_synthesis_agent-1.0.9-py3-none-any.whl.

File metadata

File hashes

Hashes for target_synthesis_agent-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 f076eb71ea5416278df29b6a22f2fc78d756a3d03916416c30306f8495be5c81
MD5 6705edd1bb7c0b2d0b24ebd16004d6ba
BLAKE2b-256 f46d4237f53dd68844e8c37de1e06d3cf8e7d4f2b5dab175520303adec708b88

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page