A tool for agentic recursive model improvement

These details have not been verified by PyPI

Project links

Project description

Introducing the Co-DataScientist

Version License

Beat the competition.

Why is everyone talking about the Tropiflo?

Idea Explosion — Launches a swarm of models, feature recipes & hyper-parameters you never knew existed.
Full-Map Exploration — Charts the entire optimization galaxy so you can stop guessing and start winning.
Hands-Free Mode — Hit run and the search party works through the night.
KPI Fanatic — Every evolutionary step is focused on improving your target metric.
Data Stays Home — Your training and testing data never leaves your server; everything runs locally.

Quickstart — 5 Minutes

Prerequisites: Docker must be installed (Get Docker)

1. Install Tropiflo

pip install tropiflo

2. Set Your API Token (Required!)

tropiflo set-token --token YOUR_TOKEN  # Get token from your account

⚠️ You MUST set your token before running tropiflo! Get your token from your account dashboard.

3. Create Your Project Files

Your Python code (xor.py):

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score
import numpy as np

# XOR toy-set
X = np.array([[0,0],[0,1],[1,0],[1,1]])
y = np.array([0,1,1,0])

# CO_DATASCIENTIST_BLOCK_START

pipe = Pipeline([
    ("scale", StandardScaler()),
    ("clf", LogisticRegression(random_state=0))
])

pipe.fit(X, y)
acc = accuracy_score(y, pipe.predict(X))

# CO_DATASCIENTIST_BLOCK_END


print(f"KPI: {acc:.4f}")  # Tag your metric!

# comments
# This is the classic XOR problem — it's not linearly separable!
# A linear model like LogisticRegression can't solve it perfectly,
# because no straight line can separate the classes in 2D.
# This makes it a great test for feature engineering or non-linear models.

Dockerfile:

FROM python:3.10-slim
WORKDIR /app
COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt
COPY . .
CMD ["python", "xor.py"]

requirements.txt:

scikit-learn
numpy

config.yaml:

mode: "local"       # Required: "local", "gcloud", "aws", or "databricks"
parallel: 3         # Optional: evolve N versions simultaneously

4. Run & Watch It Evolve

tropiflo run --config config.yaml

Tropiflo will:

✅ Build a Docker container with your code
✅ Run your baseline (should print "KPI: 0.5000")
✅ Evolve your code over multiple generations
✅ Save best results to co_datascientist_checkpoints/

Best results saved to co_datascientist_checkpoints/best_X/

Pro tips:

Add data_volume: "/path/to/data" in config.yaml to mount data directories
Use --use-cached-qa to skip questions on subsequent runs
Cloud configs available in /docs/ for GCloud, AWS, Databricks

How It Works

Mark code blocks: Use # CO_DATASCIENTIST_BLOCK_START and # CO_DATASCIENTIST_BLOCK_END
Print your KPI: Add print(f"KPI: {score}") to output your metric (can be accuracy, F1, loss, anything!)
Run: Tropiflo evolves your code and saves best results
Deploy: Use tropiflo deploy co_datascientist_checkpoints/best_X/ to create a production-ready project

📁 Data Access Patterns

Option 1: Hardcoded Paths (Simple)

For quick local runs, hardcode data paths directly in your script:

data = np.loadtxt("/full/path/to/my_data.csv")

Option 2: Volume Mounting (Recommended for Docker)

For Docker-based execution, use environment variables with volume mounting:

config.yaml:

mode: "local"
data_volume: "/path/to/your/data"  # Host directory containing data

Your code:

import os
INPUT_URI = os.environ.get("INPUT_URI", "/default/path")
data = pd.read_csv(os.path.join(INPUT_URI, "train.csv"))

When you run with --config config.yaml, Co-DataScientist automatically:

Mounts your data directory into the Docker container
Sets the INPUT_URI environment variable
Your code accesses data without hardcoding paths

Note: Avoid input() or interactive prompts - Co-DataScientist needs to run your code automatically.

🧬 Blocks to evolve

As you will see in the XOR example, Co-DataScientist uses # CO_DATASCIENTIST_BLOCK_START and # CO_DATASCIENTIST_BLOCK_END tags to identify the parts of the system you want it to improve. Make sure to tag parts of your system you care about improving! It will help to Co-DataScientist stay focused on its job.

📂 Project Structure

Co-DataScientist supports both single-file scripts and multi-file projects:

Single File: tropiflo run python my_script.py
Multi-File: Auto-detects run.sh, main.py, or run.py in your project root
Custom Entry Point: Just wrap your command: tropiflo run bash custom_script.sh

The system automatically detects which files contain CO_DATASCIENTIST_BLOCK markers and evolves them intelligently.

Add Domain-Specific Notes for Best Results

After your code, add comments with any extra context, known issues, or ideas you have about your problem. This helps Co-DataScientist understand your goals and constraints! The Co-Datascientist UNDERSTANDS your problem. It's not just doing a blind search!

🎯 How Multi-File Evolution Works

When you run Co-DataScientist on a multi-file project:

Scanning: It scans all .py files in your project for CO_DATASCIENTIST_BLOCK markers
Selection: Each generation, it randomly picks ONE file to evolve
Evolution: The AI generates hypotheses and modifies the selected block
Stitching: Modified code is integrated back into your full project
Testing: Your entire project runs with the new code using your run.sh or custom command
Checkpointing: Best results are saved as complete directories with all files

This means:

✅ You can have complex multi-file ML pipelines
✅ Each file evolves independently but is tested as a complete system
✅ Your project structure is preserved
✅ Dependencies between files are maintained

Example Evolution Flow:

Generation 1: Evolve model.py → Test full project → KPI: 30.0
Generation 2: Evolve data_loader.py → Test full project → KPI: 45.0
Generation 3: Evolve main.py → Test full project → KPI: 60.0

Other helpful stuff

Skip Q&A on Repeat Runs

For faster iterations, use cached answers from your previous run:

tropiflo run --use-cached-qa python xor.py

This skips the interactive questions and uses your previous answers, jumping straight to the optimization process.

Deploy Checkpoints to Production

The deploy command makes it easy to take your best checkpoint and create a production-ready project:

# Basic usage - auto-detects original project location
tropiflo deploy co_datascientist_checkpoints/best_6_explore/

# Specify original project path manually
tropiflo deploy best_6_explore/ --original-path /path/to/my_project

# Use custom output directory name
tropiflo deploy best_6_explore/ --output-dir my_optimized_v2

What it does:

Copies your entire original project (including data, configs, assets)
Integrates the evolved code from the checkpoint
Excludes Co-DataScientist artifacts (checkpoints, cache, etc.)
Creates a deployment_info.json with checkpoint metadata

The result is a complete, standalone project ready to deploy to production!

📝 Before vs After

Before _{KPI ≈ 0.50}	After _{KPI 1.00}
from sklearn.ensemble import RandomForestClassifier from sklearn.preprocessing import StandardScaler from sklearn.pipeline import Pipeline from sklearn.metrics import accuracy_score import numpy as np # XOR data X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]]) y = np.array([0, 1, 1, 0]) pipeline = Pipeline([ ('scaler', StandardScaler()), ('clf', RandomForestClassifier(n_estimators=10, random_state=0)) ]) pipeline.fit(X, y) preds = pipeline.predict(X) accuracy = accuracy_score(y, preds) print(f'Accuracy: {accuracy:.2f}') print(f'KPI: {accuracy:.4f}')	import numpy as np from sklearn.base import TransformerMixin, BaseEstimator from sklearn.ensemble import RandomForestClassifier from sklearn.preprocessing import StandardScaler from sklearn.pipeline import Pipeline from sklearn.metrics import accuracy_score from tqdm import tqdm class ChebyshevPolyExpansion(BaseEstimator, TransformerMixin): def __init__(self, degree=3): self.degree = degree def fit(self, X, y=None): return self def transform(self, X): X = np.asarray(X) X_scaled = 2 * X - 1 n_samples, n_features = X_scaled.shape features = [] for f in tqdm(range(n_features), desc='Chebyshev features'): x = X_scaled[:, f] T = np.empty((self.degree + 1, n_samples)) T[0] = 1 if self.degree >= 1: T[1] = x for d in range(2, self.degree + 1): T[d] = 2 * x * T[d - 1] - T[d - 2] features.append(T.T) return np.hstack(features) X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]]) y = np.array([0, 1, 1, 0]) pipeline = Pipeline([ ('cheb', ChebyshevPolyExpansion(degree=3)), ('scaler', StandardScaler()), ('clf', RandomForestClassifier(n_estimators=10, random_state=0)) ]) pipeline.fit(X, y) preds = pipeline.predict(X) accuracy = accuracy_score(y, preds) print(f'Accuracy: {accuracy:.2f}') print(f'KPI: {accuracy:.4f}')

Before
_{KPI ≈ 0.50}

After
_{KPI 1.00}

from sklearn.ensemble import RandomForestClassifier
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline
from sklearn.metrics import accuracy_score
import numpy as np

# XOR data
X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
y = np.array([0, 1, 1, 0])

pipeline = Pipeline([
    ('scaler', StandardScaler()),
    ('clf', RandomForestClassifier(n_estimators=10, random_state=0))
])

pipeline.fit(X, y)
preds = pipeline.predict(X)
accuracy = accuracy_score(y, preds)
print(f'Accuracy: {accuracy:.2f}')
print(f'KPI: {accuracy:.4f}')

import numpy as np
from sklearn.base import TransformerMixin, BaseEstimator
from sklearn.ensemble import RandomForestClassifier
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline
from sklearn.metrics import accuracy_score
from tqdm import tqdm

class ChebyshevPolyExpansion(BaseEstimator, TransformerMixin):
    def __init__(self, degree=3):
        self.degree = degree
    def fit(self, X, y=None):
        return self
    def transform(self, X):
        X = np.asarray(X)
        X_scaled = 2 * X - 1
        n_samples, n_features = X_scaled.shape
        features = []
        for f in tqdm(range(n_features), desc='Chebyshev features'):
            x = X_scaled[:, f]
            T = np.empty((self.degree + 1, n_samples))
            T[0] = 1
            if self.degree >= 1:
                T[1] = x
            for d in range(2, self.degree + 1):
                T[d] = 2 * x * T[d - 1] - T[d - 2]
            features.append(T.T)
        return np.hstack(features)

X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])
y = np.array([0, 1, 1, 0])

pipeline = Pipeline([
    ('cheb', ChebyshevPolyExpansion(degree=3)),
    ('scaler', StandardScaler()),
    ('clf', RandomForestClassifier(n_estimators=10, random_state=0))
])

pipeline.fit(X, y)
preds = pipeline.predict(X)
accuracy = accuracy_score(y, preds)
print(f'Accuracy: {accuracy:.2f}')
print(f'KPI: {accuracy:.4f}')

We now support Databricks

Databricks setup

Download the databricks CLI package

curl -fsSL https://raw.githubusercontent.com/databricks/setup-cli/main/install.sh | sudo sh

Get a databricks token : and test the CLI works Get your Databricks token here
Prepare a config file with all of your compute/environmental requirements in databricks_config.yaml example below

# Enable Databricks integration
databricks: true

# Databricks configuration for XOR demo
databricks:
  cli: "databricks"  # databricks CLI command (optional, defaults to "databricks")
  volume_uri: "dbfs:/Volumes/workspace/default/volume"  # DBFS volume URI for file uploads
  code_path: "dbfs:/Volumes/workspace/default/volume/xor.py"  # Specific code path (optional, overrides volume_uri + temp filename)
  timeout: "30m"  # Job timeout duration
  
  job:
    name: "run-<script-stem>-<timestamp>"  # Job name template (supports <script-stem> and <timestamp>)
    tasks:
      - task_key: "t"
        spark_python_task:
          python_file: "<remote_path>"  # Will be automatically replaced with actual remote path
        environment_key: "default"
    environments:
      - environment_key: "default"
        spec:
          client: "1"
          dependencies:
            - "scikit-learn>=1.0.0"
            - "numpy>=1.20.0"

Then run the co-datascientist with:

tropiflo run --config databricks_config.yaml

Now your new optimized model checkpoints will save in : dbfs:/Volumes/workspace/default/volume/co-datascientist-checkpoints

🐳 Local Docker Execution with Volume Mounting

Run your code in Docker containers locally with automatic data volume mounting. Perfect for reproducible environments and large datasets.

Setup

Create a config file (e.g., config.yaml):

mode: "local"
data_volume: "/absolute/path/to/your/data"  # Host directory with your data files
parallel: 1  # Number of parallel executions

Update your code to use environment variables:

import os
import pandas as pd

# Co-DataScientist automatically sets INPUT_URI to /data in the container
INPUT_URI = os.environ.get("INPUT_URI")
df = pd.read_csv(os.path.join(INPUT_URI, "train.csv"))

Add a Dockerfile to your project:

FROM python:3.10-slim
WORKDIR /app
COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt
COPY . .
CMD ["python", "your_script.py"]

Run Co-DataScientist:

tropiflo run --working-directory . --config config.yaml

What Happens

Co-DataScientist will:

Build a Docker image from your project
Mount your data_volume directory to /data inside the container
Set the INPUT_URI=/data environment variable automatically
Execute your code in the container with access to your data
Extract KPIs and manage the evolution process

Benefits

Reproducible: Same environment every time
Isolated: Dependencies don't conflict with your system
Scalable: Easy to move to cloud later with minimal changes
Clean: No need to copy large datasets into Docker images

📖 See the complete demo: /demos/docker_demo/

☁️ Google Cloud Run Jobs Integration

Execute and optimize your Python code at scale using Google Cloud Run Jobs.

Setup

Prerequisites:
- Google Cloud project with Cloud Run enabled
- Authenticated gcloud CLI: gcloud auth login
- A Cloud Run Job template (e.g., test-job-clean)
Create a config file (e.g., gcloud_config.yaml):

mode: "gcloud"
project_id: "my-gcp-project"
region: "us-central1"
repo: "co-datascientist-repo"
job_name: "co-datascientist-job"
cleanup_job: false
cleanup_remote_image: true

# Optional: Mount GCS bucket for data access
data_volume: "gs://my-data-bucket"  # or just "my-data-bucket"

For GCS volume mounting:

Specify your GCS bucket in data_volume (with or without gs:// prefix)
Your bucket will be mounted to /data inside the Cloud Run container
INPUT_URI=/data environment variable is automatically set
Your code accesses data the same way as local Docker execution

Your code:

import os
INPUT_URI = os.environ.get("INPUT_URI")
df = pd.read_csv(os.path.join(INPUT_URI, "data.csv"))

Run Co-DataScientist:

tropiflo run --config gcloud_config.yaml

Your code will be executed in Google Cloud Run Jobs, with results and KPIs retrieved automatically. Perfect for scaling compute-intensive optimizations!

Important Notes for GCS Volume Mounting:

Requires Cloud Run 2nd generation execution environment
Your Cloud Run service account needs storage.objectViewer permission on the bucket
GCS buckets are mounted read-only via Cloud Storage FUSE
Files appear as regular files in the container at /data

📖 See the complete demo: /demos/gcloud/

☁️ AWS ECS Fargate Integration

Execute and optimize your Python code at scale using AWS ECS Fargate.

Setup

Prerequisites:
- AWS account with ECS Fargate enabled
- Authenticated AWS CLI: aws configure
- An ECS cluster and task definition configured for your needs
Create a config file (e.g., aws_config.yaml):

aws:
  enabled: true
  script_path: "/path/to/your/script.py"
  cluster: "my-cluster"
  task_definition: "my-job-taskdef"
  launch_type: "FARGATE"
  region: "us-east-1"
  network_configuration:
    subnets: ["subnet-abc123", "subnet-def456"]
    security_groups: ["sg-123456"]
    assign_public_ip: "ENABLED"
  timeout: 1800  # seconds

Run Co-DataScientist:

tropiflo run --config aws_config.yaml

Your code will be executed in AWS ECS Fargate containers, with results and KPIs retrieved automatically. Perfect for serverless compute scaling!

Analysis and Visualization Tools

Co-DataScientist includes built-in visualization tools to help you analyze your optimization results and compare different versions of your code.

Plot KPI Progression

Visualize how your KPI improves over iterations from checkpoint JSON files:

# Basic usage - plot KPI progression from checkpoints directory
tropiflo plot-kpi --checkpoints-dir /path/to/co_datascientist_checkpoints

# Advanced usage with custom options
tropiflo plot-kpi \
  --checkpoints-dir /path/to/checkpoints \
  --max-iteration 350 \
  --title "AUC Training Progress" \
  --kpi-label "AUC" \
  --output my_kpi_plot.png

Options:

--checkpoints-dir, -c: Directory containing checkpoint JSON files (required)
--max-iteration, -m: Maximum iteration to include in plot
--title, -t: Custom title for the plot
--output, -o: Output file path (auto-generated if not specified)
--kpi-label, -k: Label for the KPI metric (default: "RMSE")

Generate PDF Code Diffs

Create beautiful PDF reports comparing two versions of your Python code:

# Basic usage - compare two Python files
tropiflo diff-pdf baseline.py improved.py

# Advanced usage with custom options
tropiflo diff-pdf \
  baseline.py \
  optimized.py \
  --output "optimization_report.pdf" \
  --title "XOR Problem Optimization Results"

Options:

file1: Path to the baseline/original file (required)
file2: Path to the modified/new file (required)
--output, -o: Output PDF file path (auto-generated if not specified)
--title, -t: Custom title for the diff report

Example workflow:

# 1. Run optimization
tropiflo run --parallel 3 python xor.py

# 2. Plot the KPI progression
tropiflo plot-kpi --checkpoints-dir co_datascientist_checkpoints --title "XOR Optimization"

# 3. Compare best result with baseline
tropiflo diff-pdf xor.py co_datascientist_checkpoints/best_iteration_50.py --title "XOR Improvements"

These tools help you understand your optimization journey and create professional reports showing the improvements Co-DataScientist achieved.

Need help

We’d love to chat: oz.kilim@tropiflo.io

All set? Run your pipelines and track the results.

⚠️ Disclaimer: Co-DataScientist executes your scripts on your own machine. Make sure you trust the code you feed it!

Made by the Tropiflo team

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.9

May 7, 2026

2.0.8

Apr 7, 2026

2.0.7

Apr 3, 2026

2.0.6

Apr 3, 2026

2.0.5

Apr 3, 2026

2.0.4

Apr 2, 2026

2.0.3

Mar 26, 2026

2.0.2

Mar 26, 2026

2.0.1

Mar 26, 2026

2.0.0

Mar 26, 2026

1.1.9

Mar 25, 2026

1.1.8

Mar 25, 2026

1.1.7

Mar 25, 2026

1.1.6

Mar 24, 2026

1.1.5

Mar 24, 2026

1.1.4

Mar 2, 2026

1.1.3

Feb 11, 2026

1.1.2

Feb 11, 2026

1.1.1

Feb 5, 2026

1.1.0

Feb 5, 2026

1.0.9

Feb 5, 2026

1.0.8

Jan 22, 2026

1.0.7

Jan 22, 2026

1.0.6

Jan 21, 2026

1.0.5

Jan 20, 2026

1.0.4

Jan 8, 2026

1.0.3

Dec 10, 2025

1.0.2

Dec 8, 2025

1.0.1

Dec 8, 2025

1.0.0

Dec 8, 2025

0.0.9

Dec 8, 2025

0.0.8

Dec 8, 2025

0.0.7

Nov 18, 2025

0.0.6

Oct 28, 2025

0.0.5

Oct 28, 2025

This version

0.0.4

Oct 28, 2025

0.0.3

Oct 28, 2025

0.0.2

Oct 28, 2025

0.0.1

Oct 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tropiflo-0.0.4.tar.gz (396.8 kB view details)

Uploaded Oct 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tropiflo-0.0.4-py3-none-any.whl (76.9 kB view details)

Uploaded Oct 28, 2025 Python 3

File details

Details for the file tropiflo-0.0.4.tar.gz.

File metadata

Download URL: tropiflo-0.0.4.tar.gz
Upload date: Oct 28, 2025
Size: 396.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for tropiflo-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`7aa9467de554ae3cca41f254fd2e979a2368884f1880ec52d1049c9f707083d2`
MD5	`f703213d0b0e21ddae3a2d824bf1f840`
BLAKE2b-256	`41d1d51cad8c1d8469eef4165cc47c2c6e8607801398b9f9bf7fed40dc6bf13e`

See more details on using hashes here.

File details

Details for the file tropiflo-0.0.4-py3-none-any.whl.

File metadata

Download URL: tropiflo-0.0.4-py3-none-any.whl
Upload date: Oct 28, 2025
Size: 76.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for tropiflo-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b746740a5c32e2b203f3d4ba9feb8b7d16749eb4a5905b6d39e1618341006b1a`
MD5	`da186191c55b4760d71f367c9fb6362c`
BLAKE2b-256	`f3badfd1b964b314ba74662db54838d4c9def77451360d8c9cd967e736e31309`

See more details on using hashes here.

tropiflo 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Introducing the Co-DataScientist

Why is everyone talking about the Tropiflo?

Quickstart — 5 Minutes

1. Install Tropiflo

2. Set Your API Token (Required!)

3. Create Your Project Files

4. Run & Watch It Evolve

How It Works

📁 Data Access Patterns

Option 1: Hardcoded Paths (Simple)

Option 2: Volume Mounting (Recommended for Docker)

🧬 Blocks to evolve

📂 Project Structure

Add Domain-Specific Notes for Best Results

🎯 How Multi-File Evolution Works

Skip Q&A on Repeat Runs

Deploy Checkpoints to Production

📝 Before vs After

We now support Databricks

🐳 Local Docker Execution with Volume Mounting

Setup

What Happens

Benefits

☁️ Google Cloud Run Jobs Integration

Setup

☁️ AWS ECS Fargate Integration

Setup

Analysis and Visualization Tools

Plot KPI Progression

Generate PDF Code Diffs

Need help

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes