AI Agent with dynamic planning and persistent Jupyter kernel execution for data analysis

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

DSAgent

An AI-powered autonomous agent for data science with persistent Jupyter kernel execution, session management, and conversational interface.

    ____  _____  ___                    __
   / __ \/ ___/ /   | ____ ____  ____  / /_
  / / / /\__ \ / /| |/ __ `/ _ \/ __ \/ __/
 / /_/ /___/ // ___ / /_/ /  __/ / / / /_
/_____//____//_/  |_\__, /\___/_/ /_/\__/
                   /____/

Features

Conversational Interface: Interactive chat with persistent context and sessions
Dynamic Planning: Agent creates and follows plans with step tracking
Persistent Execution: Code runs in a Jupyter kernel with variable persistence across messages
Session Management: Save and resume conversations with full kernel state
Multi-Provider LLM: Supports OpenAI, Anthropic, Google, Ollama via LiteLLM
MCP Tools: Connect to external tools (web search, databases, etc.) via Model Context Protocol
Human-in-the-Loop: Configurable checkpoints for plan and code approval
Notebook Generation: Automatically generates clean, runnable Jupyter notebooks

Installation

pip install datascience-agent

With optional features:

pip install "datascience-agent[api]"   # FastAPI server support
pip install "datascience-agent[mcp]"   # MCP tools support

For development:

git clone https://github.com/nmlemus/dsagent
cd dsagent
uv sync --all-extras

Docker

# Run API server
docker run -d -p 8000:8000 \
  -e OPENAI_API_KEY=sk-your-key \
  nmlemus/dsagent:latest

# Run interactive CLI
docker run -it \
  -e OPENAI_API_KEY=sk-your-key \
  nmlemus/dsagent:latest \
  dsagent chat

For Docker deployment details, see docs/DOCKER.md.

Quick Start

1. Setup (First Time)

Run the setup wizard to configure your LLM provider:

dsagent init

This will:

Ask for your LLM provider (OpenAI, Anthropic, Google, local, etc.)
Store your API key securely in ~/.dsagent/.env
Automatically select a default model based on provider:
- OpenAI → gpt-4o
- Anthropic → claude-sonnet-4-5
- Google → gemini/gemini-2.5-flash
- Local → ollama/llama3
Optionally configure MCP tools (web search, etc.)

To use a different model, edit ~/.dsagent/.env or use the --model flag:

dsagent --model gpt-4o-mini

2. Start Chatting

dsagent

This starts an interactive session where you can:

Chat naturally with the agent
Execute Python code with persistent variables
Analyze data files
Generate visualizations
Resume previous sessions

3. One-Shot Tasks

For batch processing or scripts:

dsagent run "Analyze sales trends" --data ./sales.csv

CLI Commands

Command	Description
`dsagent`	Start interactive chat (default)
`dsagent chat`	Same as above, with explicit options
`dsagent run "task"`	Execute a one-shot task
`dsagent init`	Setup wizard for configuration
`dsagent mcp list`	List configured MCP servers
`dsagent mcp add <template>`	Add an MCP server

Examples

# Interactive chat with specific model
dsagent --model claude-sonnet-4-5

# One-shot analysis
dsagent run "Find patterns in this data" --data ./dataset.csv

# Resume a previous session
dsagent --session abc123

# With MCP tools (web search)
dsagent --mcp-config ~/.dsagent/mcp.yaml

# Human-in-the-loop mode
dsagent --hitl plan

For complete CLI documentation, see docs/CLI.md.

Python API

DSAgent provides two agents for different use cases:

ConversationalAgent (Interactive)

For building chat interfaces and interactive applications:

from dsagent import ConversationalAgent, ConversationalAgentConfig

config = ConversationalAgentConfig(model="gpt-4o")
agent = ConversationalAgent(config)
agent.start()

# Chat with persistent context
response = agent.chat("Load the iris dataset")
print(response.content)

response = agent.chat("Train a classifier on it")
print(response.content)  # Has access to previous variables

agent.shutdown()

PlannerAgent (Batch)

For one-shot tasks and automated pipelines:

from dsagent import PlannerAgent

with PlannerAgent(model="gpt-4o", data="./data.csv") as agent:
    result = agent.run("Analyze this dataset and create visualizations")
    print(result.answer)
    print(f"Notebook: {result.notebook_path}")

For complete API documentation, see docs/PYTHON_API.md.

Supported Models

DSAgent uses LiteLLM to support 100+ LLM providers:

Provider	Models	API Key
OpenAI	`gpt-4o`, `o1`, `o3-mini`	`OPENAI_API_KEY`
Anthropic	`claude-sonnet-4-5`, `claude-opus-4`	`ANTHROPIC_API_KEY`
Google	`gemini-2.5-pro`, `gemini-2.5-flash`	`GOOGLE_API_KEY`
DeepSeek	`deepseek/deepseek-r1`	`DEEPSEEK_API_KEY`
Ollama	`ollama/llama3.2`	None (local)

For detailed model setup, see docs/MODELS.md.

MCP Tools

Connect to external tools via the Model Context Protocol:

# Add web search capability
dsagent mcp add brave-search

# Use it in chat
dsagent --mcp-config ~/.dsagent/mcp.yaml

Available templates: brave-search, filesystem, github, memory, fetch, bigquery

For MCP configuration details, see docs/MCP.md.

Session Management

Sessions persist your conversation history and kernel state:

# List sessions
dsagent chat
> /sessions

# Resume a session
dsagent --session <session-id>

# Export session to notebook
> /export myanalysis.ipynb

Output Structure

Each run creates organized output:

workspace/
└── runs/{run_id}/
    ├── data/           # Input data (copied)
    ├── notebooks/      # Generated Jupyter notebooks
    ├── artifacts/      # Charts, models, exports
    └── logs/           # Execution logs

Included Libraries

DSAgent comes with essential data science libraries pre-installed:

Category	Libraries
Core	numpy, pandas, scipy
DataFrames	polars, pyarrow
Visualization	matplotlib, seaborn, plotly
Machine Learning	scikit-learn, xgboost, lightgbm, pycaret
Feature Selection	boruta
Statistics	statsmodels

Documentation

CLI Reference - Complete command-line options
Python API - Detailed API documentation
Model Configuration - LLM provider setup
MCP Tools - External tools integration
Docker Guide - Container deployment

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nmlemus

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.1

Feb 19, 2026

0.9.0

Feb 19, 2026

0.8.4

Feb 5, 2026

0.8.3

Jan 29, 2026

0.8.2

Jan 27, 2026

0.8.1

Jan 21, 2026

This version

0.8.0

Jan 20, 2026

0.7.0

Jan 11, 2026

0.6.2

Jan 11, 2026

0.6.1

Jan 9, 2026

0.5.1

Jan 2, 2026

0.5.0

Dec 31, 2025

0.4.0

Dec 31, 2025

0.3.0

Dec 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datascience_agent-0.8.0.tar.gz (7.0 MB view details)

Uploaded Jan 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datascience_agent-0.8.0-py3-none-any.whl (155.1 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file datascience_agent-0.8.0.tar.gz.

File metadata

Download URL: datascience_agent-0.8.0.tar.gz
Upload date: Jan 20, 2026
Size: 7.0 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datascience_agent-0.8.0.tar.gz
Algorithm	Hash digest
SHA256	`3c3b36292ac5d1bdaa65e299114fc04ce3a12333107aa46d27a770aac47b045f`
MD5	`8f7129dafbfc2375add3eaca28266d98`
BLAKE2b-256	`ef3167dbf3215d4dfca36a2fe69bd3f49ede84a670eab879558df09325826409`

See more details on using hashes here.

Provenance

The following attestation bundles were made for datascience_agent-0.8.0.tar.gz:

Publisher: python-publish.yml on nmlemus/dsagent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: datascience_agent-0.8.0.tar.gz
- Subject digest: 3c3b36292ac5d1bdaa65e299114fc04ce3a12333107aa46d27a770aac47b045f
- Sigstore transparency entry: 836085947
- Sigstore integration time: Jan 20, 2026
Source repository:
- Permalink: nmlemus/dsagent@c0e0f39c9531701c3586d69544b280dad595881b
- Branch / Tag: refs/tags/v0.8.0
- Owner: https://github.com/nmlemus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@c0e0f39c9531701c3586d69544b280dad595881b
- Trigger Event: release

File details

Details for the file datascience_agent-0.8.0-py3-none-any.whl.

File metadata

Download URL: datascience_agent-0.8.0-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 155.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datascience_agent-0.8.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`41e8afe78a5621c2d7c7baa804c4b5c1134706b6e5aabe7e8c86853803538c0e`
MD5	`21ab643c66c65856db50c58b03d726c9`
BLAKE2b-256	`2e7e2c20ee784a30f36ceca80c7de7fd10bee4417642d95832025f44264ddbb0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for datascience_agent-0.8.0-py3-none-any.whl:

Publisher: python-publish.yml on nmlemus/dsagent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: datascience_agent-0.8.0-py3-none-any.whl
- Subject digest: 41e8afe78a5621c2d7c7baa804c4b5c1134706b6e5aabe7e8c86853803538c0e
- Sigstore transparency entry: 836085949
- Sigstore integration time: Jan 20, 2026
Source repository:
- Permalink: nmlemus/dsagent@c0e0f39c9531701c3586d69544b280dad595881b
- Branch / Tag: refs/tags/v0.8.0
- Owner: https://github.com/nmlemus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@c0e0f39c9531701c3586d69544b280dad595881b
- Trigger Event: release

datascience-agent 0.8.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

DSAgent

Features

Installation

Docker

Quick Start

1. Setup (First Time)

2. Start Chatting

3. One-Shot Tasks

CLI Commands

Examples

Python API

ConversationalAgent (Interactive)

PlannerAgent (Batch)

Supported Models

MCP Tools

Session Management

Output Structure

Included Libraries

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance