AI chat and code completion plugin for Spyder IDE using local Ollama models

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

spyder-ai-assistant

A local-first AI assistant for Spyder IDE. Chat with a model about your code, get Copilot-style inline completions, inspect live variables and tracebacks, and browse your conversation history — all running on your own GPU through Ollama, with optional support for OpenAI-compatible endpoints.

Chat panel with AI explaining a Python script

Quick start

1. Install Ollama and pull a model

curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen2.5:7b          # chat model (~5 GB VRAM)

Pick a model that fits your GPU. See model recommendations below for more options.

2. Install the plugin

pip install spyder-ai-assistant

Install into the same Python environment where Spyder lives (e.g. your conda env).

3. Restart Spyder and open the chat

The plugin registers automatically. After restart:

Open the chat panel: View > Panes > AI Chat
Pick your model from the toolbar dropdown
Start typing — inline completions appear automatically as ghost text

To manually trigger an inline suggestion: press Ctrl+Shift+Space.

That's it. Everything runs locally and works offline.

Optional: to use cloud or self-hosted chat endpoints, open AI Chat > More > Provider Profiles... and add one or more OpenAI-compatible profiles.

Features

Inline code completions

Copilot-style ghost text that appears as you type, powered by Ollama's Fill-in-Middle (FIM) API.

Ghost text inline completions

Shortcut	Action
`Ctrl+Shift+Space`	Manually trigger a suggestion
`Tab`	Accept the full suggestion
`Alt+Right`	Accept next word
`Alt+Shift+Right`	Accept next line
`Escape`	Dismiss

The provider is tuned beyond a basic API call: it caches recent prompts, trims suffix overlap so brackets aren't duplicated, filters repetitive output, pulls relevant snippets from other open files for richer context, suppresses Spyder's native popup when a ghost suggestion is active, and cycles through alternative candidates locally without extra model round-trips. The status bar shows the active completion model and its state (generating, offline, or ready).

Chat panel

A dockable pane for talking to a model about your code. Open it from View > Panes > AI Chat.

Multi-tab sessions — each conversation lives in its own tab
Streaming responses — tokens arrive in real time
Syntax-highlighted code blocks — with copy, insert-at-cursor, and replace-selection actions
Thinking/reasoning display — models that emit <think> blocks (QwQ, DeepSeek-R1, etc.) show their reasoning in a dimmed section
Per-tab chat modes — switch between Coding, Debugging, Review, Data Analysis, Explanation, or Documentation presets
Per-tab inference settings — override temperature and max tokens for individual tabs
Mid-conversation model switching — change models from the toolbar without losing context
Stop and regenerate — cancel a response mid-stream, or rerun the last turn
Delete individual exchanges — remove any saved turn from the conversation
Export to Markdown — save any session with full metadata

Kernel integration and runtime inspection

The chat panel has read-only access to your active Spyder IPython session. It can inspect:

Tracebacks and errors — the latest exception from your kernel
Console output — recent visible output from your IPython session
Live variables — the current variable list and targeted inspection of specific variables by name
Structured runtime values — arrays, images, DataFrames, Series, and bounded nested-container previews
Kernel state — shown in the chat toolbar so you always know what's running
Multiple consoles — the toolbar can follow the active console or pin runtime inspection to a different open console

This is on-demand, not automatic — ordinary questions stay file-focused and lean. The AI only pulls runtime state when the question actually depends on it, and it never executes code on your behalf.

Quick-action buttons for common debugging workflows:

Control	What it does
Debug	Opens runtime-aware actions such as Explain Error, Fix Traceback, Use Variables, and Use Console
Regenerate	Reruns the last turn on the active tab

When more than one IPython console is open, the runtime target selector in the chat toolbar lets you choose Follow Active Console or pin the debugging context to a specific console. The runtime tooltip shows which console is currently active and which one is actually being inspected.

Editor integration

The AI automatically sees your current file, cursor position, selection, other open tabs, and your project's file tree. Right-click any selection for AI actions:

Action	What it does
Ask AI	Opens chat with your selection as context
Explain	Explains the selected code
Fix	Finds and fixes bugs in the selection
Add Docstring	Generates a docstring for the selected function or class

Code blocks in chat responses now expose Copy and Apply.... Apply... opens a preview dialog that lets you choose insert-vs-replace, inspect the diff, and confirm or cancel before the editor changes. The final mutation is grouped into a single undo step.

Session history and persistence

Chat sessions save automatically. When a Spyder project is open, conversations persist in .spyproject/ai-assistant/chat-sessions.json and restore when the project reopens. Without a project, sessions fall back to a global store.

The Sessions button keeps session actions in one place. Its history browser lets you search, filter, sort, reopen, duplicate, or delete saved sessions. Per-tab chat modes and inference overrides persist with each session.

Multi-provider support

By default, everything runs through Ollama. For chat, you can also manage multiple named OpenAI-compatible profiles from AI Chat > More > Provider Profiles....

each profile has its own label, endpoint, API key, and enabled state
the shared model selector groups entries by provider/profile and keeps endpoint details in the tooltip
the status label reports provider issues without hiding working models
removing a stale profile falls back cleanly to another available model

Inline completions stay Ollama-backed for low latency.

Model recommendations

Chat models

Pick one that fits your GPU:

VRAM	Model	Command
8 GB	Qwen 2.5 7B	`ollama pull qwen2.5:7b`
12 GB	Qwen 2.5 14B	`ollama pull qwen2.5:14b`
16 GB+	Qwen 3.5 27B	`ollama pull huihui_ai/qwen3.5-abliterated:27b`

Completion models (optional)

A smaller, faster model for inline suggestions. Recommended but not required — the chat model handles completions if no separate model is configured.

VRAM	Model	Command
8 GB	Qwen 2.5 Coder 3B	`ollama pull qwen2.5-coder:3b`
12 GB+	Qwen3 Coder 30B (3B active)	`ollama pull qooba/qwen3-coder-30b-a3b-instruct:q3_k_m`

GPU and memory

Ollama uses your GPU automatically if CUDA or ROCm drivers are installed. Rough VRAM requirements for Q4_K_M quantized models:

Model size	VRAM needed	Typical use
3B	~2.5 GB	Completions
7B	~5 GB	Basic chat
14B	~9 GB	Good chat
27B	~15 GB	Excellent chat

Running chat and completions simultaneously keeps both models in VRAM. A 7B chat + 3B completion model needs about 7.5 GB total. Without a GPU, Ollama falls back to CPU (slower but functional).

Configuration

All settings live in Preferences:

Preferences > AI Chat — default chat provider, Ollama server URL, model names, temperature, max tokens, keyboard shortcuts, system prompt, and action prompt templates (with {filename} and {code} placeholders)
Preferences > Completion and linting > AI Chat — completion toggle, model, temperature, max tokens, debounce delay

OpenAI-compatible chat endpoints are managed directly from the chat pane through More > Provider Profiles.... Existing single-endpoint settings are imported automatically the first time you open that dialog.

Per-tab chat modes and inference overrides are set directly in the chat pane and persist with the session.

Troubleshooting

"No models found" in the dropdown — Run curl http://localhost:11434/api/tags to check Ollama, or pull a model with ollama pull qwen2.5:7b. For OpenAI-compatible profiles, open More > Provider Profiles... and confirm the endpoint responds on /v1/models.

Completions aren't appearing — Enable them in Preferences > Completion and linting > AI Chat. The status bar should show AI: model-name. If it says AI: offline, Ollama isn't reachable.

Chat panel doesn't show up — Check View > Panes for "AI Chat". If missing, the plugin may be in a different Python env than Spyder. Verify: python -c "import spyder_ai_assistant; print('OK')".

Slow responses — Try a smaller model. Check nvidia-smi for GPU usage. First requests are always slower while the model loads into VRAM.

Too much VRAM — Run ollama ps to see loaded models and ollama stop <model> to unload.

Runtime inspection returns generic answers — Use a stronger instruction-following model. The runtime bridge requires the model to emit structured inspection requests. Qwen-based models handle this reliably.

Roadmap

Active development. Rough priority order:

Session management — pinning, labeling, bulk operations
More providers — adapters beyond OpenAI-compatible, profile import/export, provider health checks
Smart setup — one-click Ollama install, guided model downloads, hardware-aware recommendations
Smarter completions — scope-aware truncation, rename-aware suggestions, multi-site edits
Polish the apply workflow further — richer inline diff rendering, smarter multi-block edit handling
Agent workflows — multi-step task execution with approval gates, git-aware context

Contributing

See CONTRIBUTING.md for development setup, architecture overview, validation workflow, and release process.

License

Creative Commons Attribution-NonCommercial 4.0 International. Free to use, share, and adapt for non-commercial purposes with attribution. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

costantinoai

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.0

Mar 13, 2026

0.5.0

Mar 13, 2026

This version

0.4.0

Mar 12, 2026

0.3.0

Mar 11, 2026

0.2.0

Mar 11, 2026

0.1.1

Mar 11, 2026

0.1.0

Mar 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spyder_ai_assistant-0.4.0.tar.gz (667.0 kB view details)

Uploaded Mar 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

spyder_ai_assistant-0.4.0-py3-none-any.whl (130.1 kB view details)

Uploaded Mar 12, 2026 Python 3

File details

Details for the file spyder_ai_assistant-0.4.0.tar.gz.

File metadata

Download URL: spyder_ai_assistant-0.4.0.tar.gz
Upload date: Mar 12, 2026
Size: 667.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for spyder_ai_assistant-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`a55b9251cd9164cb117a662752dc1ba70092c38f1689617629d4b2326b156dc4`
MD5	`3a3097cfe77309b9c95bfa9b66ab6354`
BLAKE2b-256	`c0c9d827c44df35e24154fb94d4721956cbf20fbe27fa3bb199313c0c6005f9e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for spyder_ai_assistant-0.4.0.tar.gz:

Publisher: publish.yml on costantinoai/spyder-ai-assistant

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: spyder_ai_assistant-0.4.0.tar.gz
- Subject digest: a55b9251cd9164cb117a662752dc1ba70092c38f1689617629d4b2326b156dc4
- Sigstore transparency entry: 1091191824
- Sigstore integration time: Mar 12, 2026
Source repository:
- Permalink: costantinoai/spyder-ai-assistant@f23980d9013a7f60bbacc52dbb4d6391a246944b
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/costantinoai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f23980d9013a7f60bbacc52dbb4d6391a246944b
- Trigger Event: push

File details

Details for the file spyder_ai_assistant-0.4.0-py3-none-any.whl.

File metadata

Download URL: spyder_ai_assistant-0.4.0-py3-none-any.whl
Upload date: Mar 12, 2026
Size: 130.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for spyder_ai_assistant-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e414ae46bf70adf440ae9464eaf2972297c9f4e2a4927ebd8b468e19130081a6`
MD5	`c19df3997eff31976a004f6dc3ff3baa`
BLAKE2b-256	`c6b507ddebc0f70ff0dc3e829c7a02961f3be1db2adcd4e81c9a33a55e3f5f24`

See more details on using hashes here.

Provenance

The following attestation bundles were made for spyder_ai_assistant-0.4.0-py3-none-any.whl:

Publisher: publish.yml on costantinoai/spyder-ai-assistant

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: spyder_ai_assistant-0.4.0-py3-none-any.whl
- Subject digest: e414ae46bf70adf440ae9464eaf2972297c9f4e2a4927ebd8b468e19130081a6
- Sigstore transparency entry: 1091191838
- Sigstore integration time: Mar 12, 2026
Source repository:
- Permalink: costantinoai/spyder-ai-assistant@f23980d9013a7f60bbacc52dbb4d6391a246944b
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/costantinoai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f23980d9013a7f60bbacc52dbb4d6391a246944b
- Trigger Event: push

spyder-ai-assistant 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

spyder-ai-assistant

Quick start

1. Install Ollama and pull a model

2. Install the plugin

3. Restart Spyder and open the chat

Features

Inline code completions

Chat panel

Kernel integration and runtime inspection

Editor integration

Session history and persistence

Multi-provider support

Model recommendations

Chat models

Completion models (optional)

GPU and memory

Configuration

Troubleshooting

Roadmap

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance