A comprehensive desktop application for visualizing, querying, and managing vector database data

Project description

Vector Inspector

A comprehensive desktop application for visualizing, querying, and managing vector database data. Similar to SQL database viewers, Vector Inspector provides an intuitive GUI for exploring vector embeddings, metadata, and performing similarity searches across multiple vector database providers.

Vector Inspector bridges the gap between vector databases and user-friendly data exploration tools. While vector databases are powerful for semantic search and AI applications, they often lack the intuitive inspection and management tools that traditional SQL databases have. This project aims to provide that missing layer.

Key Features

1. Multi-Provider Support

Connect to vector databases:
- ChromaDB (persistent local storage)
- Qdrant (remote server or embedded local)
Unified interface regardless of backend provider
Automatically saves last connection configuration

2. Data Visualization

Metadata Explorer: Browse and filter vector entries by metadata fields
Vector Dimensionality Reduction: Visualize high-dimensional vectors in 2D/3D using:
- t-SNE
- UMAP
- PCA
Cluster Visualization: Color-code vectors by metadata categories or clustering results
Interactive Plots: Zoom, pan, and select vectors for detailed inspection
Data Distribution Charts: Histograms and statistics for metadata fields

3. Search & Query Interface

Similarity Search:
- Text-to-vector search (with embedding model integration)
- Vector-to-vector search
- Find similar items to selected entries
- Adjustable top-k results and similarity thresholds
Metadata Filtering:
- SQL-like query builder for metadata
- Combine vector similarity with metadata filters
- Advanced filtering: ranges, IN clauses, pattern matching
Hybrid Search: Combine semantic search with keyword search
Query History: Save and reuse frequent queries

4. Data Management

Browse Collections/Indexes: View all available collections with statistics
CRUD Operations:
- View individual vectors and their metadata
- Add new vectors (with auto-embedding options)
- Update metadata fields
- Delete vectors (single or batch)
Bulk Import/Export:
- Import from CSV, JSON, Parquet
- Export query results to various formats
- Backup and restore collections
Schema Inspector: View collection configuration, vector dimensions, metadata schema

5. SQL-Like Experience

Query Console: Write queries in a familiar SQL-like syntax (where supported)
Results Grid:
- Sortable, filterable table view
- Pagination for large result sets
- Column customization
Data Inspector: Click any row to see full details including raw vector
Query Execution Plans: Understand how queries are executed
Auto-completion: Intelligent suggestions for collection names, fields, and operations

6. Advanced Features

Embedding Model Integration:
- Use OpenAI, Cohere, HuggingFace models for text-to-vector conversion
- Local model support (sentence-transformers)
- Custom model integration
Vector Analysis:
- Compute similarity matrices
- Identify outliers and anomalies
- Cluster analysis with k-means, DBSCAN
Embedding Inspector:
- For similar collections or items, automatically identify which vector dimensions (activations) most contribute to the similarity
- Map key activations to interpretable concepts (e.g., 'humor', 'sadness', 'anger') using metadata or labels
- Generate human-readable explanations for why items are similar
Performance Monitoring:
- Query latency tracking
- Index performance metrics
- Connection health monitoring

Architecture

Technology Stack

Frontend (GUI)

Framework: PySide6 (Qt for Python) - native desktop application
UI Components: Qt Widgets for forms, dialogs, and application structure
Visualization:
- Plotly for interactive charts (embedded via QWebEngineView)
- matplotlib for static visualizations
Data Grid: QTableView with custom models for high-performance data display

Backend

Language: Python 3.12
Core Libraries:
- Vector DB clients: chromadb, qdrant-client (implemented), pinecone-client, weaviate-client, pymilvus (planned)
- Embeddings: sentence-transformers, fastembed (implemented), openai, cohere (planned)
- Data processing: pandas, numpy
- Dimensionality reduction: scikit-learn, umap-learn
API Layer: FastAPI (planned for programmatic access) or direct Python integration

Data Layer

Connection Management: Provider-specific connection classes with unified interface
Query Abstraction: Base connection interface that each provider implements
Storage Modes:
- ChromaDB: Persistent local storage
- Qdrant Remote: Connect via host/port (e.g., localhost:6333)
- Qdrant Embedded: Local path storage without separate server
Caching: Redis or in-memory cache for frequently accessed data (planned)
Settings Persistence: User settings saved to ~/.vector-viewer/settings.json

Application Structure

vector-viewer/
├── src/
│   └── vector_viewer/
│       ├── core/
│       │   └── connections/   # Connection managers for each provider
│       ├── ui/
│       │   ├── components/    # Reusable UI components
│       │   └── views/         # Main application views
│       ├── services/          # Business logic services
│       └── main.py            # Application entry point
├── tests/
├── docs/
├── data/                      # Local database storage
│   ├── chroma_db/
│   └── qdrant/
├── run.sh / run.bat           # Launch scripts
└── pyproject.toml

User settings are saved to ~/.vector-viewer/settings.json

Use Cases

AI/ML Development: Inspect embeddings generated during model development
RAG System Debugging: Verify what documents are being retrieved
Data Quality Assurance: Identify poorly embedded or outlier vectors
Production Monitoring: Check vector database health and data consistency
Data Migration: Transfer data between vector database providers
Education: Learn and experiment with vector databases interactively

Feature Access (Free vs Pro)

Feature	Access
Connection to ChromaDB	Free
Basic metadata browsing and filtering	Free
Simple similarity search interface	Free
2D vector visualization (PCA/t-SNE)	Free
Basic CRUD operations	Free
Metadata filtering (advanced)	Free
Item editing	Free
Import/export (CSV, JSON, Parquet)	Free
Provider abstraction layer	Free
Pinecone support	Free
Weaviate support	Free
Qdrant support (basic/experimental)	Free
Milvus support	Pro
ChromaDB advanced support	Pro
FAISS (local files) support	Pro
pgvector (PostgreSQL extension) support	Pro
Elasticsearch with vector search support	Pro
Advanced query builder	Free
3D visualization	Free
Embedding model integration (basic)	Free
Query history and saved queries	Free
Model Comparison Mode	Pro
Cluster Explorer	Pro
Embedding Inspector	Pro
Embedding Provenance Graph	Pro
Semantic Drift Timeline	Pro
Cross-Collection Similarity	Pro
Vector Surgery	Pro
Custom plugin system	Pro
Team collaboration features	Pro

Note: Qdrant support is available for free users in the open source version (basic/experimental). Advanced Qdrant features (e.g., payload filtering, geo, cloud auth) may be reserved for Pro in the future.

Planned Roadmap

Phase 1: Foundation (MVP)

Connection to ChromaDB
Basic metadata browsing and filtering
Simple similarity search interface
2D vector visualization (PCA/t-SNE)
Basic CRUD operations

Phase 2: Core Features

Metadata filtering (advanced filtering, combine with search)
Item editing (update metadata and documents)
Import/export (CSV, JSON, Parquet, backup/restore)
Provider abstraction layer (unified interface for all supported vector DBs)
Qdrant support (basic/experimental, free)

Phase 3: UX & Professional Polish

Unified Information Panel (new "Info" tab as default view)
Database and collection metadata display
Connection health and version information
Schema visualization and index configuration display

Phase 4: Modular/Plugin System & Hybrid Model

Implement modular/plugin system for feature extensions
Migrate paid/advanced features to commercial modules
Add licensing/access control for commercial features

Phase 5: Provider Expansion (Incremental)

Pinecone support (free)
Weaviate support (free)
Qdrant support (paid)

Future/Backlog Providers

Milvus support (paid)
ChromaDB advanced support (paid)
FAISS (local files) support (paid)
pgvector (PostgreSQL extension) support (paid)
Elasticsearch with vector search support (paid)

Phase 6A: Advanced Usability & Visualization

Advanced query builder (free)
3D visualization (free)
Embedding model integration (free)
Query history and saved queries (free)
Metadata Type Detection & Rich Media Preview (free)

Phase 6B: Analytical & Comparison Tools

Model Comparison Mode (paid)
Cluster Explorer (paid)
Embedding Inspector (paid)
Embedding Provenance Graph (paid)

Phase 6C: Temporal & Cross-Collection Analytics

Semantic Drift Timeline (paid)
Cross-Collection Similarity (paid)

Phase 6D: Experimental & Power Features

Vector Surgery (paid)
Custom plugin system (paid)
Team collaboration features (paid)

Phase 7: Enterprise Features

Multi-user support with auth
Audit logging
Advanced security features
Custom reporting
API for programmatic access (FastAPI backend)
Caching layer (Redis/in-memory) for performance
Connection pooling and optimization

Installation

# Clone the repository
git clone https://github.com/anthonypdawson/vector-viewer.git
cd vector-viewer

# Install dependencies using PDM
pdm install

# Launch application
./run.sh     # Linux/macOS
./run.bat    # Windows

Configuration

Paths are resolved relative to the project root (where pyproject.toml is). For example, entering ./data/chroma_db will use the absolute path resolved from the project root.

The application automatically saves your last connection configuration to ~/.vector-viewer/settings.json. The next time you launch the application, it will attempt to reconnect using the last saved settings.

Example settings structure:

{
  "last_connection": {
    "provider": "chromadb",
    "connection_type": "persistent",
    "path": "./data/chroma_db"
  }
}

Development Setup

# Install PDM if you haven't already
pip install pdm

# Install dependencies with development tools (PDM will create venv automatically)
pdm install -d

# Run tests
pdm run pytest

# Run application in development mode
./run.sh     # Linux/macOS
./run.bat    # Windows

# Or use Python module directly from src directory:
cd src
pdm run python -m vector_viewer

Contributing

Contributions are welcome! Areas where help is needed:

Additional vector database provider integrations
UI/UX improvements
Performance optimizations
Documentation
Test coverage

Please see CONTRIBUTING.md for guidelines.

License

MIT License - See LICENSE file for details.

Acknowledgments

This project draws inspiration from:

DBeaver (SQL database viewer)
MongoDB Compass (NoSQL database GUI)
Pinecone Console
Various vector database management tools

Status: ✅ Phase 2 Complete - Advanced Features Implemented!

What's New in Phase 2:

🔍 Advanced metadata filtering with customizable filter rules (AND/OR logic)
✏️ Double-click to edit items directly in the data browser
📥 Import data from CSV, JSON, and Parquet files
📤 Export filtered data to CSV, JSON, and Parquet formats
💾 Comprehensive backup and restore system for collections
🔄 Metadata filters integrated with search for powerful queries

See GETTING_STARTED.md for usage instructions and IMPLEMENTATION_SUMMARY.md for technical details.

Contact: Anthony Dawson

Project details

Release history Release notifications | RSS feed

0.7.2

Apr 15, 2026

0.7.1

Apr 13, 2026

0.7.0

Apr 7, 2026

0.6.2

Mar 31, 2026

0.6.1

Mar 22, 2026

0.6.0

Mar 14, 2026

0.5.4

Mar 9, 2026

0.5.3

Mar 2, 2026

0.5.1

Feb 27, 2026

0.5.0

Feb 24, 2026

0.4.2

Feb 21, 2026

0.4.1

Feb 18, 2026

0.4.0

Feb 15, 2026

0.3.16

Feb 15, 2026

0.3.15

Feb 12, 2026

0.3.14

Feb 10, 2026

0.3.13

Feb 7, 2026

0.3.12

Feb 4, 2026

0.3.11

Jan 31, 2026

0.3.9

Jan 28, 2026

0.3.8

Jan 27, 2026

0.3.7

Jan 27, 2026

0.3.6

Jan 27, 2026

0.3.5

Jan 27, 2026

0.3.4

Jan 26, 2026

0.3.3

Jan 26, 2026

0.3.2

Jan 24, 2026

0.3.1

Jan 24, 2026

0.2.7

Jan 23, 2026

0.2.6

Jan 22, 2026

0.2.5

Jan 22, 2026

0.2.4.3

Jan 21, 2026

0.2.4

Jan 21, 2026

0.2.3

Jan 21, 2026

0.2.2

Jan 20, 2026

0.2.1

Jan 20, 2026

This version

0.2.0

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vector_inspector-0.2.0.tar.gz (48.1 kB view details)

Uploaded Jan 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vector_inspector-0.2.0-py3-none-any.whl (55.9 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file vector_inspector-0.2.0.tar.gz.

File metadata

Download URL: vector_inspector-0.2.0.tar.gz
Upload date: Jan 20, 2026
Size: 48.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.5

File hashes

Hashes for vector_inspector-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`f33c080f106b2d49a691e7904706ab477c05dd076b05c7e2cbfbfe02784cc9db`
MD5	`35adde514f62a17282fc20f365eb318a`
BLAKE2b-256	`142515e79b9584cefd4280275bef827626330b6013ecda508ded0bd0adf27798`

See more details on using hashes here.

File details

Details for the file vector_inspector-0.2.0-py3-none-any.whl.

File metadata

Download URL: vector_inspector-0.2.0-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 55.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.5

File hashes

Hashes for vector_inspector-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`99a81c33115a8685ecb12c34146cf9e27cd64b328d8686d2141b2053c350990f`
MD5	`83a3eca446f7d7b4db3334f8e089be37`
BLAKE2b-256	`e7636138a0a74d88722a9d3d430b4bfcac80192fb002bbc57be5aa5f2d2a5d49`

See more details on using hashes here.

vector-inspector 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Vector Inspector

Overview

Table of Contents

Key Features

1. Multi-Provider Support

2. Data Visualization

3. Search & Query Interface

4. Data Management

5. SQL-Like Experience

6. Advanced Features

Architecture

Technology Stack

Frontend (GUI)

Backend

Data Layer

Application Structure

Use Cases

Feature Access (Free vs Pro)

Planned Roadmap

Phase 1: Foundation (MVP)

Phase 2: Core Features

Phase 3: UX & Professional Polish

Phase 4: Modular/Plugin System & Hybrid Model

Phase 5: Provider Expansion (Incremental)

Future/Backlog Providers

Phase 6A: Advanced Usability & Visualization

Phase 6B: Analytical & Comparison Tools

Phase 6C: Temporal & Cross-Collection Analytics

Phase 6D: Experimental & Power Features

Phase 7: Enterprise Features

Installation

Configuration

Development Setup

Contributing

License

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes