Knowledge graph and vector search platform on InterSystems IRIS for financial services and biomedical applications

These details have not been verified by PyPI

Project links

Project description

IRIS Vector Graph

A knowledge graph system built on InterSystems IRIS that combines graph traversal, vector similarity search, and full-text search in a single database.

NEW: Interactive Demo Server showcasing fraud detection + biomedical capabilities

Proven at Scale Across Industries:

Financial Services: Real-time fraud detection (130M+ transactions), bitemporal audit trails, <10ms queries
Biomedical Research: Protein interaction networks (100K+ proteins), drug discovery, <50ms multi-hop queries

Same IRIS platform. Different domains. Powerful results.

Quick Start
- Option A: Fraud Detection (Financial Services)
- Option B: Biomedical Graph (Life Sciences)
Use Cases by Industry
Architecture
Key Features
Performance
Documentation

Quick Start

Two Deployment Modes:

External (DEFAULT - simpler): Python app connects to IRIS via iris.connect()
Embedded (ADVANCED - optional): Python app runs INSIDE IRIS container

Option A: Fraud Detection (Financial Services)

External Mode (Default - Simpler)

# 1. Start IRIS database
docker-compose up -d

# 2. Install Python dependencies
pip install iris-vector-graph        # Core features
pip install iris-vector-graph[ml]    # + Machine learning (fraud scoring models)

# 3. Load fraud schema
docker exec -i iris /usr/irissys/bin/irissession IRIS -U USER < sql/fraud/schema.sql

# 4. Start fraud API (external Python)
PYTHONPATH=src python -m iris_fraud_server

# Test fraud scoring API
curl -X POST http://localhost:8000/fraud/score \
  -H 'Content-Type: application/json' \
  -d '{"mode":"MLP","payer":"acct:test","device":"dev:laptop","amount":1000.0}'

Embedded Mode (Advanced - Optional)

# Run FastAPI INSIDE IRIS container (licensed IRIS required)
docker-compose -f docker-compose.fraud-embedded.yml up -d

# Test fraud scoring API (~2 min startup)
curl -X POST http://localhost:8100/fraud/score \
  -H 'Content-Type: application/json' \
  -d '{"mode":"MLP","payer":"acct:test","device":"dev:laptop","amount":1000.0}'

What you get:

FastAPI fraud scoring (external :8000 or embedded :8100)
Bitemporal data (track when transactions happened vs. when you learned about them)
Complete audit trails (regulatory compliance: SOX, MiFID II)
Direct IRIS queries (no middleware)

Learn more: examples/bitemporal/README.md - Fraud scenarios, chargeback defense, model tracking

Option B: Biomedical Graph (Life Sciences)

External Mode (Default - Simpler)

# 1. Start IRIS database
docker-compose up -d

# 2. Install dependencies
curl -LsSf https://astral.sh/uv/install.sh | sh
uv sync && source .venv/bin/activate

# 3. Load STRING protein database (10K proteins, ~1 minute)
python scripts/performance/string_db_scale_test.py --max-proteins 10000

# 4. Start interactive demo server (external Python)
PYTHONPATH=src python -m iris_demo_server.app

# 5. Open browser
open http://localhost:8200/bio

Embedded Mode (Advanced - Optional)

# Run demo server INSIDE IRIS container (licensed IRIS required)
# Coming soon - currently only external mode supported for biomedical demo

What you get:

Interactive protein search with vector similarity (EGFR, TP53, etc.)
D3.js graph visualization with click-to-expand nodes showing interaction networks
Pathway analysis between proteins using BFS graph traversal
Real STRING DB data (10K proteins, 37K interactions)
<100ms queries powered by direct IRIS integration (no API middleware)
20/20 contract tests passing - production-ready biomedical demo

Learn more:

docs/biomedical-demo-setup.md - Complete setup guide with scaling options
biomedical/README.md - Architecture and development patterns

Use Cases by Industry

Financial Services (IDFS)

Use Case	Features	Performance
Real-Time Fraud Detection	Graph-based scoring, MLP models, device fingerprinting	<10ms scoring, 130M+ transactions
Bitemporal Audit Trails	Valid time vs. system time, chargeback defense, compliance	<10ms time-travel queries
Late Arrival Detection	Settlement delay analysis, backdated transaction flagging	Pattern detection across 130M events
Regulatory Compliance	SOX, GDPR, MiFID II, Basel III reporting	Complete audit trail preservation

Files:

examples/bitemporal/ - Fraud scenarios, audit queries, Python API
sql/bitemporal/ - Schema (2 tables, 3 views, 8 indexes)
src/iris_fraud_server/ - FastAPI fraud scoring server
docker-compose.fraud-embedded.yml - Licensed IRIS + embedded Python

Quick Links:

Biomedical Research

Use Case	Features	Performance
Protein Interaction Networks	STRING DB integration, pathway analysis, vector similarity	<50ms multi-hop queries (100K+ proteins)
Drug Discovery	Compound similarity, target identification, graph analytics	<10ms vector search (HNSW)
Literature Mining	Hybrid search (embeddings + BM25), entity extraction	RRF fusion, sub-second queries
Pathway Analysis	Multi-hop traversal, PageRank, connected components	NetworkX integration, embedded Python

Files:

biomedical/ - Protein queries, pathway examples
sql/schema.sql - Graph schema (nodes, edges, properties, embeddings)
iris_vector_graph/ - Core Python graph engine
docker-compose.acorn.yml - ACORN-1 with HNSW optimization

Quick Links:

Graph Algorithms (TSP Examples)

Two standalone implementations of the Traveling Salesman Problem demonstrating graph algorithms on IRIS:

Option A: Python + NetworkX (Biomedical)

Find optimal pathways through protein interaction networks:

# Test with 10 cancer-related proteins
python scripts/algorithms/tsp_demo.py --proteins 10 --compare-methods

Algorithms: Greedy (1ms), Christofides (15ms), 2-opt (8ms) Use case: Optimize order to study protein interactions in cancer pathways

Option B: ObjectScript (Healthcare Interoperability)

Optimize caregiver routes for home healthcare:

# Load sample data (8 patients, 26 travel edges)
docker exec -i iris /usr/irissys/bin/irissession IRIS -U USER < sql/caregiver_routing_demo.sql

# Run optimization demo (IRIS Terminal)
Do ^TestCaregiverRouter

Performance: <2ms for 8-patient routes Integration: Direct Business Process method calls Impact: 53% travel time reduction (75min → 35min)

What you get:

Python approach: NetworkX integration, multiple algorithms, FastAPI endpoint example
ObjectScript approach: Zero dependencies, Interoperability production integration, bitemporal audit
Comprehensive docs: Neo4j comparison, performance benchmarks, real-world use cases

Files:

scripts/algorithms/tsp_demo.py - Python demo (works with STRING protein data)
iris/src/Graph/CaregiverRouter.cls - ObjectScript TSP optimizer
iris/src/Graph/ScheduleOptimizationProcess.cls - Business Process integration
sql/caregiver_routing_demo.sql - Sample healthcare data

Learn more:

docs/algorithms/TSP_ANALYSIS.md - Deep dive and Neo4j comparison
docs/algorithms/TSP_IMPLEMENTATION_SUMMARY.md - Overview and benchmarks
docs/examples/CAREGIVER_ROUTING_DEMO.md - Step-by-step tutorial

Architecture

Deployment Options:

External (Default): Python app connects to IRIS via iris.connect() - simpler setup, easier debugging
Embedded (Advanced): Python app runs inside IRIS container - maximum performance, requires licensed IRIS

External Deployment (DEFAULT)        Embedded Deployment (OPTIONAL)
┌────────────────────────┐          ┌──────────────────────────────┐
│ FastAPI Server         │          │ IRIS Container               │
│ (external Python)      │          │ ┌──────────────────────────┐ │
│                        │          │ │ FastAPI Server           │ │
│  iris.connect()   ─────┼──────────┤►│ (/usr/irissys/bin/       │ │
│  to localhost:1972     │          │ │  irispython)             │ │
└────────────────────────┘          │ └──────────────────────────┘ │
                                    │ ┌──────────────────────────┐ │
                                    │ │ IRIS Database Engine     │ │
                                    │ │ (Bitemporal/Graph/Vector)│ │
                                    │ └──────────────────────────┘ │
                                    └──────────────────────────────┘

         Same Platform: InterSystems IRIS
         Same Features: Vector Search, Graph Traversal, Bitemporal Audit
         Different Domains: Finance vs. Life Sciences

Core Components:

IRIS Globals: Append-only storage (perfect for audit trails + graph data)
Embedded Python: Run ML models and graph algorithms in-database
SQL Procedures: kg_KNN_VEC (vector search), kg_RRF_FUSE (hybrid search)
HNSW Indexing: 100x faster vector similarity (requires IRIS 2025.3+ or ACORN-1)

Key Features

Cross-Domain Capabilities

Feature	Financial Services Use	Biomedical Use
Embedded Python	Fraud ML models in-database	Graph analytics (PageRank, etc.)
Personalized PageRank	Entity importance scoring	Document ranking, pathway analysis
Temporal Queries	Bitemporal audit ("what did we know when?")	Time-series biomarker analysis
Graph Traversal	Fraud ring detection (multi-hop)	Protein interaction pathways
Vector Search	Transaction similarity	Protein/compound similarity
Partial Indexes	`WHERE system_to IS NULL` (10x faster)	`WHERE label = 'protein'`

IRIS-Native Optimizations

Globals Storage: Append-only (no UPDATE contention)
Partial Indexes: Filter at index level (WHERE system_to IS NULL)
Temporal Views: Pre-filter current versions
Foreign Key Constraints: Referential integrity across graph
HNSW Vector Index: 100x faster than flat search (ACORN-1)
PPR Functional Index: ObjectScript $LISTBUILD + $LISTNEXT for 8.9x faster PageRank at scale (10K nodes: 184ms vs 1,631ms Python)

Performance

Financial Services (Fraud Detection)

Metric	Community IRIS	Licensed IRIS
Transactions	30M	130M
Database Size	5.3GB	22.1GB
Fraud Scoring	<10ms	<10ms
Bitemporal Queries	<10ms (indexed)	<10ms (indexed)
Time-Travel Queries	<50ms	<50ms
Late Arrival Detection	Pattern search across 30M	Pattern search across 130M

Biomedical (Protein Networks)

Metric	Pure Python	ObjectScript Native
Vector Search	5800ms (flat) → 1.7ms (HNSW)	Same (HNSW index)
Multi-hop Queries	<50ms	<50ms
Hybrid Search (RRF)	<100ms	<20ms
Personalized PageRank (1K)	14.5ms	14.3ms
Personalized PageRank (10K)	1,631ms	184ms (8.9x faster) ✨
Graph Analytics	NetworkX integration	Zero-copy Global access

Tested At Scale:

✅ 130M fraud transactions (licensed IRIS)
✅ 100K+ protein interactions (STRING DB)
✅ 768-dimensional embeddings (biomedical models)

Usage Examples

Personalized PageRank (PPR)

Compute entity importance scores for knowledge graph ranking:

from iris_vector_graph import IRISGraphEngine
import iris

# Connect to IRIS
conn = iris.connect("localhost", 1972, "USER", "_SYSTEM", "SYS")
engine = IRISGraphEngine(conn)

# Compute PPR scores from seed entity
scores = engine.kg_PERSONALIZED_PAGERANK(
    seed_entities=["PROTEIN:TP53"],  # Seed with cancer protein
    damping_factor=0.85,              # Standard PageRank parameter
    top_k=20                          # Return top 20 scored entities
)

# Results: {'PROTEIN:TP53': 0.152, 'PROTEIN:MDM2': 0.087, ...}

# Rank documents by PPR scores
docs = engine.kg_PPR_RANK_DOCUMENTS(
    seed_entities=["PROTEIN:TP53"],
    top_k=10
)

# Results: [{document_id, score, top_entities, entity_count}, ...]

Performance: <25ms for 1K entities, ~200ms for 10K entities (Python implementation)

Documentation

Getting Started

Architecture & Design

API Reference

Examples

Repository Structure

sql/
  schema.sql              # Core graph schema
  bitemporal/             # Fraud detection schema
  fraud/                  # Transaction tables

examples/
  bitemporal/             # Financial services (fraud, audit)

biomedical/               # Life sciences (proteins, pathways)

iris_vector_graph/   # Python graph engine

src/iris_fraud_server/    # FastAPI fraud API

scripts/
  fraud/                  # 130M loader, benchmarks
  migrations/             # NodePK migration

docker/
  Dockerfile.fraud-embedded      # Licensed IRIS + fraud API
  start-fraud-server.sh          # Embedded Python startup

License

MIT License - See LICENSE

Contributing

We welcome contributions! This repo demonstrates IRIS versatility across:

Financial Services: Fraud detection, bitemporal data, regulatory compliance
Biomedical Research: Protein networks, drug discovery, literature mining

Feel free to add examples from other domains or improve existing implementations.

Production-Ready: Proven with 130M+ financial transactions and 100K+ biomedical interactions on InterSystems IRIS.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.54.1

Apr 19, 2026

1.54.0

Apr 19, 2026

1.53.1

Apr 19, 2026

1.53.0

Apr 19, 2026

1.52.1

Apr 19, 2026

1.52.0

Apr 19, 2026

1.51.1

Apr 19, 2026

1.51.0

Apr 19, 2026

1.50.2

Apr 18, 2026

1.50.1

Apr 18, 2026

1.50.0

Apr 18, 2026

1.49.0

Apr 18, 2026

1.48.0

Apr 18, 2026

1.47.1

Apr 11, 2026

1.47.0

Apr 11, 2026

1.46.0

Apr 7, 2026

1.45.3

Apr 5, 2026

1.45.2

Apr 3, 2026

1.45.1

Apr 3, 2026

1.45.0

Apr 3, 2026

1.44.0

Apr 3, 2026

1.43.0

Apr 3, 2026

1.42.0

Apr 3, 2026

1.41.0

Apr 3, 2026

1.40.0

Apr 2, 2026

1.39.0

Apr 2, 2026

1.38.0

Apr 1, 2026

1.37.0

Apr 1, 2026

1.36.0

Mar 31, 2026

1.35.0

Mar 31, 2026

1.34.0

Mar 31, 2026

1.33.0

Mar 31, 2026

1.32.0

Mar 31, 2026

1.31.0

Mar 31, 2026

1.30.0

Mar 31, 2026

1.29.0

Mar 31, 2026

1.28.0

Mar 30, 2026

1.27.0

Mar 29, 2026

1.26.4

Mar 29, 2026

1.26.3

Mar 29, 2026

1.26.2

Mar 29, 2026

1.26.1

Mar 29, 2026

1.26.0

Mar 29, 2026

1.25.1

Mar 29, 2026

1.25.0

Mar 29, 2026

1.24.1

Mar 29, 2026

1.24.0

Mar 29, 2026

1.23.0

Mar 29, 2026

1.22.1

Mar 29, 2026

1.22.0

Mar 29, 2026

1.21.1

Mar 29, 2026

1.21.0

Mar 29, 2026

1.20.2

Mar 29, 2026

1.20.1

Mar 28, 2026

1.20.0

Mar 28, 2026

1.19.2

Mar 28, 2026

1.19.1

Mar 28, 2026

1.19.0

Mar 28, 2026

1.18.0

Mar 28, 2026

1.17.0

Mar 27, 2026

1.16.2

Mar 23, 2026

1.16.1

Mar 19, 2026

1.16.0

Mar 19, 2026

1.15.0

Mar 19, 2026

1.14.1

Mar 19, 2026

1.14.0

Mar 19, 2026

1.13.1

Mar 19, 2026

1.13.0

Mar 19, 2026

1.12.0

Mar 19, 2026

1.11.0

Mar 18, 2026

1.10.2

Mar 18, 2026

1.10.1

Mar 18, 2026

1.10.0

Mar 18, 2026

1.9.0

Feb 28, 2026

1.8.2

Feb 24, 2026

1.8.1

Feb 24, 2026

1.8.0

Feb 24, 2026

1.7.0

Feb 22, 2026

1.6.5

Feb 20, 2026

1.6.4

Feb 20, 2026

1.6.3

Feb 18, 2026

1.6.2

Feb 18, 2026

1.6.1

Feb 18, 2026

1.6.0

Feb 8, 2026

1.5.4

Feb 8, 2026

1.5.3

Feb 8, 2026

1.5.2

Feb 1, 2026

1.5.1

Feb 1, 2026

1.5.0

Feb 1, 2026

1.4.9

Feb 1, 2026

1.4.8

Feb 1, 2026

1.4.7

Feb 1, 2026

1.4.6

Feb 1, 2026

1.4.5

Feb 1, 2026

1.4.4

Feb 1, 2026

1.4.3

Feb 1, 2026

1.4.1

Feb 1, 2026

1.4.0

Feb 1, 2026

1.3.4

Feb 1, 2026

1.3.3

Jan 26, 2026

1.3.2

Jan 26, 2026

1.3.1

Jan 26, 2026

1.3.0

Jan 26, 2026

1.2.0

Jan 26, 2026

1.1.9

Dec 19, 2025

1.1.8

Dec 17, 2025

1.1.7

Nov 23, 2025

1.1.6

Nov 15, 2025

1.1.5

Nov 15, 2025

1.1.4

Nov 9, 2025

1.1.3

Nov 9, 2025

1.1.2

Nov 8, 2025

This version

1.1.1

Nov 8, 2025

1.1.0

Nov 8, 2025

1.0.0

Nov 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iris_vector_graph-1.1.1.tar.gz (730.6 kB view details)

Uploaded Nov 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

iris_vector_graph-1.1.1-py3-none-any.whl (44.6 kB view details)

Uploaded Nov 8, 2025 Python 3

File details

Details for the file iris_vector_graph-1.1.1.tar.gz.

File metadata

Download URL: iris_vector_graph-1.1.1.tar.gz
Upload date: Nov 8, 2025
Size: 730.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for iris_vector_graph-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`df3c2572c5e19ea2ffe9d564ee2f468a560efd36a10fa3e543c00db6927edf25`
MD5	`f350d566b99907a8c6adb90aaee357be`
BLAKE2b-256	`3eda136f4d6b728382528b35adfbd41791407fde58e70f3b2ba1162799e964d0`

See more details on using hashes here.

File details

Details for the file iris_vector_graph-1.1.1-py3-none-any.whl.

File metadata

Download URL: iris_vector_graph-1.1.1-py3-none-any.whl
Upload date: Nov 8, 2025
Size: 44.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for iris_vector_graph-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3e449c1f49857b0b361611f066d2db5ded5bc6a9a0d1fde463b4c2b498071122`
MD5	`ca1b6160ad706e33724da2a28abfd1a4`
BLAKE2b-256	`bbfe899fa155e382e5ac2778b1a669a564e88c89fc65f36b193946f159510abc`

See more details on using hashes here.

iris-vector-graph 1.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

IRIS Vector Graph

Table of Contents

Quick Start

Option A: Fraud Detection (Financial Services)

External Mode (Default - Simpler)

Embedded Mode (Advanced - Optional)

Option B: Biomedical Graph (Life Sciences)

External Mode (Default - Simpler)

Embedded Mode (Advanced - Optional)

Use Cases by Industry

Financial Services (IDFS)

Biomedical Research

Graph Algorithms (TSP Examples)

Option A: Python + NetworkX (Biomedical)

Option B: ObjectScript (Healthcare Interoperability)

Architecture

Key Features

Cross-Domain Capabilities

IRIS-Native Optimizations

Performance

Financial Services (Fraud Detection)

Biomedical (Protein Networks)

Usage Examples

Personalized PageRank (PPR)

Documentation

Getting Started

Architecture & Design

API Reference

Examples

Repository Structure

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes