Skip to main content

A temporal graph building library

Project description

Graphiti-ts-small

Temporal Knowledge Graphs for Agentic Applications


Discord Lint Unit Tests MyPy Check Open in GitHub Codespaces


Graphiti builds dynamic, temporally aware Knowledge Graphs that represent complex, evolving relationships between entities over time. Graphiti ingests both unstructured and structured data, and the resulting graph may be queried using a fusion of time, full-text, semantic, and graph algorithm approaches.


Graphiti temporal walkthrough


Graphiti helps you create and query Knowledge Graphs that evolve over time. A knowledge graph is a network of interconnected facts, such as “Kendra loves Adidas shoes.” Each fact is a “triplet” represented by two entities, or nodes (”Kendra”, “Adidas shoes”), and their relationship, or edge (”loves”). Knowledge Graphs have been explored extensively for information retrieval. What makes Graphiti unique is its ability to autonomously build a knowledge graph while handling changing relationships and maintaining historical context.

With Graphiti, you can build LLM applications such as:

  • Assistants that learn from user interactions, fusing personal knowledge with dynamic data from business systems like CRMs and billing platforms.
  • Agents that autonomously execute complex tasks, reasoning with state changes from multiple dynamic sources.

Graphiti supports a wide range of applications in sales, customer service, health, finance, and more, enabling long-term recall and state-based reasoning for both assistants and agents.

Why Graphiti?

We were intrigued by Microsoft’s GraphRAG, which expanded on RAG text chunking by using a graph to better model a document corpus and making this representation available via semantic and graph search techniques. However, GraphRAG did not address our core problem: It's primarily designed for static documents and doesn't inherently handle temporal aspects of data.

Graphiti is designed from the ground up to handle constantly changing information, hybrid semantic and graph search, and scale:

  • Temporal Awareness: Tracks changes in facts and relationships over time, enabling point-in-time queries. Graph edges include temporal metadata to record relationship lifecycles.
  • Episodic Processing: Ingests data as discrete episodes, maintaining data provenance and allowing incremental entity and relationship extraction.
  • Hybrid Search: Combines semantic and BM25 full-text search, with the ability to rerank results by distance from a central node e.g. “Kendra”.
  • Scalable: Designed for processing large datasets, with parallelization of LLM calls for bulk processing while preserving the chronology of events.
  • Supports Varied Sources: Can ingest both unstructured text and structured JSON data.

Graphiti structured + unstructured demo

Graphiti and Zep Memory

Graphiti powers the core of Zep's memory layer for LLM-powered Assistants and Agents.

We're excited to open-source Graphiti, believing its potential reaches far beyond memory applications.

Installation

Requirements:

  • Python 3.10 or higher
  • Neo4j 5.21 or higher
  • OpenAI API key (for LLM inference and embedding)

Optional:

  • Anthropic or Groq API key (for alternative LLM providers)

[!TIP] The simplest way to install Neo4j is via Neo4j Desktop. It provides a user-friendly interface to manage Neo4j instances and databases.

pip install graphiti-core

or

poetry add graphiti-core

Quick Start

[!IMPORTANT] Graphiti uses OpenAI for LLM inference and embedding. Ensure that an OPENAI_API_KEY is set in your environment. Support for Anthropic and Groq LLM inferences is available, too.

from graphiti_core import Graphiti
from graphiti_core.nodes import EpisodeType
from datetime import datetime

# Initialize Graphiti
graphiti = Graphiti("bolt://localhost:7687", "neo4j", "password")

# Initialize the graph database with Graphiti's indices. This only needs to be done once.
graphiti.build_indices_and_constraints()

# Add episodes
episodes = [
    "Kamala Harris is the Attorney General of California. She was previously "
    "the district attorney for San Francisco.",
    "As AG, Harris was in office from January 3, 2011 – January 3, 2017",
]
for i, episode in enumerate(episodes):
    await graphiti.add_episode(
        name=f"Freakonomics Radio {i}",
        episode_body=episode,
        source=EpisodeType.text,
        source_description="podcast",
        reference_time=datetime.now()
    )

# Search the graph
# Execute a hybrid search combining semantic similarity and BM25 retrieval
# Results are combined and reranked using Reciprocal Rank Fusion
results = await graphiti.search('Who was the California Attorney General?')
[
    EntityEdge(
   uuid = '3133258f738e487383f07b04e15d4ac0',
   source_node_uuid = '2a85789b318d4e418050506879906e62',
   target_node_uuid = 'baf7781f445945989d6e4f927f881556',
   created_at = datetime.datetime(2024, 8, 26, 13, 13, 24, 861097),
   name = 'HELD_POSITION',
# the fact reflects the updated state that Harris is
# no longer the AG of California
   fact = 'Kamala Harris was the Attorney General of California',
   fact_embedding = [
      -0.009955154731869698,
       ...
      0.00784289836883545
],
   episodes = ['b43e98ad0a904088a76c67985caecc22'],
   expired_at = datetime.datetime(2024, 8, 26, 20, 18, 1, 53812),
# These dates represent the date this edge was true.
   valid_at = datetime.datetime(2011, 1, 3, 0, 0, tzinfo= < UTC >),
   invalid_at = datetime.datetime(2017, 1, 3, 0, 0, tzinfo= < UTC >)
)
]

# Rerank search results based on graph distance
# Provide a node UUID to prioritize results closer to that node in the graph.
# Results are weighted by their proximity, with distant edges receiving lower scores.
await graphiti.search('Who was the California Attorney General?', center_node_uuid)

# Close the connection
graphiti.close()

Graph Service

The server directory contains an API service for interacting with the Graphiti API. It is built using FastAPI.

Please see the server README for more information.

Documentation

Status and Roadmap

Graphiti is under active development. We aim to maintain API stability while working on:

  • Implementing node and edge CRUD operations
  • Improving performance and scalability
  • Achieving good performance with different LLM and embedding models
  • Creating a dedicated embedder interface
  • Supporting custom graph schemas:
    • Allow developers to provide their own defined node and edge classes when ingesting episodes
    • Enable more flexible knowledge representation tailored to specific use cases
  • Enhancing retrieval capabilities with more robust and configurable options
  • Expanding test coverage to ensure reliability and catch edge cases

Contributing

We encourage and appreciate all forms of contributions, whether it's code, documentation, addressing GitHub Issues, or answering questions in the Graphiti Discord channel. For detailed guidelines on code contributions, please refer to CONTRIBUTING.

Support

Join the Zep Discord server and make your way to the #Graphiti channel!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphiti_core-0.3.12.tar.gz (49.9 kB view details)

Uploaded Source

Built Distribution

graphiti_core-0.3.12-py3-none-any.whl (75.2 kB view details)

Uploaded Python 3

File details

Details for the file graphiti_core-0.3.12.tar.gz.

File metadata

  • Download URL: graphiti_core-0.3.12.tar.gz
  • Upload date:
  • Size: 49.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for graphiti_core-0.3.12.tar.gz
Algorithm Hash digest
SHA256 7b48560d0f12fd5bcdb92645eea76562958f0e8ada34a5a6742ecf5f90ea3227
MD5 0a4a49aca04c931e3d9e8b821bb77741
BLAKE2b-256 1af77bf77e45032f9fa89456665e4dd48af3f19266bdf76719f3bee3bdc9055e

See more details on using hashes here.

File details

Details for the file graphiti_core-0.3.12-py3-none-any.whl.

File metadata

File hashes

Hashes for graphiti_core-0.3.12-py3-none-any.whl
Algorithm Hash digest
SHA256 8ca6c09b5139c6d78f3b42162a7ae2e1118e91caaeb5c29eb9af6a8d2efcf580
MD5 64da629d4a1e31b51f25d10a7fc79e19
BLAKE2b-256 2d5cfbb0ac160029ae89fa0a59b7bccb4ba1c4b95cb59f2702b5f115fd7bf32f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page