Skip to main content

Graph retrieval

Project description

GraphRetrieval

GraphRetrieval is a Python library designed for advanced text retrieval and knowledge graph querying. It supports various models and techniques to enable efficient and accurate information retrieval from large text corpora and knowledge bases.

Installation

pip install -e git+https://github.com/jayavibhavnk/GraphRetrieval.git#egg=GraphRetrieval

or

pip install GraphRetrieval

Usage Setting Up Environment Variables

Before using the library, set up the necessary environment variables for Neo4j and OpenAI:

import os

os.environ["NEO4J_URI"] = "add your Neo4j URI here"
os.environ["NEO4J_USERNAME"] = "add your Neo4j username here"
os.environ["NEO4J_PASSWORD"] = "add your Neo4j password here"
os.environ['OPENAI_API_KEY'] = "add your OpenAI API key here"

GraphRAG

GraphRAG is used to create and query graphs based on text documents.

Example

import GraphRetrieval
from GraphRetrieval import GraphRAG

grag = GraphRAG()
grag.create_graph_from_file('add file path here')

# Query using the default A* search
print(grag.queryLLM("Ask your query here")) 

# Switch to greedy search
grag.retrieval_model = "greedy"
print(grag.queryLLM("Ask your query here"))

KnowledgeRAG

KnowledgeRAG integrates with a knowledge graph and supports hybrid searches combining structured and unstructured data.

Example

from GraphRetrieval import KnowledgeRAG
from langchain_community.graphs import Neo4jGraph

graph = Neo4jGraph()
gr = KnowledgeRAG()

# Initialize graph
gr.init_graph(graph)

# Create the graph chain
gchain = gr.graphChain()

# Query the graph chain
print(gchain.invoke({"question": "Ask your query here"}))

# Hybrid search using Neo4j vector index
gr.init_neo4j_vector_index()
gr.hybrid = True
print(gchain.invoke({"question": "Ask your query here"}))

Ingesting Data into Graph

Ingest large text data into the knowledge graph.

text = "enter text here"

from langchain_text_splitters import CharacterTextSplitter

text_splitter = CharacterTextSplitter(
    separator="

",
    chunk_size=1000,
    chunk_overlap=200,
    length_function=len,
    is_separator_regex=False,
)

docs1 = text_splitter.create_documents([text])
docs = gr.generate_graph_from_text(docs1)
gr.ingest_data_into_graph(docs)

gr.init_neo4j_vector_index()
print(gchain.invoke({"question": "Ask your query here"}))

Hybrid Search with GraphRetrieval and Knowledge Base

Combine GraphRAG and KnowledgeRAG for hybrid search.

gr.vector_index = grag
gr.hybrid = True
print(gchain.invoke({"question": "Ask your query here"}))

Image Graph RAG

Use directories of images for searching similar images.

image_graph_rag = ImageGraphRAG()
image_paths = image_graph_rag.create_graph_from_directory('/content/images')
similar_images = image_graph_rag.similarity_search('/content/images/car.jpg', k=5)

for doc in similar_images:
    print(doc.metadata["path"])
image_graph_rag.visualize_graph() # for graph visualization

Note: This is a new version without parallelization, use 0.1.5>= for parallelization.

Contributing

Contributions are welcome! Please submit a pull request or open an issue to discuss what you would like to change. License

This project is licensed under the MIT License. See the LICENSE file for details.

This README.md provides an overview of the GraphRetrieval library, installation instructions, and example usage scenarios, with the specified changes to the file path and environment variables sections.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphretrieval-0.1.7.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

GraphRetrieval-0.1.7-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file graphretrieval-0.1.7.tar.gz.

File metadata

  • Download URL: graphretrieval-0.1.7.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.9.19

File hashes

Hashes for graphretrieval-0.1.7.tar.gz
Algorithm Hash digest
SHA256 662ecec842c22eea271f008ca513d67352305a6c9bbda83a718839fd2a71543b
MD5 cd9e40a5aa9b19d30937b147558131b1
BLAKE2b-256 109ab4d9f1c3168fe7569f236209756d08359278d84baca5b96e913673686092

See more details on using hashes here.

File details

Details for the file GraphRetrieval-0.1.7-py3-none-any.whl.

File metadata

File hashes

Hashes for GraphRetrieval-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 e700d76e6b8757f354e509651b8f83d7b343b2e024891264bee2858c589a88a8
MD5 fcb4458a05802edacac40573402a660f
BLAKE2b-256 615dd6b5dfd92eae8e6f1fc7ebfd4982d03a8820e6caffd9bf2f1c980227d3f7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page