A FastAPI application with Langchain, Weaviate, and MinIO integrations
Project description
This will guide users on how to deploy the application using Docker, which simplifies the setup process by handling dependencies and configurations through Docker containers. Here's an updated version of the README.md
with Docker Compose deployment instructions:
README.md
My FastAPI Application with Langchain, Weaviate, and MinIO Integrations
This FastAPI application demonstrates an integrated text processing pipeline using Weaviate and MinIO. It's designed to efficiently process, store, and retrieve documents through a Dockerized environment.
Features
- Process documents from a MinIO bucket.
- Use OpenAI's LangChain for advanced text processing.
- Index and manage documents in Weaviate.
- Easy deployment with Docker Compose.
Requirements
- Docker
- Docker Compose
Installation and Deployment
-
Clone the Repository:
git clone https://github.com/Cdaprod/minio-weaviate-langchain cd minio-weaviate-langchain
-
Start the Application with Docker Compose:
Run the following command to start all services defined in your
docker-compose.yaml
:docker-compose up
This command will build and start containers for your FastAPI app, Weaviate, and MinIO.
Proof of Concept
The application showcases the following workflow:
-
Load Documents from MinIO: The application retrieves documents stored in a MinIO bucket.
-
Document Processing: Documents are processed via LangChain's OpenAI integration.
-
Indexing in Weaviate: Processed documents are indexed in Weaviate for full-text search and data retrieval.
-
API Endpoints:
POST /process_documents
: Processes and indexes documents from MinIO into Weaviate.GET /
: Health check endpoint.POST /index_from_minio
: Indexes documents from MinIO into Weaviate.POST /query
: Queries documents in Weaviate.POST /update/{uuid}
: Updates a document in Weaviate.DELETE /delete/{uuid}
: Deletes a document from Weaviate.
Example Usage
-
Index Documents from MinIO:
import requests response = requests.post('http://localhost:8000/index_from_minio') print(response.json())
-
Query Documents in Weaviate:
query = '{"query": "your_query_here"}' response = requests.post('http://localhost:8000/query', json=query) print(response.json())
Contributing
Contributions are welcome! Fork the repository, make your changes, and submit a pull request.
Further Development for Notebook Environment
To adapt your FastAPI application with Weaviate and MinIO integration for use in a Jupyter notebook, you'll need to structure it as a Python package that can be installed and used interactively. The structure of the package should facilitate easy installation and usage within a notebook environment.
Here's a basic outline of steps to restructure your application:
1. Organize Your Application as a Package
Your project directory might look something like this:
minio-weaviate-langchain/
my_fastapi_lib/
__init__.py
main.py
weaviate_operations.py
minio_operations.py
# ... other modules ...
setup.py
README.md
LICENSE
In minio-weaviate-langchain/my_fastapi_lib/__init__.py
, you can import the main components:
from .main import app
from .weaviate_operations import WeaviateOperations
from .minio_operations import load_documents_from_minio
2. Setup.py for Installation
Your setup.py
allows the package to be installed via pip. Here's an example:
from setuptools import setup, find_packages
setup(
name='my_fastapi_lib',
version='0.1.0',
packages=find_packages(),
include_package_data=True,
install_requires=[
'fastapi',
'uvicorn',
'weaviate-client',
'minio',
'langchain',
# ... other dependencies ...
],
)
3. Using the Library in a Jupyter Notebook
Once your library is structured and installable, you can create a Jupyter notebook to demonstrate its usage. For example:
Cell 1: Install the library
!pip install git+https://github.com/yourusername/my_fastapi_lib.git
Cell 2: Import and Initialize Components
from my_fastapi_lib import WeaviateOperations, load_documents_from_minio
# Setup Weaviate and MinIO clients
weaviate_ops = WeaviateOperations(WEAVIATE_ENDPOINT)
minio_documents = load_documents_from_minio(MINIO_BUCKET, MINIO_ENDPOINT, MINIO_ACCESS_KEY, MINIO_SECRET_KEY)
Cell 3: Use the Library
# Example operation
for doc in minio_documents:
processed_doc = process_document(doc) # Define this function as per your processing logic
weaviate_ops.index_document(MINIO_BUCKET, "DocumentName", processed_doc)
4. Documentation and Examples
Provide clear instructions and examples in your README.md
and include Jupyter notebooks that demonstrate the use of your library.
5. Running the FastAPI App
Since the FastAPI app is designed to be run as a server, it might not be directly applicable in a Jupyter notebook for interactive use. Instead, focus on demonstrating how to interact with the server (which could be running locally or remotely) using requests or similar in the notebook.
Note
Adapting a FastAPI application for use as a library in a Jupyter notebook involves a shift in focus. The interactive notebook environment is more suited for demonstrating API calls to the running FastAPI server, rather than running the server itself within the notebook.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file minio-weaviate-langchain-0.0.0.tar.gz
.
File metadata
- Download URL: minio-weaviate-langchain-0.0.0.tar.gz
- Upload date:
- Size: 4.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | acebe7cfc0db06ffb40955b7f459d8006028f4a08f07517b4b0fa39eba716445 |
|
MD5 | 49a9a18f833f9bdf65e8d70f4560511e |
|
BLAKE2b-256 | 94750ecd2f8e10d54c16ce595771c604a0e09a2c2f196bc0156b1e2ad2b1918e |
File details
Details for the file minio_weaviate_langchain-0.0.0-py3-none-any.whl
.
File metadata
- Download URL: minio_weaviate_langchain-0.0.0-py3-none-any.whl
- Upload date:
- Size: 3.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0aabe43e4ad4ba688dd06cc451a5d6e63d291de3346de570f5434a707ba7a414 |
|
MD5 | 9ad74c614f107b1ffe6f4107b50c6ca2 |
|
BLAKE2b-256 | cfdea323b2f28f6ee3323ba9d7271ab07de20512a0222c17dacc9d803a4febc2 |