Skip to main content

Chat with your docs using langchain in a streamlit app with mistral or llama in ollama.

Project description

DocsChat 📚🗣️

docschat is a command-line interface that let's you start a local streamlit server and interact with your documents.

The chatbot utilizes a conversational retrieval chain to answer user queries based on the content of embedded documents. It leverages various NLP techniques, including language models and embeddings, to provide relevant responses.

Features

  • Document Embedding: Embeds PDF documents for efficient retrieval of information.
  • Conversational Interface: Allows users to interact with documents through a chat interface.
  • Settings: Provides customizable settings for configuring document retrieval and model parameters.

Installation

To run the application locally, follow these steps for installation.

pip install DocsChat

Pulll Ollama llm:

ollama pull llama3
ollama pull llama2
ollama pull gemma
ollama pull mistral
ollama pull codellama

Start the Ollama server:

ollama run llama3

Run the application:

docschat

Configure

DocsChat

PDF sources

  • Configure the PDF source directory from which all PDFs should be read in recusively.
  • Select a splitter, this has an influence on the chunks that we will make available to the LLM and thus also on the answers. By default no splitter is selected, this means a larger context.

Vector store

Vector store

  • Chroma DB in memory is used as a vector store, which stores the data in a Persit directory, so the data in the DB is also available after the restart.
  • The Retriever search type has and the various parameters influence the search of documents in the Vectore Store.

Ollama

Ollama

  • Configure the ollama server connection and the model with which the server was started.
  • the LLM parameters influence the embedding of the PDFs but also the answering of questions in the RAG pipeline.

Actions

Actions

There are two functions available, the sync of PDF documents into the Vectore Store. This can take some time depending on the system resources, embedding and splitter. The Delete DB function deletes the Chroma Collection.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docschat-5.0.0.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

DocsChat-5.0.0-py3-none-any.whl (11.3 kB view details)

Uploaded Python 3

File details

Details for the file docschat-5.0.0.tar.gz.

File metadata

  • Download URL: docschat-5.0.0.tar.gz
  • Upload date:
  • Size: 10.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for docschat-5.0.0.tar.gz
Algorithm Hash digest
SHA256 4e0d95ef02b1a0589d898f80967a864062aeeffa57608306958b98918dd5951b
MD5 f1eacc3d74a174242c07739df4c3bdc0
BLAKE2b-256 87c1761df07ee056667a82e3cbb71297e42e5d40f006c21b38c55b9dbf1769f3

See more details on using hashes here.

File details

Details for the file DocsChat-5.0.0-py3-none-any.whl.

File metadata

  • Download URL: DocsChat-5.0.0-py3-none-any.whl
  • Upload date:
  • Size: 11.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for DocsChat-5.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fe6b0553d7fcb86a841c30be024435baffc75d48f9de1c0009289ef5a8a01187
MD5 ebc7782dd62bb559d62d8c664f824658
BLAKE2b-256 3ae430ea653f426f25629b76d415c5b4343637b3b53a049b28e6740ba9628506

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page