Skip to main content

Chat with your docs using langchain in a streamlit app with mistral or llama in ollama.

Project description

DocsChat 📚🗣️

docschat is a command-line interface that let's you start a local streamlit server and interact with your documents.

The chatbot utilizes a conversational retrieval chain to answer user queries based on the content of embedded documents. It leverages various NLP techniques, including language models and embeddings, to provide relevant responses.

Features

  • Document Embedding: Embeds PDF documents for efficient retrieval of information.
  • Conversational Interface: Allows users to interact with documents through a chat interface.
  • Settings: Provides customizable settings for configuring document retrieval and model parameters.

Installation

To run the application locally, follow these steps for installation.

pip install DocsChat

Pulll Ollama llm:

ollama pull llama3
ollama pull llama2
ollama pull gemma
ollama pull mistral
ollama pull codellama

Start the Ollama server:

ollama run llama3

Run the application:

docschat

Configure

DocsChat

PDF sources

  • Configure the PDF source directory from which all PDFs should be read in recusively.
  • Select a splitter, this has an influence on the chunks that we will make available to the LLM and thus also on the answers. By default no splitter is selected, this means a larger context.

Vector store

Vector store

  • Chroma DB in memory is used as a vector store, which stores the data in a Persit directory, so the data in the DB is also available after the restart.
  • The Retriever search type has and the various parameters influence the search of documents in the Vectore Store.

Ollama

Ollama

  • Configure the ollama server connection and the model with which the server was started.
  • the LLM parameters influence the embedding of the PDFs but also the answering of questions in the RAG pipeline.

Actions

Actions

There are two functions available, the sync of PDF documents into the Vectore Store. This can take some time depending on the system resources, embedding and splitter. The Delete DB function deletes the Chroma Collection.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docschat-7.0.0.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

DocsChat-7.0.0-py3-none-any.whl (11.5 kB view details)

Uploaded Python 3

File details

Details for the file docschat-7.0.0.tar.gz.

File metadata

  • Download URL: docschat-7.0.0.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for docschat-7.0.0.tar.gz
Algorithm Hash digest
SHA256 3c6abaefb1099cc02385e156da75abb2717109595226d0a4c22cd60c18021281
MD5 9df7946eac3bee335fd85c972ef4eb92
BLAKE2b-256 2672ee51d0a335852b5da321d9a98b8ee1fe70788ba02e6334f89aada54604f6

See more details on using hashes here.

Provenance

The following attestation bundles were made for docschat-7.0.0.tar.gz:

Publisher: main.yaml on flojud/DocsChat

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file DocsChat-7.0.0-py3-none-any.whl.

File metadata

  • Download URL: DocsChat-7.0.0-py3-none-any.whl
  • Upload date:
  • Size: 11.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for DocsChat-7.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4878058e8757808e135955277d8293cc53e4799f88e4a3c54b4b30e9227a9eb3
MD5 9cc4258def17efd0453b166e7623ce84
BLAKE2b-256 637bc1d05f78abf474ec8163000a31f2882a46b906df6dfdb0b1e26ccdaa5be9

See more details on using hashes here.

Provenance

The following attestation bundles were made for DocsChat-7.0.0-py3-none-any.whl:

Publisher: main.yaml on flojud/DocsChat

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page