document QA(RAG) package using langchain and chromadb

These details have not been verified by PyPI

Project links

Homepage

Project description

akasha

GitLab CI akasha simplifies document-based Question Answering (QA) and Retrieval Augmented Generation(RAG) by harnessing the power of Large Language Models to accurately answer your queries while searching through your provided documents.

With akasha, you have the flexibility to choose from a variety of language models, embedding models, and search types. Adjusting these parameters is straightforward, allowing you to optimize your approach and discover the most effective methods for obtaining accurate answers from Large Language Models.

For the chinese manual, please visit manual

Quick Start (Local Development)

If you are developing in this repository (instead of only installing from PyPI), use editable install:

cd akasha
uv venv --python 3.11
source .venv/bin/activate  # Windows PowerShell: .venv\Scripts\Activate.ps1
uv pip install -e .

Set at least one model API key:

export OPENAI_API_KEY="your_key"

# or
export GEMINI_API_KEY="your_key"

Run a quick example:

python examples/ex_rag.py
python examples/ex_agent.py

Change log

1.1
1. fixed keep_logs consistency for ask, RAG, summary, websearch, and eval
2. added INFO-level runtime logging for main execution flows
3. improved exception-path logging to ensure ERROR entries are written to log files
1.0
1. bug fixes
2. added a lightweight installation option (API-call-only mode)
3. upgraded LangChain to 1.2
0.9.14
1. function calling
2. MCP agent support

Installation

We recommend using Python 3.11 to run our akasha package. Supported versions are Python 3.11 and 3.12.

Standard Installation

###create environment
$ uv venv --python 3.11

###install akasha
$ uv pip install akasha-terminal

Lightweight Installation (API-call-only, v1.0+)

### create environment
uv venv --python 3.11
source .venv/bin/activate  # Windows PowerShell: .venv\Scripts\Activate.ps1

### install lightweight mode (API-call-only)
uv pip install "akasha-terminal[light]"

light keeps Chroma-backed RAG and MemoryManager, but uses remote embedding APIs instead of the local HuggingFace/Torch stack.

Editable Install Commands

If you are developing in this repository, use one of the following commands.

Base editable install:

uv pip install -e .

Editable install with light extras:

uv pip install -e ".[light]"

Editable install with light + development tools:

uv pip install -e ".[light,dev]"

Editable install with full extras:

uv pip install -e ".[full]"

Editable install with full + development tools:

uv pip install -e ".[full,dev]"

If you prefer uv extras syntax instead of bracket notation, these are equivalent:

uv pip install -e . --extra light
uv pip install -e . --extra full
uv pip install -e . --extra light --extra dev
uv pip install -e . --extra full --extra dev

Note:

Use ".[light]", not ". [light]".
light = remote-model / remote-embedding path with Chroma retained.
full = local embedding / rerank / HuggingFace / Torch stack included.

If you need synchronized plain requirements files, regenerate them from pyproject.toml:

python scripts/sync_requirements.py

API Keys

OPENAI

If you want to use openai models or embeddings, go to openai to get the API key. You can either save OPENAI_API_KEY=your api key into .env file to current working directory or, set as a environment variable, using export in bash or use os.environ in python.

# set a environment variable

export OPENAI_API_KEY="your api key"

GEMINI

If you want to use Gemini models, set GEMINI_API_KEY in your .env file or export it in your shell.

.env example:

GEMINI_API_KEY=your_gemini_api_key

Shell example:

export GEMINI_API_KEY="your_gemini_api_key"

AZURE OPENAI

If you want to use azure openai, go to auzreAI and get you own Language API base url and key. Also, remember to depoly all the models in Azure OpenAI Studio, the deployment name should be same as the model name. save OPENAI_API_KEY=your azure key, OPENAI_API_BASE=your Language API base url, OPENAI_API_TYPE=azure, OPENAI_API_VERSION=2023-05-15 into .env file to current working directory. If you want to save both openai key and azure key at the same time, you can also use AZURE_API_KEY, AZURE_API_BASE, AZURE_API_TYPE, AZURE_API_VERSION

## .env file
AZURE_API_KEY={your azure key}
AZURE_API_BASE={your Language API base url}
AZURE_API_TYPE=azure
AZURE_API_VERSION=2023-05-15

And now we can run akasha in python

#PYTHON3.11+
import akasha

# simple QA
ak = akasha.ask(model="gemini:gemini-2.5-flash")
response = ak(
    prompt="akasha 是什麼？",
    info=["https://github.com/iii-org/akasha"],
)

And then run a RAG example:

#PYTHON3.11+
import akasha
data_source = "doc/mic"
prompt = "五軸是什麼?"
ak = akasha.RAG(model="gemini:gemini-2.5-flash")
response = ak(data_source, prompt)

Some models you can use

Please note that for OpenAI models, you need to set the environment variable 'OPENAI_API_KEY,' and for most Hugging Face models, a GPU is required to run the models. However, for .gguf models, you can use a CPU to run them.

openai_model = "openai:gpt-3.5-turbo"  # need environment variable "OPENAI_API_KEY" or "AZURE_API_KEY"
openai4_model = "openai:gpt-4"  # need environment variable "OPENAI_API_KEY" or "AZURE_API_KEY"
gemini_flash_model = "gemini:gemini-2.5-flash" # need environment variable "GEMINI_API_KEY"
huggingface_model = "hf:meta-llama/Llama-2-7b-chat-hf" #need environment variable "HUGGINGFACEHUB_API_TOKEN" to download meta-llama model
quantized_ch_llama_model = "gptq:FlagAlpha/Llama2-Chinese-13b-Chat-4bit"
taiwan_llama_gptq = "gptq:weiren119/Taiwan-LLaMa-v1.0-4bits-GPTQ"
mistral = "hf:Mistral-7B-Instruct-v0.2" 
mediatek_Breeze = "hf:MediaTek-Research/Breeze-7B-Instruct-64k-v0.1"

### If you want to use llama-cpp to run model on cpu, you can download gguf version of models 

### from https://huggingface.co/TheBloke/Llama-2-7b-Chat-GGUF  and the name behind "llama-gpu:" or "llama-cpu:"

### from https://huggingface.co/TheBloke/CodeUp-Llama-2-13B-Chat-HF-GGUF

### is the path of the downloaded .gguf file
llama_cpp_model = "llama-cpp:model/llama-2-13b-chat-hf.Q5_K_S.gguf"  
llama_cpp_model = "llama-cpp:model/llama-2-7b-chat.Q5_K_S.gguf"
llama_cpp_chinese_alpaca = "llama-cpp:model/chinese-alpaca-2-7b.Q5_K_S.gguf"
chatglm_model = "chatglm:THUDM/chatglm2-6b"

Some embeddings you can use

Please noted that each embedding model has different window size, texts that over the max seq length will be truncated and won't be represent in embedding model.

Rerank_base and rerank_large are not embedding models; instead, they compare the query to each chunk of the documents and return scores that represent the similarity. As a result, they offer higher accuracy compared to embedding models but may be slower.

openai_emd = "openai:text-embedding-ada-002"  # need environment variable "OPENAI_API_KEY"  # 8192 max seq length
huggingface_emd = "hf:all-MiniLM-L6-v2" 
text2vec_ch_emd = "hf:shibing624/text2vec-base-chinese"   # 128 max seq length 
text2vec_mul_emd = "hf:shibing624/text2vec-base-multilingual"  # 256 max seq length
text2vec_ch_para_emd = "hf:shibing624/text2vec-base-chinese-paraphrase" # perform better for long text, 256 max seq length
bge_en_emd = "hf:BAAI/bge-base-en-v1.5"  # 512 max seq length
bge_ch_emd = "hf:BAAI/bge-base-zh-v1.5"  # 512 max seq length

rerank_base = "rerank:BAAI/bge-reranker-base"    # 512 max seq length
rerank_large = "rerank:BAAI/bge-reranker-large"  # 512 max seq length

File Summarization

To create a summary of a text file in various formats like .pdf, .txt, or .docx, you can use the Summary class. For example, the following code uses the map_reduce method to generate a summary.

There are two summary types, map_reduce and refine. map_reduce summarizes each chunk first, then produces a final summary from all chunk summaries. refine summarizes chunk-by-chunk and uses the previous summary as context for the next chunk, which often improves consistency.

import akasha

summ = akasha.summary(
    model="gemini:gemini-2.5-flash",
    sum_type="map_reduce",
    chunk_size=1000,
    sum_len=1000,
    language="en",
    keep_logs=True,
    verbose=True,
    max_input_tokens=8000,
)

# Content can be a URL, file, or plain text
ret = summ(content=["https://github.com/iii-org/akasha"])

agent

By implementing an agent, you empower the LLM with the capability to utilize tools more effectively to accomplish tasks. You can allocate tools for tasks such as file editing, conducting Google searches, and enlisting the LLM's assistance in task execution, rather than solely relying on it to respond your questions.

Use Built-in Tools

import akasha.agent.agent_tools as at

# Use built-in web search and JSON save tools
tool_list = [at.websearch_tool(search_engine="brave"), at.saveJSON_tool()]

agent = akasha.agents(
    tools=tool_list,
    model="gemini:gemini-2.5-flash",
    temperature=1.0,
    max_input_tokens=8000,
    verbose=True,
    keep_logs=True,
)

# Ask a question and let the agent use tools to answer
response = agent("Search for Industry 4.0 on the web and save the result to iii.json")
print(response)

# Save logs
agent.save_logs("logs.json")

Define and Use a Custom Tool

import akasha
from datetime import datetime

# Define a tool to get today's date
def today_f():
    now = datetime.now()
    return "today's date: " + str(now.strftime("%Y-%m-%d %H:%M:%S"))

# Create the tool
today_tool = akasha.create_tool(
    "This is the tool to get today's date, the tool doesn't have any input parameter.",
    today_f,
    "today_date_tool",
)

# Create an agent with the tool
agent = akasha.agents(
    tools=[today_tool],
    model="gemini:gemini-2.5-flash",
    temperature=1.0,
    verbose=True,
    keep_logs=True,
)

# Ask a question and let the agent use the tool
response = agent("What is today's date?")
print(response)

# Save logs
agent.save_logs("logs.json")

Use Tools from MCP Servers

import asyncio
import akasha
from langchain_mcp_adapters.client import MultiServerMCPClient

MODEL = "gemini:gemini-2.5-flash"

# Define MCP server connection info
connection_info = {
    "math": {
        "command": "python",
        "args": ["cal_server.py"],
        "transport": "stdio",
    },
    "weather": {
        "url": "http://localhost:8000/sse",
        "transport": "sse",
    },
}
prompt = "tell me the weather in Taipei"

# Use MCP tools
agent = akasha.agents(
    model=MODEL,
    temperature=1.0,
    verbose=True,
    keep_logs=True,
)
response = agent.mcp_agent(connection_info, prompt)
agent.save_logs("logs_agent.json")

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.2

Jul 6, 2026

1.1

Mar 31, 2026

1.0.0

Feb 26, 2026

0.9.14

Dec 20, 2025

0.9.13

Sep 25, 2025

0.9.12

Sep 25, 2025

0.9.11

Aug 20, 2025

0.9.10

Aug 15, 2025

0.9.9

Jul 31, 2025

0.9.8

Jul 7, 2025

0.9.7

Jun 30, 2025

0.9.6

Jun 24, 2025

0.9.5

Jun 20, 2025

0.9.4

Jun 13, 2025

0.9.3

Jun 13, 2025

0.9.2

Jun 11, 2025

0.9.1

Jun 9, 2025

0.9.0

Jun 5, 2025

0.8.88

Feb 6, 2025

0.8.87

Jan 23, 2025

0.8.86

Jan 9, 2025

0.8.85

Jan 7, 2025

0.8.84

Dec 25, 2024

0.8.83

Dec 20, 2024

0.8.81

Dec 16, 2024

0.8.80

Dec 9, 2024

0.8.79

Dec 9, 2024

0.8.78

Dec 4, 2024

0.8.77

Nov 13, 2024

0.8.76

Nov 8, 2024

0.8.75

Nov 6, 2024

0.8.74

Nov 6, 2024

0.8.73

Nov 5, 2024

0.8.72

Nov 4, 2024

0.8.71

Nov 4, 2024

0.8.70

Oct 30, 2024

0.8.69

Oct 24, 2024

0.8.68

Oct 22, 2024

0.8.67

Oct 17, 2024

0.8.66

Oct 17, 2024

0.8.65

Oct 17, 2024

0.8.64

Oct 17, 2024

0.8.63

Oct 4, 2024

0.8.62

Oct 4, 2024

0.8.61

Sep 27, 2024

0.8.60

Sep 20, 2024

0.8.59

Sep 11, 2024

0.8.58

Sep 5, 2024

0.8.57

Aug 19, 2024

0.8.56

Aug 15, 2024

0.8.55

Aug 15, 2024

0.8.54

Aug 15, 2024

0.8.53

Aug 5, 2024

0.8.52

Jul 23, 2024

0.8.51

Jul 15, 2024

0.8.50

Jul 11, 2024

0.8.49

Jul 11, 2024

0.8.48

Jul 10, 2024

0.8.47

Jul 3, 2024

0.8.46

Jul 2, 2024

0.8.45

Jul 1, 2024

0.8.44

Jun 27, 2024

0.8.43

Jun 26, 2024

0.8.42

Jun 25, 2024

0.8.41

Jun 25, 2024

0.8.40

Jun 24, 2024

0.8.39

Jun 20, 2024

0.8.38

Jun 20, 2024

0.8.37

Jun 20, 2024

0.8.36

Jun 19, 2024

0.8.35

May 30, 2024

0.8.34

May 28, 2024

0.8.33

May 28, 2024

0.8.32

May 28, 2024

0.8.31

May 28, 2024

0.8.30

May 17, 2024

0.8.29

May 16, 2024

0.8.28

May 7, 2024

0.8.27

Apr 24, 2024

0.8.26

Apr 19, 2024

0.8.25

Apr 16, 2024

0.8.24

Apr 8, 2024

0.8.23

Mar 29, 2024

0.8.22

Mar 27, 2024

0.8.21

Mar 22, 2024

0.8.20

Mar 7, 2024

0.8.19

Feb 26, 2024

0.8.18

Feb 6, 2024

0.8.17

Feb 5, 2024

0.8.16

Feb 2, 2024

0.8.15

Feb 2, 2024

0.8.14

Feb 2, 2024

0.8.13

Feb 1, 2024

0.8.12

Feb 1, 2024

0.8.11

Feb 1, 2024

0.8.10

Jan 29, 2024

0.8.9

Jan 22, 2024

0.8.8

Jan 16, 2024

0.8.7

Jan 15, 2024

0.8.6

Dec 28, 2023

0.8.5

Dec 27, 2023

0.8.4

Dec 22, 2023

0.8.3

Dec 19, 2023

0.8.2

Dec 11, 2023

0.8.1

Dec 8, 2023

0.8

Dec 8, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

akasha_terminal-1.2.tar.gz (136.7 kB view details)

Uploaded Jul 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

akasha_terminal-1.2-py3-none-any.whl (161.4 kB view details)

Uploaded Jul 6, 2026 Python 3

File details

Details for the file akasha_terminal-1.2.tar.gz.

File metadata

Download URL: akasha_terminal-1.2.tar.gz
Upload date: Jul 6, 2026
Size: 136.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for akasha_terminal-1.2.tar.gz
Algorithm	Hash digest
SHA256	`f27477d7a34cc8f6421feed56c1be82f09f40edaee9825833b278cf65c1b5d8e`
MD5	`27338c99c56eb1fc2883d11551e677dd`
BLAKE2b-256	`fdb6e5a74369d8b9954b236ff2c04a0b18a3b77ea266e6e5d4a2a262605be911`

See more details on using hashes here.

File details

Details for the file akasha_terminal-1.2-py3-none-any.whl.

File metadata

Download URL: akasha_terminal-1.2-py3-none-any.whl
Upload date: Jul 6, 2026
Size: 161.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for akasha_terminal-1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9217526ad705dec83c1c4ddb38b00bf138893208508c93a5239e96f9f7b1afcd`
MD5	`cd51606d2d0a42dccdfb7636394409dd`
BLAKE2b-256	`6dea7c61098127b205c04d4d040fd8c8e0cdb6b70cec60dd7b75f6082fb2b104`

See more details on using hashes here.

akasha-terminal 1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

akasha

Quick Start (Local Development)

Change log

Installation

Standard Installation

Lightweight Installation (API-call-only, v1.0+)

Editable Install Commands

API Keys

OPENAI

GEMINI

AZURE OPENAI

Some models you can use

Some embeddings you can use

File Summarization

agent

Use Built-in Tools

Define and Use a Custom Tool

Use Tools from MCP Servers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes