No project description provided

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

codeqai

Search your codebase semantically or chat with it from cli. 100% local support without any dataleaks.
Built with langchain, treesitter, sentence-transformers, instructor-embedding, faiss, lama.cpp, Ollama.

✨ Features

🔎 Semantic code search
💬 GPT-like chat with your codebase
💻 100% local embeddings and llms
- sentence-transformers, instructor-embeddings, llama.cpp, Ollama
🌐 OpenAI and Azure OpenAI support

[!NOTE]
There will be better results if the code is well documented. You might consider doc-comments-ai for code documentation generation.

🚀 Usage

Start semantic search:

codeqai search

Start chat dialog:

codeqai chat

📋 Requirements

Python >= 3.9

🔧 Installation

pipx install codeqai

At first usage it is asked to install faiss-cpu or faiss-gpu. Faiss-gpu is recommended if the hardware supports CUDA 7.5+. If local embeddings and llms are used it will be further asked to install sentence-transformers, instructor or llama.cpp later.

⚙️ Configuration

At first usage or by running

codeqai configure

the configuration process is initiated, where the embeddings and llms can be chosen.

🌐 Remote models

If remote models are preferred instead of local, some environment variables needs to be specified in advance.

OpenAI

export OPENAI_API_KEY = "your OpenAI api key"

Azure OpenAI

export OPENAI_API_TYPE = "azure"
export OPENAI_API_BASE = "https://<your-endpoint.openai.azure.com/"
export OPENAI_API_KEY = "your AzureOpenAI api key"
export OPENAI_API_VERSION = "2023-05-15"

💡 How it works

The entire git repo is parsed with treesitter to extract all methods with documentations and saved to a local FAISS vector database with either sentence-transformers, instructor-embeddings or OpenAI's text-embedding-ada-002. The vector database is saved to a file on your system and will be loaded later again after further usage.
Afterwards it is possible to do semantic search on the codebase based on the embeddings model.
To chat with the codebase locally llama.cpp or Ollama is used by specifying the desired model. Using llama.cpp the specified model needs to be available on the system in advance. Using Ollama the Ollama container with the desired model needs to be running locally in advance on port 11434. Also OpenAI or Azure-OpenAI can be used for remote chat models.

📚 Supported Languages

Python
Typescript
Javascript
Java
Rust
Kotlin
Go
C++
C
C#

FAQ

Where do I get models for llama.cpp?

Install the huggingface-cli and download your desired model from the model hub. For example

huggingface-cli download TheBloke/CodeLlama-13B-Python-GGUF codellama-13b-python.Q5_K_M.gguf

will download the codellama-13b-python.Q5_K_M model. After the download has finished the absolute path of the model .gguf file is printed to the console.

[!IMPORTANT]
llama.cpp compatible models must be in the .gguf format.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.18

Apr 15, 2024

0.0.17

Mar 29, 2024

0.0.16

Mar 11, 2024

0.0.15

Mar 4, 2024

0.0.14

Feb 11, 2024

0.0.13

Feb 10, 2024

0.0.12

Feb 10, 2024

0.0.11

Feb 6, 2024

0.0.10

Jan 20, 2024

0.0.9

Jan 14, 2024

0.0.8

Jan 13, 2024

0.0.7

Jan 13, 2024

0.0.6

Jan 1, 2024

0.0.5

Oct 14, 2023

0.0.4

Oct 8, 2023

This version

0.0.3

Oct 7, 2023

0.0.2

Sep 26, 2023

0.0.1

Sep 26, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codeqai-0.0.3.tar.gz (18.2 kB view hashes)

Uploaded Oct 7, 2023 Source

Built Distribution

codeqai-0.0.3-py3-none-any.whl (24.9 kB view hashes)

Uploaded Oct 7, 2023 Python 3

Hashes for codeqai-0.0.3.tar.gz

Hashes for codeqai-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`45f00670a99ed8e8eb9fa38f89ae401af99990407dafbf0f4f0c6cd32f0f67f9`
MD5	`8fbc4ca69d925e2b1fc214de859243f7`
BLAKE2b-256	`43451db75a19536dbde0eea5c6328a1db29f937c07fa79f8540d8605113edf06`

Hashes for codeqai-0.0.3-py3-none-any.whl

Hashes for codeqai-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`df3b6b4d1c789de2ced19806bffc46b1650a25e8bf16f8b536db055df35517a7`
MD5	`ac92d304f8b5fa6501fe5b387ec5187c`
BLAKE2b-256	`551c94a2b750b075336ed21e08d0e1e5d3d2700f2d82429f13c01f6af385c89e`