Skip to main content

A Python package for embedding and analyzing code changes in git repositories.

Project description

PyPI version License: MIT Downloads LinkedIn

Beatrica Embedding

beatrica-embedding is a Python package designed to embed and analyze code changes in git repositories, utilizing Language Learning Models (LLMs) for enhanced insights. It allows seamless switching between different LLMs, making it highly adaptable for various code analysis needs.

Installation

To install beatrica-embedding, use pip:

pip install beatrica-embedding

Usage

beatrica-embedding offers a flexible way to analyze commit changes and generate insights using different LLMs. Below is an example demonstrating how to extract commit changes from a repository and process them with a chosen LLM.

Extracting and Embedding Code Changes with LLM

from beatrica_git.recent_change_inspector import BeatricaDiffTracker
from beatrica_embedding.embedding_generator import BeatricaCodeChangeProcessor
from langchain_openai import ChatOpenAI
from langchain_mistralai.chat_models import ChatMistralAI


# Extract commit changes from a git repository
beatrica_diff_tracker = BeatricaDiffTracker(base_branch="main")

# Analyze the commits
beatrica_diff_tracker.analyze_commits()

# Get the commit changes
commit_changes = beatrica_diff_tracker.commit_changes.items()

# Choose the LLM for processing
# language_model = ChatOpenAI(model_name="gpt-4-0125-preview", api_key=os.getenv("OPENAI_API_KEY"), max_tokens=1000)
language_model = ChatMistralAI(model="mistral-medium-latest", mistral_api_key=os.getenv("MISTRAL_API_KEY"), max_tokens=500)

code_change_processor = BeatricaCodeChangeProcessor(commit_changes, language_model=language_model)

# Process the commit changes
retrieval_chain = code_change_processor.process()

# Example query to analyze the changes
question = "What are the changes in the following code?"
print(question)
result = retrieval_chain(question)

answer = result['answer']
print(answer)

Features

  • Flexible integration with multiple LLMs for code change analysis.
  • Easy to switch between models like OpenAI's GPT or MistralAI for different levels of analysis.
  • Embedding and analyzing commit changes in git repositories for enhanced insights.
  • Supports detailed analysis of code changes, leveraging the power of conversational models.

Contributing

Contributions, issues, and feature requests are welcome! Feel free to check the issues page.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beatrica_embedding-2025.5.180922.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

beatrica_embedding-2025.5.180922-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file beatrica_embedding-2025.5.180922.tar.gz.

File metadata

File hashes

Hashes for beatrica_embedding-2025.5.180922.tar.gz
Algorithm Hash digest
SHA256 09a437e8aa8a88b550fc98d317ae37db75b932e6ceeb3731bddad981c9bb10df
MD5 f5058175a768f4c97fd9ba593724d731
BLAKE2b-256 bc7f3dd9cdd79e74201351c5ff61dd23392c892c0d525bac9d3dd079c4d951a4

See more details on using hashes here.

File details

Details for the file beatrica_embedding-2025.5.180922-py3-none-any.whl.

File metadata

File hashes

Hashes for beatrica_embedding-2025.5.180922-py3-none-any.whl
Algorithm Hash digest
SHA256 f6cfee5d6df8d7e1342772a4a7928a9bfeb7f7e6bd2018c37f5956ea62f837c9
MD5 aeb45a37aa5e63e9af1a8f66aac20c9c
BLAKE2b-256 74f55e87e302a786ffac33e2fbf67bd95ad4747dd1dbf6b720047f58543c085e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page