No project description provided
Project description
tock-genai-core
Composants principaux d'IA générative : models, factories, gestion des erreurs utilisés dans les composants Gen AI python de tock.
Architecture
Le projet est structuré en trois composants principaux :
- Models : Contient les définitions des classes et modèles de données utilisés dans l'application
- Services/Factories : Regroupe la logique métier et les fonctions utilisées par les routes
Technologies
- Backend : Python
- Base de données :
- pgvector pour la base de données vectorielle
- LLM & RAG :
- Langchain pour l'orchestration
- Support de guardrails
- Reranker pour l'amélioration des résultats
- Langfuse pour le monitoring
Providers disponibles
-
LLMProvider (LLM)
- TGI = "HuggingFaceTextGenInference"
- OpenAI = "OpenAI"
- Vllm = "Vllm"
-
GuardrailProvider (Guardrail)
- BloomZ = "BloomzGuardrail"
-
EMProvider (Embedding)
- BloomZ = "BloomzEmbeddings"
- OpenAI = "OpenAI"
- Vllm = "Vllm"
-
VectorDBProvider (Database)
- OpenSearch = "OPENSEARCH"
- PGVector = "PGVECTOR"
-
ContextualCompressorProvider (Contetual Compressor)
- BloomZ = "BloomzRerank"
Secret Keys
Types disponibles :
- RawSecretKey : stocke directement la valeur du secret.
- Champ principal :
secret - Alias rétro-compatible :
value(encore utilisable pour compatibilité, mais à éviter dans les nouvelles configurations).
- Champ principal :
- AwsSecretKey : référence un secret dans AWS Secrets Manager.
- KubernetesSecretKey : référence un secret dans Kubernetes Secrets.
- GcpSecretKey : référence un secret dans GCP Secret Manager.
GCP Secret Manager
Pour utiliser GcpSecretKey, un project_id GCP doit être disponible.
Il est résolu automatiquement de la façon suivante :
- Si la variable d’environnement
GCP_PROJECT_IDest définie -> elle est utilisée directement. - Sinon, le
project_idest automatiquement détecté à partir des credentials Google (GOOGLE_APPLICATION_CREDENTIALS).
Settings
-
Embedding
-
Classe parente
BaseEMSetting: provider: EMProvider model: Optional[str] api_key: Optional[SecretKey] api_base: str pooling: Optional[str] space_type: Optional[str] -
Classes enfants
BloomZEMSetting(BaseEMSetting): provider: Literal[EMProvider.BloomZ]VLLMEMSetting(BaseEMSetting): provider: Literal[EMProvider.Vllm] model: strOpenAIEMSetting(BaseEMSetting): provider: Literal[EMProvider.OpenAI] api_base: str api_version: str deployment: str
-
-
Contextual compressor
-
Classe parente
BaseCompressorSetting: provider: ContextualCompressorProvider endpoint: str api_key: Optional[SecretKey] -
Classe enfant
BloomZCompressorSetting(BaseCompressorSetting): provider: Literal[ContextualCompressorProvider.BloomZ] min_score: float max_documents: Optional[int] label: Optional[str]
-
-
Database
-
Classe parente
BaseVectorDBSetting: index: Optional[str] provider: VectorDBProvider db_url: str -
Classes enfants
OpenSearchSetting(BaseVectorDBSetting): provider: Literal[VectorDBProvider.OpenSearch] username: SecretKey password: SecretKey use_ssl: bool verify_certs: boolclass PGVectorSetting(BaseVectorDBSetting): provider: Literal[VectorDBProvider.PGVector] username: SecretKey password: SecretKey db_name: str sslmode: Optional[str] namespace: str
-
-
Guardrail
-
Classe parente
BaseGuardrailSetting: provider: GuardrailProvider api_base: str max_score: Optional[float] api_key: Optional[SecretKey] -
Classe enfant
BloomZGuardrailSetting(BaseGuardrailSetting): provider: Literal[GuardrailProvider.BloomZ]
-
-
Langfuse
LangfuseSetting: host: Optional[str] public_key: Optional[SecretKey] secret_key: Optional[SecretKey] metadata: Optional[Dict[str, Any]] -
LLM
-
Classe parente
BaseLLMSetting: provider: LLMProvider model: Optional[str] api_key: Optional[SecretKey] temperature: float -
Classes enfants
OpenAILLMSetting(BaseLLMSetting): provider: Literal[LLMProvider.OpenAI] api_base: str api_version: str deployment: strHuggingFaceTextGenInferenceLLMSetting(BaseLLMSetting): provider: Literal[LLMProvider.TGI] repetition_penalty: float max_new_tokens: int api_base: str streaming: boolVllmSetting(BaseLLMSetting): provider: Literal[LLMProvider.Vllm] api_base: str max_new_tokens: int additional_model_kwargs: Optional[Dict[str, Any]]
-
Fonctionnement
Chaque outil utilisé (database, embedding, llm, langfuse, ...) a besoin d'un certains nombre de paramètres qui sont référencés dans les models (classes de settings)
Ces classes sont ensuite héritées par des services ou des factories afin de pouvoir répondre au besoin.
Exemple de get_vector_db_factory qui crée une factory de base vectorielle basée sur le nom de l'application et les paramètres d'embedding fournis
from tock-genai-core import get_vector_db_factory
from tock-genai-core import PGVectorSetting, VLLMEMSetting
from tock-genai-core import DBSetting, EMSetting
db_settings = PGVectorSetting(
index = "first_index",
provider = "PGVECTOR",
db_url = "127.0.0.1:XXXX",
db_name = "rag_sandbox_db",
sslmode = "disable",
username = {
type = "Raw",
value = "admin"
},
password = {
type = "Raw",
value = "example"
},
namespace = "test-name"
)
em_settings = VLLMEMSetting(
provider = "Vllm",
model = "model_name",
api_base = "https://continue.com/v1"
)
def function_name(db_settings: DBSetting, em_settings: EMSetting):
# do somethings
vector = get_vector_db_factory(db_settings: DBSetting, em_settings: BaseEMSetting)
# do somethings
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tock_genai_core-1.6.0.tar.gz.
File metadata
- Download URL: tock_genai_core-1.6.0.tar.gz
- Upload date:
- Size: 25.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.1 CPython/3.11.14 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
db028ffdf2a2f9c4bf7c3b958dffd8bf5949f29acf35fa6b3fee1941cc007cd8
|
|
| MD5 |
df1bbf16355d5ba8ff1351cf8e528825
|
|
| BLAKE2b-256 |
77af74f1723ddd5bae405025429a95321b9e0b58cfd2e603145319e472cc2c6e
|
File details
Details for the file tock_genai_core-1.6.0-py3-none-any.whl.
File metadata
- Download URL: tock_genai_core-1.6.0-py3-none-any.whl
- Upload date:
- Size: 62.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.1 CPython/3.11.14 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
600434140636339212730ca9197f09e92403e3f67bd2faab89bf670faa49d676
|
|
| MD5 |
04cd0054e1dc7ad98a74f6e95fc84414
|
|
| BLAKE2b-256 |
e111788c7ab02fce1f1af8b35e2098cfb2f280e8cdccbd6ca52613b1ec8c3b4f
|