Lightweight wrapper for cortecs.ai enabling ⚡️ instant provisioning

These details have not been verified by PyPI

Project links

Project description

cortecs-py

PyPI Version Python Versions Downloads Workflow Status

Lightweight wrapper for the cortecs.ai enabling instant provisioning.

⚡Quickstart

Dynamic provisioning allows you to run LLM-workflows on dedicated compute. The LLM and underlying resources are automatically provisioned for the duration of use, providing maximum cost-efficiency. Once the workflow is complete, the infrastructure is automatically shut down.

This library starts and stops your resources. The logic can be implemented using popular frameworks such as LangChain or crewAI.

Start your LLM
Execute your (batch) jobs
Shutdown your LLM

from cortecs_py.client import Cortecs
from cortecs_py.integrations import DedicatedLLM

cortecs = Cortecs()

with DedicatedLLM(client=cortecs, model_id='neuralmagic--Meta-Llama-3.1-8B-Instruct-FP8') as llm:
    essay = llm.invoke('Write an essay about dynamic provisioning')
    print(essay.content)

Example

Install

pip install cortecs-py

Summarizing documents

First, set up the environment variables. Use your credentials from cortecs.ai.

export OPENAI_API_KEY="<YOUR_CORTECS_API_KEY>"
export CORTECS_CLIENT_ID="<YOUR_ID>"
export CORTECS_CLIENT_SECRET="<YOUR_SECRET>"

This example shows how to use LangChain to configure a simple summarization chain. The llm is dynamically provisioned and the chain is executed in parallel.

from langchain_community.document_loaders import ArxivLoader
from langchain_core.prompts import ChatPromptTemplate

from cortecs_py.client import Cortecs
from cortecs_py.integrations import DedicatedLLM

cortecs = Cortecs()
loader = ArxivLoader(
    query="reasoning",
    load_max_docs=40,
    get_ful_documents=True,
    doc_content_chars_max=25000,  # ~6.25k tokens, make sure the models supports that context length
    load_all_available_meta=False
)

prompt = ChatPromptTemplate.from_template("{text}\n\n Explain to me like I'm five:")
docs = loader.load()

with DedicatedLLM(client=cortecs, model_id='neuralmagic--Meta-Llama-3.1-8B-Instruct-FP8') as llm:
    chain = prompt | llm

    print("Processing data batch-wise ...")
    summaries = chain.batch([{"text": doc.page_content} for doc in docs])
    for summary in summaries:
        print(summary.content + '-------\n\n\n')

This simple example showcases the power of dynamic provisioning. We summarized 224.2k input tokens into 12.9k output tokens in 55 seconds. The llm can be fully utilized in those 55 seconds enabling better cost efficiency. Comparing to serverless open source model providers we observe the following:

Price Comparison per Million Tokens (USD)

Use Cases

Low latency -> How to process reddit in realtime
Multi-agents -> How to use CrewAI without request limits
Batch processing
High-security

For more information see our docs or join our discord.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.2

Mar 19, 2025

0.1.1

Feb 28, 2025

0.1.0

Jan 10, 2025

0.0.11

Dec 12, 2024

This version

0.0.10

Dec 4, 2024

0.0.2

Nov 12, 2024

0.0.1

Oct 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cortecs_py-0.0.10-py3-none-any.whl (15.0 kB view details)

Uploaded Dec 4, 2024 Python 3

File details

Details for the file cortecs_py-0.0.10-py3-none-any.whl.

File metadata

Download URL: cortecs_py-0.0.10-py3-none-any.whl
Upload date: Dec 4, 2024
Size: 15.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.13.0

File hashes

Hashes for cortecs_py-0.0.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4060af80f0d183cc1ef7f52d3c12e0b36ef3d6974a5ff138143eff0b0ae2c91a`
MD5	`ef45b215b4b95be899b97843cf8b288b`
BLAKE2b-256	`d1cad930835a8cee21072030c3939d7146bb81b5c611aa7d55f8147094a90a50`

See more details on using hashes here.

cortecs-py 0.0.10

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cortecs-py

⚡Quickstart

Example

Install

Summarizing documents

Use Cases

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes