Wrapped API for Neural Condense Subnet - Bittensor

These details have not been verified by PyPI

Project links

Homepage

Project description

🚀 Organic API Usage for Neural Condense Subnet 🌐

Empowered by Bittensor

🌟 Overview

The Neural Condense Subnet (NCS) library provides an efficient and intuitive interface to compress extensive input contexts into concise, high-relevance formats. This optimization is especially beneficial when working with large language models (LLMs) that have token limitations, as it allows you to maximize the use of input constraints, enhancing inference efficiency.

📦 Installation

Install the library using pip:

pip install neural-condense

🛠️ Usage

Quick Start in Python

This example demonstrates how to initialize the CondenseClient, define a message context, generate condensed tokens, and apply them in an LLM pipeline.

Get condense your long messages into condensed tokens.

from neural_condense import CondenseClient, SAT_TOKEN
import numpy as np

# Initialize the client with your API key
client = CondenseClient(
  api_key="your_api_key", 
  model_name="mistralai/Mistral-7B-Instruct-v0.2"
)

# Define a long context and focused prompt
messages = [
  {
    "role": "user",
    "content": "Many of you think that EPL and other salary levels are similar, but you are wrong. In EPL, the media glosses over pre-tax salary information, while in Serie A they deal with salary. That means the salary that Milan must pay Donnarumma if they agree to sign the contract is 24m/season + 20m in salary. No one pays that much money for a goalkeeper... What is the salary that Milan must pay Donnarumma if they agree to sign the contract?"
  },
  {
    "role": "assistant",
    "content": f"The salary that Milan must pay Donnarumma if they agree to sign the contract is 24m/season + 20m in salary. {SAT_TOKEN}"
  },
  {
    "role": "user",
    "content": "Who is Donnarumma?"
  }
]

# Generate condensed tokens
condensed_output = client.create_condensed_tokens(
    messages=messages,
    tier="inference_0", 
)

# Check the shape of the condensed tokens
print(f"Condensed tokens shape: {condensed_output.condensed_tokens.shape}")

Apply the condensed tokens in an LLM pipeline.

# Example: Using the condensed tokens in an LLM pipeline
from transformers import pipeline

# Initialize language model (Hugging Face transformers)
llm = pipeline("text-generation", model="mistralai/Mistral-7B-Instruct-v0.2")

# Use condensed embeddings as input
output = llm(inputs_embeds=condensed_output.inputs_embeds, max_new_tokens=100)

print(output)

Asynchronous Usage 🌐

For asynchronous contexts, use AsyncCondenseClient to handle requests without blocking execution.

from neural_condense import AsyncCondenseClient
import asyncio

async def main():
    client = AsyncCondenseClient(api_key="your_api_key")
    condensed_output = await client.create_condensed_tokens(
        messages=messages,
        tier="inference_0", 
        target_model="mistralai/Mistral-7B-Instruct-v0.2"
    )
    print(f"Condensed tokens shape: {condensed_output.inputs_embeds.shape}")

asyncio.run(main())

🔍 Additional Information

Supported Models

The library supports a variety of pre-trained models available through Hugging Face's model hub. Ensure that the model you choose is compatible with the Neural Condense Subnet’s framework.

SAT_TOKEN

The SAT_TOKEN acts as a delimiter within your message templates, separating context and prompts. This token helps guide the API in recognizing specific sections of input messages, optimizing them for compression.

API Parameters

tier: Specify the inference tier, which affects the quality and speed of token condensation.
target_model: Set the target model to shape the condensed output according to the requirements of the chosen language model.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.6

Nov 7, 2024

0.0.5

Nov 7, 2024

This version

0.0.4

Nov 6, 2024

0.0.3

Nov 5, 2024

0.0.2

Nov 4, 2024

0.0.1

Nov 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neural_condense-0.0.4.tar.gz (4.7 kB view details)

Uploaded Nov 6, 2024 Source

Built Distribution

neural_condense-0.0.4-py3-none-any.whl (5.5 kB view details)

Uploaded Nov 6, 2024 Python 3

File details

Details for the file neural_condense-0.0.4.tar.gz.

File metadata

Download URL: neural_condense-0.0.4.tar.gz
Upload date: Nov 6, 2024
Size: 4.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for neural_condense-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`ec6f731fbd35a53f212063e08cd238ff71bd0a673476b151cabdc28f7d3c75dc`
MD5	`537494e256ecbe82255068ca4b13c878`
BLAKE2b-256	`df081939559f23ab1d2bfd4243182f909135f6cd00d2ad22221ade627d8e3a87`

See more details on using hashes here.

File details

Details for the file neural_condense-0.0.4-py3-none-any.whl.

File metadata

Download URL: neural_condense-0.0.4-py3-none-any.whl
Upload date: Nov 6, 2024
Size: 5.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for neural_condense-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0e9fec3ae86578d626409a5750e64d08946bbe7b1009d233415df94e26601ca1`
MD5	`df6815a15c6231e16ad15d64d9b8ddba`
BLAKE2b-256	`0c049e405b840814dfea26fe85f9a6021ff7e47d337d9a0e52fcde19b2f50963`