Generalist and Lightweight Model for Text Classification

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

ingvarstep

These details have not been verified by PyPI

Project description

⭐ GLiClass: Generalist and Lightweight Model for Sequence Classification

GLiClass is an efficient, zero-shot sequence classification model inspired by the GLiNER framework. It achieves comparable performance to traditional cross-encoder models while being significantly more computationally efficient, offering classification results approximately 10 times faster by performing classification in a single forward pass.

📄 Blog • 📢 Discord • 📺 Demo • 🤗 Available models •

🚀 Quick Start

Install GLiClass easily using pip:

pip install gliclass

Install from Source

Clone and install directly from GitHub:

git clone https://github.com/Knowledgator/GLiClass
cd GLiClass

python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

pip install -r requirements.txt
pip install .

Verify your installation:

import gliclass
print(gliclass.__version__)

🧑‍💻 Usage Example

from gliclass import GLiClassModel, ZeroShotClassificationPipeline
from transformers import AutoTokenizer

model = GLiClassModel.from_pretrained("knowledgator/gliclass-small-v1.0")
tokenizer = AutoTokenizer.from_pretrained("knowledgator/gliclass-small-v1.0")

pipeline = ZeroShotClassificationPipeline(
    model, tokenizer, classification_type='multi-label', device='cuda:0'
)

text = "One day I will see the world!"
labels = ["travel", "dreams", "sport", "science", "politics"]
results = pipeline(text, labels, threshold=0.5)[0]

for result in results:
    print(f"{result['label']} => {result['score']:.3f}")

🔥 New Features

Hierarchical Labels

GLiClass now supports hierarchical label structures using dot notation:

hierarchical_labels = {
    "sentiment": ["positive", "negative", "neutral"],
    "topic": ["product", "service", "shipping"]
}

text = "The product quality is amazing but delivery was slow"
results = pipeline(text, hierarchical_labels, threshold=0.5)[0]

for result in results:
    print(f"{result['label']} => {result['score']:.3f}")
# Output:
# sentiment.positive => 0.892
# topic.product => 0.921
# topic.shipping => 0.763

Get hierarchical output matching your input structure:

results = pipeline(text, hierarchical_labels, return_hierarchical=True)[0]
print(results)
# Output:
# {
#     "sentiment": {"positive": 0.892, "negative": 0.051, "neutral": 0.124},
#     "topic": {"product": 0.921, "service": 0.153, "shipping": 0.763}
# }

Few-Shot Examples

Improve classification accuracy with in-context examples using the <<EXAMPLE>> token:

examples = [
    {
        "text": "Love this item, great quality!",
        "labels": ["positive", "product"]
    },
    {
        "text": "Customer support was unhelpful",
        "labels": ["negative", "service"]
    }
]

text = "Fast delivery and the item works perfectly!"
labels = ["positive", "negative", "product", "service", "shipping"]

results = pipeline(text, labels, examples=examples, threshold=0.5)[0]

for result in results:
    print(f"{result['label']} => {result['score']:.3f}")

Task Description Prompts

Add custom prompts to guide the classification task:

text = "The battery life on this phone is incredible"
labels = ["positive", "negative", "neutral"]

results = pipeline(
    text,
    labels,
    prompt="Classify the sentiment of this product review:",
    threshold=0.5
)[0]

Use per-text prompts for batch processing:

texts = ["Review about electronics", "Review about clothing"]
prompts = [
    "Analyze this electronics review:",
    "Analyze this clothing review:"
]

results = pipeline(texts, labels, prompt=prompts)

Long Document Classification

Process long documents with automatic text chunking:

from gliclass import ZeroShotClassificationWithChunkingPipeline

chunking_pipeline = ZeroShotClassificationWithChunkingPipeline(
    model,
    tokenizer,
    text_chunk_size=8192,
    text_chunk_overlap=256,
    labels_chunk_size=8
)

long_document = "..." # Very long text
labels = ["category1", "category2", "category3"]

results = chunking_pipeline(long_document, labels, threshold=0.5)

🌟 Retrieval-Augmented Classification (RAC)

With new models trained with retrieval-agumented classification, such as this model you can specify examples to improve classification accuracy:

example = {
    "text": "A new machine learning platform automates complex data workflows but faces integration issues.",
    "all_labels": ["AI", "automation", "data_analysis", "usability", "integration"],
    "true_labels": ["AI", "integration", "automation"]
}

text = "The new AI-powered tool streamlines data analysis but has limited integration capabilities."
labels = ["AI", "automation", "data_analysis", "usability", "integration"]

results = pipeline(text, labels, threshold=0.1, rac_examples=[example])[0]

for predict in results:
    print(f"{predict['label']} => {predict['score']:.3f}")

🚀 Production Serving

Deploy GLiClass with Ray Serve for production workloads with dynamic batching and memory-aware processing.

Installation

pip install gliclass[serve]

Quick Start

# Default model
python -m gliclass.serve

# Specify model and port
python -m gliclass.serve --model knowledgator/gliclass-edge-v3.0 --port 8000

# With config file
python -m gliclass.serve --config serve_configs/serve_config.yaml

Python Client

from gliclass.serve import GLiClassClient

client = GLiClassClient(url="http://localhost:8000/gliclass")

result = client.classify(
    text="This is a great product!",
    labels=["positive", "negative", "neutral"],
    threshold=0.3,
)
print(result)  # [{"label": "positive", "score": 0.95}, ...]

HTTP API

The HTTP endpoint processes one text per request.

curl -X POST http://localhost:8000/gliclass \
  -H "Content-Type: application/json" \
  -d '{
    "texts": "This is a great product!",
    "labels": ["positive", "negative", "neutral"],
    "threshold": 0.3
  }'

# Response: [{"label": "positive", "score": 0.95}, ...]

Note: For batch processing multiple texts, use the ZeroShotClassificationPipeline directly instead of the serving API.

See serve_configs/serve_config.yaml for full configuration options.

🎯 Key Use Cases

Sentiment Analysis: Rapidly classify texts as positive, negative, or neutral.
Document Classification: Efficiently organize and categorize large document collections.
Search Results Re-ranking: Improve relevance and precision by reranking search outputs.
News Categorization: Automatically tag and organize news articles into predefined categories.
Fact Checking: Quickly validate and categorize statements based on factual accuracy.

🛠️ How to Train

Prepare your training data as follows:

[
  {"text": "Sample text.", "all_labels": ["sports", "science", "business"], "true_labels": ["sports"]},
  ...
]

Optionally, specify confidence scores explicitly:

[
  {"text": "Sample text.", "all_labels": ["sports", "science"], "true_labels": {"sports": 0.9}},
  ...
]

Please, refer to the train.py script to set up your training from scratch or fine-tune existing models.

⚙️ Advanced Configuration

Architecture Types

GLiClass supports multiple architecture types:

uni-encoder: Single encoder for both text and labels (default, most efficient)
bi-encoder: Separate encoders for text and labels
bi-encoder-fused: Bi-encoder with label embeddings fused into text encoding
encoder-decoder: Encoder-decoder architecture for sequence-to-sequence tasks

from gliclass import GLiClassBiEncoder

# Load a bi-encoder model
model = GLiClassBiEncoder.from_pretrained("knowledgator/gliclass-biencoder-v1.0")

Pooling Strategies

Configure how token embeddings are pooled:

first: First token (CLS token)
avg: Average pooling
max: Max pooling
last: Last token
sum: Sum pooling
rms: Root mean square pooling
abs_max: Max of absolute values
abs_avg: Average of absolute values

from gliclass import GLiClassModelConfig

config = GLiClassModelConfig(
    pooling_strategy='avg',
    class_token_pooling='average'  # or 'first'
)

Scoring Mechanisms

Choose different scoring mechanisms for classification:

simple: Dot product (fastest)
weighted-dot: Weighted dot product with learned projections
mlp: Multi-layer perceptron scorer
hopfield: Hopfield network-based scorer

config = GLiClassModelConfig(
    scorer_type='mlp'
)

Streaming Classification

GLiClass supports incremental, multi-session text classification over a decoder-KV model. Instead of re-encoding the full document on every update, it maintains a persistent KV cache per session and runs the scorer only when a pluggable strategy decides classification should fire.

pip install gliclass[streaming]

from gliclass.streaming import StreamingPipeline, SessionInput, EveryNTokensStrategy

pipeline = StreamingPipeline(model, tokenizer, device="cuda", max_cache_len=1024)
strategy = EveryNTokensStrategy(n=50)

for chunk in text_chunks:
    outputs = pipeline([SessionInput(
        session_id="doc_001",
        text=chunk,
        labels=["science", "politics", "finance"],
        strategy=strategy,
        classification_type="multi-label",
    )])
    if outputs[0].triggered:
        print(outputs[0].predictions)

Built-in strategies: EveryChunkStrategy, EveryNTokensStrategy, OnDelimiterStrategy, SlidingWindowStrategy, ComposedStrategy, NeverStrategy.

For full documentation on session management, KV cache internals, batching, CPU offloading, and custom strategies, see docs/streaming.md.

Flash Attention Backends

GLiClass supports optional flash attention backends for faster inference.

Install

pip install flashdeberta   # DeBERTa v2
pip install turbot5        # T5 / mT5

FlashDeBERTa (DeBERTa v2)

Enable via environment variable:

export USE_FLASHDEBERTA=1

If flashdeberta is installed, DeBERTa v2 models will use FlashDebertaV2Model. Otherwise, GLiClass falls back to DebertaV2Model.

TurboT5 (T5 / mT5)

Enable via environment variable:

export TURBOT5_ATTN_TYPE=triton-basic

If turbot5 is installed, T5 / mT5 models will use FlashT5EncoderModel. Otherwise, GLiClass falls back to T5EncoderModel.

Notes:

Flash backends are optional
Enabled automatically when available
No code changes required

Want it even tighter (single block), or is this the sweet spot?

📚 Citations

If you find GLiClass useful in your research or project, please cite our papers:

@misc{stepanov2025gliclassgeneralistlightweightmodel,
      title={GLiClass: Generalist Lightweight Model for Sequence Classification Tasks}, 
      author={Ihor Stepanov and Mykhailo Shtopko and Dmytro Vodianytskyi and Oleksandr Lukashov and Alexander Yavorskyi and Mykyta Yaroshenko},
      year={2025},
      eprint={2508.07662},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2508.07662}, 
}

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

ingvarstep

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.20

Jul 21, 2026

0.1.19

Jul 3, 2026

0.1.18

May 9, 2026

0.1.17

Apr 30, 2026

0.1.16

Feb 17, 2026

0.1.15

Feb 17, 2026

0.1.14

Jan 26, 2026

0.1.13

Jan 23, 2026

0.1.12

Dec 4, 2025

0.1.11

Apr 11, 2025

0.1.10

Mar 19, 2025

0.1.9

Mar 7, 2025

0.1.8

Feb 25, 2025

0.1.7

Jan 2, 2025

0.1.6

Nov 14, 2024

0.1.5

Sep 6, 2024

0.1.4

Aug 14, 2024

0.1.3

Jul 2, 2024

0.1.1

Jun 3, 2024

0.1.0

Jun 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gliclass-0.1.20.tar.gz (88.8 kB view details)

Uploaded Jul 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gliclass-0.1.20-py3-none-any.whl (80.3 kB view details)

Uploaded Jul 21, 2026 Python 3

File details

Details for the file gliclass-0.1.20.tar.gz.

File metadata

Download URL: gliclass-0.1.20.tar.gz
Upload date: Jul 21, 2026
Size: 88.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for gliclass-0.1.20.tar.gz
Algorithm	Hash digest
SHA256	`749fcf6e4323f475ead281b9d2ec3f5ab2ec2e7c246e541840f36685bafc05dc`
MD5	`63b231debfd08ca86816054d4048abdf`
BLAKE2b-256	`877e50ec9ff331823078a2fcb6fce2c41e0cea7988b67ad9fa96e3446aade94f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gliclass-0.1.20.tar.gz:

Publisher: release.yaml on Knowledgator/GLiClass

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gliclass-0.1.20.tar.gz
- Subject digest: 749fcf6e4323f475ead281b9d2ec3f5ab2ec2e7c246e541840f36685bafc05dc
- Sigstore transparency entry: 2213138439
- Sigstore integration time: Jul 21, 2026
Source repository:
- Permalink: Knowledgator/GLiClass@40baa67cdea577449bc3f6a251646377b2cb0a6c
- Branch / Tag: refs/tags/v0.1.20
- Owner: https://github.com/Knowledgator
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@40baa67cdea577449bc3f6a251646377b2cb0a6c
- Trigger Event: push

File details

Details for the file gliclass-0.1.20-py3-none-any.whl.

File metadata

Download URL: gliclass-0.1.20-py3-none-any.whl
Upload date: Jul 21, 2026
Size: 80.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for gliclass-0.1.20-py3-none-any.whl
Algorithm	Hash digest
SHA256	`554005810fc18155bd4dd506f08785196005d1414ba5beddd2533121b6dd98ba`
MD5	`85c6801c98870319654098816baa8257`
BLAKE2b-256	`4eabb37dd1e5365dd738e3398ca5c73a49b229dc03b1f187260b5933c6877a95`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gliclass-0.1.20-py3-none-any.whl:

Publisher: release.yaml on Knowledgator/GLiClass

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gliclass-0.1.20-py3-none-any.whl
- Subject digest: 554005810fc18155bd4dd506f08785196005d1414ba5beddd2533121b6dd98ba
- Sigstore transparency entry: 2213138792
- Sigstore integration time: Jul 21, 2026
Source repository:
- Permalink: Knowledgator/GLiClass@40baa67cdea577449bc3f6a251646377b2cb0a6c
- Branch / Tag: refs/tags/v0.1.20
- Owner: https://github.com/Knowledgator
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@40baa67cdea577449bc3f6a251646377b2cb0a6c
- Trigger Event: push

gliclass 0.1.20

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

⭐ GLiClass: Generalist and Lightweight Model for Sequence Classification

🚀 Quick Start

Install from Source

🧑‍💻 Usage Example

🔥 New Features

Hierarchical Labels

Few-Shot Examples

Task Description Prompts

Long Document Classification

🌟 Retrieval-Augmented Classification (RAC)

🚀 Production Serving

Installation

Quick Start

Python Client

HTTP API

🎯 Key Use Cases

🛠️ How to Train

⚙️ Advanced Configuration

Architecture Types

Pooling Strategies

Scoring Mechanisms

Streaming Classification

Flash Attention Backends

Install

FlashDeBERTa (DeBERTa v2)

TurboT5 (T5 / mT5)

📚 Citations

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance