A tool for categorizing text data and images using LLMs and vision models

These details have not been verified by PyPI

Project links

Project description

catllm Logo

cat-llm

CatLLM: A Reproducible LLM Pipeline for Coding Open-Ended Survey Responses

The Problem

If you work with open-ended survey data, you know the pain: hundreds or thousands of free-text responses that need to be categorized before you can do any quantitative analysis. The traditional approach is manual coding—either doing it yourself or hiring research assistants. It's slow, expensive, and doesn't scale.

The Solution

CatLLM is a Python package designed specifically for survey research that uses LLMs to automate the categorization of open-ended responses. It handles both:

Category Assignment: Classify responses into your predefined categories (multi-label supported)
Category Extraction: Automatically discover and extract categories from your data when you don't have a predefined scheme

With leading models like GPT-5, Gemini, and Qwen 3, CatLLM achieves 98% accuracy compared to human consensus on classification tasks.

Try the web app: https://huggingface.co/spaces/CatLLM/survey-classifier

Installation
Quick Start
Configuration
Supported Models
API Reference
- classify() - Unified function for text, image, and PDF (auto-detects input type)
- extract() - Unified function for category extraction
- image_score_drawing()
- image_features()
- cerad_drawn_score()
Deprecated Functions
Related Projects
Academic Research
Contact
License

Installation

pip install cat-llm

For PDF support:

pip install cat-llm[pdf]

Quick Start

This package is designed for building datasets at scale, not one-off queries. While you can categorize individual responses, its primary purpose is batch processing entire survey columns or image collections into structured research datasets.

Simply provide your survey responses and category list—the package handles the rest and outputs clean data ready for statistical analysis. It works with single or multiple categories per response and automatically skips missing data to save API costs.

Also supports image and PDF classification using the same methodology: extract features, count objects, identify categories, or determine presence of elements based on your research questions.

All outputs are formatted for immediate statistical analysis and can be exported directly to CSV.

Not to be confused with CAT-LLM for Chinese article‐style transfer (Tao et al. 2024).

Configuration

Get Your API Key

Get an API key from your preferred provider:

OpenAI: platform.openai.com
Anthropic: console.anthropic.com
Google: aistudio.google.com
Huggingface: huggingface.co/settings/tokens
xAI: console.x.ai
Mistral: console.mistral.ai
Perplexity: perplexity.ai/settings/api

Most providers require adding a payment method and purchasing credits. Store your key securely and never share it publicly.

Supported Models

OpenAI: GPT-4o, GPT-4, GPT-5, etc.
Anthropic: Claude Sonnet 4, Claude 3.5 Sonnet, Claude Haiku, etc.
Google: Gemini 2.5 Flash, Gemini 2.5 Pro, etc.
Huggingface: Qwen, Llama 4, DeepSeek, and thousands of community models
xAI: Grok models
Mistral: Mistral Large, Pixtral, etc.
Perplexity: Sonar Large, Sonar Small, etc.

Fully Tested:

OpenAI (GPT-4, GPT-4o, GPT-5, etc.)
Anthropic (Claude Sonnet 4, Claude 3.5 Sonnet, Haiku)
Perplexity (Sonar models)
Google Gemini - Free tier has severe rate limits (5 RPM). Requires Google AI Studio billing account for large-scale use.
Huggingface - Access to Qwen, Llama 4, DeepSeek, and thousands of user-trained models for specific tasks. API routing can occasionally be unstable.
xAI (Grok models)
Mistral (Mistral Large, Pixtral, etc.)

Note: For best results, I recommend starting with OpenAI or Anthropic.

API Reference

`classify()`

Unified classification function for text, image, and PDF inputs. Input type is auto-detected from your data—no need to specify whether you're classifying text, images, or PDFs.

Supports both single-model and multi-model ensemble classification for improved accuracy through consensus voting.

Parameters:

input_data: The data to classify. Can be:
- Text: list of strings or pandas Series
- Images: directory path, single file, or list of image paths
- PDFs: directory path, single file, or list of PDF paths
categories (list): List of category names for classification
api_key (str): API key for the LLM service (single-model mode)
description (str): Description of the input data context
user_model (str, default="gpt-4o"): Model to use
mode (str, default="image"): PDF processing mode - "image", "text", or "both"
creativity (float, optional): Temperature setting (0.0-1.0)
chain_of_thought (bool, default=True): Enable step-by-step reasoning
filename (str, optional): Output filename for CSV
save_directory (str, optional): Directory to save results
model_source (str, default="auto"): Provider - "auto", "openai", "anthropic", "google", "mistral", "perplexity", "huggingface", "xai"
models (list, optional): For multi-model ensemble, list of (model, provider, api_key) tuples
consensus_threshold (float, default=0.5): Agreement threshold for ensemble mode (0-1)

Returns:

pandas.DataFrame: Classification results with category columns

Examples:

import catllm as cat

# Text classification (auto-detected)
results = cat.classify(
    input_data=df['responses'],
    categories=["Positive feedback", "Negative feedback", "Neutral"],
    description="Customer satisfaction survey",
    api_key=api_key
)

# Image classification (auto-detected from file paths)
results = cat.classify(
    input_data="/path/to/images/",
    categories=["Contains person", "Outdoor scene", "Has text"],
    description="Product photos",
    api_key=api_key
)

# PDF classification (auto-detected, processes each page separately)
results = cat.classify(
    input_data="/path/to/reports/",
    categories=["Contains table", "Has chart", "Is summary page"],
    description="Financial reports",
    mode="both",  # Use both image and extracted text
    api_key=api_key
)

# Multi-model ensemble for higher accuracy
results = cat.classify(
    input_data=df['responses'],
    categories=["Positive", "Negative", "Neutral"],
    models=[
        ("gpt-4o", "openai", "sk-..."),
        ("claude-sonnet-4-5-20250929", "anthropic", "sk-ant-..."),
        ("gemini-2.5-flash", "google", "AIza..."),
    ],
    consensus_threshold=0.5,  # Majority vote
)

Multi-Model Ensemble:

When you provide the models parameter, CatLLM runs classification across multiple models in parallel and combines results using majority voting. This can significantly improve accuracy by reducing individual model biases.

The output includes:

Individual model predictions (e.g., category_1_gpt_4o, category_1_claude)
Consensus columns (e.g., category_1_consensus)
Agreement scores showing how many models agreed

`extract()`

Unified category extraction function for text, image, and PDF inputs. Automatically discovers categories in your data when you don't have a predefined scheme.

Parameters:

input_data: The data to explore (text list, image paths, or PDF paths)
api_key (str): API key for the LLM service
input_type (str, default="text"): Type of input - "text", "image", or "pdf"
description (str): Description of the input data
max_categories (int, default=12): Maximum number of categories to return
categories_per_chunk (int, default=10): Categories to extract per chunk
divisions (int, default=5): Number of chunks to divide data into
user_model (str, default="gpt-4o"): Model to use
specificity (str, default="broad"): "broad" or "specific" category granularity
research_question (str, optional): Research context to guide extraction
focus (str, optional): Focus instruction for category extraction (e.g., "emotional responses")
filename (str, optional): Output filename for CSV

Returns:

dict with keys:
- counts_df: DataFrame of categories with counts
- top_categories: List of top category names
- raw_top_text: Raw model output

Example:

import catllm as cat

# Extract categories from survey responses
results = cat.extract(
    input_data=df['responses'],
    description="Why did you move?",
    api_key=api_key,
    max_categories=10,
    focus="decisions to relocate"  # Optional focus
)

print(results['top_categories'])
# ['Employment opportunity', 'Family reasons', 'Cost of living', ...]

`image_score_drawing()`

Performs quality scoring of images against a reference description and optional reference image, returning structured results with optional CSV export.

Methodology: Processes each image individually, assigning a drawing quality score on a 5-point scale based on similarity to the expected description:

1: No meaningful similarity (fundamentally different)
2: Barely recognizable similarity (25% match)
3: Partial match (50% key features)
4: Strong alignment (75% features)
5: Near-perfect match (90%+ similarity)

Parameters:

reference_image_description (str): A description of what the model should expect to see
image_input (list): List of image file paths or folder path containing images
reference_image (str): A file path to the reference image
api_key (str): API key for the LLM service
user_model (str, default="gpt-4o"): Specific vision model to use
creativity (float, default=0): Temperature/randomness setting (0.0-1.0)
safety (bool, default=False): Enable safety checks and save results at each API call step
filename (str, default="image_scores.csv"): Filename for CSV output
save_directory (str, optional): Directory path to save the CSV file
model_source (str, default="OpenAI"): Model provider

Returns:

pandas.DataFrame: DataFrame with image paths, quality scores, and analysis details

Example:

import catllm as cat

image_scores = cat.image_score_drawing(
    reference_image_description='A hand-drawn circle',
    image_input=['image1.jpg', 'image2.jpg', 'image3.jpg'],
    user_model="gpt-4o",
    api_key="OPENAI_API_KEY"
)

`image_features()`

Extracts specific features and attributes from images, returning exact answers to user-defined questions (e.g., counts, colors, presence of objects).

Methodology: Processes each image individually using vision models to extract precise information about specified features. Unlike scoring and classification functions, this returns factual data such as object counts, color identification, or presence/absence of specific elements.

Parameters:

image_description (str): A description of what the model should expect to see
image_input (list): List of image file paths or folder path containing images
features_to_extract (list): List of specific features to extract (e.g., ["number of people", "primary color", "contains text"])
api_key (str): API key for the LLM service
user_model (str, default="gpt-4o"): Specific vision model to use
creativity (float, default=0): Temperature/randomness setting (0.0-1.0)
to_csv (bool, default=False): Whether to save the output to a CSV file
safety (bool, default=False): Enable safety checks and save results at each API call step
filename (str, default="categorized_data.csv"): Filename for CSV output
save_directory (str, optional): Directory path to save the CSV file
model_source (str, default="OpenAI"): Model provider

Returns:

pandas.DataFrame: DataFrame with image paths and extracted feature values

Example:

import catllm as cat

features = cat.image_features(
    image_description='Product photos from e-commerce site',
    features_to_extract=['number of items', 'primary color', 'has price tag'],
    image_input='/path/to/images/',
    user_model="gpt-4o",
    api_key="OPENAI_API_KEY"
)

`cerad_drawn_score()`

Automatically scores drawings of circles, diamonds, overlapping rectangles, and cubes according to the official Consortium to Establish a Registry for Alzheimer's Disease (CERAD) scoring system.

Methodology: Processes each image individually, evaluating the drawn shapes based on CERAD criteria. Works even with images that contain other drawings or writing.

Parameters:

shape (str): The type of shape to score ("circle", "diamond", "rectangles", "cube")
image_input (list): List of image file paths or folder path containing images
api_key (str): API key for the LLM service
user_model (str, default="gpt-4o"): Specific model to use
creativity (float, default=0): Temperature/randomness setting (0.0-1.0)
safety (bool, default=False): Enable safety checks and save results at each API call step
filename (str, optional): Filename for CSV output
model_source (str, default="auto"): Model provider

Returns:

pandas.DataFrame: DataFrame with image paths, CERAD scores, and analysis details

Example:

import catllm as cat

diamond_scores = cat.cerad_drawn_score(
    shape="diamond",
    image_input=df['diamond_pic_path'],
    api_key=api_key,
    safety=True,
    filename="diamond_scores.csv",
)

Deprecated Functions

The following functions are deprecated and will be removed in a future version. Please use classify() instead, which auto-detects input type and supports all the same features.

Deprecated Function	Replacement
`multi_class()`	`classify(input_data=texts, ...)`
`image_multi_class()`	`classify(input_data=images, ...)`
`pdf_multi_class()`	`classify(input_data=pdfs, ...)`
`explore_corpus()`	`extract(input_data=texts, ...)`
`explore_common_categories()`	`extract(input_data=texts, ...)`

These functions still work but will show deprecation warnings. Migration is straightforward—simply use classify() with your data and it will automatically detect whether you're passing text, images, or PDFs.

Related Projects

Looking for web research capabilities? Check out llm-web-research - a precision-focused LLM-powered web research tool that uses a novel Funnel of Verification (FoVe) methodology to reduce false positives. It's designed for use cases where accuracy matters more than completeness.

pip install llm-web-research

Academic Research

This package implements methodology from research on LLM performance in social science applications, including the UC Berkeley Social Networks Study. The package addresses reproducibility challenges in LLM-assisted research by providing standardized interfaces and consistent output formatting.

If you use this package for research, please cite:

Soria, C. (2025). CatLLM (0.1.0). Zenodo. https://doi.org/10.5281/zenodo.15532317

Contact

Interested in research collaboration? Email: ChrisSoria@Berkeley.edu

License

cat-llm is distributed under the terms of the GNU license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.0.0

Mar 19, 2026

2.7.0

Mar 7, 2026

2.5.1

Mar 1, 2026

2.5.0

Feb 26, 2026

2.4.0

Feb 11, 2026

2.3.4

Feb 11, 2026

2.3.3

Feb 11, 2026

2.3.2

Feb 11, 2026

2.3.1

Feb 10, 2026

2.3.0

Feb 9, 2026

2.2.0

Feb 8, 2026

2.1.0

Jan 30, 2026

2.0.0

Jan 17, 2026

This version

0.1.15

Jan 11, 2026

0.1.14

Jan 11, 2026

0.1.13

Jan 9, 2026

0.1.12

Jan 9, 2026

0.1.11

Jan 9, 2026

0.1.10

Jan 9, 2026

0.1.9

Jan 9, 2026

0.1.8

Jan 7, 2026

0.1.7

Jan 7, 2026

0.1.6

Jan 5, 2026

0.1.4

Jan 3, 2026

0.1.3

Jan 3, 2026

0.1.2

Jan 3, 2026

0.1.1

Dec 30, 2025

0.0.103

Dec 10, 2025

0.0.102

Dec 10, 2025

0.0.101

Nov 5, 2025

0.0.100

Nov 4, 2025

0.0.99

Nov 4, 2025

0.0.98

Oct 29, 2025

0.0.97

Oct 26, 2025

0.0.96

Oct 26, 2025

0.0.95

Oct 25, 2025

0.0.94

Oct 25, 2025

0.0.93

Oct 25, 2025

0.0.92

Oct 25, 2025

0.0.91

Oct 25, 2025

0.0.90

Oct 25, 2025

0.0.89

Oct 25, 2025

0.0.88

Oct 25, 2025

0.0.87

Oct 25, 2025

0.0.85

Oct 24, 2025

0.0.84

Oct 24, 2025

0.0.83

Oct 24, 2025

0.0.82

Oct 23, 2025

0.0.81

Oct 23, 2025

0.0.80

Oct 23, 2025

0.0.79

Oct 23, 2025

0.0.78

Oct 23, 2025

0.0.77

Oct 23, 2025

0.0.76

Oct 23, 2025

0.0.75

Oct 23, 2025

0.0.74

Oct 21, 2025

0.0.73

Oct 21, 2025

0.0.72

Oct 21, 2025

0.0.71

Oct 21, 2025

0.0.70

Oct 21, 2025

0.0.69

Oct 21, 2025

0.0.68

Oct 21, 2025

0.0.67

Oct 20, 2025

0.0.66

Oct 20, 2025

0.0.65

Oct 13, 2025

0.0.64

Oct 13, 2025

0.0.63

Oct 8, 2025

0.0.62

Oct 8, 2025

0.0.61

Oct 7, 2025

0.0.60

Sep 29, 2025

0.0.59

Sep 19, 2025

0.0.58

Sep 19, 2025

0.0.57

Sep 19, 2025

0.0.56

Sep 19, 2025

0.0.55

Sep 19, 2025

0.0.54

Sep 19, 2025

0.0.53

Sep 18, 2025

0.0.52

Sep 18, 2025

0.0.51

Sep 18, 2025

0.0.50

Sep 18, 2025

0.0.43

Aug 8, 2025

0.0.42

Aug 8, 2025

0.0.41

Aug 8, 2025

0.0.40

Aug 8, 2025

0.0.39

Jul 23, 2025

0.0.38

Jun 7, 2025

0.0.37

Jun 7, 2025

0.0.36

Jun 7, 2025

0.0.35

Jun 7, 2025

0.0.34

Jun 7, 2025

0.0.33

Jun 5, 2025

0.0.32

Jun 5, 2025

0.0.31

Jun 5, 2025

0.0.30

Jun 5, 2025

0.0.29

Jun 5, 2025

0.0.28

Jun 5, 2025

0.0.27

Jun 5, 2025

0.0.26

Jun 4, 2025

0.0.25

Jun 1, 2025

0.0.24

Jun 1, 2025

0.0.23

Jun 1, 2025

0.0.22

Jun 1, 2025

0.0.21

Jun 1, 2025

0.0.20

Jun 1, 2025

0.0.19

May 30, 2025

0.0.18

May 30, 2025

0.0.17

May 30, 2025

0.0.16

May 30, 2025

0.0.15

May 30, 2025

0.0.14

May 30, 2025

0.0.13

May 29, 2025

0.0.12

May 29, 2025

0.0.11

May 28, 2025

0.0.10

May 28, 2025

0.0.9

May 28, 2025

0.0.8

May 28, 2025

0.0.7

May 28, 2025

0.0.6

May 27, 2025

0.0.5

May 27, 2025

0.0.4

May 21, 2025

0.0.3

May 21, 2025

0.0.2

May 12, 2025

0.0.1

May 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cat_llm-0.1.15.tar.gz (407.1 kB view details)

Uploaded Jan 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cat_llm-0.1.15-py3-none-any.whl (422.7 kB view details)

Uploaded Jan 11, 2026 Python 3

File details

Details for the file cat_llm-0.1.15.tar.gz.

File metadata

Download URL: cat_llm-0.1.15.tar.gz
Upload date: Jan 11, 2026
Size: 407.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.14

File hashes

Hashes for cat_llm-0.1.15.tar.gz
Algorithm	Hash digest
SHA256	`19a70623ca8c17b2791d1fc97e985e381aa2b10d9847b52419f2a445090828de`
MD5	`f1ee00e1b74520bd215a5bb995681cb8`
BLAKE2b-256	`4466b4d12827e03948101a59b98ba4bbee463f4c4e8a12e01b0ddb4aea6e6f9b`

See more details on using hashes here.

File details

Details for the file cat_llm-0.1.15-py3-none-any.whl.

File metadata

Download URL: cat_llm-0.1.15-py3-none-any.whl
Upload date: Jan 11, 2026
Size: 422.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.14

File hashes

Hashes for cat_llm-0.1.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ed8281ccd6a9cf65e12b6b6def771978287cc140d1b75f01effe73c4c9125eb3`
MD5	`478291e7734af07b40df8f1e6210fde2`
BLAKE2b-256	`5fe3068fd399f35ec71d2ca2fe5408443e96e1c9c956bf162db575f766d86609`

See more details on using hashes here.

cat-llm 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cat-llm

The Problem

The Solution

Table of Contents

Installation

Quick Start

Configuration

Get Your API Key

Supported Models

API Reference

classify()

extract()

image_score_drawing()

image_features()

cerad_drawn_score()

Deprecated Functions

Related Projects

Academic Research

Contact

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`classify()`

`extract()`

`image_score_drawing()`

`image_features()`

`cerad_drawn_score()`