A Python library for classifying free-text poll responses using large language models (LLMs).

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

cognitum

Tests

Overview

A Python library for classifying free-text poll responses using large language models (LLMs). The system takes CSV input where each row contains a response and outputs coded classifications according to a provided codebook.

Features

Flexible classification using LLMs (currently supports Llama and OpenAI models)
Support for single and multi-label classification
Confidence scores for predictions
Evaluation against ground truth data
Random sampling capabilities for testing
Support for reproducibility

Installation

Install Using PyPI

pip install cognitum

Build From Source

This is currently tested using Apple Silicon M1 Max. Support for other systems is planned.

Requires Python >= 3.10

Clone the repository
Install dependencies:

# Install PyTorch
$ pip install torch torchvision

# Install llama-cpp-python with GPU support (for Apple Silicon)
# Review the installation instructions on the llama-cpp-python repo for your specific system. https://github.com/abetlen/llama-cpp-python?tab=readme-ov-file#installation
$ CMAKE_ARGS="-DGGML_METAL=on" pip install -U llama-cpp-python --no-cache-dir
$ pip install 'llama-cpp-python[server]'

# Install LMQL
$ pip install "lmql[hf]"

Download the model:

$ pip install -U "huggingface_hub[cli]"
$ huggingface-cli download bartowski/Llama-3.2-3B-Instruct-GGUF --include "Llama-3.2-3B-Instruct-Q4_0.gguf" --local-dir ./models

Usage

Basic Classification

Dataset

# Prepare your data
# data needs to be a list of tuples with first element being an identifier key and second element being a string of the text to be classified.
data = [
    ("id1", "text1"),
    ("id2", "text2"),
    ("id3", "text3"),
]
ds = Dataset(data)

Dataset objects have several methods.

hash method returns a unique hash for the dataset.

ds.hash()
# Returns: "a1b2c3d4e5f6g7h8i9j0"

sample method returns a random sample of the dataset where n is the number of samples to return and seed is the random seed to use for the sample.

ds.sample(n=3, seed=42)
# Returns: [("id2", "text2"), ("id3", "text3"), ("id1", "text1")]

Model

Model objects are configured as a predictor. You can pass prompts, valid labels, language model objects, and other parameters to the constructor.

# Configure and run model
# If using a local model refer to [lmql#344](https://github.com/eth-sri/lmql/issues/344) for how to structure the path.
model = Model(
    prompt="Review: {review}",
    valid_labels=["A", "B", "C"],
    model=lmql.model("llama.cpp:path/to/model.gguf")
)

Model objects have a predict method that takes a dataset as input and returns a list of predictions. Some models may return return a list of predictions per item in the dataset.

# Get predictions
predictions = model.predict(ds)
# Returns: [("id1", "A"), ("id2", "B"), ("id3", ["A", "C"])]

# Get predictions with confidence scores
predictions = model.predict(ds, return_confidences=True)
# Returns: [("id1", "A", 0.9), ("id2", "B", 0.8), ("id3", ["A", "C"], [0.7, 0.3])]

Evaluation

You can also use the evaluate method to test the model against ground truth data. This returns an overall score for exact matches, partial matches, and false positives.

scores = model.evaluate(ds, ground_truth)
# Returns: {"exact": 0.5, "partial": 0.5, "false_positives": 0.0}

Server Configuration

For optimal performance, run the LMQL server with GPU acceleration (for Apple Silicon):

lmql serve-model "llama.cpp:path/to/model.gguf" --n_ctx 1024 --n_gpu_layers -1

References & Further Reading

Future Improvements

Implementation of Chain of Thought reasoning
RAG (Retrieval Augmented Generation) support for historical response context
Vector-based classification methods
Support for additional classification tasks (policy comments, sentiment analysis, etc.)

Notes

Ok, now I know that this works, but has a looping error/warning message I don't understand.

Server code. Running with --n_gpu_layers -1 enables GPU acceleration and is faster than CPU. Its not as fast as llama.cpp.

lmql serve-model "llama.cpp:../../../../models/Llama-3.2-3B-Instruct-Q4_0.gguf" --n_ctx 1024 --n_gpu_layers -1

Client code. Note it must be run from within a function:

@lmql.query(
    model=lmql.model(
        "llama.cpp:../../../../models/Llama-3.2-3B-Instruct-Q4_0.gguf", 
        tokenizer="meta-llama/Llama-3.2-3B-Instruct",
    )
)

References used while installing and troubleshooting, note all have varrying degrees of correctness and usefulness.

https://lmql.ai/docs/lib/generations.html https://lmql.ai/docs/models/llama.cpp.html https://github.com/eth-sri/lmql/blob/3db7201403da4aebf092052d2e19ad7454158dd7/src/lmql/models/lmtp/backends/llama_cpp_model.py https://github.com/eth-sri/lmql/blob/main/src/lmql/api/llm.py#L68 https://github.com/eth-sri/lmql/blob/main/src/lmql/models/lmtp/README.md https://github.com/eth-sri/lmql/blob/3db7201403da4aebf092052d2e19ad7454158dd7/src/lmql/models/lmtp/lmtp_serve.py#L100 https://llama-cpp-python.readthedocs.io/en/latest/install/macos/ https://github.com/abetlen/llama-cpp-python?tab=readme-ov-file#installation

Prompt could include the codebook.

The codebook ought to be modifed to include a better description of each catetory.

Get fully functioning system working with just those steps before adding RAG. Get the success rate first.

Could also do RAG on the 2022 responses and provide context. Do this by getting embeddings for each response and then using a vector database to query for similar responses. Add the the whole row to the context under "Similar responses". Indicate that examples use an older version of the codebook, so if uncertian follow the description in the codebook.

Instruct to label 2 if not clear what the correct responce is supposed to be.

for codebook 3.0, should send to llm with code names masked but descriptions included and examples, and ask to create new simple code names. the 1.0 names are not good or clear. using code names instead of numbers may have performance benefits.

After reading this, we might want to use text codes instead of numeric codes.

Followed this course for prompting using proper chat tokens.

Could also add another method for classification of dataset that uses pure vector-based classification.

Can be used for other classification tasks. Like: Request for comment on policy Get valence (+ or -) and category of concern

Review example articles that do AI classification and validate against human coders.

Example:

Appendix B.2 provides examples of the resulting annotation. To validate this method, we compare the answers provided by GPT-engine to those provided by two independent research assistants for a random sample of 300 articles. Figure B2 shows that the agreement between Chat-GPT and a given human annotator is very similar to the agreement between two human annotators. We measure agreement by an accuracy score, i.e. the ratio of answers that are classified identically by GPT-engine and by the human annotator over the number of total answers. This lends confidence in the reliability of the method for this specific annotation task.23 https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4680430#page=47.86

Take a look at expected parrot. Possible alternative / inspiration: https://www.linkedin.com/pulse/adding-ai-your-r-data-analysis-pipeline-jeff-clement-czusc/

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.0.1.dev2 pre-release

Dec 12, 2024

0.0.1.dev1 pre-release

Dec 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cognitum-0.0.1.dev2.tar.gz (14.5 kB view details)

Uploaded Dec 12, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cognitum-0.0.1.dev2-py3-none-any.whl (10.3 kB view details)

Uploaded Dec 12, 2024 Python 3

File details

Details for the file cognitum-0.0.1.dev2.tar.gz.

File metadata

Download URL: cognitum-0.0.1.dev2.tar.gz
Upload date: Dec 12, 2024
Size: 14.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for cognitum-0.0.1.dev2.tar.gz
Algorithm	Hash digest
SHA256	`cca2bc49f47fa149f21c1c41108ce34994a8ba4dc08f2c2d48465902471238e1`
MD5	`2fa2291addbaa3f95b9903bf75ec1da2`
BLAKE2b-256	`a455104df0326f08f5292f7b0e2da79b0ff26b61df5fe4e597732601117c6026`

See more details on using hashes here.

File details

Details for the file cognitum-0.0.1.dev2-py3-none-any.whl.

File metadata

Download URL: cognitum-0.0.1.dev2-py3-none-any.whl
Upload date: Dec 12, 2024
Size: 10.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for cognitum-0.0.1.dev2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bec6530d847b18345b05f4441e10f2ab86ddd38b49fd13173ddf0fdf40d0f8e3`
MD5	`fa7f4232ddd012ce801643a5441696b0`
BLAKE2b-256	`5fe7dd2546e751bbe64e0f69b95b593bf4e86879adf49534dacc2e4474115c5d`

See more details on using hashes here.

cognitum 0.0.1.dev2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cognitum

Overview

Features

Installation

Install Using PyPI

Build From Source

Usage

Basic Classification

Dataset

Model

Evaluation

Server Configuration

References & Further Reading

Future Improvements

Notes

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes