LLM-powered estimators for scikit-learn pipelines

These details have not been verified by PyPI

Project links

Homepage

Project description

promptlearn

PyPI - Python Version PyPI - Wheel PyPI - Implementation

promptlearn brings large language models into your scikit-learn workflow. It replaces traditional estimators with language-native reasoning systems that learn, adapt, and describe patterns using natural language as the model substrate. The output is directly executable and portable Python code that is executed in a safe sandbox environment during predict() calls.

📊 Outperforming Traditional Models with Built-In Knowledge

promptlearn allows LLMs to internalize both structure and semantics during training. As a result, the models often exceed the capabilities of classical estimators when the task requires reasoning, real-world knowledge, or symbolic understanding.

Consider a simple binary classification task: predicting whether an animal is a mammal based on its name, weight, and lifespan.

Traditional models depend solely on the input features. But promptlearn models can use their internal understanding of zoology to form highly accurate rules. Even when a label like "Whale" is never seen during training, the model knows it belongs to the mammal class.

                model  accuracy  fit_time_sec  predict_time_sec
  promptlearn_o3-mini      0.94     49.114336          0.002808
  promptlearn_o4-mini      0.86     60.961045          0.002417

promptlearn_gpt-3.5-turbo 0.66 20.246616 0.002738 promptlearn_gpt-4o 0.66 43.930959 0.002250 logistic_regression 0.60 0.016565 0.000962 decision_tree 0.53 0.001409 0.000529 gradient_boosting 0.53 0.020737 0.001094 promptlearn_gpt-4 0.40 12.494963 0.002196 dummy 0.34 0.000554 0.000120 random_forest 0.28 0.010656 0.001659

This type of semantic generalization is a powerful advantage for LLM-backed models.

Now compare performance on a regression task where the data contains samples of objects falling from different heights, under different gravity. This is a classic physics problem, with a well-known equation:

fall_time_s = sqrt((2 * height_m) / gravity_mps2)

Recent promptlearn estimators are able to recover this exact formula and use it to generate near-perfect predictions:

                model     mse  fit_time_sec  predict_time_sec
   promptlearn_gpt-4o   0.000         2.924             0.001
  promptlearn_o3-mini   0.000        10.801             0.001
  promptlearn_o4-mini   0.000         7.959             0.001
        random_forest   0.028         0.013             0.002
    gradient_boosting   0.035         0.011             0.001
        decision_tree   0.067         0.001             0.000
    linear_regression   0.498         0.001             0.000
                dummy   5.273         0.001             0.000

promptlearn_gpt-3.5-turbo 18.193 3.009 0.002 promptlearn_gpt-4 855.445 2.428 0.001

No feature engineering was performed. No physics constants were added. The model discovered the rule and applied it directly. Classical regressors, by contrast, approximated a curve but missed the exact structure.

These results highlight the practical benefit of reasoning models: they learn compact, expressive heuristics and can outperform traditional systems when symbolic insight or background knowledge is essential.

🤖 Estimators Powered by Language

promptlearn provides scikit-learn-compatible estimators that use LLMs as the modeling engine:

PromptClassifier – for predicting classes through generalized reasoning
PromptRegressor – for modeling numeric relationships in data

These estimators follow the same API as other scikit-learn models (fit, predict, score) but operate via dynamic prompt construction and few-shot abstraction.

📘 What it Learns: The Heuristic

When you call .fit(), the LLM reviews your data and generates executable Python code that realizes the found relationships.

The result is thus a plain-text, human-readable, piece of code. It is readable, portable, and expressive. This is stored in .heuristic_, and it powers all predictions.

🧠 Language-Aware Reasoning

Because the models are backed by LLMs, they can reason across both structure and semantics:

Names of columns matter
Missing data can be explained or inferred
World knowledge is available by default

A trained model might use context like:

“Bachelors” typically correlates with medium income
“Private” workclass often means lower capital gain
Rows with missing native-country likely default to “United States”

This allows reasoning across incomplete, skewed, or lightly structured data without hand-tuning features.

🧬 Background Knowledge Included

The LLM brings its internal knowledge graph to the modeling task. For instance:

Input: country = "Norway"
Output: has_blue_in_flag = 1

Even if there is no signal in the data, the model may still predict correctly by referencing background information. This creates a kind of ambient “web join” during training that gets materialized as an explicit list or dictionary that expands all categorical values that are encountered during training, to cover unseen cases. This can include countries, flags, animals, and more.

🕳 Zero-Example Learning

If you call .fit() with no rows — just column names — promptlearn will still return a working model.

This is possible because the LLM can hallucinate a plausible mapping based on:

Column names
Prior knowledge
Type hints or value patterns

This makes rapid prototyping and conceptual modeling trivial.

🧪 Native `.sample()` Support

You can generate synthetic rows directly from any trained model using .sample(n):

>>> model.sample(3)
fruit    is_citrus
Lime     1
Banana   0
Orange   1

This is useful for:

Understanding what the model believes
Creating test sets or bootstrapped data
Building readable examples from internal logic

💾 Save and Reload with `joblib`

Like any scikit-learn model, promptlearn estimators can be serialized:

import joblib

joblib.dump(model, "model.joblib")
model = joblib.load("model.joblib")

The LLM client is excluded from the saved file and re-initialized on load. The heuristic remains intact, interpretable, and ready to use.

📚 Related Work

Scikit-LLM

Scikit-LLM provides zero- and few-shot classification through template-based prompting.
It is lightweight and NLP-focused.

promptlearn offers a broader modeling philosophy:

Capability	Scikit-LLM	promptlearn
Produces runnable Python code	❌ No	✅ Yes
Regression support	❌ No	✅ Yes

📁 License

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.5.0

Jun 24, 2026

0.4.1

Jun 24, 2026

0.3.0

Jul 13, 2025

0.2.3

Jul 12, 2025

0.2.2

Jul 10, 2025

0.2.1

Jul 10, 2025

This version

0.2.0

Jul 9, 2025

0.1.3

Jul 8, 2025

0.1.2

Jul 8, 2025

0.1.1

Jul 6, 2025

0.1.0

Jul 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

promptlearn-0.2.0.tar.gz (13.9 kB view details)

Uploaded Jul 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

promptlearn-0.2.0-py3-none-any.whl (13.9 kB view details)

Uploaded Jul 9, 2025 Python 3

File details

Details for the file promptlearn-0.2.0.tar.gz.

File metadata

Download URL: promptlearn-0.2.0.tar.gz
Upload date: Jul 9, 2025
Size: 13.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for promptlearn-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`d6e0e2a3dc2078bd07f8dd006a1a1b2afbff50d0d78976b3081c6cff65f6a1a7`
MD5	`33358ed520df2e5a2cac3ba807d102a8`
BLAKE2b-256	`8d2dbe9f9a2066afbbfcfd7003f3a177f19f479a9593b872ece4df2a1cdf884d`

See more details on using hashes here.

File details

Details for the file promptlearn-0.2.0-py3-none-any.whl.

File metadata

Download URL: promptlearn-0.2.0-py3-none-any.whl
Upload date: Jul 9, 2025
Size: 13.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for promptlearn-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da89665ba8f45db3a2b7845b02ddf85c69f4e5db8e364234c110fb7e66491129`
MD5	`bae3a380328a8f16ac692fe3eca5b601`
BLAKE2b-256	`823f67e288b32b3316153fab68ffb2eaeab49fc27443be9c1b8ab62af263b924`

See more details on using hashes here.

promptlearn 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

promptlearn

📊 Outperforming Traditional Models with Built-In Knowledge

🤖 Estimators Powered by Language

📘 What it Learns: The Heuristic

🧠 Language-Aware Reasoning

🧬 Background Knowledge Included

🕳 Zero-Example Learning

🧪 Native `.sample()` Support

💾 Save and Reload with `joblib`

📚 Related Work

Scikit-LLM

📁 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

promptlearn 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

promptlearn

📊 Outperforming Traditional Models with Built-In Knowledge

🤖 Estimators Powered by Language

📘 What it Learns: The Heuristic

🧠 Language-Aware Reasoning

🧬 Background Knowledge Included

🕳 Zero-Example Learning

🧪 Native .sample() Support

💾 Save and Reload with joblib

📚 Related Work

Scikit-LLM

📁 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

🧪 Native `.sample()` Support

💾 Save and Reload with `joblib`