A framework for building ML models from natural language

These details have not been verified by PyPI

Project links

Project description

smolmodels ✨

Build machine learning models using natural language and minimal code

Quickstart | Features | Installation & Setup | Documentation | Benchmarks

Create machine learning models with minimal code by describing what you want them to do in plain words. You explain the task, and the library builds a model for you, including data generation, feature engineering, training, and packaging.

[!NOTE] This library is in early development, and we're actively working on new features and improvements! Please report any bugs or share your feature requests on GitHub or Discord 💛

1. Quickstart

Installation:

pip install smolmodels

Define, train and save a Model:

import smolmodels as sm

# Step 1: define the model
model = sm.Model(
    intent="Predict sentiment on a news article such that [...]",
    input_schema={"headline": str, "content": str},         # [optional - can be pydantic or dict]
    output_schema={"sentiment": str}                        # [optional - can be pydantic or dict]
)

# Step 2: build and train the model on data
model.build(
   datasets=[dataset, auxiliary_dataset],
   provider="openai/gpt-4o-mini",
   timeout=3600
)

# Step 3: use the model to get predictions on new data
sentiment = model.predict({
   "headline": "600B wiped off NVIDIA market cap",
   "content": "NVIDIA shares fell 38% after [...]",
})

# Step 4: save the model, can be loaded later for reuse
sm.save_model(model, "news-sentiment-predictor")

# Step 5: load a saved model and use it
loaded_model = sm.load_model("news-sentiment-predictor.tar.gz")

2. Features

smolmodels combines graph search, LLM code/data generation and code execution to produce a machine learning model that meets the criteria of the task description. When you call model.build(), the library generates a graph of possible model solutions, evaluates them, and selects the one that maximises the performance metric for this task.

2.1. 💬 Define Models using Natural Language

A model is defined as a transformation from an input schema to an output schema, which behaves according to an intent. The schemas can be defined either using pydantic models, or plain dictionaries that are convertible to pydantic models.

# This defines the model's identity
model = sm.Model(
    intent="Predict sentiment on a news article such that [...]",
    input_schema={"headline": str, "content": str},                 # supported: pydantic or dict
    output_schema={"sentiment": str}                                # supported: pydantic or dict
)

You describe the model's expected behaviour in plain English. The library will select a metric to optimise for, and produce logic for feature engineering, model training, evaluation, and so on.

2.2. 🎯 Model Building

The model is built by calling model.build(). This method takes one or more datasets and generates a set of possible model solutions, training and evaluating them to select the best one. The model with the highest performance metric becomes the "implementation" of the predictor.

You can specify the model building cutoff in terms of a timeout, a maximum number of solutions to explore, or both.

model.build(
    datasets=[dataset_a, dataset_b],
    provider="openai/gpt-4o-mini",
    timeout=3600,                       # [optional] max time in seconds
    max_iterations=10                   # [optional] max number of model solutions to explore
)

The model can now be used to make predictions, and can be saved or loaded using sm.save_model() or sm.load_model().

sentiment = model.predict({"headline": "600B wiped off NVIDIA market cap", ...})

2.3. 🎲 Data Generation and Schema Inference

The library can generate synthetic data for training and testing. This is useful if you have no data available, or want to augment existing data. You can do this with the sm.DatasetGenerator class:

dataset = sm.DatasetGenerator(
    schema={"headline": str, "content": str, "sentiment": str},  # supported: pydantic or dict
    data=existing_data
)
dataset.generate(1000)

model.build(
    datasets=[dataset],
    ...
)

[!CAUTION] Data generation can consume a lot of tokens. Start with a conservative generate_samples value and increase it if needed.

The library can also infer the input and/or output schema of your predictor, if required. This is based either on the dataset you provide, or on the model's intent. This can be useful when you don't know what the model should look like. As with the models, you can specify the schema using pydantic models or plain dictionaries.

# In this case, the library will infer a schema from the intent and generate data for you
model = sm.Model(intent="Predict sentiment on a news article such that [...]")
model.build(provider="openai/gpt-4o-mini")

[!TIP] If you know how the model will be used, you will get better results by specifying the schema explicitly. Schema inference is primarily intended to be used if you don't know what the input/output schema at prediction time should be.

2.4. 🌐 Multi-Provider Support

You can use multiple LLM providers for model generation. Specify the provider and model in the format provider/model:

model.build(provider="openai/gpt-4o-mini", ...)

See the section on installation and setup for more details on supported providers and how to configure API keys.

3. Installation & Setup

Install the library in the usual manner:

pip install smolmodels

Set your API key as an environment variable based on which provider you want to use. For example:

# For OpenAI
export OPENAI_API_KEY=<your-API-key>
# For Anthropic
export ANTHROPIC_API_KEY=<your-API-key>
# For Gemini
export GEMINI_API_KEY=<your-API-key>

[!TIP] The library uses LiteLLM as its provider abstraction layer. For other supported providers and models, check the LiteLLM documentation.

4. Documentation

For full documentation, visit docs.plexe.ai.

5. Benchmarks

Performance evaluated on 20 OpenML benchmark datasets and 12 Kaggle competitions. Higher performance observed on 12/20 OpenML datasets, with remaining datasets showing performance within 0.005 of baseline. Experiments conducted on standard infrastructure (8 vCPUs, 30GB RAM) with 1-hour runtime limit per dataset.

Complete code and results are available at plexe-ai/plexe-results.

6. Contributing

We love contributions! You can get started with issues, submitting a PR with improvements, or joining the Discord to chat with the team. See CONTRIBUTING.md for detailed guidelines.

7. License

Apache-2.0 License - see LICENSE for details.

8. Docker Deployment

Run smolmodels as a platform with a RESTful API and web UI using Docker:

git clone https://github.com/plexe-ai/smolmodels.git
cd smolmodels/docker
cp .env.example .env  # Edit with your LLM provider API key
docker-compose up -d

Access your deployment:

API: http://localhost:8000
Web UI: http://localhost:8501

The web interface provides an easy way to create models, view their status, and make predictions without writing code. See the Docker README for more details.

9. Product Roadmap

Fine-tuning and transfer learning for small pre-trained models
Use Pydantic for schemas and split data generation into a separate module
Smolmodels self-hosted platform ⭐ (More details coming soon!)
Support for non-tabular data types in model generation
File upload to docker containers

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.0

Apr 15, 2025

0.14.0

Apr 10, 2025

0.13.0

Apr 4, 2025

0.12.6

Apr 4, 2025

0.12.5

Apr 3, 2025

0.12.4

Mar 27, 2025

0.12.3

Mar 27, 2025

0.12.1

Mar 26, 2025

0.12.0

Mar 19, 2025

0.11.1

Mar 15, 2025

0.11.0

Mar 10, 2025

This version

0.10.0

Mar 9, 2025

0.9.3

Mar 4, 2025

0.9.2

Feb 25, 2025

0.9.1

Feb 25, 2025

0.9.0

Feb 21, 2025

0.8.2

Feb 21, 2025

0.8.1

Feb 21, 2025

0.8.0

Feb 20, 2025

0.7.1

Feb 20, 2025

0.7.0

Feb 15, 2025

0.6.0

Feb 12, 2025

0.5.3

Feb 11, 2025

0.5.2

Feb 9, 2025

0.5.1

Feb 8, 2025

0.5.0

Feb 4, 2025

0.4.0

Feb 4, 2025

0.3.2

Feb 1, 2025

0.3.1

Feb 1, 2025

0.3.0

Feb 1, 2025

0.2.0

Jan 31, 2025

0.1.2

Jan 30, 2025

0.1.1

Jan 29, 2025

0.1.0

Jan 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smolmodels-0.10.0.tar.gz (58.5 kB view details)

Uploaded Mar 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

smolmodels-0.10.0-py3-none-any.whl (81.6 kB view details)

Uploaded Mar 9, 2025 Python 3

File details

Details for the file smolmodels-0.10.0.tar.gz.

File metadata

Download URL: smolmodels-0.10.0.tar.gz
Upload date: Mar 9, 2025
Size: 58.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.12.9 Linux/6.8.0-1021-azure

File hashes

Hashes for smolmodels-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`e3d45cec48606d38875e94551d70dc55d2d96e52c9beccd32dc8739dbd8cd071`
MD5	`9525eab0946ecdb2e27b6717cc298913`
BLAKE2b-256	`b3b182e3e9de790cacff4d8f9720434965f1bdb19b9d0a5a0a36d95576635af1`

See more details on using hashes here.

File details

Details for the file smolmodels-0.10.0-py3-none-any.whl.

File metadata

Download URL: smolmodels-0.10.0-py3-none-any.whl
Upload date: Mar 9, 2025
Size: 81.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.12.9 Linux/6.8.0-1021-azure

File hashes

Hashes for smolmodels-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`95276e5cca53709350592c4f8e3630bc9ab2e15c02edcbc3d3806caec1dd96b2`
MD5	`c4c1791e6cf7f20d94d3583ef35fb1a7`
BLAKE2b-256	`87fff5d9413e98628a524735bc1300fae1a1d9ae3cd41629c9e545d4bcd290ce`

See more details on using hashes here.

smolmodels 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

smolmodels ✨

1. Quickstart

2. Features

2.1. 💬 Define Models using Natural Language

2.2. 🎯 Model Building

2.3. 🎲 Data Generation and Schema Inference

2.4. 🌐 Multi-Provider Support

3. Installation & Setup

4. Documentation

5. Benchmarks

6. Contributing

7. License

8. Docker Deployment

9. Product Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes