mlx-lm·PyPI

LLMs on Apple silicon with MLX and the Hugging Face Hub

Project description

Generate Text with LLMs and MLX

The easiest way to get started is to install the mlx-lm package:

pip install mlx-lm

Python API

You can use mlx-lm as a module:

from mlx_lm import load, generate

model, tokenizer = load("mistralai/Mistral-7B-v0.1")

response = generate(model, tokenizer, prompt="hello", verbose=True)

To see a description of all the arguments you can do:

>>> help(generate)

The mlx-lm package also comes with functionality to quantize and optionally upload models to the Hugging Face Hub.

You can convert models in the Python API with:

from mlx_lm import convert 

upload_repo = "mlx-community/My-Mistral-7B-v0.1-4bit"

convert("mistralai/Mistral-7B-v0.1", quantize=True, upload_repo=upload_repo)

This will generate a 4-bit quantized Mistral-7B and upload it to the repo mlx-community/My-Mistral-7B-v0.1-4bit. It will also save the converted model in the path mlx_model by default.

To see a description of all the arguments you can do:

>>> help(convert)

Command Line

You can also use mlx-lm from the command line with:

python -m mlx_lm.generate --model mistralai/Mistral-7B-v0.1 --prompt "hello"

This will download a Mistral 7B model from the Hugging Face Hub and generate text using the given prompt.

For a full list of options run:

python -m mlx_lm.generate --help

To quantize a model from the command line run:

python -m mlx_lm.convert --hf-path mistralai/Mistral-7B-v0.1 -q

For more options run:

python -m mlx_lm.convert --help

You can upload new models to Hugging Face by specifying --upload-repo to convert. For example, to upload a quantized Mistral-7B model to the MLX Hugging Face community you can do:

python -m mlx_lm.convert \
    --hf-path mistralai/Mistral-7B-v0.1 \
    -q \
    --upload-repo mlx-community/my-4bit-mistral

Supported Models

The example supports Hugging Face format Mistral, Llama, and Phi-2 style models. If the model you want to run is not supported, file an issue or better yet, submit a pull request.

Here are a few examples of Hugging Face models that work with this example:

Most Mistral, Llama, Phi-2 and Mixtral style models should work out of the box.

Project details

Release history Release notifications | RSS feed

0.26.0

Jul 8, 2025

0.25.3

Jul 1, 2025

0.25.2

Jun 9, 2025

0.25.1

Jun 7, 2025

0.25.0

Jun 2, 2025

0.24.1

May 14, 2025

0.24.0

Apr 28, 2025

0.23.2

Apr 22, 2025

0.23.1

Apr 20, 2025

0.23.0

Apr 18, 2025

0.22.5

Apr 11, 2025

0.22.4

Apr 6, 2025

0.22.3

Apr 3, 2025

0.22.2

Mar 21, 2025

0.22.1

Mar 18, 2025

0.22.0

Mar 13, 2025

0.21.5

Feb 27, 2025

0.21.4

Feb 8, 2025

0.21.3

Feb 7, 2025

0.21.2

Feb 5, 2025

0.21.1

Jan 16, 2025

0.21.0

Jan 10, 2025

0.20.6

Jan 3, 2025

0.20.5

Dec 23, 2024

0.20.4

Dec 13, 2024

0.20.3

Dec 11, 2024

0.20.2

Dec 8, 2024

0.20.1

Nov 25, 2024

0.19.3

Nov 4, 2024

0.19.2

Oct 23, 2024

0.19.1

Oct 14, 2024

0.19.0

Oct 2, 2024

0.18.2

Sep 19, 2024

0.18.1

Aug 30, 2024

0.17.1

Aug 24, 2024

0.17.0

Aug 17, 2024

0.16.1

Jul 23, 2024

0.16.0

Jul 22, 2024

0.15.3

Jul 17, 2024

0.15.2

Jul 8, 2024

0.15.1

Jul 7, 2024

0.15.0

Jun 27, 2024

0.14.3

Jun 3, 2024

0.14.2

Jun 2, 2024

0.14.1

May 31, 2024

0.14.0

May 24, 2024

0.13.1

May 17, 2024

0.13.0

May 10, 2024

0.12.1

Apr 30, 2024

0.12.0

Apr 26, 2024

0.11.0

Apr 23, 2024

0.10.0

Apr 19, 2024

0.9.0

Apr 11, 2024

0.8.0

Apr 8, 2024

0.7.0

Apr 5, 2024

0.6.0

Apr 2, 2024

0.5.0

Mar 25, 2024

0.4.0

Mar 21, 2024

0.3.0

Mar 13, 2024

0.2.0

Mar 13, 2024

0.1.0

Mar 8, 2024

0.0.14

Mar 4, 2024

0.0.13

Feb 21, 2024

0.0.12

Feb 20, 2024

0.0.11

Feb 18, 2024

0.0.10

Feb 13, 2024

0.0.9

Feb 8, 2024

0.0.8

Feb 6, 2024

0.0.7

Feb 4, 2024

0.0.6

Jan 26, 2024

0.0.5

Jan 24, 2024

This version

0.0.3

Jan 15, 2024

0.0.2

Jan 12, 2024

0.0.1

Jan 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlx-lm-0.0.3.tar.gz (11.9 kB view details)

Uploaded Jan 15, 2024 Source

Built Distribution

mlx_lm-0.0.3-py3-none-any.whl (14.2 kB view details)

Uploaded Jan 15, 2024 Python 3

File details

Details for the file mlx-lm-0.0.3.tar.gz.

File metadata

Download URL: mlx-lm-0.0.3.tar.gz
Upload date: Jan 15, 2024
Size: 11.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for mlx-lm-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`57e11b6e359ebb496e0fa302e96b63af65855da53ea15ce43d05b08ef7f109d9`
MD5	`284434381e8d93d4da25443fb80036bb`
BLAKE2b-256	`3beda363caf42ad94238f04c0ded359a664a19f3bf5d309cb201d8f16c98c583`

See more details on using hashes here.

File details

Details for the file mlx_lm-0.0.3-py3-none-any.whl.

File metadata

Download URL: mlx_lm-0.0.3-py3-none-any.whl
Upload date: Jan 15, 2024
Size: 14.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for mlx_lm-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5981276d73b891562ac5dffe8318f8cba4429535f69b56509f41ac9ec294eeb7`
MD5	`051700d70ab16f33d63ea815ec396b31`
BLAKE2b-256	`64fe1ca7f2eda4bdb7d0ee5382b0ef06c400505ab56bab20084f7b92b53e94cc`

See more details on using hashes here.

mlx-lm 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Generate Text with LLMs and MLX

Python API

Command Line

Supported Models

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes