Skip to main content

Fast and easy LLM serving.

Project description

mistral.rs

mistralrs is a Python package which provides an easy to use API for mistral.rs.

Example

More examples can be found here!

from mistralrs import Runner, Which, ChatCompletionRequest

runner = Runner(
    which=Which.Plain(
        model_id="microsoft/Phi-3.5-mini-instruct",
    ),
    in_situ_quant="Q4K",
)

res = runner.send_chat_completion_request(
    ChatCompletionRequest(
        model="mistral",
        messages=[
            {"role": "user", "content": "Tell me a story about the Rust type system."}
        ],
        max_tokens=256,
        presence_penalty=1.0,
        top_p=0.1,
        temperature=0.1,
    )
)
print(res.choices[0].message.content)
print(res.usage)

Please find API docs here and the type stubs here, which are another great form of documentation.

We also provide a cookbook here!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mistralrs_cuda-0.5.0-cp312-none-win_amd64.whl (19.0 MB view details)

Uploaded CPython 3.12Windows x86-64

mistralrs_cuda-0.5.0-cp312-cp312-manylinux_2_39_x86_64.whl (24.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.39+ x86-64

mistralrs_cuda-0.5.0-cp311-none-win_amd64.whl (19.0 MB view details)

Uploaded CPython 3.11Windows x86-64

mistralrs_cuda-0.5.0-cp311-cp311-manylinux_2_39_x86_64.whl (24.8 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.39+ x86-64

mistralrs_cuda-0.5.0-cp310-none-win_amd64.whl (19.0 MB view details)

Uploaded CPython 3.10Windows x86-64

mistralrs_cuda-0.5.0-cp310-cp310-manylinux_2_39_x86_64.whl (24.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.39+ x86-64

File details

Details for the file mistralrs_cuda-0.5.0-cp312-none-win_amd64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp312-none-win_amd64.whl
Algorithm Hash digest
SHA256 415f6fe858bd8f6d83e4156653dcf607215ff8b62c8ebf1fd2c39a0ae63e61fc
MD5 171980513002a2dcbab59ab7e89922e8
BLAKE2b-256 812b3e3ac9cec4e627a193c9e91673f6dcd13e3599b89f2033ad119aa470b37a

See more details on using hashes here.

File details

Details for the file mistralrs_cuda-0.5.0-cp312-cp312-manylinux_2_39_x86_64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp312-cp312-manylinux_2_39_x86_64.whl
Algorithm Hash digest
SHA256 a815fe8dd962a19ed7baa42e918bae93817275ae46df02c435f611699a7cde63
MD5 6fb3d4dad0289b2755f0f7b5ed133051
BLAKE2b-256 3f7e656bc858a4156cd51ddd9ca68b780672087c7d39707baea4ff31f9ab180b

See more details on using hashes here.

File details

Details for the file mistralrs_cuda-0.5.0-cp311-none-win_amd64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp311-none-win_amd64.whl
Algorithm Hash digest
SHA256 5fa92f93664f1236a4fc2cce357e7ba0e9ece27c0118c08b2e1f660d91911813
MD5 9f9df918befb69ec52cd7097da36331b
BLAKE2b-256 2a0bca8d77371dda71991d4c643aa62e5f630e042fc5307dd0019520b5390f9c

See more details on using hashes here.

File details

Details for the file mistralrs_cuda-0.5.0-cp311-cp311-manylinux_2_39_x86_64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp311-cp311-manylinux_2_39_x86_64.whl
Algorithm Hash digest
SHA256 daaa2de97ce4a9a7214f6786c278415a12bc5a85ace0c27bc121ce541150984c
MD5 0ad9a96bd6e72d43745c3d7104376714
BLAKE2b-256 12c053a2c13218f022dfdc631ae2571094c71e3e44f4025153fe1170b6d6d22d

See more details on using hashes here.

File details

Details for the file mistralrs_cuda-0.5.0-cp310-none-win_amd64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp310-none-win_amd64.whl
Algorithm Hash digest
SHA256 71430eaa984cba16f624e8e01c41dc41d98469685a13f3504df68e1a6cc123b5
MD5 0fe9ada9317f985f1ca100b26ba26a04
BLAKE2b-256 7d62581750214cfb9136b9a16430f7973c1679529239e5205808c04cdbc6c15f

See more details on using hashes here.

File details

Details for the file mistralrs_cuda-0.5.0-cp310-cp310-manylinux_2_39_x86_64.whl.

File metadata

File hashes

Hashes for mistralrs_cuda-0.5.0-cp310-cp310-manylinux_2_39_x86_64.whl
Algorithm Hash digest
SHA256 8562d72fe15081e2a5e7b21671f4e356c1124c29668e5b55c4de8b8c6f16add8
MD5 fa94ff12854c3b2536fff9b6b85ef57a
BLAKE2b-256 d58ee753fac407be18c8d5b2ec6e4704e127f718ef3d35ff3a54aba7959b67e7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page