Simple inference for large language models

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Language Models

Python building blocks to explore large language models in as little as 512MB of RAM

Translation hello world example

This package makes using large language models from Python as simple as possible. All inference is performed locally to keep your data private by default.

Installation and Getting Started

This package can be installed using the following command:

pip install languagemodels

Once installed, you should be able to interact with the package in Python as follows:

>>> import languagemodels as lm
>>> lm.do("What color is the sky?")
'The color of the sky is blue.'

This will require downloading a significant amount of data (~250MB) on the first run. Models will be cached for later use and subsequent calls should be quick.

Example Usage

Here are some usage examples as Python REPL sessions. This should work in the REPL, notebooks, or in traditional scripts and applications.

Instruction Following

>>> import languagemodels as lm

>>> lm.do("Translate to English: Hola, mundo!")
'Hello, world!'

>>> lm.do("What is the capital of France?")
'Paris.'

Outputs can be restricted to a list of choices if desired:

>>> lm.do("Is Mars larger than Saturn?", choices=["Yes", "No"])
'No'

Adjusting Model Performance

The base model should run quickly on any system with 512MB of memory, but this memory limit can be increased to select more powerful models that will consume more resources. Here's an example:

>>> import languagemodels as lm
>>> lm.do("If I have 7 apples then eat 5, how many apples do I have?")
'You have 8 apples.'
>>> lm.config["max_ram"] = "4gb"
4.0
>>> lm.do("If I have 7 apples then eat 5, how many apples do I have?")
'I have 2 apples left.'

GPU Acceleration

If you have an NVIDIA GPU with CUDA available, you can opt in to using the GPU for inference:

>>> import languagemodels as lm
>>> lm.config["device"] = "auto"

Text Completions

>>> import languagemodels as lm

>>> lm.complete("She hid in her room until")
'she was sure she was safe'

External Retrieval

Helper functions are provided to retrieve text from external sources that can be used to augment prompt context.

>>> import languagemodels as lm

>>> lm.get_wiki('Chemistry')
'Chemistry is the scientific study...

>>> lm.get_weather(41.8, -87.6)
'Partly cloudy with a chance of rain...

>>> lm.get_date()
'Friday, May 12, 2023 at 09:27AM'

Here's an example showing how this can be used (compare to previous chat example):

>>> lm.do(f"It is {lm.get_date()}. What time is it?")
'The time is 12:53PM.'

Semantic Search

Semantic search is provided to retrieve documents that may provide helpful context from a document store.

>>> import languagemodels as lm
>>> lm.store_doc(lm.get_wiki("Python"), "Python")
>>> lm.store_doc(lm.get_wiki("C language"), "C")
>>> lm.store_doc(lm.get_wiki("Javascript"), "Javascript")
>>> lm.get_doc_context("What does it mean for batteries to be included in a language?")
'From Python document: It is often described as a "batteries included" language due to its comprehensive standard library.Guido van Rossum began working on Python in the late 1980s as a successor to the ABC programming language and first released it in 1991 as Python 0.9.

From C document: It was designed to be compiled to provide low-level access to memory and language constructs that map efficiently to machine instructions, all with minimal runtime support.'

Full documentation

Speed

This package currently outperforms Hugging Face transformers for CPU inference thanks to int8 quantization and the CTranslate2 backend. The following table compares CPU inference performance on identical models using the best available quantization on a 20 question test set.

Backend	Inference Time	Memory Used
Hugging Face transformers	22s	1.77GB
This package	11s	0.34GB

Note that quantization does technically harm output quality slightly, but it should be negligible at this level.

Models

Sensible default models are provided. The package should improve over time as stronger models become available. The basic models used are 1000x smaller than the largest models in use today. They are useful as learning tools, but perform far below the current state of the art.

Here are the current default models used by the package for a supplied max_ram value:

max_ram	Model Name	Parameters (B)
0.5	LaMini-Flan-T5-248M	0.248
1.0	LaMini-Flan-T5-783M	0.783
2.0	LaMini-Flan-T5-783M	0.783
4.0	flan-alpaca-gpt4-xl	3.0
8.0	openchat-3.5-0106	7.0

For code completions, the CodeT5+ series of models are used.

Commercial Use

This package itself is licensed for commercial use, but the models used may not be compatible with commercial use. In order to use this package commercially, you can filter models by license type using the require_model_license function.

>>> import languagemodels as lm
>>> lm.config['instruct_model']
'LaMini-Flan-T5-248M-ct2-int8'
>>> lm.require_model_license("apache|bsd|mit")
>>> lm.config['instruct_model']
'flan-t5-base-ct2-int8'

It is recommended to confirm that the models used meet the licensing requirements for your software.

Projects Ideas

One of the goals for this package is to be a straightforward tool for learners and educators exploring how large language models intersect with modern software development. It can be used to do the heavy lifting for a number of learning projects:

CLI Chatbot (see examples/chat.py)
Streamlit chatbot (see examples/streamlitchat.py)
Chatbot with information retrieval
Chatbot with access to real-time information
Tool use
Text classification
Extractive question answering
Semantic search over documents
Document question answering

Several example programs and notebooks are included in the examples directory.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.24.0

Feb 14, 2025

0.23.0

Dec 17, 2024

0.22.0

Nov 2, 2024

0.21.0

Sep 26, 2024

0.20.0

Apr 25, 2024

0.19.0

Apr 18, 2024

0.18.0

Feb 24, 2024

0.17.0

Feb 15, 2024

0.16.0

Feb 5, 2024

0.15.0

Feb 4, 2024

0.14.0

Jan 6, 2024

0.13.0

Jan 5, 2024

0.12.0

Dec 2, 2023

0.11.0

Dec 2, 2023

0.10.0

Oct 29, 2023

0.9.0

Oct 7, 2023

0.8.0

Aug 4, 2023

0.8.0rc1 pre-release

Aug 4, 2023

0.7.0

Jul 27, 2023

0.6.0

Jun 30, 2023

0.5.0

Jun 17, 2023

0.4.0

Jun 17, 2023

0.3.2

Jun 8, 2023

0.3.1

Jun 7, 2023

0.3.0

Jun 7, 2023

0.2.2

Jun 6, 2023

0.2.1

Jun 6, 2023

0.1.2

Jun 5, 2023

0.1.1

Jun 5, 2023

0.1.0

Jun 5, 2023

0.0.4

May 15, 2023

0.0.3

May 12, 2023

0.0.2

May 11, 2023

0.0.1

May 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

languagemodels-0.24.0.tar.gz (24.2 kB view details)

Uploaded Feb 14, 2025 Source

Built Distribution

languagemodels-0.24.0-py3-none-any.whl (23.5 kB view details)

Uploaded Feb 14, 2025 Python 3

File details

Details for the file languagemodels-0.24.0.tar.gz.

File metadata

Download URL: languagemodels-0.24.0.tar.gz
Upload date: Feb 14, 2025
Size: 24.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for languagemodels-0.24.0.tar.gz
Algorithm	Hash digest
SHA256	`e4f474fea7827ae27784155548840c259e0907a308052500a40c82fd50481434`
MD5	`9a8f3585e8a2ccf1e40ad56caddcc785`
BLAKE2b-256	`8cf97a938857a665cd0e479cc89fdf01b68689717e145ca5690a752279981f73`

See more details on using hashes here.

File details

Details for the file languagemodels-0.24.0-py3-none-any.whl.

File metadata

Download URL: languagemodels-0.24.0-py3-none-any.whl
Upload date: Feb 14, 2025
Size: 23.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for languagemodels-0.24.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2f3bd1ea2e08feb2c53f62de5b34a298f2e48e1aef39f6f368a55965521e557d`
MD5	`1986b17ad497d0978a3c728feb0b12e0`
BLAKE2b-256	`b13580d14430c114fdf0c126e4fa359901c966df370625f1fa57a775a6d43af1`

See more details on using hashes here.

languagemodels 0.24.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Language Models

Installation and Getting Started

Example Usage

Instruction Following

Adjusting Model Performance

GPU Acceleration

Text Completions

External Retrieval

Semantic Search

Speed

Models

Commercial Use

Projects Ideas

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes