Skip to main content

A user toolkit for analyzing and interfacing with Large Language Models (LLMs)

Project description

Lingua-SDK

A user toolkit for analyzing and interfacing with Large Language Models (LLMs)

Overview

lingua-sdk is a Python module used to interact with large language models hosted via the Lingua service (available at https://github.com/VectorInstitute/lingua). It provides a simple interface launch LLMs on an HPC cluster, ask them to perform basic features like text generation, but also retrieve intermediate information from inside the model such as log probabilities and activations. These features are exposed via a few high-level APIs, namely:

  • generate - Returns an LLM text generation based on prompt input
  • module_names - Returns all modules in the LLM neural network
  • model_instances - Returns all active LLMs instantiated and avaiable to use
  • get_activations - Retrieves all activations for a set of modules

Full documentation and API reference are available at http://lingua-sdk.readthedocs.io.

Getting Started

Install

python3 -m pip install pylingua

or install from source:

pip install git+https://github.com/VectorInstitute/lingua-sdk.git

Authentication

In order to submit generation jobs, a designated Vector Institute cluster account is required. Please contact the AI Engineering Team in charge of Lingua for more information.

Sample Workflow

The following workflow shows how to load and interact with an OPT-175B model on the Vector Institute Vaughan cluster.

import lingua
import time
# Establish a client connection to the Lingua service
# If you have not previously authenticated with the service, you will be prompted to now
client = lingua.Client(gateway_host="llm.cluster.local", gateway_port=3001)

# See which models are available to use
print(client.models)

# Get a handle to a model. If this model is not actively running, it will get launched in the background.
# In this example we want to use the OPT-175B model
opt_model = client.load_model("OPT-175B")

# If not the was not actively running, this it could take several minutes to load. Wait for it come online.
while opt_model.state != "ACTIVE":
    time.sleep(1)

# Sample text generation w/ input parameters
text_gen = opt_model.generate("What is the answer to life, the universe, and everything?", {'max_tokens': 5, 'top_k': 4, 'top_p': 3, 'rep_penalty': 1, 'temperature': 0.5})
dir(text_gen) # display methods associated with generated text object
text_gen.generation['text'] # display only text
text_gen.generation['logprobs'] # display logprobs
text_gen.generation['tokens'] # display tokens

# Now let's retrieve some activations from the model
# First, show a list of modules in the neural network
print(opt_model.module_names)

# Setup a request for module acivations for a certain module layer
requested_activations = ['_fsdp_wrapped_module._fpw_module.decoder.layers.0._fsdp_wrapped_module._fpw_module']
activations = opt_model.get_activations("What are activations?", requested_activations)
print(activations)

Documentation

More information can be found on the Lingua documentation site.

Contributing

Contributing to lingua is welcomed. See Contributing for guidelines.

License

MIT

Citation

Reference to cite when you use Lingua in a project or a research paper:

Sivaloganathan, J., Coatsworth, M., Willes, J., Choi, M., & Shen, G. (2022). Lingua. http://VectorInstitute.github.io/lingua. computer software, Vector Institute for Artificial Intelligence. Retrieved from https://github.com/VectorInstitute/lingua-sdk.git.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pylingua-0.3.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pylingua-0.3.0-py3-none-any.whl (7.6 kB view details)

Uploaded Python 3

File details

Details for the file pylingua-0.3.0.tar.gz.

File metadata

  • Download URL: pylingua-0.3.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.0

File hashes

Hashes for pylingua-0.3.0.tar.gz
Algorithm Hash digest
SHA256 4ec2dbd85df3f9fc4a65388892fd7d84570d89138c1b88f5ded328be37bb9e92
MD5 e1226ba9d67871b5d992aeb247638d42
BLAKE2b-256 5e8e0854f3f8d6bbe9588b9109bcf833d4a16897ba4212514f43622ba6885ebb

See more details on using hashes here.

File details

Details for the file pylingua-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: pylingua-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 7.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.0

File hashes

Hashes for pylingua-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1058a1ca4fe3853a88282350904dfb3d9f86723ab904db242e49534519f3fe1f
MD5 9b01fe0815fa4413038c3c9e1a660afb
BLAKE2b-256 af404356cb61ad4fb31c740f2bdc6be71ffff4eedc257e8e2f87ea3dc75a278f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page