Skip to main content

LLM Tools for looking at separability of LLM Capabilities

Project description

Separability of Language Models

separability

My basic library for studying LLMs, with support for Multi-GPU inference and editing, as well as quantilised inference (not editing yet). This includes functions for analysing the activations of the models for different inputs, and for pruning different parts of the model based on those activations.

The currently tested list of models is:

  • EleutherAI's Pythia
  • Meta Opt
  • Meta Galactica
  • GPT2
  • BERT

For check out the examples folder to see in more detail how the library can be used.

Pruning based on Capabilities

For a full example, see src/examples/prune_30.py.

The simple example is:

from separability.data_classes import PruningConfig
from separability.parser import cli_parser
from separability.prune import run_pruning

# Configure initial model and tests
c = PruningConfig(
    wandb_project = "testing",
    model_repo   = "facebook/opt-125m",
    token_limit  = 1000,
    run_pre_test = True,

    # Removals parameters
    ff_scoring = "abs"
    ff_frac   = 0.02,
    ff_eps    = 0.001,
    attn_scoring = "abs",
    attn_frac = 0.00,
    attn_eps  = 1e-4,

    # Eval
    focus     = "pile_codeless",
    cripple   = "code",
    additional_datasets=tuple(),
)

# optionally, use parser to get CLI arguments.
# c, args = cli_parser(c)

# Run the iterated pruning
model, history = run_pruning(c)

model.py

This defines a wrapper function that encapsulates the HuggingFace implementation of Meta OPT. To get the model, simply run:

from separability import Model

m = Model("facebook/opt-125m", limit=1000)

Where you can provide any of the model sizes that are pre-trained for OPT, and the token limit must be smaller than the max token length that the model is able to handle.

Next, you can run the model to do 2 tokens of predictions, by, for example, running:

text = 'Hello, my name is'
inpt, output = opt.predict( text, num=2 )

We can look at the residual stream of how the output changes over time.

residual_stream = opt.get_residual_stream( text )

This will return a tensor of size 2 + 2*n_layers. i.e:

  • the input (w/ positional encoding)
  • n attention layer outputs
  • n feed forward layer outputs
  • the final output

If we want just the output of the attention / feed forward layers, we can instead look at the activations:

inpt, attn_out, ff_out, output = opt.get_text_activations( text )

or alternatively:

inpt, attn_out, ff_out, output = opt.get_text_activations( residual_stream=residual_stream )

To get the activations for the input text at all of the MLP mid layers, we can look at: opt.get_ff_key_activations( text ) or opt.get_ff_key_activations( residual_stream=residual_stream ).

texts.py

Has some basic tools for loading the text datasets I am using:

  • 'pile', ( EleutherAI's 'The Pile' dataset)
  • 'pile-codeless' (Pile without GitHub)
  • 'code' (CodeParrot's 'github-code' dataset)
  • 'python', (Subset of only python)
  • 'wiki', (WikiText)
  • 'civil', (Civil comments with toxicity < 0.2)
  • 'toxic', (Civil comments with toxicity > 0.8)
  • ...

activations.py

Has code specific to the datasets I am using to analyze and attempt to remove capabilities from the models.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

separability-0.9.2.tar.gz (57.3 kB view details)

Uploaded Source

Built Distribution

separability-0.9.2-py3-none-any.whl (60.0 kB view details)

Uploaded Python 3

File details

Details for the file separability-0.9.2.tar.gz.

File metadata

  • Download URL: separability-0.9.2.tar.gz
  • Upload date:
  • Size: 57.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.11.2 Darwin/22.1.0

File hashes

Hashes for separability-0.9.2.tar.gz
Algorithm Hash digest
SHA256 64f6e57e6da9c0c16dfeca0517d081dc3201b70224533f2b38a0b91e94dccbcc
MD5 2cadd6e55b6783102027ade40092bfb5
BLAKE2b-256 8cff4fe6a248bd655f8c01ca3a4b93e7fecfef91c16a7e267f01249bd3219671

See more details on using hashes here.

File details

Details for the file separability-0.9.2-py3-none-any.whl.

File metadata

  • Download URL: separability-0.9.2-py3-none-any.whl
  • Upload date:
  • Size: 60.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.11.2 Darwin/22.1.0

File hashes

Hashes for separability-0.9.2-py3-none-any.whl
Algorithm Hash digest
SHA256 da7410af64a93216d11887e1e7ab54c4b795c4564d16ed31d87f7e1f1c8386f2
MD5 5b12eaf4045c98ba328ab428e0ba8f23
BLAKE2b-256 22dec55b7de4b93b76f34da91fbbcf36533a1a03755d8230757ac34927a33828

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page