Skip to main content

Experiments regarding LLM components

Project description

PAL: Predictive Analysis & Laws for Neural Networks

Dismantling large language models parts to understand them better, with the hope to build better models.

Requirements

The code back-end is made with:

numpy
torch

The visualization are made with:

matplotlib
networkx

Research papers

  • Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Alice Yang, Francois Charton, Julia Kempe. Iteration Head: A Mechanistic Study of Chain-of-Thought, 2024. The codebase is in the folder projects/cot.

  • Vivien Cabannes, Elvis Dohmatob, Alberto Bietti. Scaling laws for associative memories, in International Conference on Learning Representations (ICLR), 2024. The codebase is in the folder projects/scaling_laws.

  • Vivien Cabannes, Berfin Simsek, Alberto Bietti. Learning Associative Memories with Gradient Descent in International Conference on Machine Learning (ICML), 2024. The codebase is in the folder projects/gradient_descent.

  • In preparation. Codebase in pruning. Show that learning appear by pruning circuits.

  • In preparation. Codebase in factorization. Empirical study of memorization capacity of MLPs and their abilities to leverage hidden factorization.

Organization

The main resuable code is in the src folder. The code for our different research streams is in the projects folder. Other folders may include:

  • data: contains data used in the experiments.
  • models: saves models' weights.
  • launchers: contains bash scripts to launch experiments.
  • notebooks: used for exploration and visualization.
  • scripts: contains python scripts to run experiments.
  • tests: contains tests for the code.
  • tutorial: contains tutorial notebooks to get started with LLMs' training.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metapal-0.0.0.tar.gz (23.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

metapal-0.0.0-py3-none-any.whl (28.0 kB view details)

Uploaded Python 3

File details

Details for the file metapal-0.0.0.tar.gz.

File metadata

  • Download URL: metapal-0.0.0.tar.gz
  • Upload date:
  • Size: 23.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for metapal-0.0.0.tar.gz
Algorithm Hash digest
SHA256 c5763d5203de72969a710b630d715c8a265d14aed73ed60d14be884bec63b8d4
MD5 ebd88d258f7041676989b0b0e0314320
BLAKE2b-256 08717ca2daa7e04efe61d073b4081ca7bfeced139ec637807bb534ce10a66cc1

See more details on using hashes here.

File details

Details for the file metapal-0.0.0-py3-none-any.whl.

File metadata

  • Download URL: metapal-0.0.0-py3-none-any.whl
  • Upload date:
  • Size: 28.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for metapal-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f7ccfeaddaa58b0cf4e1e044b1c130eeb3224e4465e1454701ff4f9315dac0ea
MD5 57712e9d16a6fc12907ba8d543ebd27b
BLAKE2b-256 46e63f317c932bd2740e18da34ed686790ca622314b48fbc6a4f0099dc2fce08

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page