Experiments regarding LLM components
Project description
PAL: Predictive Analysis & Laws for Neural Networks
Dismantling large language models parts to understand them better, with the hope to build better models.
Requirements
The code back-end is made with:
numpy
torch
The visualization are made with:
matplotlib
networkx
Research papers
-
Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Alice Yang, Francois Charton, Julia Kempe. Iteration Head: A Mechanistic Study of Chain-of-Thought, 2024. The codebase is in the folder
projects/cot. -
Vivien Cabannes, Elvis Dohmatob, Alberto Bietti. Scaling laws for associative memories, in International Conference on Learning Representations (ICLR), 2024. The codebase is in the folder
projects/scaling_laws. -
Vivien Cabannes, Berfin Simsek, Alberto Bietti. Learning Associative Memories with Gradient Descent in International Conference on Machine Learning (ICML), 2024. The codebase is in the folder
projects/gradient_descent. -
In preparation. Codebase in
pruning. Show that learning appear by pruning circuits. -
In preparation. Codebase in
factorization. Empirical study of memorization capacity of MLPs and their abilities to leverage hidden factorization.
Organization
The main resuable code is in the src folder.
The code for our different research streams is in the projects folder.
Other folders may include:
data: contains data used in the experiments.models: saves models' weights.launchers: contains bash scripts to launch experiments.notebooks: used for exploration and visualization.scripts: contains python scripts to run experiments.tests: contains tests for the code.tutorial: contains tutorial notebooks to get started with LLMs' training.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file metapal-0.0.0.tar.gz.
File metadata
- Download URL: metapal-0.0.0.tar.gz
- Upload date:
- Size: 23.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c5763d5203de72969a710b630d715c8a265d14aed73ed60d14be884bec63b8d4
|
|
| MD5 |
ebd88d258f7041676989b0b0e0314320
|
|
| BLAKE2b-256 |
08717ca2daa7e04efe61d073b4081ca7bfeced139ec637807bb534ce10a66cc1
|
File details
Details for the file metapal-0.0.0-py3-none-any.whl.
File metadata
- Download URL: metapal-0.0.0-py3-none-any.whl
- Upload date:
- Size: 28.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f7ccfeaddaa58b0cf4e1e044b1c130eeb3224e4465e1454701ff4f9315dac0ea
|
|
| MD5 |
57712e9d16a6fc12907ba8d543ebd27b
|
|
| BLAKE2b-256 |
46e63f317c932bd2740e18da34ed686790ca622314b48fbc6a4f0099dc2fce08
|