(Oral @ ICML 2025) CollabLLM: From Passive Responders to Active Collaborators
Project description
CollabLLM: From Passive Responders to Active Collaborators
📢 Oral @ ICML 2025 (1% out of all submissions)
Overview
CollabLLM transforms traditional language models from passive responders to active collaborators in multi-turn conversations. This repository provides the complete framework for computing multiturn-aware rewards and training collaborative language models.
Installation
To get started, create a new environment and install collabllm via pip:
conda create -n collabllm python=3.10
conda activate collabllm
pip install collabllm
Optional: You may install additional packages (e.g., bigcodebench) for task-specific metrics or evaluation.
Quick Start
- Lightweight usage: Compute Multiturn-aware Rewards (MRs) for any model responses and construct datasets following
notebook_tutorials/. - Synthetic data generation: Generating high-quality synthetic conversational data following
scripts/engine/build_dataset.py. - Train CollabLLM: Conduct SFT/DPO/PPO models training to maximize MRs following examples under
scripts/train/*.py.
Add Your Own Task
To apply CollabLLM to a new task:
-
Add a Dataset:
Place your single-turn dataset inexamples/single_turn_ds/and register it in__init__.py. -
(Optional) Add Metrics:
Add new metrics toexamples/metrics/and register them in__init__.py.
You can now run data generation, reward computation, and model training using your customized setup.
Citation
If you find our work useful in your research, please cite the following:
@inproceedings{collabllm2025,
title={CollabLLM: From Passive Responders to Active Collaborators},
author={Shirley Wu and Michel Galley and Baolin Peng and Hao Cheng and
Gavin Li and Yao Dou and Weixin Cai and James Zou and
Jure Leskovec and Jianfeng Gao},
booktitle={International Conference on Machine Learning (ICML)},
year={2025}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file collabllm-0.0.6.tar.gz.
File metadata
- Download URL: collabllm-0.0.6.tar.gz
- Upload date:
- Size: 29.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
95dd5ad432f072620f8b393cef8c9daaa027e3d2d0bcc9b5771ccfa772f49bb1
|
|
| MD5 |
7224b4a8204a4ceea918b2cd23e6bd6b
|
|
| BLAKE2b-256 |
4dfc6489033904ff0852da8b6046e0af7a93a6e292099695a2fcec2b71f1efd9
|
File details
Details for the file collabllm-0.0.6-py3-none-any.whl.
File metadata
- Download URL: collabllm-0.0.6-py3-none-any.whl
- Upload date:
- Size: 34.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
da8b193eb2a51d1a478a8af6ae7bbfc005a3d697804f1fc1f3827ba1174fff4f
|
|
| MD5 |
ac089d3f8944c7bb33c357f8964bd2e7
|
|
| BLAKE2b-256 |
6e22d582ea43170de38071fc19f8f121e0b3383da0886df572be54e948361f3e
|