Efficiently fine-tune large neural networks by intelligent active data selection
Project description
Active Fine-Tuning
A library for automatic data selection in active fine-tuning of large neural networks.
Please cite our work if you use this library in your research (bibtex below):
- Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
- Transductive Active Learning: Theory and Applications (Section 4)
Installation
pip install activeft
Usage Example
from activeft.sift import Retriever
# Load embeddings
embeddings = np.random.rand(1000, 512)
query_embeddings = np.random.rand(1, 512)
index = faiss.IndexFlatIP(d)
index.add(embeddings)
retriever = Retriever(index)
indices = retriever.search(query_embeddings, N=10)
Development
CI checks
- The code is auto-formatted using
black .. - Static type checks can be run using
pyright. - Tests can be run using
pytest test.
Documentation
To start a local server hosting the documentation run pdoc ./activeft --math.
Publishing
- update version number in
pyproject.tomlandactiveft/__init__.py - build:
poetry build - publish:
poetry publish - push version update to GitHub
- create new release on GitHub
Citation
@article{hubotter2024efficiently,
title = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
author = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
year = 2024,
journal = {arXiv Preprint}
}
@inproceedings{hubotter2024transductive,
title = {Transductive Active Learning: Theory and Applications},
author = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas},
year = 2024,
booktitle = {Advances in Neural Information Processing Systems}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file activeft-0.1.1.tar.gz.
File metadata
- Download URL: activeft-0.1.1.tar.gz
- Upload date:
- Size: 33.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
49cd79e0c64eebb983516c3ff37d11e05de476d3a38ed84ffe18960753bbe58d
|
|
| MD5 |
1033d6681cc3e318e33e8d989974710e
|
|
| BLAKE2b-256 |
9b5d2d4491118d328e18dcabb6b102506ad9f343c7e25d984d1788f1035ea2d2
|
File details
Details for the file activeft-0.1.1-py3-none-any.whl.
File metadata
- Download URL: activeft-0.1.1-py3-none-any.whl
- Upload date:
- Size: 52.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
708d8fbb12b3fd6140e9f57ba79ee121e74f3f79780245c81acb01ad1d461748
|
|
| MD5 |
1d797edc409722c32d52d78209ad6e98
|
|
| BLAKE2b-256 |
5bade6c280958e10027a4aafc90c202982e6bdb4e2502dd3c3eec2a0289c0f49
|