Efficiently fine-tune large neural networks by intelligent active data selection
Project description
Active Fine-Tuning
A library for automatic data selection in active fine-tuning of large neural networks.
Please cite our work if you use this library in your research (bibtex below):
- Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
- Transductive Active Learning: Theory and Applications (Section 4)
Installation
pip install activeft
Usage Example
from activeft.sift import Retriever
# Load embeddings
embeddings = np.random.rand(1000, 512)
query_embeddings = np.random.rand(1, 512)
index = faiss.IndexFlatIP(d)
index.add(embeddings)
retriever = Retriever(index)
indices = retriever.search(query_embeddings, N=10)
Development
CI checks
- The code is auto-formatted using
black .
. - Static type checks can be run using
pyright
. - Tests can be run using
pytest test
.
Documentation
To start a local server hosting the documentation run pdoc ./activeft --math
.
Publishing
- update version number in
pyproject.toml
andactiveft/__init__.py
- build:
poetry build
- publish:
poetry publish
- push version update to GitHub
- create new release on GitHub
Citation
@article{hubotter2024efficiently,
title = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
author = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
year = 2024,
journal = {arXiv Preprint}
}
@inproceedings{hubotter2024transductive,
title = {Transductive Active Learning: Theory and Applications},
author = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas},
year = 2024,
booktitle = {Advances in Neural Information Processing Systems}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
activeft-0.1.1.tar.gz
(33.7 kB
view details)
Built Distribution
activeft-0.1.1-py3-none-any.whl
(52.7 kB
view details)
File details
Details for the file activeft-0.1.1.tar.gz
.
File metadata
- Download URL: activeft-0.1.1.tar.gz
- Upload date:
- Size: 33.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 49cd79e0c64eebb983516c3ff37d11e05de476d3a38ed84ffe18960753bbe58d |
|
MD5 | 1033d6681cc3e318e33e8d989974710e |
|
BLAKE2b-256 | 9b5d2d4491118d328e18dcabb6b102506ad9f343c7e25d984d1788f1035ea2d2 |
File details
Details for the file activeft-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: activeft-0.1.1-py3-none-any.whl
- Upload date:
- Size: 52.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 708d8fbb12b3fd6140e9f57ba79ee121e74f3f79780245c81acb01ad1d461748 |
|
MD5 | 1d797edc409722c32d52d78209ad6e98 |
|
BLAKE2b-256 | 5bade6c280958e10027a4aafc90c202982e6bdb4e2502dd3c3eec2a0289c0f49 |