Skip to main content

Efficiently fine-tune large neural networks by intelligent active data selection

Project description

Active Fine-Tuning

A library for automatic data selection in active fine-tuning of large neural networks.

Website | Documentation

Please cite our work if you use this library in your research (bibtex below):

Installation

pip install activeft

Usage Example

from activeft.sift import Retriever

# Load embeddings
embeddings = np.random.rand(1000, 512)
query_embeddings = np.random.rand(1, 512)

index = faiss.IndexFlatIP(d)
index.add(embeddings)
retriever = Retriever(index)
indices = retriever.search(query_embeddings, N=10)

Development

CI checks

  • The code is auto-formatted using black ..
  • Static type checks can be run using pyright.
  • Tests can be run using pytest test.

Documentation

To start a local server hosting the documentation run pdoc ./activeft --math.

Publishing

  1. update version number in pyproject.toml and activeft/__init__.py
  2. build: poetry build
  3. publish: poetry publish
  4. push version update to GitHub
  5. create new release on GitHub

Citation

@article{hubotter2024efficiently,
	title        = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
	author       = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
	year         = 2024,
	journal      = {arXiv Preprint}
}

@inproceedings{hubotter2024transductive,
	title        = {Transductive Active Learning: Theory and Applications},
	author       = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas},
	year         = 2024,
	booktitle    = {Advances in Neural Information Processing Systems}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

activeft-0.1.1.tar.gz (33.7 kB view hashes)

Uploaded Source

Built Distribution

activeft-0.1.1-py3-none-any.whl (52.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page