Skip to main content

Efficiently fine-tune large neural networks by intelligent active data selection

Project description

Active Fine-Tuning

A library for automatic data selection in active fine-tuning of large neural networks.

Website | Documentation

Please cite our work if you use this library in your research (bibtex below):

Installation

pip install activeft

Usage Example

from activeft.sift import Retriever

# Load embeddings
embeddings = np.random.rand(1000, 512)
query_embeddings = np.random.rand(1, 512)

index = faiss.IndexFlatIP(d)
index.add(embeddings)
retriever = Retriever(index)
indices = retriever.search(query_embeddings, N=10)

Development

CI checks

  • The code is auto-formatted using black ..
  • Static type checks can be run using pyright.
  • Tests can be run using pytest test.

Documentation

To start a local server hosting the documentation run pdoc ./activeft --math.

Publishing

  1. update version number in pyproject.toml and activeft/__init__.py
  2. build: poetry build
  3. publish: poetry publish
  4. push version update to GitHub
  5. create new release on GitHub

Citation

@article{hubotter2024efficiently,
	title        = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
	author       = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
	year         = 2024,
	journal      = {arXiv Preprint}
}

@inproceedings{hubotter2024transductive,
	title        = {Transductive Active Learning: Theory and Applications},
	author       = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas},
	year         = 2024,
	booktitle    = {Advances in Neural Information Processing Systems}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

activeft-0.1.1.tar.gz (33.7 kB view details)

Uploaded Source

Built Distribution

activeft-0.1.1-py3-none-any.whl (52.7 kB view details)

Uploaded Python 3

File details

Details for the file activeft-0.1.1.tar.gz.

File metadata

  • Download URL: activeft-0.1.1.tar.gz
  • Upload date:
  • Size: 33.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0

File hashes

Hashes for activeft-0.1.1.tar.gz
Algorithm Hash digest
SHA256 49cd79e0c64eebb983516c3ff37d11e05de476d3a38ed84ffe18960753bbe58d
MD5 1033d6681cc3e318e33e8d989974710e
BLAKE2b-256 9b5d2d4491118d328e18dcabb6b102506ad9f343c7e25d984d1788f1035ea2d2

See more details on using hashes here.

File details

Details for the file activeft-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: activeft-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 52.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.11.6 Darwin/24.0.0

File hashes

Hashes for activeft-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 708d8fbb12b3fd6140e9f57ba79ee121e74f3f79780245c81acb01ad1d461748
MD5 1d797edc409722c32d52d78209ad6e98
BLAKE2b-256 5bade6c280958e10027a4aafc90c202982e6bdb4e2502dd3c3eec2a0289c0f49

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page