Skip to main content

A vision transformer for training on MNIST

Project description

Python package mnistvit

A PyTorch-only implementation of a vision transformer (ViT) for training on MNIST, achieving 99.65% test accuracy with default parameters and without pre-training. The ViT architecture and learning parameters can be configured easily. Code for hyperparameter optimization is provided as well.

The code is intended to be used for learning about vision transformers. With MNIST as a simple and well understood dataset, the importance of various hyperparameters can be explored.

Requirements

The package requires Python 3.10 or greater and additionally requires the torch and torchvision packages. For hyperparameter optimization, additionally ray[tune] and optuna are required. The ViT itself requires torch only.

Installation

To install the mnistvit package, run the following command in the parent directory of the repository:

pip install mnistvit

To install the package with hyperparameter optimization support:

pip install mnistvit[tune]

Usage

To train a model with default parameters:

python -m mnistvit.train

The script will produce a file config.json with the model configuration and file model.pt containing the trained model. Use the -h argument for a list of options.

To evaluate the test set accuracy of the model stored in model.pt with the configuration in config.json:

python -m mnistvit.predict --use-accuracy

To predict the class of the digit stored in the file sample.jpg:

python -m mnistvit.predict --image-file sample.jpg

For hyperparameter optimization with default search parameters:

python -m mnistvit.tune

A trained model is available on Hugging Face.

License

mnistvit is released under the GPLv3 license, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mnistvit-1.4.2.tar.gz (25.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mnistvit-1.4.2-py3-none-any.whl (27.4 kB view details)

Uploaded Python 3

File details

Details for the file mnistvit-1.4.2.tar.gz.

File metadata

  • Download URL: mnistvit-1.4.2.tar.gz
  • Upload date:
  • Size: 25.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for mnistvit-1.4.2.tar.gz
Algorithm Hash digest
SHA256 3d2bb81c53fa29950a234b6d373361fc7f4ac5723e9183836fad99a67ab8a446
MD5 bcf062fcc24d900deaf8b3696e358ce1
BLAKE2b-256 f26be50abbd3dab132549207bd6278ef955e8a03bcd07b0815aa663ce379c114

See more details on using hashes here.

File details

Details for the file mnistvit-1.4.2-py3-none-any.whl.

File metadata

  • Download URL: mnistvit-1.4.2-py3-none-any.whl
  • Upload date:
  • Size: 27.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for mnistvit-1.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b231b2413af7fb33c4551132ce78e8ee8cafaf5b537c55f666c97fd7d860ec90
MD5 8480afb565ffad65b7d103695b9d8ad0
BLAKE2b-256 e5a045ea5f4d451e66d6a5275aa0fad06ff60518fddd9022152969d584942bb3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page