Skip to main content

A vision transformer for training on MNIST

Project description

Python package mnistvit

A PyTorch-only implementation of a vision transformer (ViT) for training on MNIST, achieving 99.65% test accuracy with default parameters and without pre-training. The ViT architecture and learning parameters can be configured easily. Code for hyperparameter optimization is provided as well.

The code is intended to be used for learning about vision transformers. With MNIST as a simple and well understood dataset, the importance of various hyperparameters can be explored.

Requirements

The package requires Python 3.10 or greater and additionally requires the torch and torchvision packages. For hyperparameter optimization, additionally ray[tune] and optuna are required. The ViT itself requires torch only.

Installation

To install the mnistvit package, run the following command in the parent directory of the repository:

pip install mnistvit

To install the package with hyperparameter optimization support:

pip install mnistvit[tune]

Usage

To train a model with default parameters:

python -m mnistvit.train

The script will produce a file config.json with the model configuration and file model.pt containing the trained model. Use the -h argument for a list of options.

To evaluate the test set accuracy of the model stored in model.pt with the configuration in config.json:

python -m mnistvit.predict --use-accuracy

To predict the class of the digit stored in the file sample.jpg:

python -m mnistvit.predict --image-file sample.jpg

For hyperparameter optimization with default search parameters:

python -m mnistvit.tune

License

mnistvit is released under the GPLv3 license, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mnistvit-1.4.0.tar.gz (25.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mnistvit-1.4.0-py3-none-any.whl (27.3 kB view details)

Uploaded Python 3

File details

Details for the file mnistvit-1.4.0.tar.gz.

File metadata

  • Download URL: mnistvit-1.4.0.tar.gz
  • Upload date:
  • Size: 25.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for mnistvit-1.4.0.tar.gz
Algorithm Hash digest
SHA256 042e5abf6d992a09cf544ffe5e7c45a6d5d3ea474f709432eee59b92eb2ea2dd
MD5 c6a5a10ea545d5168c9b1582ea51f7e3
BLAKE2b-256 657b3b01f741addc642dd404ed8331cc70be0b9ff8e51e94b702c34a0bce47d1

See more details on using hashes here.

File details

Details for the file mnistvit-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: mnistvit-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 27.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for mnistvit-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dfe1cc0cb7aaa98caeec38f965b632ffebd4c212a7c076a85134aef9ee1425f4
MD5 a62b75ee30cd4e1d42b36d53a860a769
BLAKE2b-256 5322ddad6b54ef2b5bdf32f78ef879966cb80363f3fb7b3292759190f97741ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page