General compute framework for Tenstorrent devices

These details have been verified by PyPI

Project links

Owner

Tenstorrent

GitHub Statistics

Maintainers

blozano-tt

These details have not been verified by PyPI

Project links

Project description

Install | Buy Hardware | Bounty $ | Join Us | Discord

TT-NN is a Python & C++ Neural Network OP library.

Latest Releases

Release	Release Date
0.62.0	ETA Aug 13, 2025
0.61.0	Skipped
0.60.1	Jul 22, 2025
0.59.0	Jun 18, 2025
0.58.0	May 13, 2025
0.57.0	Apr 15, 2025
0.56.0	Mar 7, 2025

LLMs

Model	Batch	Hardware	ttft (ms)	t/s/u	Target t/s/u	t/s	TT-Metalium Release	vLLM Tenstorrent Repo Release
Qwen 3 32B (TP=8)	32	QuietBox (Wormhole)	109	22.1	30	707.2	v0.59.0-rc52	f028da1
QwQ 32B (TP=8)	32	QuietBox (Wormhole)	133	25.2	30	806.4	v0.56.0-rc51	e2e0002
DeepSeek R1 Distill Llama 3.3 70B (TP=8)	32	QuietBox (Wormhole)	159	15.9	20	508.8	v0.59.0-rc53	f028da1
Llama 3.1 70B (TP=32)	32	Galaxy	68	66.7	80	2134.4	v0.60.0-rc20	5cbc982
Llama 3.1 70B (TP=8)	32	QuietBox (Wormhole)	159	15.9	20	508.8	v0.59.0-rc53	f028da1
Llama 3.1 70B (TP=4)	32	QuietBox (Blackhole)	195*	14.9*		476.5*	v0.59.0-rc53	f028da1
Llama 3.2 11B Vision (TP=2)	16	n300	2550	15.8	17	252.8	v0.56.0-rc6	e2e0002
Qwen 2.5 7B (TP=2)	32	n300	126	32.5	38	1040.0	v0.56.0-rc33	e2e0002
Qwen 2.5 72B (TP=8)	32	QuietBox (Wormhole)	319	14.6	20	467.2	v0.59.0-rc52	f028da1
Falcon 7B	32	n150	70	18.5	26	592.0	v0.60.0-rc20
Falcon 7B (DP=8)	256	QuietBox (Wormhole)	87	15.9	26	4070.4	v0.60.0-rc20
Falcon 7B (DP=32)	1024	Galaxy	121	13.2	26	13516.8	v0.60.0-rc20
Falcon 40B (TP=8)	32	QuietBox (Wormhole)		11.9	36	380.8	v0.59.0-rc38
Llama 3.1 8B	32	p100	87*	26.5*		848.0*	v0.59.0-rc3	739dcaa
Llama 3.1 8B	32	p150	69*	29.1*		931.2*	v0.59.0-rc3	739dcaa
Llama 3.1 8B (DP=2)	64	2 x p150	64*	18.6*		1190.4*	v0.59.0-rc3	739dcaa
Llama 3.1 8B	32	n150	104	24.8	23	793.6	v0.59.0-rc52	f028da1
Llama 3.2 1B	32	n150	23	72.6	160	2323.2	v0.59.0-rc52	f028da1
Llama 3.2 3B	32	n150	53	43.5	60	1392.0	v0.59.0-rc52	f028da1
Mamba 2.8B	32	n150	35	14.1	41	451.2	v0.59.0-rc38
Mistral 7B	32	n150	101	28.3	23	905.6	v0.59.0-rc52	f028da1
Mixtral 8x7B (TP=8)	32	QuietBox (Wormhole)	207	16.6	33	531.2	v0.59.0-rc53

Last Update: July 21, 2025

Notes:

ttft = time to first token | t/s/u = tokens/second/user | t/s = tokens/second; where t/s = t/s/u * batch.

TP = Tensor Parallel, DP = Data Parallel; Defines parallelization factors across multiple devices.

The reported LLM performance is for an input sequence length (number of rows filled in the KV cache) of 128 for all models except Mamba (which can accept any sequence length).

The t/s/u reported is the throughput of the first token generated after prefill, i.e. 1 / inter token latency.

Performance numbers were collected using the tt-metal model demos (accessible via the model links). If running with a vLLM inference server, performance may be different.

* Blackhole software optimization is under active development. Please join us in shaping the future of open source AI!
[Discord] [Developer Hub]

For more information regarding vLLM installation and environment creation visit the Tenstorrent vLLM repository.

Speech-to-Text

Model	Batch	Hardware	ttft (ms)	t/s/u	Target t/s/u	t/s	TT-Metalium Release
Whisper (distil-large-v3)	1	n150	232	58.1	45	58.1	v0.59.0-rc52

Diffusion Models

Model	Batch	Hardware	Sec/Image	Target Sec/Image	Release
Stable Diffusion 1.4 (512x512)	1	n150	6.25	3
Stable Diffusion 3.5 Medium (512x512)	1	n150	16	10

Notes:

Stable Diffusion sec/image is based on the time elapsed from submitting the input prompt to receiving the image from the VAE decoder.

CNNs and Vision Transformers

Classification models

Model	Batch	Hardware	Image/sec	Target Image/sec	Release
ResNet-50 (224x224)	16	n150	4,700	7,000	v0.59.0
ResNet-50 (224x224) (DP=2)	32	n300	9,200	14,000	v0.59.0
ResNet-50 (224x224) (DP=8)	128	QuietBox (Wormhole)	35,800	56,000	v0.59.0
ResNet-50 (224x224) (DP=32)	512	Galaxy	96,800	224,000	v0.59.0
ViT-base (224x224)	8	n150	1,370	1,600	v0.60.0-rc4
ViT-base (224x224) (DP=2)	16	n300	1,900	3,200	v0.60.0-rc4
ViT-base (224x224) (DP=8)	64	QuietBox (Wormhole)	7,700	12,800	v0.60.0-rc4
MobileNet-v2 (224x224)	10	n150	2,808	3,500

Object Detection

Model	Batch	Hardware	Frame/sec (FPS)	Target FPS
YOLOv4 (320x320)	1	n150	120	320
YOLOv4 (640x640)	1	n150	50	180
YOLOv8x (640x640)	1	n150	45	100
YOLOv8s (640x640)	1	n150	175	320
YOLOv8s_world (640x640)	1	n150	57	200
YOLOv9c (640x640)	1	n150	55	320
YOLOv10x (640x640)	1	n150	26	200

Segmentation

Model	Batch	Hardware	Frame/sec (FPS)	Target FPS
UNet - VGG19 (256x256)	1	n150	77	150
SegFormer Semantic Segmentation (512x512)	1	n150	84	300
YOLOv9c (640x640)	1	n150	40	240
UFLD - v2 (320x800)	1	n150	255	2000

NLPs

Model	Batch	Hardware	Sentence/sec	Target sentence/sec
BERT-Large	8	n150	270	400
Sentence-Bert (backbone: bert-base)	8	n150	403	550
Sentence-Bert (backbone: bert-base)	64	QuietBox	2961	4400

Model Updates

For the latest model updates and features, please see MODEL_UPDATES.md

Model Bring-Up and Testing

For information on initial model procedures, please see Model Bring-Up and Testing

TT-NN Tech Reports

Advanced Performance Optimizations for Models (updated March 4th, 2025)
Programming Mesh of Devices (updated Sept 9th, 2024)
ViT Implementation in TT-NN on GS (updated Sept 22nd, 2024)
LLMs Bring up in TT-NN (updated Oct 29th, 2024)
YOLOv4 Implementation in TT-NN on WH (updated November 8th, 2024)
CNN Bring up & Optimization in TT-NN (updated Jan 22nd, 2025)

Benchmarks

Matrix Multiply FLOPS on Wormhole and Blackhole (updated June 17th, 2025)

TT-Metalium is our low-level programming model, enabling kernel development for Tenstorrent hardware.

Programming Guide | API Reference

Getting started

Get started with simple kernels.

TT-Metalium Tech Reports

Matrix Engine (updated Sept 6th, 2024)
Data Formats (updated Sept 7th, 2024)
Reconfiguring Data Formats (updated Oct 17th, 2024)
Handling special floating-point numbers (updated Oct 5th, 2024)
Allocator (Updated Dec 19th, 2024)
Tensor Layouts (updated Sept 6th, 2024)
Saturating DRAM Bandwidth (updated Sept 6th, 2024)
Flash Attention on Wormhole (updated Sept 6th, 2024)
CNNs on TT Architectures (updated Sept 6th, 2024)
Ethernet and Multichip Basics (Updated Sept 20th, 2024)
Collective Communication Library (CCL) (Updated Sept 20th, 2024)
Blackhole Bring-Up Programming Guide (Updated Dec 18th, 2024)
Sub-Devices (Updated Jan 7th, 2025)

TT-Metalium Programming Examples

Hello World

Add Integers

Simple Tensor Manipulation

DRAM Data Movement

Dram Loopback Data Movement

Eltwise

Matmul

Tools and Instruments

A comprehensive tool for visualizing and analyzing model execution, offering interactive graphs, memory plots, tensor details, buffer overviews, operation flow graphs, and multi-instance support with file or SSH-based report loading. Install via pip or build from source:

pip install ttnn-visualizer

Tenstorrent Bounty Program Terms and Conditions

This repo is a part of Tenstorrent’s bounty program. If you are interested in helping to improve tt-metal, please make sure to read the Tenstorrent Bounty Program Terms and Conditions before heading to the issues tab. Look for the issues that are tagged with both “bounty” and difficulty level!

License

TT-Metalium and TTNN are licensed under the Apache 2.0 License, as detailed in LICENSE and LICENSE_understanding.txt.

Some distributable forms of this project—such as manylinux-compliant wheels—may need to bundle additional libraries beyond the standard Linux system libraries. For example:

libnuma
libhwloc
openmpi (when built with multihost support)
libevent (when built with multihost support)

These libraries are bound by their own license terms.

Project details

These details have been verified by PyPI

Project links

Owner

Tenstorrent

GitHub Statistics

Maintainers

blozano-tt

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.70.1

May 15, 2026

0.70.1rc1 pre-release

May 15, 2026

0.70.0rc6 pre-release

May 14, 2026

0.69.0

May 5, 2026

0.69.0rc8 pre-release

May 4, 2026

0.68.0

Apr 13, 2026

0.68.0rc5 pre-release

Apr 10, 2026

0.67.4

Mar 29, 2026

0.67.0

Mar 24, 2026

0.67.0rc14 pre-release

Mar 21, 2026

0.66.0

Feb 16, 2026

0.66.0rc15 pre-release

Feb 15, 2026

0.65.1

Jan 10, 2026

0.65.1rc14 pre-release

Jan 9, 2026

0.65.1rc12 pre-release

Jan 1, 2026

0.65.1rc5 pre-release

Dec 19, 2025

0.65.0

Dec 15, 2025

0.65.0rc14 pre-release

Dec 15, 2025

0.64.5

Nov 29, 2025

0.64.5rc5 pre-release

Nov 29, 2025

0.64.4

Nov 21, 2025

0.64.4rc1 pre-release

Nov 21, 2025

0.64.3

Nov 13, 2025

0.64.3rc1 pre-release

Nov 13, 2025

0.64.1

Nov 13, 2025

0.64.1rc4 pre-release

Nov 13, 2025

0.64.1rc3 pre-release

Nov 13, 2025

0.64.0

Oct 29, 2025

0.64.0rc9 pre-release

Oct 28, 2025

0.63.0

Sep 22, 2025

0.62.2

Aug 20, 2025

0.62.2rc1 pre-release

Aug 20, 2025

0.62.0

Aug 11, 2025

0.62.0rc32 pre-release

Aug 19, 2025

0.62.0rc31 pre-release

Aug 19, 2025

0.62.0rc29 pre-release

Aug 18, 2025

0.62.0rc26 pre-release

Aug 15, 2025

0.62.0rc24 pre-release

Aug 14, 2025

This version

0.62.0rc20 pre-release

Aug 14, 2025

0.62.0rc19 pre-release

Aug 13, 2025

0.62.0rc14 pre-release

Aug 12, 2025

0.62.0rc11 pre-release

Aug 9, 2025

0.62.0rc10 pre-release

Aug 7, 2025

0.60.1

Jul 23, 2025

0.60.0rc25 pre-release

Jul 17, 2025

0.60.0rc25.dev1 pre-release

Jul 21, 2025

0.60.0rc24 pre-release

Jul 15, 2025

0.60.0rc23 pre-release

Jul 14, 2025

0.60.0rc22 pre-release

Jul 11, 2025

0.60.0rc21 pre-release

Jul 11, 2025

0.60.0rc20 pre-release

Jul 10, 2025

0.60.0rc4 pre-release

Jun 26, 2025

0.60.0rc1 pre-release

Jun 24, 2025

0.59.0rc61 pre-release

Jun 26, 2025

0.59.0rc60 pre-release

Jun 25, 2025

0.59.0rc58 pre-release

Jun 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl (26.4 MB view details)

Uploaded Aug 14, 2025 CPython 3.10manylinux: glibc 2.34+ x86-64

File details

Details for the file ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

Download URL: ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl
Upload date: Aug 14, 2025
Size: 26.4 MB
Tags: CPython 3.10, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`4c47761b3dc6b1752a405454dfa5775bf2d044a9ba22eafda3dcf7feaa472ecb`
MD5	`e7b78255baaff555b51c0bc170d71458`
BLAKE2b-256	`3040b2739f46147806cb456899ac4b141a882a3c6c265643cfab6e42f36e5f7c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl:

Publisher: package-and-release.yaml on tenstorrent/tt-metal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ttnn-0.62.0rc20-cp310-cp310-manylinux_2_34_x86_64.whl
- Subject digest: 4c47761b3dc6b1752a405454dfa5775bf2d044a9ba22eafda3dcf7feaa472ecb
- Sigstore transparency entry: 393526020
- Sigstore integration time: Aug 14, 2025
Source repository:
- Permalink: tenstorrent/tt-metal@fbb9b4e3343e9d9b245011f3e408a7bb357911a4
- Branch / Tag: refs/heads/releases/v0.62.0
- Owner: https://github.com/tenstorrent
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: package-and-release.yaml@fbb9b4e3343e9d9b245011f3e408a7bb357911a4
- Trigger Event: workflow_dispatch

ttnn 0.62.0rc20

Navigation

Verified details

Project links

Owner

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Project description

Install | Buy Hardware | Bounty $ | Join Us | Discord

API Reference | Model Demos

Latest Releases

LLMs

Speech-to-Text

Diffusion Models

CNNs and Vision Transformers

Classification models

Object Detection

Segmentation

NLPs

Model Updates

Model Bring-Up and Testing

TT-NN Tech Reports

Benchmarks

Programming Guide | API Reference

Getting started

TT-Metalium Tech Reports

TT-Metalium Programming Examples

Hello World

Add Integers

Simple Tensor Manipulation

DRAM Data Movement

Eltwise

Matmul

Tools and Instruments

TT_NN Visualizer

Tenstorrent Bounty Program Terms and Conditions

License

Project details

Verified details

Project links

Owner

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance