20 projects
tokenspeed-triton
A language and compiler for custom Deep Learning operations (vendor release for TokenSpeed)
tokenspeed-proton
A profiler for Triton (vendor release for TokenSpeed)
smg
High-performance Rust-based inference gateway for large-scale LLM deployments
tokenspeed-fa3
FlashAttention-3
tokenspeed-trtllm-common
Name reserved for the tokenspeed-trtllm-common project.
tokenspeed-trtllm-gemm
Name reserved for the tokenspeed-trtllm-gemm project.
tokenspeed-trtllm-moe
Name reserved for the tokenspeed-trtllm-moe project.
tokenspeed-trtllm-attn
Name reserved for the tokenspeed-trtllm-attn project.
tokenspeed-flashmla
None
tokenspeed-deepep
None
tokenspeed-fast-hadamard-transform
Fast Hadamard Transform in CUDA, with a PyTorch interface
tokenspeed-deepgemm
None
tokenspeed-fa4
Flash Attention CUTE (CUDA Template Engine) implementation
tokenspeed-trtllm-kernel
Name reserved for the tokenspeed-trtllm-kernel project.
tokenspeed-flash-attn
Name reserved for the tokenspeed-flash-attn project.
tokenspeed-scheduler
Name reserved for the tokenspeed-scheduler project.
modelgt
Name reserved for the modelgt project.
tokenspeed-kernel
Name reserved for the tokenspeed-kernel project.
tokenspeed
Name reserved for the tokenspeed project.
torchspec
TorchSpec (placeholder package name reservation).