Skip to main content
Avatar for LightSeek Foundation from gravatar.com

LightSeek Foundation

Username    lightseek
Date joined   Joined

26 projects

tokenspeed-smg

Last released

High-performance Rust-based inference gateway for large-scale LLM deployments

tokenspeed-smg-grpc-servicer

Last released

SMG gRPC servicer implementations for LLM inference engines (vLLM, SGLang, MLX, TokenSpeed)

tokenspeed-smg-grpc-proto

Last released

SMG gRPC proto definitions for SGLang, vLLM, TRT-LLM, and MLX

tokenspeed-kernel-nvidia

Last released

Placeholder package for TokenSpeed NVIDIA kernel distribution.

tokenspeed-kernel-amd

Last released

Placeholder package for TokenSpeed AMD kernel distribution.

tokenspeed-triton

Last released

A language and compiler for custom Deep Learning operations (vendor release for TokenSpeed)

tokenspeed-proton

Last released

A profiler for Triton (vendor release for TokenSpeed)

tokenspeed-mooncake

Last released

Python binding of a Mooncake library using pybind11

tokenspeed-mla

Last released

Speed-of-light TokenSpeed MLA kernels for Blackwell SM100 and SM103.

tokenspeed-trie

Last released

A small harness for evaluating OpenAI-compatible inference endpoints with synthetic agentic workloads.

tokenspeed-iris

Last released

Triton-based framework for Remote Memory Access (RMA) operations with SHMEM-like APIs for multi-GPU programming.

tokenspeed-tritonblas

Last released

A Lightweight Triton-based BLAS Library

tokenspeed-triton-kernels

Last released

None

tokenspeed-fa4

Last released

Flash Attention CUTE (CUDA Template Engine) implementation

tokenspeed-trtllm-kernel

Last released

Standalone TensorRT-LLM CUDA kernels as PyTorch custom ops

tokenspeed-flashmla

Last released

None

tokenspeed-deepep

Last released

None

tokenspeed-deepgemm

Last released

None

smg

Last released

High-performance Rust-based inference gateway for large-scale LLM deployments

tokenspeed-fa3

Last released

FlashAttention-3

tokenspeed-fast-hadamard-transform

Last released

Fast Hadamard Transform in CUDA, with a PyTorch interface

tokenspeed-scheduler

Last released

Name reserved for the tokenspeed-scheduler project.

modelgt

Last released

Name reserved for the modelgt project.

tokenspeed-kernel

Last released

Name reserved for the tokenspeed-kernel project.

tokenspeed

Last released

Name reserved for the tokenspeed project.

torchspec

Last released

TorchSpec (placeholder package name reservation).

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page