Profile of michaelfeil

llm-runtime-metrics

Last released Jun 12, 2026

Rust-backed performance metrics and request tracing

baseten-performance-client

Last released Jun 4, 2026

A ultra-high performance package for sending requests to Baseten Embedding Inference'

truss-transfer

Last released Jun 3, 2026

Speed up file transfers with the baseten.co + baseten_fs.

fastokens-b10

Last released Mar 20, 2026

None

mm-cache-client

Last released Mar 13, 2026

Tiny aiohttp client for the mm-cache distributed cache service

radix-mlp

Last released Jan 2, 2026

RadixMLP: Prefix-based computation sharing for transformer models

infinity-client

Last released Aug 22, 2025

A client library for accessing ♾️ Infinity - Embedding Inference Server

infinity-emb

Last released Aug 22, 2025

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip.

briton

Last released Apr 4, 2025

Python component of using Briton

embed

Last released Sep 24, 2024

A stable, fast and easy-to-use inference library with a focus on a sync-to-async API

gradientai

Last released Jun 28, 2024

Gradient AI API

hf-hub-ctranslate2

Last released Jun 26, 2024

Connecting Transfromers on HuggingfaceHub with CTranslate2.

rlskyjo

Last released Jan 31, 2022

Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB

Michael Feil

13 projects