Skip to main content

SAGE Benchmark - SAGE framework-specific system-level benchmarks

Project description

benchmark_sage – SAGE System-Level Benchmarks and ICML Artifacts

benchmark_sage is a home for system-level benchmarks and artifacts that focus on SAGE as a complete ML systems platform.

Key points:

  • SAGE is more than an LLM control plane. The LLM/embedding control plane is one subsystem. SAGE also includes components such as sage.db, sage.flow, sage.tsdb, and others, all orchestrated via a common declarative dataflow model.
  • packages/sage-benchmark already contains multiple benchmark suites (agents, control-plane scheduling, DB, retrieval, memory, schedulers, refiner, libamm, etc.). benchmark_sage can aggregate cross-cutting experiments that involve several SAGE subsystems together.
  • This folder may also store ICML writing prompts and experiment templates for the SAGE system track papers, under docs/.

Suggested uses:

  • End-to-end experiments that span sage.flow pipelines, sage.db storage, sage.tsdb time-series monitoring, and the LLM/embedding control plane.
  • Configs (config/*.yaml) for system-track experiments described in an ICML paper.
  • Notebook or script entry points that reproduce figures/tables.

Q-style Workload Catalog (TPC-H/TPC-C inspired)

benchmark_sage adopts a fixed Q1..Q8 catalog where each Q denotes a workload family rather than a one-off script. This keeps paper claims, configs, and run outputs aligned.

Query Name Entry Workload Family
Q1 PipelineChain e2e_pipeline End-to-end RAG pipeline workloads
Q2 ControlMix control_plane Mixed LLM+embedding scheduling workloads
Q3 NoisyNeighbor isolation Multi-tenant interference and isolation workloads
Q4 ScaleFrontier scalability Scale-out throughput/latency workloads
Q5 HeteroResilience heterogeneity Heterogeneous deployment and recovery workloads
Q6 BurstTown burst_priority Bursty mixed-priority transactional workloads
Q7 ReconfigDrill reconfiguration Online reconfiguration drill workloads
Q8 RecoverySoak recovery Fault-recovery soak workloads

Examples:

python -m sage.benchmark.benchmark_sage --experiment Q1
python -m sage.benchmark.benchmark_sage --experiment Q3 --quick
python -m sage.benchmark.benchmark_sage --experiment Q7 --quick
python -m sage.benchmark.benchmark_sage --all

At the repo root, docs/icml-prompts/ contains reusable writing prompts. You can either reference them directly or copy customized versions into this folder when preparing a specific ICML submission.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

isage_sage_benchmark-0.1.0.2.tar.gz (968.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

isage_sage_benchmark-0.1.0.2-py2.py3-none-any.whl (1.1 MB view details)

Uploaded Python 2Python 3

File details

Details for the file isage_sage_benchmark-0.1.0.2.tar.gz.

File metadata

  • Download URL: isage_sage_benchmark-0.1.0.2.tar.gz
  • Upload date:
  • Size: 968.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for isage_sage_benchmark-0.1.0.2.tar.gz
Algorithm Hash digest
SHA256 53918eee191c8bd8f16410f7bdace2652de1b641ffa7b35f409d7d2880e04800
MD5 e1abb31080ff87fe261747ca6439c193
BLAKE2b-256 a379655bd8abbd5bc6f7a08499b0d4e7e21f585317408d33bbecdb99e01fcea4

See more details on using hashes here.

File details

Details for the file isage_sage_benchmark-0.1.0.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for isage_sage_benchmark-0.1.0.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 449776b8a6ad9b71ad3481ae0e140458c8e2b3f322f5b3cc2399dcd4983c097c
MD5 e6aac9a19cf6b6a710568179dab80ce8
BLAKE2b-256 cc5857afd72d9916c9d7bdcb19c39857d60fb2bd49d2fc28cad41e19961d64cf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page