Artifact-backed LLM serving performance lab for vLLM baselines, official metrics, GuideLLM checks, and SGLang/PD scaffolding
python performance-engineering modal prometheus artifact-evaluation llm llm-serving vllm llm-inference sglang llm-performance gpu-benchmarking guidellm inference-benchmarking serving-metrics
-
Updated
May 21, 2026 - Python