Kthena v0.3.0 Released: Production-Ready Inference Orchestration
· 7 min read
Released: 2026-01-31
Summary
Release v0.3.0 establishes Kthena as a more robust and scalable platform for AI inference workloads. This release introduces significant enhancements in ModelServing, Router, and ModelBooster. Key highlights include seamless integration with LeaderWorkerSet, advanced network topology-aware scheduling for PD disaggregation, and a comprehensive Router Observability framework. Additionally, this version brings native ModelServing version control, support for vLLM data parallel deployment, and a complete E2E test suite for the router, ensuring high stability and reliability for production environments.


