INA Research Group

TopFull: Execution flow overview

Summary

Microservice has become a de facto standard for building large-scale cloud applications. Overload control is essential in preventing microservice failures and maintaining system performance under overloads. Although several approaches have been proposed, they are limited to mitigating the overload of individual microservices, lacking assessments of interdependent microservices and APIs. This paper presents TopFull, an adaptive overload control at entry for microservices that leverages global observations to maximize throughput that meets service level objectives (i.e., goodput). TopFull makes adaptive load control on a per-API basis, exercises parallel control on each independent subset of microservices, and applies RL-based rate controllers that adjust the admitted rates of the APIs at entry according to the severity of overload. Our experiments on various open-source benchmarks demonstrate that TopFull significantly increases goodput in overload scenarios, outperforming DAGOR by 1.82x and Breakwater by 2.26x. Furthermore, the Kubernetes autoscaler with TopFull serves up to 3.91x more requests under traffic surge and tolerates traffic spikes with up to 57% fewer resources than the standalone Kubernetes autoscaler.

Publications

SIGCOMM

TopFull: An Adaptive Top-Down Overload Control for SLO-Oriented Microservices

Jinwoo Park, Jaehyeong Park, Youngmok Jung, Hwijoon Lim, Hyunho Yeo, and Dongsu Han

In Proceedings of the ACM SIGCOMM 2024 conference on SIGCOMM Aug 2024

Paper Project Code

Members

Jinwoo Park

Ph.D. Student

Jaehyeong Park

Ph.D. Student

Youngmok Jung

Alumni

Hwijoon Lim

Alumni

Hyunho Yeo

Alumni

Dongsu Han

Principal Investigator

Media

Pre-recorded SIGCOMM talk.