Queueing Analysis of an Ensemble Machine Learning System

Keishin Tsutsumi, Tuan Phung-Duc, Linh Truong

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

77 Downloads (Pure)

Abstract

Recent advances in AI/ML technologies have accelerated the development of various ML applications. One of the major trends in AI/ML application development is the increasing use of multiple ML models to support high-accuracy inference in a complex end-to-end ML serving. However, testing the right configuration of multiple ML models is expensive, and the application requirements for ML inferences are highly dependent on various factors like the quality of ML models, computing resource performance, and data quality. In this context, techniques and methods that help to emulate and analyze ML inference characteristics using queueing theory can reduce the development effort and cost for ML services encapsulating ML models but also the entire ML system. In this paper, we modeled and analyzed a queueing model for an ML system that uses ensemble learning as an inference method with a new rule and clarified the impacts of model design in ensemble learning on the system’s performance. As a result, we demonstrate the usefulness of the analysis for understanding possible configurations and their efficiency in the ML system through queueing analysis and simulation.

Original languageEnglish
Title of host publicationAnalytical and Stochastic Modelling Techniques and Applications - 28th International Conference, ASMTA 2024, Proceedings
EditorsArnaud Devos, András Horváth, Sabina Rossi
PublisherSpringer
Pages97-111
Number of pages15
ISBN (Electronic)978-3-031-70753-7
ISBN (Print)978-3-031-70752-0
DOIs
Publication statusPublished - 13 Sept 2024
MoE publication typeA4 Conference publication
EventInternational Conference on Analytical and Stochastic Modelling Techniques and Applications - Venice, Italy
Duration: 14 Jun 202414 Jun 2024
Conference number: 28

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14826 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Analytical and Stochastic Modelling Techniques and Applications
Abbreviated titleASMTA
Country/TerritoryItaly
CityVenice
Period14/06/202414/06/2024

Fingerprint

Dive into the research topics of 'Queueing Analysis of an Ensemble Machine Learning System'. Together they form a unique fingerprint.

Cite this