Abstract
Recent advances in AI/ML technologies have accelerated the development of various ML applications. One of the major trends in AI/ML application development is the increasing use of multiple ML models to support high-accuracy inference in a complex end-to-end ML serving. However, testing the right configuration of multiple ML models is expensive, and the application requirements for ML inferences are highly dependent on various factors like the quality of ML models, computing resource performance, and data quality. In this context, techniques and methods that help to emulate and analyze ML inference characteristics using queueing theory can reduce the development effort and cost for ML services encapsulating ML models but also the entire ML system. In this paper, we modeled and analyzed a queueing model for an ML system that uses ensemble learning as an inference method with a new rule and clarified the impacts of model design in ensemble learning on the system’s performance. As a result, we demonstrate the usefulness of the analysis for understanding possible configurations and their efficiency in the ML system through queueing analysis and simulation.
Original language | English |
---|---|
Title of host publication | Analytical and Stochastic Modelling Techniques and Applications - 28th International Conference, ASMTA 2024, Proceedings |
Editors | Arnaud Devos, András Horváth, Sabina Rossi |
Publisher | Springer |
Pages | 97-111 |
Number of pages | 15 |
ISBN (Electronic) | 978-3-031-70753-7 |
ISBN (Print) | 978-3-031-70752-0 |
DOIs | |
Publication status | Published - 13 Sept 2024 |
MoE publication type | A4 Conference publication |
Event | International Conference on Analytical and Stochastic Modelling Techniques and Applications - Venice, Italy Duration: 14 Jun 2024 → 14 Jun 2024 Conference number: 28 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 14826 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | International Conference on Analytical and Stochastic Modelling Techniques and Applications |
---|---|
Abbreviated title | ASMTA |
Country/Territory | Italy |
City | Venice |
Period | 14/06/2024 → 14/06/2024 |