Supporting Opportunistic Data Operations for Data-Intensive Computational Applications

Minh Tri Nguyen, Anh Dung Nguyen, Jarno Rantaharju, Touko Puro, Matthias Rheinhardt, Maarit Korpi-Lagg, Hong Linh Truong*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

4 Downloads (Pure)

Abstract

A long running data-intensive computational application acquires costly computing resources. With the emerging new architectures, like computing systems with multiple nodes of many-core CPUs and accelerators, while domain-specific tools and libraries employed in such an application leverage high parallelism on accelerators for intensive computations, the remaining resources can potentially be utilized for other application-related data operations. Such data operations, called opportunistic data operations in this work, must usually be carried out for post-processing or follow-up analytics based on results produced during the runtime of the application. These operations are not easily backfilled or preempted under the guidance of the domain scientist or by common task scheduling systems due to their complex dependencies.In this paper, we introduce a framework for domain scientists to identify and execute opportunistic data operation tasks. With a minimal specification or modification of the main application, the scientists can specify, monitor, and execute opportunistic tasks independently from the main application and the framework will detect underutilized resources to execute these tasks, thereby, optimizing utilization efficiency within the allocated resources. We present experiments to demonstrate the applicability of our framework on a magnetic field modeling running on the LUMI computing system.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Big Data, BigData 2024
EditorsWei Ding, Chang-Tien Lu, Fusheng Wang, Liping Di, Kesheng Wu, Jun Huan, Raghu Nambiar, Jundong Li, Filip Ilievski, Ricardo Baeza-Yates, Xiaohua Hu
PublisherIEEE
Pages3735-3744
Number of pages10
ISBN (Electronic)979-8-3503-6248-0
DOIs
Publication statusPublished - 2024
MoE publication typeA4 Conference publication
EventIEEE International Conference on Big Data - Washington, United States
Duration: 15 Dec 202418 Dec 2024
https://www3.cs.stonybrook.edu/~ieeebigdata2024/

Publication series

NameIEEE International Conference on Big Data
ISSN (Electronic)2573-2978

Conference

ConferenceIEEE International Conference on Big Data
Abbreviated titleBigData
Country/TerritoryUnited States
CityWashington
Period15/12/202418/12/2024
Internet address

Keywords

  • computational applications
  • data operations
  • opportunistic tasks
  • performance optimization

Fingerprint

Dive into the research topics of 'Supporting Opportunistic Data Operations for Data-Intensive Computational Applications'. Together they form a unique fingerprint.

Cite this