RAPL in Action : Experiences in Using RAPL for Power Measurements

Tutkimustuotos: Lehtiartikkelivertaisarvioitu

Tutkijat

Organisaatiot

  • University of Helsinki
  • Aalto University
  • University of Lausanne
  • Beijing University of Posts and Telecommunications
  • Helsinki Institute of Physics
  • VTT Technical Research Centre of Finland

Kuvaus

To improve energy efficiency and comply with the power budgets, it is important to be able to measure the power consumption of cloud computing servers. Intel's Running Average Power Limit (RAPL) interface is a powerful tool for this purpose. RAPL provides power limiting features and accurate energy readings for CPUs and DRAM, which are easily accessible through different interfaces on large distributed computing systems. Since its introduction, RAPL has been used extensively in power measurement and modeling. However, the advantages and disadvantages of RAPL have not beenwell investigated yet. To fill this gap, we conduct a series of experiments to disclose the underlying strengths and weaknesses of the RAPL interface by using both customized microbenchmarks and three well-known application level benchmarks: Stream, Stress-ng, and ParFullCMS. Moreover, to make the analysis as realistic as possible, we leverage two production-level power measurement datasets from the Taito, a supercomputing cluster of the Finnish Center of Scientific Computing and also replicate our experiments on Amazon EC2. Our results illustrate different aspects of RAPL and document the findings through comprehensive analysis. Our observations reveal that RAPL readings are highly correlated with plug power, promisingly accurate enough, and have negligible performance overhead. Experimental results suggest RAPL can be a very useful tool tomeasure and monitor the energy consumption of servers without deploying any complex power meters. We also show that there are still some open issues, such as driver support, non-atomicity of register updates, and unpredictable timings that might weaken the usability of RAPL in certain scenarios. For such scenarios, we pinpoint solutions and workarounds.

Yksityiskohdat

AlkuperäiskieliEnglanti
Artikkeli9
Sivumäärä26
JulkaisuACM Transactions on Modeling and Performance Evaluation of Computing Systems
Vuosikerta3
Numero2
TilaJulkaistu - huhtikuuta 2018
OKM-julkaisutyyppiA1 Julkaistu artikkeli, soviteltu

ID: 31378878