EAR 4.3
Reference Manual
Changelog

EAR 4.3

  • MPI stats collection now is guided by sampling to minimize the overhead.
  • EARL-EARD communication optimized.
  • EARL: Periodic actions optimization.
  • EARL: Reduce time consumption of loop signature computation.
  • erun: Provide support for multiple batch schedulers.
  • eardbd: Verbosity quality improved.
  • Improved metrics computation in AMD Zen2/Zen3.
  • Improved robustness in metrics computation to support hardware failures.

EAR 4.2

  • Improved support for node sharing : save/restore configurations
  • AMD(Zen3) CPUs
  • Intel(r) SST support ondemand
  • Improved Phases classification
  • GPU idle optimization in all the application phases
  • MPI load balance for energy optimization integrated on EAR policies
  • On demand COUNTDOWN support for MPI calls energy optimization
  • Energy savings estimates reported to the DB (available with eacct)
  • Application phases reported to the DB (available with eacct)
  • MPI statistics reports: CSV file with MPI statistics
  • New Intel Node Manager powercap node plugin
  • Improvements in the Meta-EARGM and node powercap
  • Improvements in the Soft cluster powercap
  • New report plugins for non-relational DB: EXAMON, Cassandra, DCDB
  • Improvements in the ear.conf parsing
  • Improved metrics and management API
  • Changes in the environment variables have been done for homogeneity

EAR4.1.1

  • Select replaced by poll to support bigger nodes
  • Minor changes in edb_create and FP exceptions fixes

EAR 4.1

  • Meta EARGM.
  • Support for N jobs in a node.
  • CPU power models for N jobs.
  • Python apps loaded automatically.
  • Support for MPI-Python through environment variable.
  • Report plug-ins in EARL, EARD and EARDBD.
  • PostgreSQL support.
  • Soft cluster powercap.
  • New AMD virtual P-states support using max frequency and different P-states.
  • New RPC system in EARL-EARD communication (including locks).
  • Partial support for different schedulers (PBS).
  • New task messages between EARPlug and EARD.
  • New DCMI and INM-Freeipmi based energy plug-ins.
  • IceLake support.
  • Likwid support for IceLake memory bandwidth computation.
  • msr_safe
  • HEROES plug-in.

EAR 4.0

  • AMD virtual p-states support and DF frequency management included
  • AMD optimization based on min_energy and min_time
  • GPU optimization in low GPU utilization phases
  • Application phases IO/MPI/Computation detection included
  • Node powercap and cluster powercap implemented: Intel CPU and NVIDIA GPUS tested. Meta EAR-GM not released
  • IO, Percentage of MPI and Uncore frequency reported to DB and included in eacct
  • econtrol extensions for EAR health-check

EAR 3.4

  • Automatic loading of EAR library for MPI applications (already in 3.3), OpenMP, MKL and CUDA applications. Programming model detection is based on dynamic symbols so it could not work if symbols are statically included.
  • AMD monitoring support.
  • TAGS support included in policies.
  • Request dynamic in eard_rapi.
  • GPU monitoring support in EAR library for NVIDIA devices.
  • Node powercap and cluster power cap under development.
  • papi dependency removed.

EAR 3.3

  • eacct loop signature reported.
  • EAR loader included.
  • GPU support migrated to nvml API.
  • GPU support in configure.
  • TAGS supported in ear.conf.
  • Heterogeneous clusters specification supported.
  • EARGM energy capping management improved.
  • Internal messaging protocol improved.
  • Average CPU frequency and Average IMC frequency computation improved.

EAR 3.2

  • GPU monitoring based on nvidia-smi command.
  • GPU power reported to the DB using NVIDIA commands.
  • Postgresql support.
  • freeipmi dependence removed.