The Flux Power Monitor is a lightweight telemetry module for the Flux resource manager designed to provide job-level power statistics. It aggregates power data over time for each node and its respective hardware components based on the provided jobId
.
Flux Power Monitor depends on:
-
Flux-core: Must be installed on the system.
-
Variorum: For power data collection. Installation steps are available here.
-
Flux Python Bindings: Install with the command:
pip install flux-python
The module is built using Autotools
.
To build the Flux Power Monitor, follow these steps:
./autogen.sh
- Configure the build specifying the Variorum and Flux paths:
./configure --with-variorum=<path_to_variorum> --prefix=<path_to_flux>
make -j32
To utilize the Flux Power Monitor:
- Load the module:
./scripts/load_power_monitor.sh -s <buffer_size> -r <sampling_rate>
-s
: Buffer size (amount of power data each node stores).-r
: Sampling rate for data collection.
- Confirm the module is loaded with:
flux module list
- Retrieve power data for a specific job using:
python power_query.py -j {jobId}
- To unload the module:
./scripts/unload_power_monitor.sh
SPDX-License-Identifier: LGPL-3.0
LLNL-CODE-764420