[SD-119] Implement layer execution latency measurements for Pytorch #48

osw282 · 2025-01-15T16:33:49Z

This PR implement a script that will record the layer execution time of a pytorch model for inferencing using CPU only.

The script will output a json file containing the execution time, timestamp and the layer name for all inference cycles.

dagshub · 2025-01-15T16:33:52Z

Join the discussion on DagsHub!

dudeperf3ct · 2025-01-16T09:14:11Z

jetson/power_logging/pyproject.toml

@@ -7,4 +7,6 @@ requires-python = ">=3.11"
 dependencies = [
    "dvc-s3>=3.2.0",
    "pandas>=2.2.3",
+    "pillow>=11.1.0",


Is this packaged being used anywhere?

It an implicit requirement by pytorch to run resnet18

dudeperf3ct · 2025-01-16T09:35:26Z

jetson/power_logging/pytorch/layer_execution_time.py

+        module: the module to register hook.
+        input: tuple containing the input arguments to module's forward method.
+    """
+    layer_time_dict[layer_name] = (time.time(), datetime.datetime.now().strftime("%Y-%m-%d %H:%M:%S"))


I am not sure if time.time is a reliable function to get a time as it depends on system clock.

I think time.perf_counter or time.perf_counter_ns() makes more sense as these layers are going to be fast and we need more precise estimates (resolution offered by these functions is different).

Reference:

https://medium.com/@jordan.l.edmunds/how-to-time-your-code-correctly-time-monotonic-e730dce49006

https://stackoverflow.com/questions/75011155/how-to-get-the-time-in-ms-between-two-calls-in-python

I think we don't need to worry about CPU implementation of timing too much, as the hardware we're using has cuda, and CUDA events will be used in the next ticket

dudeperf3ct · 2025-01-16T09:36:29Z

jetson/power_logging/pytorch/layer_execution_time.py

+    layer_time_dict = {}
+
+    for layer_name, layer in get_layers(model):
+        layer.register_forward_pre_hook(partial(layer_time_pre_hook, layer_time_dict, layer_name))


Why are hooks being used? Does profiler API not work? I think it will provide much better results on CPU and GPU.

As far as I understand autograd profiler gives us the wrong resolution (it gives latencies by operation type, not by layer). But @osw282 can give more context I guess

d-lowl

A small nit about milliseconds, otherwise good to go I think

d-lowl · 2025-01-16T09:41:52Z

jetson/power_logging/pytorch/layer_execution_time.py

+        module: the module to register hook.
+        input: tuple containing the input arguments to module's forward method.
+    """
+    layer_time_dict[layer_name] = (time.time(), datetime.datetime.now().strftime("%Y-%m-%d %H:%M:%S"))


N.B. these functions will need use CudaEvents from SD-118 in the actual benchmark script

d-lowl · 2025-01-16T09:48:52Z

jetson/power_logging/pytorch/layer_execution_time.py

+        module: the module to register hook.
+        input: tuple containing the input arguments to module's forward method.
+    """
+    layer_time_dict[layer_name] = (time.time(), datetime.datetime.now().strftime("%Y-%m-%d %H:%M:%S"))


Not enough precision for layer start time, we will need milliseconds too (layers in resnet18 take about 1.5ms to execute). Worth fixing here, to not forget to fix it in the next ticket with Cuda Events

Script for measuring torch model execution time

197d863

osw282 self-assigned this Jan 15, 2025

Remove line for running benchmark

cf9637d

osw282 requested review from d-lowl, dudeperf3ct and OCarrollM January 16, 2025 08:47

dudeperf3ct reviewed Jan 16, 2025

View reviewed changes

d-lowl requested changes Jan 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SD-119] Implement layer execution latency measurements for Pytorch #48

[SD-119] Implement layer execution latency measurements for Pytorch #48

osw282 commented Jan 15, 2025 •

edited

Loading

dagshub bot commented Jan 15, 2025

dudeperf3ct Jan 16, 2025

d-lowl Jan 16, 2025

dudeperf3ct Jan 16, 2025 •

edited

Loading

d-lowl Jan 16, 2025

dudeperf3ct Jan 16, 2025 •

edited

Loading

d-lowl Jan 16, 2025

d-lowl left a comment

d-lowl Jan 16, 2025

d-lowl Jan 16, 2025

[SD-119] Implement layer execution latency measurements for Pytorch #48

Are you sure you want to change the base?

[SD-119] Implement layer execution latency measurements for Pytorch #48

Conversation

osw282 commented Jan 15, 2025 • edited Loading

dagshub bot commented Jan 15, 2025

dudeperf3ct Jan 16, 2025

Choose a reason for hiding this comment

d-lowl Jan 16, 2025

Choose a reason for hiding this comment

dudeperf3ct Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

d-lowl Jan 16, 2025

Choose a reason for hiding this comment

dudeperf3ct Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

d-lowl Jan 16, 2025

Choose a reason for hiding this comment

d-lowl left a comment

Choose a reason for hiding this comment

d-lowl Jan 16, 2025

Choose a reason for hiding this comment

d-lowl Jan 16, 2025

Choose a reason for hiding this comment

osw282 commented Jan 15, 2025 •

edited

Loading

dudeperf3ct Jan 16, 2025 •

edited

Loading

dudeperf3ct Jan 16, 2025 •

edited

Loading