Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FLOPs and Bandwidth report for GPUs #50

Open
amir-raoofy opened this issue Jul 31, 2023 · 7 comments
Open

FLOPs and Bandwidth report for GPUs #50

amir-raoofy opened this issue Jul 31, 2023 · 7 comments
Assignees

Comments

@amir-raoofy
Copy link

amir-raoofy commented Jul 31, 2023

I noticed that running FIRESTARTER on GPUs (i.e., Nvidia GPUs to be precise), with "-r" flag only reports estimated the CPU FLOPs and Bandwidth . Is there a way to get estimated FLOPs and Bandwidth on GPU? if not is there a plan to support this?

Another point is that the log does not explicitly mention that the reported flops and bandwidth are related to CPU and I find it a bit confusing to get this statistic without associating them with, e.g., CPU or GPU. Especially once one would build FIRESTARTER for GPU.

@amir-raoofy amir-raoofy changed the title FLOP and Bandwidth report for GPUs FLOPs and Bandwidth report for GPUs Aug 5, 2023
@rschoene
Copy link
Member

I would check for the FLOPS values on the GPUs. However, we cannot really make an assumption on the bandwidth for GPUs, since the implementation of gemm allows for a lot of optimizations.
Would that be sufficient or would that lead to more irritation (FLOPS for both, but bw only for host)?

@amir-raoofy
Copy link
Author

Thanks. My main point is that when someone builds FIRESTARTER with GPU support (e.g., with CUDA), there is a good chance they are after the numbers associated with the GPU. However, the benchmark only reports values without mentioning that the numbers are associated with the CPU and not the GPU, which could be misleading.

GPU estimates (FLOPS and/or BW rates) would be good to have, but I think, even if getting estimates is not possible for any reason, it is still fine as long as the confusion about what these numbers are associated with is addressed.

@rschoene
Copy link
Member

I'm at it in branch gpu_flops

@rschoene
Copy link
Member

(comments and remarks are welcome)

@rschoene
Copy link
Member

@amir-raoofy should work now, please provide feedback

@rschoene rschoene self-assigned this Sep 20, 2024
@amir-raoofy
Copy link
Author

This looks good from my point of view. Many thanks!

(apologies for the late response/feedback)

@DiDu0815
Copy link

I just want to second that feature, because the CPU/GPU FLOPs also could be quantitative indirect indicators for (e.g. thermal) throttling.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

3 participants