Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch Runic CI #552

Merged
merged 2 commits into from
Jan 9, 2025
Merged

Switch Runic CI #552

merged 2 commits into from
Jan 9, 2025

Conversation

vchuravy
Copy link
Member

@vchuravy vchuravy commented Jan 8, 2025

No description provided.

Copy link
Contributor

github-actions bot commented Jan 8, 2025

Benchmark Results

main 4675536... main/467553638f074a...
saxpy/default/Float16/1024 0.733 ± 0.0089 μs 0.742 ± 0.0064 μs 0.988
saxpy/default/Float16/1048576 0.173 ± 0.0028 ms 0.172 ± 0.0027 ms 1
saxpy/default/Float16/16384 3.34 ± 0.046 μs 3.33 ± 0.034 μs 1
saxpy/default/Float16/2048 0.916 ± 0.011 μs 0.921 ± 0.012 μs 0.994
saxpy/default/Float16/256 0.584 ± 0.0074 μs 0.593 ± 0.0056 μs 0.985
saxpy/default/Float16/262144 0.0439 ± 0.00044 ms 0.0441 ± 0.00071 ms 0.997
saxpy/default/Float16/32768 6.02 ± 0.092 μs 6.01 ± 0.082 μs 1
saxpy/default/Float16/4096 1.31 ± 0.02 μs 1.32 ± 0.02 μs 0.993
saxpy/default/Float16/512 0.645 ± 0.009 μs 0.655 ± 0.0068 μs 0.985
saxpy/default/Float16/64 0.552 ± 0.0064 μs 0.56 ± 0.0049 μs 0.986
saxpy/default/Float16/65536 11.7 ± 0.19 μs 11.6 ± 0.14 μs 1.01
saxpy/default/Float32/1024 0.637 ± 0.012 μs 0.639 ± 0.0094 μs 0.996
saxpy/default/Float32/1048576 0.191 ± 0.027 ms 0.191 ± 0.027 ms 0.996
saxpy/default/Float32/16384 2.73 ± 0.14 μs 2.81 ± 0.89 μs 0.972
saxpy/default/Float32/2048 0.748 ± 0.054 μs 0.764 ± 0.028 μs 0.978
saxpy/default/Float32/256 0.566 ± 0.0083 μs 0.562 ± 0.0057 μs 1.01
saxpy/default/Float32/262144 0.0449 ± 0.0042 ms 0.0451 ± 0.0045 ms 0.996
saxpy/default/Float32/32768 5.28 ± 0.31 μs 5.61 ± 1.6 μs 0.943
saxpy/default/Float32/4096 1.13 ± 0.084 μs 1.12 ± 0.072 μs 1.01
saxpy/default/Float32/512 0.606 ± 0.01 μs 0.601 ± 0.0077 μs 1.01
saxpy/default/Float32/64 0.551 ± 0.0067 μs 0.55 ± 0.0052 μs 1
saxpy/default/Float32/65536 12.1 ± 1.4 μs 12.4 ± 1.3 μs 0.981
saxpy/default/Float64/1024 0.767 ± 0.022 μs 0.786 ± 0.051 μs 0.975
saxpy/default/Float64/1048576 0.486 ± 0.041 ms 0.475 ± 0.047 ms 1.02
saxpy/default/Float64/16384 5.25 ± 0.47 μs 5.49 ± 1.5 μs 0.956
saxpy/default/Float64/2048 1.14 ± 0.088 μs 1.14 ± 0.079 μs 0.997
saxpy/default/Float64/256 0.603 ± 0.009 μs 0.59 ± 0.0078 μs 1.02
saxpy/default/Float64/262144 0.0891 ± 0.0078 ms 0.0905 ± 0.0081 ms 0.985
saxpy/default/Float64/32768 12.5 ± 1.6 μs 12.2 ± 1.3 μs 1.02
saxpy/default/Float64/4096 1.75 ± 0.3 μs 1.74 ± 0.3 μs 1
saxpy/default/Float64/512 0.658 ± 0.011 μs 0.648 ± 0.012 μs 1.02
saxpy/default/Float64/64 0.577 ± 0.0075 μs 0.568 ± 0.0064 μs 1.02
saxpy/default/Float64/65536 23.9 ± 2.1 μs 24 ± 2.1 μs 0.997
saxpy/static workgroup=(1024,)/Float16/1024 2.18 ± 0.028 μs 2.17 ± 0.028 μs 1
saxpy/static workgroup=(1024,)/Float16/1048576 0.162 ± 0.011 ms 0.16 ± 0.0091 ms 1.01
saxpy/static workgroup=(1024,)/Float16/16384 4.42 ± 0.097 μs 4.4 ± 0.08 μs 1
saxpy/static workgroup=(1024,)/Float16/2048 2.35 ± 0.029 μs 2.34 ± 0.031 μs 1
saxpy/static workgroup=(1024,)/Float16/256 2.82 ± 0.037 μs 2.81 ± 0.032 μs 1.01
saxpy/static workgroup=(1024,)/Float16/262144 0.0417 ± 0.0012 ms 0.0422 ± 0.0011 ms 0.989
saxpy/static workgroup=(1024,)/Float16/32768 6.83 ± 0.18 μs 6.84 ± 0.17 μs 0.998
saxpy/static workgroup=(1024,)/Float16/4096 2.68 ± 0.037 μs 2.66 ± 0.036 μs 1
saxpy/static workgroup=(1024,)/Float16/512 3.27 ± 0.036 μs 3.25 ± 0.035 μs 1
saxpy/static workgroup=(1024,)/Float16/64 2.52 ± 0.21 μs 2.5 ± 0.21 μs 1.01
saxpy/static workgroup=(1024,)/Float16/65536 12.4 ± 0.28 μs 12.4 ± 0.28 μs 1.01
saxpy/static workgroup=(1024,)/Float32/1024 2.24 ± 0.035 μs 2.22 ± 0.033 μs 1.01
saxpy/static workgroup=(1024,)/Float32/1048576 0.196 ± 0.023 ms 0.195 ± 0.019 ms 1.01
saxpy/static workgroup=(1024,)/Float32/16384 4.4 ± 0.29 μs 4.35 ± 0.27 μs 1.01
saxpy/static workgroup=(1024,)/Float32/2048 2.41 ± 0.048 μs 2.38 ± 0.043 μs 1.01
saxpy/static workgroup=(1024,)/Float32/256 2.67 ± 0.041 μs 2.64 ± 0.039 μs 1.01
saxpy/static workgroup=(1024,)/Float32/262144 0.0482 ± 0.0038 ms 0.048 ± 0.0038 ms 1
saxpy/static workgroup=(1024,)/Float32/32768 7.48 ± 0.56 μs 7.38 ± 0.44 μs 1.01
saxpy/static workgroup=(1024,)/Float32/4096 2.67 ± 0.063 μs 2.65 ± 0.062 μs 1.01
saxpy/static workgroup=(1024,)/Float32/512 2.71 ± 0.06 μs 2.69 ± 0.096 μs 1.01
saxpy/static workgroup=(1024,)/Float32/64 2.71 ± 5.1 μs 2.67 ± 5.1 μs 1.02
saxpy/static workgroup=(1024,)/Float32/65536 14.7 ± 1.4 μs 14.7 ± 1.5 μs 1
saxpy/static workgroup=(1024,)/Float64/1024 2.34 ± 0.079 μs 2.31 ± 0.053 μs 1.01
saxpy/static workgroup=(1024,)/Float64/1048576 0.499 ± 0.052 ms 0.486 ± 0.049 ms 1.03
saxpy/static workgroup=(1024,)/Float64/16384 7.28 ± 0.57 μs 7.2 ± 0.33 μs 1.01
saxpy/static workgroup=(1024,)/Float64/2048 2.62 ± 0.08 μs 2.59 ± 0.058 μs 1.01
saxpy/static workgroup=(1024,)/Float64/256 2.67 ± 0.067 μs 2.67 ± 0.074 μs 1
saxpy/static workgroup=(1024,)/Float64/262144 0.0951 ± 0.0096 ms 0.0935 ± 0.0085 ms 1.02
saxpy/static workgroup=(1024,)/Float64/32768 14.7 ± 1.6 μs 14.6 ± 1.1 μs 1.01
saxpy/static workgroup=(1024,)/Float64/4096 3.16 ± 0.15 μs 3.12 ± 0.13 μs 1.01
saxpy/static workgroup=(1024,)/Float64/512 2.67 ± 0.072 μs 2.65 ± 0.064 μs 1.01
saxpy/static workgroup=(1024,)/Float64/64 2.61 ± 0.053 μs 2.59 ± 0.051 μs 1.01
saxpy/static workgroup=(1024,)/Float64/65536 26.6 ± 3.1 μs 26.4 ± 2.9 μs 1.01
time_to_load 0.321 ± 0.0074 s 0.318 ± 0.0014 s 1.01

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.
Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

@vchuravy vchuravy merged commit a58aab6 into main Jan 9, 2025
25 of 33 checks passed
@vchuravy vchuravy deleted the vc/runic2 branch January 9, 2025 11:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant