Put RNG in shared memory where beneficial #229

bernhardmgruber · 2022-10-07T12:21:56Z

The shared memory hardware can handle the access pattern on the RNG better than the local memory (in case of spilling) or global memory. For most kernels, it is thus beneficial to move the RNG into shared memory at the beginning of the kernel and store it back at the end in case the track survives.

Running example19 -particles 10000 -batch 5000 -gdml_file ./examples/Example14/macros/testEm3.gdml on the V100.

master (V100):
Mean: 4.00011
Stddev: 0.00030375

rng_sm (V100, only TransportElectrons):
Mean: 3.70078
Stddev: 0.00141616

rng_sm (V100, TransportElectrons and TransportGammas):
Mean: 3.94435
Stddev: 0.00170546

So it’s only beneficial to put the RNG in SM in TransportElectrons.

rng_sm (V100, TransportElectrons and electron interaction kernels):
Mean: 3.68766
Stddev: 0.000562445

rng_sm (V100, TransportElectrons and electron/gamma interaction kernels):
Mean: 3.65415
Stddev: 0.00167166

The benefit in the interaction kernels is significantly smaller. Furthermore, contention for the shared memory is increased, since SM is also needed to compact active tracks at the beginning of the interaction kernels.

Opinions on the SM usage for the interaction kernels?

Depends on:

Mark a track's survival more explicit #228

phsft-bot · 2022-10-07T12:26:01Z

Can one of the admins verify this patch?

hahnjo · 2022-10-10T09:17:46Z

The benefit in the interaction kernels is significantly smaller. Furthermore, contention for the shared memory is increased, since SM is also needed to compact active tracks at the beginning of the interaction kernels.

As mentioned in #216 (comment), we can get rid of this super complex "compaction in shared memory" code and just use queues... Let me know if you want me to push my WIP code somewhere.

bernhardmgruber · 2022-10-10T20:49:45Z

The benefit in the interaction kernels is significantly smaller. Furthermore, contention for the shared memory is increased, since SM is also needed to compact active tracks at the beginning of the interaction kernels.

As mentioned in #216 (comment), we can get rid of this super complex "compaction in shared memory" code and just use queues... Let me know if you want me to push my WIP code somewhere.

Sure, please push your changes somewhere and share them with us!

agheata · 2023-10-17T08:24:20Z

@bernhardmgruber is this PR still relevant, is it worth it using shared memory for other examples besides example19?

bernhardmgruber · 2023-11-14T15:24:48Z

@agheata it was totally worth it for example19 IIRC. It may also be useful for other examples as well. I don't have the time to look into this anymore now that I am in the middle of my PhD writeup. So I leave this PR as a suggestions to you. Continue with it as you please.

bernhardmgruber mentioned this pull request Oct 7, 2022

Mark a track's survival more explicit #228

Merged

bernhardmgruber force-pushed the rng_sm branch from 99ab67a to 69beb8b Compare October 13, 2022 20:37

bernhardmgruber added 3 commits October 31, 2022 20:18

Put RNG in shared memory in TransportElectrons

2c643b8

Put RNG in SM for electron interactions

51ab9db

Put RNG in SM for gamma interactions

232d6db

bernhardmgruber force-pushed the rng_sm branch from 69beb8b to 232d6db Compare October 31, 2022 19:18

hahnjo mentioned this pull request Nov 1, 2022

Add new Example20 with interaction queues instead of SOAData #234

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Put RNG in shared memory where beneficial #229

Put RNG in shared memory where beneficial #229

bernhardmgruber commented Oct 7, 2022 •

edited

Loading

phsft-bot commented Oct 7, 2022

hahnjo commented Oct 10, 2022

bernhardmgruber commented Oct 10, 2022

agheata commented Oct 17, 2023

bernhardmgruber commented Nov 14, 2023

Put RNG in shared memory where beneficial #229

Are you sure you want to change the base?

Put RNG in shared memory where beneficial #229

Conversation

bernhardmgruber commented Oct 7, 2022 • edited Loading

phsft-bot commented Oct 7, 2022

hahnjo commented Oct 10, 2022

bernhardmgruber commented Oct 10, 2022

agheata commented Oct 17, 2023

bernhardmgruber commented Nov 14, 2023

bernhardmgruber commented Oct 7, 2022 •

edited

Loading