CPU integration of G4HepEM using the tracking manager #316

JuanGonzalezCaminero · 2024-10-15T14:39:04Z

This PR shows how to use G4HepEM through the TrackingManager instead of the G4 process. However, it can be used only to run on CPU, and will not work with AdePT since one TrackingManager will replace the other.

In order to be able to run G4HepEM with tracking manager on full CPU and CPU + AdePT modes we would need:

Two separate physics constructors, one registering the G4HepEM tracking manager for full CPU mode, and one registering the G4HepEM process and the AdePT tracking manager
- However there seems to be a discrepancy in the results from the G4HepEM tracking manager and G4 process
To implement a hybrid AdePT/G4HepEM tracking manager, removing the need to register the G4HepEM process
- The current AdePT tracking manager would remain in case the user wants to use something other than G4HepEM on CPU

phsft-bot · 2024-10-15T14:41:02Z

Can one of the admins verify this patch?

hahnjo · 2024-10-15T15:08:26Z

However there seems to be a discrepancy in the results from the G4HepEM tracking manager and G4 process

Yes, the tracking manager does not produce bit-identical results. That's because of some optimizations (internal MSC stepping, which can be turned off) and also because we do the navigation directly, with potentially different rounding than G4Transportation.

JuanGonzalezCaminero · 2024-10-16T08:21:43Z

I see, that's good to know. I have done a run of 32 ttbar events in CMS, the average difference in total edep is around 3%, though the highest was 7%. The speedup from using the tracking manager was 1.09.
Are these numbers what we expect?

hahnjo · 2024-10-16T16:05:48Z

I see, that's good to know. I have done a run of 32 ttbar events in CMS, the average difference in total edep is around 3%, though the highest was 7%. The speedup from using the tracking manager was 1.09. Are these numbers what we expect?

In principle we don't expect a difference in energy deposit... Are 32 ttbar events enough to get stable numbers for validation? 10% speedup from the tracking manager sounds in line with expectations.

This PR introduces the required changes needed to use the latest version of G4HepEm. In G4HepEm the representation for the cross sections changed in this [PR](mnovak42/g4hepem#115). Additionally, the e-+/gamma-nuclear reactions were added in G4HepEm. This PR 1. uses the new function `G4HepEmGammaManager::SampleInteraction` and fixes the numIALeft. **_Note_**: that since we don't register the gamma/lepton nuclear processes in the physics list they are currently ignored by AdePT and G4HepEm. 2. adopts the TrackingManager approach by using a derived class of the G4HepEmTrackingManager. This supersedes #316. The G4HepEmTrackingManagerSpecialized is instantiated as a member of the AdePTTrackingmanager and it takes care of all CPU propagation and cleans up the propagation loop. 3. adds a direct conversion of the Geant4 NavigationHistory to a VecGeom NavigationState. This cleans up many problems at boundaries that can become costly for edges of GPU regions, where particles are frequently moved between CPU and GPU, or very tangent particles could get stuck. This reduces the run time for 1e6 10 GeV electrons being shot at testEm3 where all layers are GPU regions (but the calorimeter is not) from Before: Run time: 1582.26 Now: Run time: 1281.32 4. cleans up the AdePT physics. Now `FTFP_BERT_AdePT` derives from `G4EmStandardPhysics`. Currently, the gamma/lepton nuclear processes are not registered in the PhysicsList. However, if they are registered, G4HepEm treats them correctly, while AdePT just ignores them. For electrons, they are suppressed by an infinite interaction length, for gamma (since they are handled internally in G4HepEm), we simply do a empty step. There is a implementation commented out that re-draws in case a gamma-nuclear reaction is drawn. It re-draws up to 3 times and in the (unlikely, but possible) case that we still end up with gamma-nuclear, we do a redundant step limited by gamma-nuclear but not invoking any process. However, using this changes the physics by ~ 1 % at the tail, leading to worse agreement, therefore we for now accept the performance penalty of empty steps. **Testing:** For the testing, different variations with testEM3 where tested: 1. making everything a GPU region via `/adept/setTrackInAllRegions true`. 2. making all layers a GPU region (but not the encompassing `Calorimeter` region. 3. using 3 different regions that include various layers, without specific order 4. using no regions at all. These settings were tested against running with the `--noadept` option, which uses the `FTFP_BERT_HepEm` physics list. All 4 configurations were tested with this PR and with the `master` branch. **Outcome:** This PR shows excellent agreement between AdePT and G4 with G4HepEm: (the plot is for making everything a GPU region) <img width="577" alt="Screenshot 2025-01-01 at 16 58 52" src="https://github.com/user-attachments/assets/13688705-c99c-4714-b523-cb24128d097b" /> It is slightly improved to the `master` branch: <img width="575" alt="Screenshot 2025-01-01 at 17 00 27" src="https://github.com/user-attachments/assets/265b61f6-df42-479a-8a56-1430b06a548a" /> No major differences are observed using 2. all layers being regions, 3. using only certain regions, and 4. using no regions. 2. All layers GPU: <img width="576" alt="Screenshot 2025-01-01 at 18 53 07" src="https://github.com/user-attachments/assets/8359ad1e-276d-43d6-8271-87691c128df1" /> 3. Various GPU regions: <img width="568" alt="Screenshot 2025-01-01 at 18 53 48" src="https://github.com/user-attachments/assets/6029fb71-8804-43b7-abcb-31088016db54" /> 4. No GPU regions: <img width="568" alt="Screenshot 2025-01-01 at 18 54 21" src="https://github.com/user-attachments/assets/7854d0e4-b048-4026-bd5c-82ba41bb88d0" /> In the `master`, 3. gets stuck and never finishes. It could still be off in the plot above, to be investigated with higher statistics. Since it didn't work at all in the `master` this is still a significant improvement. The overall run time for G4 + HepEm in our example is improved using the tracking manager approach: From `Run time: 2842.99 s` (master) to `Run time: 2255.76 s` using this PR (16 threads, 1e6 10 GeV electrons being shot at testEm3) Additionally the energy deposition of CMS was checked and found to be at good agreement. The run time for CMS is similar, maybe slightly improved using AdePT everywhere. To Do: - [x] Proper validation after #326 is merged. - [x] Validate run with CMS. - [x] After [this PR](mnovak42/g4hepem#117) is merged in G4HepEm, tag that version of G4HepEm and update CI - [x] implement changes also in split kernels - [x] validate with split kernels (validate kernels give roughly the same result but not to machine precision anymore, however, before this PR it was not compiling anymore). NOTE: These are breaking changes, as older versions of G4HepEm cannot be used anymore.

JuanGonzalezCaminero added 3 commits December 9, 2024 11:50

Using the hepem tracking manager on CPU

beaa6b1

Temp

ee621f6

Activate G4HepEM Tracking manager

cc2e232

JuanGonzalezCaminero force-pushed the hepem_trackingmanager branch from fdb01f3 to cc2e232 Compare December 9, 2024 10:50

SeverinDiederichs mentioned this pull request Dec 29, 2024

upgrade to latest G4hepem #324

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU integration of G4HepEM using the tracking manager #316

CPU integration of G4HepEM using the tracking manager #316

JuanGonzalezCaminero commented Oct 15, 2024 •

edited

Loading

phsft-bot commented Oct 15, 2024

hahnjo commented Oct 15, 2024

JuanGonzalezCaminero commented Oct 16, 2024

hahnjo commented Oct 16, 2024

CPU integration of G4HepEM using the tracking manager #316

Are you sure you want to change the base?

CPU integration of G4HepEM using the tracking manager #316

Conversation

JuanGonzalezCaminero commented Oct 15, 2024 • edited Loading

phsft-bot commented Oct 15, 2024

hahnjo commented Oct 15, 2024

JuanGonzalezCaminero commented Oct 16, 2024

hahnjo commented Oct 16, 2024

JuanGonzalezCaminero commented Oct 15, 2024 •

edited

Loading