-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KalmanFitterError:1 results in segmentation violation v37.4.0 and above #4028
Comments
Which workflow did you run here @plariono
But the segfault should definitely not happen. Looks like we do an out of bounds array access |
Hi @andiwand, I'm running the full chain with high multiplicity events, Pythia + Fatras + CKF + Greedy Solver for the ALICE 3 geometry as usual. |
The SEGFAULT indeed seem to come from inside the Track EDM from an out-of-bounds vector access. I'm wondering if we produce incomplete track candidates or track states that are not handled correctly later on. |
@andiwand It seems like in the CKF we access the tip index from the branch state: acts/Core/include/Acts/TrackFinding/CombinatorialKalmanFilter.hpp Lines 772 to 773 in 0932329
The I'm not sure if the SEGFAULT occurs in the same call to the track finding that exhibits the magnetic field error, or if the track container gets corrupted somehow and then fails on a subsequent run. |
But that means the precondition for the |
I guess so. I'm wondering: could it be that the first CKF invokation in the track finding algo fails, and then the second invokation produces the SEGFAULT? |
Do we carry on with error'ed branches in the CKF? EDIT: Seems like the CKF actor consumes the errors and only sets a "last error" variable but, if I read it right, does not terminate the propagation. Even given that, it's unclear to me how this corrupts the track container, in the sense that the back of |
If the we encounter an error in the propagation or in the CKF the propagation should be stopped. In case of CKF errors here
If the first pass of the CKF does error we should not continue in the track finding because of
I suspect that this is something that only happens with branching because we have not encountered a segfault in Athena or Acts standalone where we do not branch. I had problems in the past with the branching logic because it is quite involved. |
@plariono Could you run the same crashing job with the track finding verbosity set to |
I would recommend only running the offending seed. otherwise the logfile will be quite overwhelming |
@paulgessinger @andiwand thanks for the suggestions. How can I run only the problematic seed? |
Also, I did run the setup without the field map and the crash did happen.
|
@plariono I think the fact that it also crashes without magnetic field is just a consequence of the fact that the underlying issue is a navigation failure, which ultimately leads to the SEGFAULT. I think the exact seed is a bit tricky, but you should probably at least be able to restrict it to the failing event in question by using the |
This is to report that at least in v37.4.0 and above the
KalmanFitterError:1
error crashes the reconstruction.This happens for the following systems:
source CI/setup_cvmfs_lcg.sh
: LCG_105/x86_64-el9-gcc13-optCrash log.
This was not happening in v36 from 18th of September '24:
The text was updated successfully, but these errors were encountered: