-
Notifications
You must be signed in to change notification settings - Fork 395
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
prov/verbs: Fix data race vrb_open_ep function #10571
Conversation
e1b235a
to
06eeddb
Compare
Please change the commit message header to: |
Resolved a data race in the vrb_open_ep function of the verbs provider caused by concurrent modifications to the global variable vrb_ep_ops. This issue violated the FI_THREAD_SAFE threading model, leading to unpredictable behavior when creating endpoints from multiple threads. Signed-off-by: Piotr Chmiel <[email protected]>
Done. @shefty Could one of the maintainers kindly review this? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks - The current code is incorrect, and this looks like the right fix.
@shefty Thanks for the review! I appreciate your feedback, and I’m glad the change looks correct now. |
bot:aws:retest |
@piotrchmiel - This PR should eventually be merged. I restarted the AWS CI, but I don't believe the previous failure was related. |
we disabled verbs in aws ci, please ignore then |
@shijin-aws - Is there a way for it to ignore or skip the testing, but report success? I believe other CI's do this. |
@shefty That is a good call, we will evaluate it. |
How does other CI handle it as scripts? is there an example? |
See: libfabric/contrib/intel/jenkins/Jenkinsfile Line 323 in d2f7028
|
Race Fix: Resolved a data race in the vrb_open_ep function of the verbs provider caused by concurrent modifications to the global variable vrb_ep_ops. This issue violated the FI_THREAD_SAFE threading model, leading to unpredictable behavior when creating endpoints from multiple threads.
Detailed race description: #10569
Signed-off-by: Piotr Chmiel [email protected]