Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

btl/uct: reduce number of messages sent when establishing connections #13018

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

hjelmn
Copy link
Member

@hjelmn hjelmn commented Jan 5, 2025

The btl/uct code can be quite aggressive at sends connection messages over the connection endpoint. This could lead to a large number of unnecessary messages in some cases. This commit adds code to restrict the retry rate to 2ms. This timing is controlled by a new MCA variable: btl_uct_connection_retry_timeout.

The btl/uct code can be quite aggressive at sends connection messages over the
connection endpoint. This could lead to a large number of unnecessary messages
in some cases. This commit adds code to restrict the retry rate to 2ms. This
timing is controlled by a new MCA variable: btl_uct_connection_retry_timeout.

Signed-off-by: Nathan Hjelm <[email protected]>
@hppritcha
Copy link
Member

is there a simple test which demonstrates the problem that this PR is addressing?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants