Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prov/efa: Create efa_ep interface #10705

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

shijin-aws
Copy link
Contributor

This PR contains two changes

  1. Migrate dgram/efa_dgram_ep.c to efa_ep.c as a common ep interface for both dgram and rdm ep type. dgram repo now has nothing and is removed.
  2. Extend efa ep interface to make it cover all the applied features that efa-rdm ep interface supports today. It also refactors and moves several internal efa_rdm_ep functions to efa_base_ep.c to cover both efa_direct and efa_rdm ep.

Refactor/changes are backed by corresponding unit tests.

@shijin-aws shijin-aws requested a review from a team January 16, 2025 00:11
prov/efa/src/efa_cntr.c Outdated Show resolved Hide resolved
prov/efa/src/efa_base_ep.c Outdated Show resolved Hide resolved
prov/efa/src/efa_base_ep.c Show resolved Hide resolved
prov/efa/src/efa_ep.c Show resolved Hide resolved
jiaxiyan
jiaxiyan previously approved these changes Jan 16, 2025
Migrate dgram/efa_dgram_ep.c to  efa_ep.c
as a common ep interface for both dgram and
rdm ep type. dgram repo now has nothing and
is removed.

Signed-off-by: Shi Jin <[email protected]>
Extend efa ep interface to make it cover all the
applied features that efa-rdm ep interface supports
today.

It also refactors and moves several internal efa_rdm_ep functions
to efa_base_ep.c to cover both efa_direct and efa_rdm ep.

Signed-off-by: Shi Jin <[email protected]>
Currently, efa_base_ep's default rnr_retry is 3 which only
does a few retry in the firmware level for RNR. This is
due to the efa_rdm_ep supports libfabric level RNR retry.
However, the efa-direct ep doesn't support libfabric
level RNR retry. Then we should make it do infinite
RNR retry (7), which is also the default behavior of
SRD QP.

Signed-off-by: Shi Jin <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants