Skip to content

v4.3.0rc1

Pre-release
Pre-release
Compare
Choose a tag to compare
@raffenet raffenet released this 17 Dec 18:24
· 49 commits to main since this release
f763d57

Changes in 4.3

  • Support MPI memory allocation kinds side document.
  • Support MPI ABI Proposal. Configure with --enable-mpi-abi and build with
    mpicc_abi. By default, mpicc still builds and links with MPICH ABI.
  • Experimental API MPIX_Op_create_x. It supports user callback function with
    extra_state context and op destructor callback. It supports language bindings
    to use proxy function for language-specific user callbacks.
  • Experimental API MPIX_{Comm,File,Session,Win}_create_errhandler_x. They allow
    user error handlers to have extra_state context and corresponding destructor.
    This allows language bindings to implement user error handlers via proxy.
  • Experimental API MPIX_Request_is_complete. This is a pure request state query
    function that will not invoke progress, nor will free the request. This should
    help applications that want separate task dependency checking from progress
    engine to avoid progress contentions, especially in multi-threaded context.
    It is also useful for tools to profile non-deterministic calls such as
    MPI_Test.
  • Experimental API MPIX_Async_start. This function let applications to inject
    progress hooks to MPI progress. It allows application to implement custom
    asynchronous operations that will be progressed by MPI. It avoids having to
    implement separate progress mechanisms that may either take additional
    resource or contend with MPI progress and negatively impact performance. It
    also allows applications to create custom MPI operations, such as MPI
    nonblocking collectives, and achieve near native performance.
  • Added benchmark tests test/mpi/bench/p2p_{latency,bw}.
  • Added CMA support in CH4 IPC.
  • Added IPC read algorithm for intranode Allgather and Allgatherv.
  • Added CVAR MPIR_CVAR_CH4_SHM_POSIX_TOPO_ENABLE to enable non-temporal memcpy
    for inter-numa shm communication.
  • Added CVAR MPIR_CVAR_DEBUG_PROGRESS_TIMEOUT for debugging MPI deadlock issues.
  • ch4:ucx now supports dynamic processes. MPI_Comm_spawn{_multiple} will work.
    MPI_Open_port will fail due to ucx port name exceeds current MPI_MAX_PORT_NAME
    of 256. One can work around by use an info hint "port_name_size" and use a
    larger port name buffer.
  • PMI-1 defines PMI_MAX_PORT_NAME, which may be different from MPI_MAX_PORT_NAME.
    This is used by "PMI_Lookup_name". Consequently, MPI_Lookup_name accepts info
    hint "port_name_size" that may be larger than MPI_MAX_PORT_NAME. If the port
    name does not fit in "port_name_size", it will return a truncation error.
  • Autogen default to use -yaksa-depth=2.
  • Default MPIR_CVAR_CH4_ROOTS_ONLY_PMI to on.
  • Added ch4 netmod API am_tag_send and am_tag_recv.
  • Added MPIR_CVAR_CH4_OFI_EAGER_THRESHOLD to force RNDV send mode.
  • Make check target will run ROMIO tests.