Skip to content

Commit

Permalink
HPL-AI Public release of Version 2.3c
Browse files Browse the repository at this point in the history
  • Loading branch information
WuK authored Apr 18, 2021
1 parent 25dcf35 commit 36dbfc5
Showing 1 changed file with 10 additions and 3 deletions.
13 changes: 10 additions & 3 deletions NEWS
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,8 @@
History

- 03/14/2021 HPL-AI Public release of Version 2.3a

(deperated when n > 65536)

Junkang Huang added the parallel GMRES and matrix construction
part (<https://github.com/schuangs/hpl-ai-with-IR>) for HPL-AI.
Expand All @@ -23,7 +25,9 @@
-- [M] testing/Makefile.am

- 03/24/2021 HPL-AI Public release of Version 2.3b


(deperated when n > 65536)

Kan Wu optimized the device blaspp routines through the
strategy of buffering memory: the program will check whether
the buffer size is sufficient at runtime, free and reallocate
Expand All @@ -33,7 +37,10 @@
-- [M] src/blas/HPLAI_blas.cc
-- [M] testing/ptest/HPLAI_pdinfo.cc

- 04/18/2021 HPL-AI Public release of Version 2.3c
- 04/18/2021 HPL-AI Public release of Version 2.3c

Kan Wu fixed the bug when n > 65536.
-- [M] src/pgesv/HPLAI_pdgesv.cc

Kan Wu added an option to use the half-precision computeType
in cublasGemmEx (cuda@11: is required):
Expand All @@ -46,4 +53,4 @@

Kan Wu release the software at
<https://github.com/SYSU-SCC/sysu-scc-spack-repo>:
-- [M] testing/ptest/HPLAI_pdinfo.cc
-- [M] testing/ptest/HPLAI_pdinfo.cc

0 comments on commit 36dbfc5

Please sign in to comment.