From 36dbfc5dc91e56a485bee8fabe26bd044b5a6c64 Mon Sep 17 00:00:00 2001 From: WuK Date: Mon, 19 Apr 2021 03:48:51 +0800 Subject: [PATCH] HPL-AI Public release of Version 2.3c --- NEWS | 13 ++++++++++--- 1 file changed, 10 insertions(+), 3 deletions(-) diff --git a/NEWS b/NEWS index fd622c1..d8a3983 100644 --- a/NEWS +++ b/NEWS @@ -5,6 +5,8 @@ History - 03/14/2021 HPL-AI Public release of Version 2.3a + + (deperated when n > 65536) Junkang Huang added the parallel GMRES and matrix construction part () for HPL-AI. @@ -23,7 +25,9 @@ -- [M] testing/Makefile.am - 03/24/2021 HPL-AI Public release of Version 2.3b - + + (deperated when n > 65536) + Kan Wu optimized the device blaspp routines through the strategy of buffering memory: the program will check whether the buffer size is sufficient at runtime, free and reallocate @@ -33,7 +37,10 @@ -- [M] src/blas/HPLAI_blas.cc -- [M] testing/ptest/HPLAI_pdinfo.cc - - 04/18/2021 HPL-AI Public release of Version 2.3c + - 04/18/2021 HPL-AI Public release of Version 2.3c + + Kan Wu fixed the bug when n > 65536. + -- [M] src/pgesv/HPLAI_pdgesv.cc Kan Wu added an option to use the half-precision computeType in cublasGemmEx (cuda@11: is required): @@ -46,4 +53,4 @@ Kan Wu release the software at : - -- [M] testing/ptest/HPLAI_pdinfo.cc \ No newline at end of file + -- [M] testing/ptest/HPLAI_pdinfo.cc