Releases
v6.1.142
Algorithms
New features
Base implementation of class SynetDeconvolution16bGemm.
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetDeconvolution16bNhwcGemm.
AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveUv.
AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveBgr.
AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveBgra.
Improving
AVX-512BW optimizations of function ConvolutionDirectNhwcConvolutionBiasActivationDepthwise.
Removing
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetConvolution32fBf16NhwcGemm.
Base implementation of class SynetConvolution32fBf16Gemm.
Parameter 'compatibility' from function SynetConvolution32fInit.
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cdc.
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cd.
Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Dc.
Base implementation of class SynetMergedConvolution32fBf16.
Parameter 'compatibility' from function SynetMergedConvolution32fInit.
Test framework
New features
Tests for verifying functionality of SynetDeconvolution16b framework.
You can’t perform that action at this time.