Skip to content

Simd v4.2.73

Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 02 Jan 06:19
· 2251 commits to master since this release

Algorithms

New features
  • Base implementation, SSE, AVX and AVX-512F optimizations of function FusedLayerForward3.
  • Base implementation, SSE, AVX and AVX-512F optimizations of function ConvolutionBiasAndActivation(NHWC mode).
Improving
  • SSE, AVX, AVX2 and AVX-512F optimizations of function Gemm32fNN.
  • Add output parameter 'internal' to function ConvolutionSetWeight.
Bug fixing
  • Wrong assert condition in AVX-512F optimization of function NeuralRelu.
  • Visual Studio 2017 compiler error (intrinsic _mm512_maskz_loadu_epi8 in Release mode).
  • Crash: reading of unaligned memory in AVX-512BW optimization of function HogLiteFilterFeatures.
  • Performance bug in functions SynetAddBias, SynetFusedLayerForwardX, SynetPreluLayerForward and SynetScaleLayerForward when (count = 1, trans = 1).

Test framework

New features
  • Tests for verifying functionality of function FusedLayerForward3.