v6.1.2: Avoid AVX-512BW on Skylake-X
Previous AVX-512 implementation of complex products used an extra ZMM register for swap_adjacent_vec
. Moreover, they used the vpshufb
instruction available only with the Ice Lake capability and newer. The replacement uses the _mm512_permute_ps
and its double-precision variant.