x86emul/test: encourage compiler to use more embedded broadcast
authorJan Beulich <jbeulich@suse.com>
Fri, 10 Jun 2022 08:24:21 +0000 (10:24 +0200)
committerJan Beulich <jbeulich@suse.com>
Fri, 10 Jun 2022 08:24:21 +0000 (10:24 +0200)
commitb8bc4588b32e8a40354defac29ceb9c90e570af8
tree969408f216113593e8f3f6254980ff9b2b897485
parent2730650c9cd65c1cb2412b480ee1f51748bc1445
x86emul/test: encourage compiler to use more embedded broadcast

For one it was an oversight to leave dup_{hi,lo}() undefined for 512-bit
vector size. And then in FMA testing we can also arrange for the
compiler to (hopefully) recognize broadcasting potential. Plus we can
replace the broadcast(1) use in the addsub() surrogate with inline
assembly explicitly using embedded broadcast (even gcc12 still doesn't
support broadcast for any of the addsub/subadd builtins).

Signed-off-by: Jan Beulich <jbeulich@suse.com>
Acked-by: Andrew Cooper <andrew.cooper3@citrix.com>
tools/tests/x86_emulator/simd-fma.c
tools/tests/x86_emulator/simd.c
tools/tests/x86_emulator/simd.h