openssl/crypto/modes/asm
Andy Polyakov 64d92d7498 x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results.
"Optimize" is in quotes because it's rather a "salvage operation"
for now. Idea is to identify processor capability flags that
drive Knights Landing to suboptimial code paths and mask them.
Two flags were identified, XSAVE and ADCX/ADOX. Former affects
choice of AES-NI code path specific for Silvermont (Knights Landing
is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are
effectively mishandled at decode time. In both cases we are looking
at ~2x improvement.

AVX-512 results cover even Skylake-X :-)

Hardware used for benchmarking courtesy of Atos, experiments run by
Romain Dolbeau <romain.dolbeau@atos.net>. Kudos!

Reviewed-by: Rich Salz <rsalz@openssl.org>
2017-07-21 14:07:32 +02:00
..
aesni-gcm-x86_64.pl x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. 2017-07-21 14:07:32 +02:00
ghash-alpha.pl
ghash-armv4.pl Remove trailing whitespace from some files. 2016-10-10 23:36:21 +01:00
ghash-c64xplus.pl
ghash-ia64.pl
ghash-parisc.pl
ghash-s390x.pl Remove trailing whitespace from some files. 2016-10-10 23:36:21 +01:00
ghash-sparcv9.pl SPARC assembly pack: enforce V8+ ABI constraints. 2016-07-01 14:25:08 +02:00
ghash-x86.pl Remove filename argument to x86 asm_init. 2017-05-11 17:00:23 -04:00
ghash-x86_64.pl x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. 2017-07-21 14:07:32 +02:00
ghashp8-ppc.pl modes/asm/ghashp8-ppc.pl: improve performance by 2.7x. 2016-06-14 23:28:39 +02:00
ghashv8-armx.pl ARMv4 assembly pack: harmonize Thumb-ification of iOS build. 2017-02-15 23:16:01 +01:00