64d92d7498
"Optimize" is in quotes because it's rather a "salvage operation" for now. Idea is to identify processor capability flags that drive Knights Landing to suboptimial code paths and mask them. Two flags were identified, XSAVE and ADCX/ADOX. Former affects choice of AES-NI code path specific for Silvermont (Knights Landing is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are effectively mishandled at decode time. In both cases we are looking at ~2x improvement. AVX-512 results cover even Skylake-X :-) Hardware used for benchmarking courtesy of Atos, experiments run by Romain Dolbeau <romain.dolbeau@atos.net>. Kudos! Reviewed-by: Rich Salz <rsalz@openssl.org> |
||
---|---|---|
.. | ||
poly1305-armv4.pl | ||
poly1305-armv8.pl | ||
poly1305-c64xplus.pl | ||
poly1305-mips.pl | ||
poly1305-ppc.pl | ||
poly1305-ppcfp.pl | ||
poly1305-s390x.pl | ||
poly1305-sparcv9.pl | ||
poly1305-x86.pl | ||
poly1305-x86_64.pl |