openssl

Author	SHA1	Message	Date
Andy Polyakov	d5487a454c	chacha/asm/chacha-x86_64.pl: add dedicated path for 128-byte inputs. The 128-byte vectors are extensively used in chacha20_poly1305_tls_cipher and dedicated code path is ~30-50% faster on most platforms. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6626)	2018-07-03 19:02:02 +02:00
Andy Polyakov	cded951378	chacha/asm/chacha-x86_64.pl: add AVX512VL code path. 256-bit AVX512VL was estimated to deliver ~50% improvement over AVX2 and it did live up to the expectations. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4838)	2017-12-08 12:57:49 +01:00
Andy Polyakov	47c9926a92	chacha/asm/chacha-x86_64.pl: fix sporadic crash in AVX512 code path. Only chacha_internal_test is affected, since this path is not used from EVP. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4758)	2017-11-25 22:08:17 +01:00
Andy Polyakov	64d92d7498	x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. "Optimize" is in quotes because it's rather a "salvage operation" for now. Idea is to identify processor capability flags that drive Knights Landing to suboptimial code paths and mask them. Two flags were identified, XSAVE and ADCX/ADOX. Former affects choice of AES-NI code path specific for Silvermont (Knights Landing is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are effectively mishandled at decode time. In both cases we are looking at ~2x improvement. AVX-512 results cover even Skylake-X :-) Hardware used for benchmarking courtesy of Atos, experiments run by Romain Dolbeau <romain.dolbeau@atos.net>. Kudos! Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-07-21 14:07:32 +02:00
Andy Polyakov	54f8f9a1ed	x86_64 assembly pack: fill some blanks in Ryzen results. Reviewed-by: Bernd Edlinger <bernd.edlinger@hotmail.de>	2017-07-03 18:17:00 +02:00
Andy Polyakov	6cbfd94d08	x86_64 assembly pack: add some Ryzen performance results. Reviewed-by: Tim Hudson <tjh@openssl.org>	2017-03-22 10:58:01 +01:00
Andy Polyakov	f17652e5f9	chacha/asm/chacha-x86_64.pl: add CFI annotations. Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-02-26 21:26:06 +01:00
Andy Polyakov	384e6de4c7	x86_64 assembly pack: Win64 SEH face-lift. - harmonize handlers with guidelines and themselves; - fix some bugs in handlers; - add missing handlers in chacha and ecp_nistz256 modules; Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-02-06 08:21:42 +01:00
Andy Polyakov	3c274a6e20	chacha/asm/chacha-x86_64.pl: add AVX512 path optimized for shorter inputs. Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-12-25 16:31:40 +01:00
Andy Polyakov	a30b0522cb	x86 assembly pack: update performance results. Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-12-19 16:18:25 +01:00
Andy Polyakov	1ea01427c5	poly1305/asm/poly1305-x86_64.pl: allow nasm to assemble AVX512 code. chacha/asm/chacha-x86_64.pl: refine nasm version detection logic. Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-12-15 17:57:50 +01:00
Andy Polyakov	abb8c44fba	x86_64 assembly pack: add AVX512 ChaCha20 and Poly1305 code paths. Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-12-12 10:58:04 +01:00
Andy Polyakov	ace05265d2	x86_64 assembly pack: add Goldmont performance results. Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-10-24 13:01:13 +02:00
Andy Polyakov	cfe1d9929e	x86_64 assembly pack: tolerate spaces in source directory name. [as it is now quoting $output is not required, but done just in case] Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-05-29 14:12:51 +02:00
Rich Salz	6aa36e8e5a	Add OpenSSL copyright to .pl files Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-05-21 08:23:39 -04:00
Andy Polyakov	f218822871	chacha/asm/chacha-*.pl: fix typos in tail processing. RT#4323 Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-02-27 21:09:02 +01:00
Andy Polyakov	622a531c18	chacha/asm/chacha*: ensure that zero length is handled (without crash). RT#4305 Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-02-14 21:22:42 +01:00
Andy Polyakov	29880e9710	chacha/asm/chacha-x86[_64].pl: fix typos and logical errors. Thanks to: David Benjamin of Chromuim. RT#4305 Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-02-14 21:03:10 +01:00
Andy Polyakov	a98c648e40	x86[_64] assembly pack: add ChaCha20 and Poly1305 modules. Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-02-10 10:31:14 +01:00

19 commits