openssl

Author	SHA1	Message	Date
David Benjamin	2086edb799	Fix some CFI issues in x86_64 assembly The add/double shortcut in ecp_nistz256-x86_64.pl left one instruction point that did not unwind, and the "slow" path in AES_cbc_encrypt was not annotated correctly. For the latter, add .cfi_{remember,restore}_state support to perlasm. Next, fill in a bunch of functions that are missing no-op .cfi_startproc and .cfi_endproc blocks. libunwind cannot unwind those stack frames otherwise. Finally, work around a bug in libunwind by not encoding rflags. (rflags isn't a callee-saved register, so there's not much need to annotate it anyway.) These were found as part of ABI testing work in BoringSSL. Reviewed-by: Richard Levitte <levitte@openssl.org> GH: #8109 (cherry picked from commit `c0e8e5007b`)	2019-02-17 23:41:11 +01:00
Matt Caswell	1212818eb0	Update copyright year Reviewed-by: Richard Levitte <levitte@openssl.org> (Merged from https://github.com/openssl/openssl/pull/7176)	2018-09-11 13:45:17 +01:00
Andy Polyakov	ce5eb5e814	modes/asm/ghash-armv4.pl: address "infixes are deprecated" warnings. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6615)	2018-07-01 11:51:44 +02:00
Andy Polyakov	1753d12374	PA-RISC assembly pack: make it work with GNU assembler for HP-UX. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6583)	2018-06-25 16:45:48 +02:00
Andy Polyakov	41013cd63c	PPC assembly pack: correct POWER9 results. As it turns out originally published results were skewed by "turbo" mode. VM apparently remains oblivious to dynamic frequency scaling, and reports that processor operates at "base" frequency at all times. While actual frequency gets increased under load. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6406)	2018-06-03 21:20:06 +02:00
Matt Caswell	83cf7abf8e	Update copyright year Reviewed-by: Richard Levitte <levitte@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6371)	2018-05-29 13:16:04 +01:00
Andy Polyakov	13f6857db1	PPC assembly pack: add POWER9 results. Reviewed-by: Rich Salz <rsalz@openssl.org>	2018-05-10 11:44:21 +02:00
Matt Caswell	6ec5fce25e	Update copyright year Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/6145)	2018-05-01 13:34:30 +01:00
Andy Polyakov	198a2ed791	ARM assembly pack: make it work with older assembler. Reviewed-by: Richard Levitte <levitte@openssl.org> Reviewed-by: Paul Dale <paul.dale@oracle.com> (Merged from https://github.com/openssl/openssl/pull/6043)	2018-04-23 17:29:59 +02:00
Andy Polyakov	603ebe0352	modes/asm/ghashv8-armx.pl: handle lengths not divisible by 4x. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4830)	2017-12-04 17:21:23 +01:00
Andy Polyakov	aa7bf31698	modes/asm/ghashv8-armx.pl: optimize modulo-scheduled loop. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4830)	2017-12-04 17:21:20 +01:00
Andy Polyakov	9ee020f8dc	modes/asm/ghashv8-armx.pl: modulo-schedule loop. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4830)	2017-12-04 17:21:15 +01:00
Andy Polyakov	7ff2fa4b92	modes/asm/ghashv8-armx.pl: implement 4x aggregate factor. This initial commit is unoptimized reference version that handles input lengths divisible by 4 blocks. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4830)	2017-12-04 17:20:25 +01:00
Andy Polyakov	7533162322	ARMv8 assembly pack: add Qualcomm Kryo results. [skip ci] Reviewed-by: Tim Hudson <tjh@openssl.org>	2017-11-13 11:13:00 +01:00
Josh Soref	46f4e1bec5	Many spelling fixes/typo's corrected. Around 138 distinct errors found and fixed; thanks! Reviewed-by: Kurt Roeckx <kurt@roeckx.be> Reviewed-by: Tim Hudson <tjh@openssl.org> Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/3459)	2017-11-11 19:03:10 -05:00
Patrick Steuer	bc4e831ccd	s390x assembly pack: extend s390x capability vector. Extend the s390x capability vector to store the longer facility list available from z13 onwards. The bits indicating the vector extensions are set to zero, if the kernel does not enable the vector facility. Also add capability bits returned by the crypto instructions' query functions. Signed-off-by: Patrick Steuer <patrick.steuer@de.ibm.com> Reviewed-by: Andy Polyakov <appro@openssl.org> Reviewed-by: Tim Hudson <tjh@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4542)	2017-10-30 14:31:32 +01:00
Patrick Steuer	af1d638730	s390x assembly pack: remove capability double-checking. An instruction's QUERY function is executed at initialization, iff the required MSA level is installed. Therefore, it is sufficient to check the bits returned by the QUERY functions. The MSA level does not have to be checked at every function call. crypto/aes/asm/aes-s390x.pl: The AES key schedule must be computed if the required KM or KMC function codes are not available. Formally, the availability of a KMC function code does not imply the availability of the corresponding KM function code. Signed-off-by: Patrick Steuer <patrick.steuer@de.ibm.com> Reviewed-by: Andy Polyakov <appro@openssl.org> Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/4501)	2017-10-17 21:55:33 +02:00
Rich Salz	e3713c365c	Remove email addresses from source code. Names were not removed. Some comments were updated. Replace Andy's address with openssl.org Reviewed-by: Andy Polyakov <appro@openssl.org> Reviewed-by: Paul Dale <paul.dale@oracle.com> (Merged from https://github.com/openssl/openssl/pull/4516)	2017-10-13 10:06:59 -04:00
Andy Polyakov	64d92d7498	x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. "Optimize" is in quotes because it's rather a "salvage operation" for now. Idea is to identify processor capability flags that drive Knights Landing to suboptimial code paths and mask them. Two flags were identified, XSAVE and ADCX/ADOX. Former affects choice of AES-NI code path specific for Silvermont (Knights Landing is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are effectively mishandled at decode time. In both cases we are looking at ~2x improvement. AVX-512 results cover even Skylake-X :-) Hardware used for benchmarking courtesy of Atos, experiments run by Romain Dolbeau <romain.dolbeau@atos.net>. Kudos! Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-07-21 14:07:32 +02:00
Rich Salz	28f298e70a	Undo commit `cd359b2` Original text: Clarify use of \|$end0\| in stitched x86-64 AES-GCM code. There was some uncertainty about what the code is doing with \|$end0\| and whether it was necessary for \|$len\| to be a multiple of 16 or 96. Hopefully these added comments make it clear that the code is correct except for the caveat regarding low memory addresses. Change-Id: Iea546a59dc7aeb400f50ac5d2d7b9cb88ace9027 Reviewed-on: https://boringssl-review.googlesource.com/7194 Reviewed-by: Adam Langley <agl@google.com> Reviewed-by: Richard Levitte <levitte@openssl.org> Reviewed-by: Tim Hudson <tjh@openssl.org> (Merged from https://github.com/openssl/openssl/pull/3700)	2017-07-05 17:06:57 -04:00
David Benjamin	e195c8a256	Remove filename argument to x86 asm_init. The assembler already knows the actual path to the generated file and, in other perlasm architectures, is left to manage debug symbols itself. Notably, in OpenSSL 1.1.x's new build system, which allows a separate build directory, converting .pl to .s as the scripts currently do result in the wrong paths. This also avoids inconsistencies from some of the files using $0 and some passing in the filename. Reviewed-by: Richard Levitte <levitte@openssl.org> Reviewed-by: Andy Polyakov <appro@openssl.org> (Merged from https://github.com/openssl/openssl/pull/3431)	2017-05-11 17:00:23 -04:00
Andy Polyakov	c93f06c12f	ARMv4 assembly pack: harmonize Thumb-ification of iOS build. Three modules were left behind in `a285992763`. Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/2617)	2017-02-15 23:16:01 +01:00
Andy Polyakov	5c72e5ea7a	modes/asm/*-x86_64.pl: add CFI annotations. Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-02-13 14:14:24 +01:00
Andy Polyakov	384e6de4c7	x86_64 assembly pack: Win64 SEH face-lift. - harmonize handlers with guidelines and themselves; - fix some bugs in handlers; - add missing handlers in chacha and ecp_nistz256 modules; Reviewed-by: Rich Salz <rsalz@openssl.org>	2017-02-06 08:21:42 +01:00
Andy Polyakov	ace05265d2	x86_64 assembly pack: add Goldmont performance results. Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-10-24 13:01:13 +02:00
David Benjamin	609b0852e4	Remove trailing whitespace from some files. The prevailing style seems to not have trailing whitespace, but a few lines do. This is mostly in the perlasm files, but a few C files got them after the reformat. This is the result of: find . -name '.pl' \| xargs sed -E -i '' -e 's/( \|'$'\t'')$//' find . -name '.c' \| xargs sed -E -i '' -e 's/( \|'$'\t'')$//' find . -name '.h' \| xargs sed -E -i '' -e 's/( \|'$'\t'')$//' Then bn_prime.h was excluded since this is a generated file. Note mkerr.pl has some changes in a heredoc for some help output, but other lines there lack trailing whitespace too. Reviewed-by: Kurt Roeckx <kurt@openssl.org> Reviewed-by: Matt Caswell <matt@openssl.org>	2016-10-10 23:36:21 +01:00
Andy Polyakov	6cf412c473	modes/asm/ghash-armv4.pl: improve interoperability with Android NDK. Reviewed-by: Tim Hudson <tjh@openssl.org>	2016-09-03 10:41:52 +02:00
Andy Polyakov	05ef4d1980	ARMv8 assembly pack: add Samsung Mongoose results. Reviewed-by: Tim Hudson <tjh@openssl.org>	2016-08-16 12:47:49 +02:00
klemens	6025001707	spelling fixes, just comments and readme. Reviewed-by: Matt Caswell <matt@openssl.org> Reviewed-by: Rich Salz <rsalz@openssl.org> (Merged from https://github.com/openssl/openssl/pull/1413)	2016-08-05 19:07:30 -04:00
Andy Polyakov	f198cc43a0	SPARC assembly pack: enforce V8+ ABI constraints. Even though it's hard to imagine, it turned out that upper half of arguments passed to V8+ subroutine can be non-zero. ["n" pseudo-instructions, such as srln being srl in 32-bit case and srlx in 64-bit one, were implemented in binutils 2.10. It's assumed that Solaris assembler implemented it around same time, i.e. 2000.] Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-07-01 14:25:08 +02:00
Brian Smith	cd359b2564	Clarify use of \|$end0\| in stitched x86-64 AES-GCM code. There was some uncertainty about what the code is doing with \|$end0\| and whether it was necessary for \|$len\| to be a multiple of 16 or 96. Hopefully these added comments make it clear that the code is correct except for the caveat regarding low memory addresses. Change-Id: Iea546a59dc7aeb400f50ac5d2d7b9cb88ace9027 Reviewed-on: https://boringssl-review.googlesource.com/7194 Reviewed-by: Adam Langley <agl@google.com> Signed-off-by: Andy Polyakov <appro@openssl.org> Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-06-27 10:15:05 +02:00
Andy Polyakov	cc77d0d84a	modes/asm/ghashp8-ppc.pl: improve performance by 2.7x. Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-06-14 23:28:39 +02:00
Andy Polyakov	cfe1d9929e	x86_64 assembly pack: tolerate spaces in source directory name. [as it is now quoting $output is not required, but done just in case] Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-05-29 14:12:51 +02:00
Rich Salz	6aa36e8e5a	Add OpenSSL copyright to .pl files Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-05-21 08:23:39 -04:00
Andy Polyakov	670ad0fbf6	s390x assembly pack: cache capability query results. IBM argues that in certain scenarios capability query is really expensive. At the same time it's asserted that query results can be safely cached, because disabling CPACF is incompatible with reboot-free operation. Reviewed-by: Tim Hudson <tjh@openssl.org>	2016-04-25 11:53:45 +02:00
Richard Levitte	a5aa63a456	Fix some assembler generating scripts for better unification Some of these scripts would recognise an output parameter if it looks like a file path. That works both in both the classic and new build schemes. Some fo these scripts would only recognise it if it's a basename (i.e. no directory component). Those need to be corrected, as the output parameter in the new build scheme is more likely to contain a directory component than not. Reviewed-by: Andy Polyakov <appro@openssl.org>	2016-03-11 00:54:31 +01:00
Richard Levitte	4f0d5f1849	Unified - adapt the generation of modes assembler to use GENERATE This gets rid of the BEGINRAW..ENDRAW sections in crypto/modes/build.info. This also moves the assembler generating perl scripts to take the output file name as last command line argument, where necessary. Reviewed-by: Andy Polyakov <appro@openssl.org>	2016-03-09 11:09:26 +01:00
Andy Polyakov	eb77e8886d	SPARCv9 assembly pack: unify build rules and argument handling. Make all scripts produce .S, make interpretation of $(CFLAGS) pre-processor's responsibility, start accepting $(PERLASM_SCHEME). [$(PERLASM_SCHEME) is redundant in this case, because there are no deviataions between Solaris and Linux assemblers. This is purely to unify .pl->.S handling across all targets.] Reviewed-by: Richard Levitte <levitte@openssl.org>	2016-03-08 15:51:06 +01:00
Andy Polyakov	d3cdab1736	modes/asm/ghash-x86_64.pl: refine GNU assembler version detection. Even though AVX support was added in GAS 2.19 vpclmulqdq was apparently added in 2.20. Reviewed-by: Rich Salz <rsalz@openssl.org>	2016-02-27 21:14:18 +01:00
Kurt Roeckx	df057ea6c8	Restore xmm7 from the correct address on win64 Reviewed-by: Richard Levitte <levitte@openssl.org> RT: #4288, MR: #1831	2016-02-04 15:42:13 +01:00
Andy Polyakov	b974943234	x86_64 assembly pack: tune clang version detection even further. RT#4171 Reviewed-by: Kurt Roeckx <kurt@openssl.org>	2015-12-13 22:18:18 +01:00
Andy Polyakov	a285992763	ARMv4 assembly pack: allow Thumb2 even in iOS build, and engage it in most modules. Reviewed-by: Tim Hudson <tjh@openssl.org>	2015-12-07 12:06:06 +01:00
Andy Polyakov	76eba0d94b	x86_64 assembly pack: tune clang version detection. RT#4142 Reviewed-by: Richard Levitte <levitte@openssl.org>	2015-11-23 16:00:06 +01:00
Andy Polyakov	fbab8badde	modes/asm/ghash-armv4.pl: extend Apple fix to all clang cases. Triggered by RT#3989. Reviewed-by: Matt Caswell <matt@openssl.org>	2015-11-11 22:09:18 +01:00
Andy Polyakov	b7f5503fa6	Skylake performance results. Reviewed-by: Matt Caswell <matt@openssl.org>	2015-09-26 19:50:11 +02:00
Andy Polyakov	11208dcfb9	ARMv4 assembly pack: implement support for Thumb2. As some of ARM processors, more specifically Cortex-Mx series, are Thumb2-only, we need to support Thumb2-only builds even in assembly. Reviewed-by: Tim Hudson <tjh@openssl.org>	2015-09-25 13:34:02 +02:00
Richard Levitte	053fa39af6	Conversion to UTF-8 where needed This leaves behind files with names ending with '.iso-8859-1'. These should be safe to remove. If something went wrong when re-encoding, there will be some files with names ending with '.utf8' left behind. Reviewed-by: Rich Salz <rsalz@openssl.org>	2015-07-14 01:10:01 +02:00
Andy Polyakov	9b6b470afe	modes/asm/ghashv8-armx.pl: additional performance data. Reviewed-by: Rich Salz <rsalz@openssl.org>	2015-04-21 09:17:53 +02:00
Andy Polyakov	313e6ec11f	Add assembly support for 32-bit iOS. Reviewed-by: Matt Caswell <matt@openssl.org> Reviewed-by: Richard Levitte <levitte@openssl.org>	2015-04-20 15:06:22 +02:00
Andy Polyakov	7eeeb49e11	modes/asm/ghashv8-armx.pl: up to 90% performance improvement. Reviewed-by: Matt Caswell <matt@openssl.org>	2015-04-02 10:03:09 +02:00

1 2 3

127 commits