boringssl

Author	SHA1	Message	Date
David Benjamin	23e1a1f2d3	Test and fix an ABI issue with small parameters. Calling conventions must specify how to handle arguments smaller than a machine word. Should the caller pad them up to a machine word size with predictable values (zero/sign-extended), or should the callee tolerate an arbitrary bit pattern? Annoyingly, I found no text in either SysV or Win64 ABI documentation describing any of this and resorted to experiment. The short answer is that callees must tolerate an arbitrary bit pattern on x86_64, which means we must test this. See the comment in abi_test::internal::ToWord for the long answer. CHECK_ABI now, if the type of the parameter is smaller than crypto_word_t, fills the remaining bytes with 0xaa. This is so the number is out of bounds for code expecting either zero or sign extension. (Not that crypto assembly has any business seeing negative numbers.) Doing so reveals a bug in ecp_nistz256_ord_sqr_mont. The rep parameter is typed int, but the code expected uint64_t. In practice, the compiler will always compile this correctly because: - On both Win64 and SysV, rep is a register parameter. - The rep parameter is always a constant, so the compiler has no reason to leave garbage in the upper half. However, I was indeed able to get a bug out of GCC via: uint64_t foo = (1ull << 63) \| 2; // Some global the compiler can't // prove constant. ecp_nistz256_ord_sqr_mont(res, a, foo >> 1); Were ecp_nistz256_ord_sqr_mont a true int-taking function, this would act like ecp_nistz256_ord_sqr_mont(res, a, 1). Instead, it hung. Fix this by having it take a full-width word. This mess has several consequences: - ABI testing now ideally needs a functional testing component to fully cover this case. A bad input might merely produce the wrong answer. Still, this is fairly effective as it will cause most code to either segfault or loop forever. (Not the enc parameter to AES however...) - We cannot freely change the type of assembly function prototypes. If the prototype says int or unsigned, it must be ignoring the upper half and thus "fixing" it to size_t cannot have handled the full range. (Unless it was simply wrong of the parameter is already bounded.) If the prototype says size_t, switching to int or unsigned will hit this type of bug. The former is a safer failure mode though. - The simplest path out of this mess: new assembly code should only ever take word-sized parameters. This is not a tall order as the bad parameters are usually ints that should have been size_t. Calling conventions are hard. Change-Id: If8254aff8953844679fbce4bd3e345e5e2fa5213 Reviewed-on: https://boringssl-review.googlesource.com/c/34627 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2019-01-28 21:09:40 +00:00
David Benjamin	54efa1afc0	Add an ABI testing framework. Dear reader, I must apologize in advance. This CL contains the following: - A new 256-line perlasm file with non-trivial perl bits and a dual-ABI variadic function caller. - C preprocessor gymnastics, with variadic macros and fun facts about __VA_ARGS__'s behavior on empty argument lists. - C++ template gymnastics, including variadic arguments, template specialization, std::enable_if, and machinery to control template argument deduction. Enjoy. This tests that our assembly functions correctly honor platform ABI conventions. Right now this only tests callee-saved registers, but it should be extendable to SEH/CFI unwind testing with single-step debugging APIs. Register-checking does not involve anything funny and should be compatible with SDE. (The future unwind testing is unlikely to be compatible.) This CL adds support for x86_64 SysV and Win64 ABIs. ARM, AArch64, and x86 can be added in the future. The testing is injected in two places. First, all the assembly tests in p256-x86_64-test.cc are now instrumented. This is the intended workflow and should capture all registers. However, we currently do not unit-test our assembly much directly. We should do that as follow-up work[0] but, in the meantime, I've also wrapped all of the GTest main function in an ABI test. This is imperfect as ABI failures may be masked by other stack frames, but it costs nothing[1] and is pretty reliable at catching Win64 xmm register failures. [0] An alternate strategy would be, in debug builds, unconditionally instrument every assembly call in libcrypto. But the CHECK_ABI macro would be difficult to replicate in pure C, and unwind testing may be too invasive for this. Still, something to consider when we C++ libcrypto. [1] When single-stepped unwind testing exists, it won't cost nothing. The gtest_main.cc call will turn unwind testing off. Change-Id: I6643b26445891fd46abfacac52bc024024c8d7f6 Reviewed-on: https://boringssl-review.googlesource.com/c/33764 Reviewed-by: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <alangley@gmail.com> Commit-Queue: David Benjamin <davidben@google.com>	2018-12-21 16:09:32 +00:00
Adam Langley	9edbc7ff9f	Revert "Revert "Speed up ECDSA verify on x86-64."" This reverts commit `e907ed4c4b`. CPUID checks have been added so hopefully this time sticks. Change-Id: I5e0e5b87427c1230132681f936b3c70bac8263b8 Reviewed-on: https://boringssl-review.googlesource.com/c/32924 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2018-11-07 23:57:22 +00:00
Adam Langley	e907ed4c4b	Revert "Speed up ECDSA verify on x86-64." This reverts commit `3d450d2844`. It fails SDE, looks like a missing CPUID check before using vector instructions. Change-Id: I6b7dd71d9e5b1f509d2e018bd8be38c973476b4e Reviewed-on: https://boringssl-review.googlesource.com/c/32864 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com>	2018-11-06 00:29:15 +00:00
Nir Drucker	3d450d2844	Speed up ECDSA verify on x86-64. This commit improves the performance of ECDSA signature verification (over NIST P-256 curve) for x86 platforms. The speedup is by a factor of 1.15x. It does so by: 1) Leveraging the fact that the verification does not need to run in constant time. To this end, we implemented: a) the function ecp_nistz256_points_mul_public in a similar way to the current ecp_nistz256_points_mul function by removing its constant time features. b) the Binary Extended Euclidean Algorithm (BEEU) in x86 assembly to replace the current modular inverse function used for the inversion. 2) The last step in the ECDSA_verify function compares the (x) affine coordinate with the signature (r) value. Converting x from the Jacobian's representation to the affine coordinate requires to perform one inversions (x_affine = x * z^(-2)). We save this inversion and speed up the computations by instead bringing r to x (r_jacobian = rz^2) which is faster. The measured results are: Before (on a Kaby Lake desktop with gcc-5): Did 26000 ECDSA P-224 signing operations in 1002372us (25938.5 ops/sec) Did 11000 ECDSA P-224 verify operations in 1043821us (10538.2 ops/sec) Did 55000 ECDSA P-256 signing operations in 1017560us (54050.9 ops/sec) Did 17000 ECDSA P-256 verify operations in 1051280us (16170.8 ops/sec) After (on a Kaby Lake desktop with gcc-5): Did 27000 ECDSA P-224 signing operations in 1011287us (26698.7 ops/sec) Did 11640 ECDSA P-224 verify operations in 1076698us (10810.8 ops/sec) Did 55000 ECDSA P-256 signing operations in 1016880us (54087.0 ops/sec) Did 20000 ECDSA P-256 verify operations in 1038736us (19254.2 ops/sec) Before (on a Skylake server platform with gcc-5): Did 25000 ECDSA P-224 signing operations in 1021651us (24470.2 ops/sec) Did 10373 ECDSA P-224 verify operations in 1046563us (9911.5 ops/sec) Did 50000 ECDSA P-256 signing operations in 1002774us (49861.7 ops/sec) Did 15000 ECDSA P-256 verify operations in 1006471us (14903.6 ops/sec) After (on a Skylake server platform with gcc-5): Did 25000 ECDSA P-224 signing operations in 1020958us (24486.8 ops/sec) Did 10373 ECDSA P-224 verify operations in 1046359us (9913.4 ops/sec) Did 50000 ECDSA P-256 signing operations in 1003996us (49801.0 ops/sec) Did 18000 ECDSA P-256 verify operations in 1021604us (17619.4 ops/sec) Developers and authors: ************************************************************************ Nir Drucker (1,2), Shay Gueron (1,2) (1) Amazon Web Services Inc. (2) University of Haifa, Israel ************************************************************************* Change-Id: Idd42a7bc40626bce974ea000b61fdb5bad33851c Reviewed-on: https://boringssl-review.googlesource.com/c/31304 Commit-Queue: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2018-11-05 23:48:07 +00:00
David Benjamin	b27b579fdd	Add some tests for scalar operations. Largely random data, but make it easy to add things in the future. Change-Id: I30bee790bd9671b4d0327c2244fe5cd1a8954f90 Reviewed-on: https://boringssl-review.googlesource.com/27591 Reviewed-by: Adam Langley <alangley@gmail.com>	2018-04-24 16:12:34 +00:00
David Benjamin	6e4ff114fc	Merge Intel copyright notice into standard This was done by OpenSSL with the kind permission of Intel. This change is imported from upstream's commit dcf6e50f48e6bab92dcd2dacb27fc17c0de34199. Change-Id: Ie8d3b700cd527a6e8cf66e0728051b2acd8cc6b9 Reviewed-on: https://boringssl-review.googlesource.com/25588 Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: Adam Langley <agl@google.com>	2018-02-12 21:44:27 +00:00
David Benjamin	808f832917	Run the comment converter on libcrypto. crypto/{asn1,x509,x509v3,pem} were skipped as they are still OpenSSL style. Change-Id: I3cd9a60e1cb483a981aca325041f3fbce294247c Reviewed-on: https://boringssl-review.googlesource.com/19504 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-08-18 21:49:04 +00:00
Adam Langley	aacb72c1b7	Move ec/ and ecdsa/ into fipsmodule/ The names in the P-224 code collided with the P-256 code and thus many of the functions and constants in the P-224 code have been prefixed. Change-Id: I6bcd304640c539d0483d129d5eaf1702894929a8 Reviewed-on: https://boringssl-review.googlesource.com/15847 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-05-04 20:27:23 +00:00

9 Commits