boringssl

Author	SHA1	Message	Date
David Benjamin	ea52ec98a5	Perform the RSA CRT reductions with Montgomery reduction. The first step of RSA with the CRT optimization is to reduce our input modulo p and q. We can do this in constant-time[] with Montgomery reduction. When p and q are the same size, Montgomery reduction's bounds hold. We need two rounds of it because the first round gives us an unwanted R^-1. This does not appear to have a measurable impact on performance. Also add a long TODO describing how to make the rest of the function constant-time[] which hopefully we'll get to later. RSA blinding should protect us from it all, but make this constant-time anyway. Since this and the follow-up work will special-case weird keys, add a test that we don't break those unintentionally. (Though I am not above breaking them intentionally someday...) Thanks to Andres Erbsen for discussions on how to do this bit properly. [*] Ignoring the pervasive bn_correct_top problem for the moment. Change-Id: Ide099a9db8249cb6549be99c5f8791a39692ea81 Reviewed-on: https://boringssl-review.googlesource.com/24204 Reviewed-by: Adam Langley <agl@google.com>	2017-12-18 18:59:18 +00:00
David Benjamin	f88242d1c1	SSL_export_keying_material should work in half-RTT. QUIC will need to derive keys at this point. This also smooths over a part of the server 0-RTT abstraction. Like with False Start, the SSL object is largely in a functional state at this point. Bug: 221 Change-Id: I4207d8cb1273a1156e728a7bff3943cc2c69e288 Reviewed-on: https://boringssl-review.googlesource.com/24224 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: Steven Valdez <svaldez@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-18 16:53:13 +00:00
David Benjamin	875095aa7c	Silence ARMv8 deprecated IT instruction warnings. ARMv8 kindly deprecated most of its IT instructions in Thumb mode. These files are taken from upstream and are used on both ARMv7 and ARMv8 processors. Accordingly, silence the warnings by marking the file as targetting ARMv7. In other files, they were accidentally silenced anyway by way of the existing .arch lines. This can be reproduced by building with the new NDK and passing -DCMAKE_ASM_FLAGS=-march=armv8-a. Some of our downstream code ends up passing that to the assembly. Note this change does not attempt to arrange for ARMv8-A/T32 to get code which honors the constraints. It only silences the warnings and continues to give it the same ARMv7-A/Thumb-2 code that backwards compatibility dictates it continue to run. Bug: chromium:575886, b/63131949 Change-Id: I24ce0b695942eaac799347922b243353b43ad7df Reviewed-on: https://boringssl-review.googlesource.com/24166 Reviewed-by: Adam Langley <agl@google.com>	2017-12-14 01:56:22 +00:00
David Benjamin	4358f104cf	Remove clang assembler .arch workaround. This makes it difficult to build against the NDK's toolchain file. The problem is __clang__ just means Clang is the frontend and implies nothing about which assembler. When using as, it is fine. When using clang-as on Linux, one needs a clang-as from this year. The only places where we case about clang's integrated assembler are iOS (where perlasm strips out .arch anyway) and build environments like Chromium which have a regularly-updated clang. Thus we can remove this now. Bug: 39 Update-Note: Holler if this breaks the build. If it doesn't break the build, you can probably remove any BORINGSSL_CLANG_SUPPORTS_DOT_ARCH or explicit -march armv8-a+crypto lines in your BoringSSL build. Change-Id: I21ce54b14c659830520c2f1d51c7bd13e0980c68 Reviewed-on: https://boringssl-review.googlesource.com/24124 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-13 22:22:41 +00:00
David Benjamin	6fe960d174	Enable __asm__ and uint128_t code in clang-cl. It actually works fine. I just forgot one of the typedefs last time. This gives a roughly 2x improvement on P-256 in clang-cl + OPENSSL_SMALL, the configuration used by Chrome. Before: Did 1302 ECDH P-256 operations in 1015000us (1282.8 ops/sec) Did 4250 ECDSA P-256 signing operations in 1047000us (4059.2 ops/sec) Did 1750 ECDSA P-256 verify operations in 1094000us (1599.6 ops/sec) After: Did 3250 ECDH P-256 operations in 1078000us (3014.8 ops/sec) Did 8250 ECDSA P-256 signing operations in 1016000us (8120.1 ops/sec) Did 3250 ECDSA P-256 verify operations in 1063000us (3057.4 ops/sec) (These were taken on a VM, so the measurements are extremely noisy, but this sort of improvement is visible regardless.) Alas, we do need a little extra bit of fiddling because division does not work (crbug.com/787617). Bug: chromium:787617 Update-Note: This removes the MSan uint128_t workaround which does not appear to be necessary anymore. Change-Id: I8361314608521e5bdaf0e7eeae7a02c33f55c69f Reviewed-on: https://boringssl-review.googlesource.com/23984 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-11 22:46:26 +00:00
David Benjamin	650d8c393e	Implement TLS 1.3 early exporters. Bug: 222 Change-Id: I33ee56358a62afcd9c3921026d55efcc543a5c11 Reviewed-on: https://boringssl-review.googlesource.com/23945 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-11 21:33:26 +00:00
Andres Erbsen	46304abf7d	ec/p256.c: fiat-crypto field arithmetic (64, 32) The fiat-crypto-generated code uses the Montgomery form implementation strategy, for both 32-bit and 64-bit code. 64-bit throughput seems slower, but the difference is smaller than noise between repetitions (-2%?) 32-bit throughput has decreased significantly for ECDH (-40%). I am attributing this to the change from varibale-time scalar multiplication to constant-time scalar multiplication. Due to the same bottleneck, ECDSA verification still uses the old code (otherwise there would have been a 60% throughput decrease). On the other hand, ECDSA signing throughput has increased slightly (+10%), perhaps due to the use of a precomputed table of multiples of the base point. 64-bit benchmarks (Google Cloud Haswell): with this change: Did 9126 ECDH P-256 operations in 1009572us (9039.5 ops/sec) Did 23000 ECDSA P-256 signing operations in 1039832us (22119.0 ops/sec) Did 8820 ECDSA P-256 verify operations in 1024242us (8611.2 ops/sec) master (`40e8c921ca`): Did 9340 ECDH P-256 operations in 1017975us (9175.1 ops/sec) Did 23000 ECDSA P-256 signing operations in 1039820us (22119.2 ops/sec) Did 8688 ECDSA P-256 verify operations in 1021108us (8508.4 ops/sec) benchmarks on ARMv7 (LG Nexus 4): with this change: Did 150 ECDH P-256 operations in 1029726us (145.7 ops/sec) Did 506 ECDSA P-256 signing operations in 1065192us (475.0 ops/sec) Did 363 ECDSA P-256 verify operations in 1033298us (351.3 ops/sec) master (`2fce1beda0`): Did 245 ECDH P-256 operations in 1017518us (240.8 ops/sec) Did 473 ECDSA P-256 signing operations in 1086281us (435.4 ops/sec) Did 360 ECDSA P-256 verify operations in 1003846us (358.6 ops/sec) 64-bit tables converted as follows: import re, sys, math p = 2256 - 2224 + 2192 + 296 - 1 R = 2256 def convert(t): x0, s1, x1, s2, x2, s3, x3 = t.groups() v = int(x0, 0) + 264 * (int(x1, 0) + 2*64(int(x2,0) + 2*64(int(x3, 0)) )) w = vR%p y0 = hex(w%(264)) y1 = hex((w>>64)%(264)) y2 = hex((w>>(264))%(2*64)) y3 = hex((w>>(364))%(264)) ww = int(y0, 0) + 264 * (int(y1, 0) + 2*64(int(y2,0) + 2*64(int(y3, 0)) )) if ww != vR%p: print(x0,x1,x2,x3) print(hex(v)) print(y0,y1,y2,y3) print(hex(w)) print(hex(ww)) assert 0 return '{'+y0+s1+y1+s2+y2+s3+y3+'}' fe_re = re.compile('{'+r'(\s,\s*)'.join(r'(\d+\|0x[abcdefABCDEF0123456789]+)' for i in range(4)) + '}') print (re.sub(fe_re, convert, sys.stdin.read()).rstrip('\n')) 32-bit tables converted from 64-bit tables Change-Id: I52d6e5504fcb6ca2e8b0ee13727f4500c80c1799 Reviewed-on: https://boringssl-review.googlesource.com/23244 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-11 17:55:46 +00:00
David Benjamin	eb9232f06f	Fully reduce scalars in EC_POINT_mul. Along the way, this allows us to tidy up the invariants associated with EC_SCALAR. They were fuzzy around ec_point_mul_scalar and some computations starting from the digest in ECDSA. The latter I've put into the type system with EC_LOOSE_SCALAR. As for the former, Andres points out that particular EC implementations are only good for scalars within a certain range, otherwise you may need extra work to avoid the doubling case. To simplify curve implementations, we reduce them fully rather than do the looser bit size check, so they can have the stronger precondition to work with. Change-Id: Iff9a0404f89adf8f7f914f8e8246c9f3136453f1 Reviewed-on: https://boringssl-review.googlesource.com/23664 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-08 17:55:54 +00:00
David Benjamin	2b63addf6a	Use uint32_t for unicode code points. The newer clang-cl is unhappy about the tautological comparison on Windows, but the comparison itself is unnecessary anyway, since the values will never exceed uint32_t. I think the reason it's not firing elsewhere is because on other 64-bit platforms, it is not tautological because long is 64-bit. On other 32-bit platforms, I'm not sure we actually have a standalone trunk clang builder right now. Update-Note: UTF8_getc and UTF8_putc were unexported. No one appears to be calling them. (We're a crypto library, not a Unicode library.) Change-Id: I0949ddea3131dca5f55d04e672c3ccf2915c41ab Reviewed-on: https://boringssl-review.googlesource.com/23844 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-08 17:51:34 +00:00
David Benjamin	296a61d600	bn/asm/rsaz-avx2.pl: fix digit correction bug in rsaz_1024_mul_avx2. Credit to OSS-Fuzz for finding this. CVE-2017-3738 (Imported from upstream's 5630661aecbea5fe3c4740f5fea744a1f07a6253 and 77d75993651b63e872244a3256e37967bb3c3e9e.) Confirmed with Intel SDE that the fix makes the test vector pass and that, without the fix, the test vector does not. (Well, we knew the latter already, since it was our test vector.) Change-Id: I167aa3407ddab3b434bacbd18e099c55aa40ac4c Reviewed-on: https://boringssl-review.googlesource.com/23884 Reviewed-by: Adam Langley <agl@google.com>	2017-12-07 16:54:32 +00:00
David Benjamin	2bc937068d	Add X509_NAME_get0_der from OpenSSL 1.1.0. Change-Id: Iaa616a09f944ce720c11236b031d0fa9deb47db3 Reviewed-on: https://boringssl-review.googlesource.com/23864 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-06 17:49:04 +00:00
David Benjamin	d8dbde79f9	Don't allow negative EC_KEY private keys. We check that the private key is less than the order, but we forgot the other end. Update-Note: It's possible some caller was relying on this, but since that function already checked the other half of the range, I'm expecting this to be a no-op change. Change-Id: I4a53357d7737735b3cfbe97d379c8ca4eca5d5ac Reviewed-on: https://boringssl-review.googlesource.com/23665 Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: Adam Langley <agl@google.com>	2017-12-05 19:46:27 +00:00
Michał Janiszewski	d3ec6f1adb	Add missing errno.h include to bio_test.cc This fixes compilation on aarch64 and other architectures for Android. Change-Id: I0b09ab06858c92d07e2376e244a4626a6af5037b Reviewed-on: https://boringssl-review.googlesource.com/23764 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-04 01:32:37 +00:00
Adam Langley	bc37ad91fe	Fix alignment-violating cast. Change-Id: Id8b69bb6103dd938f4c6d0d2ec24f3d50ba5513c Update-Note: fixes b/70034392 Reviewed-on: https://boringssl-review.googlesource.com/23744 Commit-Queue: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-12-01 22:32:17 +00:00
David Benjamin	48eaa28a12	Make EC_POINT_mul work with arbitrary BIGNUMs again. Rejecting values where we'd previous called BN_nnmod may have been overly ambitious. In the long run, all the supported ECC APIs (ECDSA*, ECDH_compute_key, and probably some additional new ECDH API) will be using the EC_SCALAR version anyway, so this doesn't really matter. Change-Id: I79cd4015f2d6daf213e4413caa2a497608976f93 Reviewed-on: https://boringssl-review.googlesource.com/23584 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-30 21:58:17 +00:00
David Benjamin	2fc4f362cd	Revert "Support high tag numbers in CBS/CBB." This reverts commit `66801feb17`. This turned out to break a lot more than expected. Hopefully we can reland it soon, but we need to fix up some consumers first. Note due to work that went in later, this is not a trivial revert and should be re-reviewed. Change-Id: I6474b67cce9a8aa03f722f37ad45914b76466bea Reviewed-on: https://boringssl-review.googlesource.com/23644 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-30 21:57:17 +00:00
David Benjamin	095b6c9baa	Also add a decoupled OBJ_obj2txt. We need it in both directions. Also I missed that in OBJ_obj2txt we allowed uint64_t components, but in my new OBJ_txt2obj we only allowed uint32_t. For consistency, upgrade that to uint64_t. Bug: chromium:706445 Change-Id: I38cfeea8ff64b9acf7998e552727c6c3b2cc600f Reviewed-on: https://boringssl-review.googlesource.com/23544 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: Steven Valdez <svaldez@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-30 18:21:48 +00:00
David Benjamin	47b8f00fdc	Reimplement OBJ_txt2obj and add a lower-level function. OBJ_txt2obj is currently implemented using BIGNUMs which is absurd. It also depends on the giant OID table, which is undesirable. Write a new one and expose the low-level function so Chromium can use it without the OID table. Bug: chromium:706445 Change-Id: I61ff750a914194f8776cb8d81ba5d3eb5eaa3c3d Reviewed-on: https://boringssl-review.googlesource.com/23364 Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: Steven Valdez <svaldez@google.com>	2017-11-27 21:29:00 +00:00
David Benjamin	56aaf164ac	Pretty-print large INTEGERs and ENUMERATEDs in hex. This avoids taking quadratic time to pretty-print certificates with excessively large integer fields. Very large integers aren't any more readable in decimal than hexadecimal anyway, and the i2s_* functions will parse either form. Found by libFuzzer. Change-Id: Id586cd1b0eef8936d38ff50433ae7c819f0054f3 Reviewed-on: https://boringssl-review.googlesource.com/23424 Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: Adam Langley <agl@google.com>	2017-11-27 18:38:50 +00:00
David Benjamin	e3b2a5d30d	Const-correct X509_ALGOR_get0. Matches the OpenSSL 1.1.0 spelling, which is what we advertise in OPENSSL_VERSION_NUMBER now. Otherwise third-party code which uses it will, in the long term, need ifdefs. Note this will require updates to any existing callers (there appear to only be a couple of them), but it should be straightforward. Change-Id: I9dd1013609abca547152728a293529055dacc239 Reviewed-on: https://boringssl-review.googlesource.com/23325 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 22:52:38 +00:00
David Benjamin	61e9245543	Use some of the word-based functions for ECDSA verification. This is only a hair faster than the signing change, but still something. I kept the call to BN_mod_inverse_odd as that appears to be faster (constant time is not a concern for verification). Before: Did 22855 ECDSA P-224 verify operations in 3015099us (7580.2 ops/sec) Did 21276 ECDSA P-256 verify operations in 3083284us (6900.4 ops/sec) Did 2635 ECDSA P-384 verify operations in 3032582us (868.9 ops/sec) Did 1240 ECDSA P-521 verify operations in 3068631us (404.1 ops/sec) After: Did 23310 ECDSA P-224 verify operations in 3056226us (7627.1 ops/sec) Did 21210 ECDSA P-256 verify operations in 3035765us (6986.7 ops/sec) Did 2666 ECDSA P-384 verify operations in 3023592us (881.7 ops/sec) Did 1209 ECDSA P-521 verify operations in 3054040us (395.9 ops/sec) Change-Id: Iec995b1a959dbc83049d0f05bdc525c14a95c28e Reviewed-on: https://boringssl-review.googlesource.com/23077 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 22:52:04 +00:00
David Benjamin	86c2b854b0	Don't use BN_nnmod to convert from field element to scalar. Hasse's theorem implies at most one subtraction is necessary. This is still using BIGNUM for now because field elements (EC_POINT_get_affine_coordinates_GFp) are BIGNUMs. This gives an additional 2% speedup for signing. Before: Did 16000 ECDSA P-224 signing operations in 1064799us (15026.3 ops/sec) Did 19000 ECDSA P-256 signing operations in 1007839us (18852.2 ops/sec) Did 1078 ECDSA P-384 signing operations in 1079413us (998.7 ops/sec) Did 484 ECDSA P-521 signing operations in 1083616us (446.7 ops/sec) After: Did 16000 ECDSA P-224 signing operations in 1054918us (15167.1 ops/sec) Did 20000 ECDSA P-256 signing operations in 1037338us (19280.1 ops/sec) Did 1045 ECDSA P-384 signing operations in 1049073us (996.1 ops/sec) Did 484 ECDSA P-521 signing operations in 1085492us (445.9 ops/sec) Change-Id: I2bfe214f968eca7a8e317928c0f3daf1a14bca90 Reviewed-on: https://boringssl-review.googlesource.com/23076 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 22:51:53 +00:00
David Benjamin	a838f9dc7e	Make ECDSA signing 10% faster and plug some timing leaks. None of the asymmetric crypto we inherented from OpenSSL is constant-time because of BIGNUM. BIGNUM chops leading zeros off the front of everything, so we end up leaking information about the first word, in theory. BIGNUM functions additionally tend to take the full range of inputs and then call into BN_nnmod at various points. All our secret values should be acted on in constant-time, but k in ECDSA is a particularly sensitive value. So, ecdsa_sign_setup, in an attempt to mitigate the BIGNUM leaks, would add a couple copies of the order. This does not work at all. k is used to compute two values: k^-1 and kG. The first operation when computing k^-1 is to call BN_nnmod if k is out of range. The entry point to our tuned constant-time curve implementations is to call BN_nnmod if the scalar has too many bits, which this causes. The result is both corrections are immediately undone but cause us to do more variable-time work in the meantime. Replace all these computations around k with the word-based functions added in the various preceding CLs. In doing so, replace the BN_mod_mul calls (which internally call BN_nnmod) with Montgomery reduction. We can avoid taking k^-1 out of Montgomery form, which combines nicely with Brian Smith's trick in `3426d10119`. Along the way, we avoid some unnecessary mallocs. BIGNUM still affects the private key itself, as well as the EC_POINTs. But this should hopefully be much better now. Also it's 10% faster: Before: Did 15000 ECDSA P-224 signing operations in 1069117us (14030.3 ops/sec) Did 18000 ECDSA P-256 signing operations in 1053908us (17079.3 ops/sec) Did 1078 ECDSA P-384 signing operations in 1087853us (990.9 ops/sec) Did 473 ECDSA P-521 signing operations in 1069835us (442.1 ops/sec) After: Did 16000 ECDSA P-224 signing operations in 1064799us (15026.3 ops/sec) Did 19000 ECDSA P-256 signing operations in 1007839us (18852.2 ops/sec) Did 1078 ECDSA P-384 signing operations in 1079413us (998.7 ops/sec) Did 484 ECDSA P-521 signing operations in 1083616us (446.7 ops/sec) Change-Id: I2a25e90fc99dac13c0616d0ea45e125a4bd8cca1 Reviewed-on: https://boringssl-review.googlesource.com/23075 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 22:51:40 +00:00
David Benjamin	66801feb17	Support high tag numbers in CBS/CBB. Android's attestion format uses some ludicrously large tag numbers: https://developer.android.com/training/articles/security-key-attestation.html#certificate_schema Add support for these in CBS/CBB. The public API does not change for callers who were using the CBS_ASN1_* constants, but it is no longer the case that tag representations match their DER encodings for small tag numbers. Chromium needs https://chromium-review.googlesource.com/#/c/chromium/src/+/783254, but otherwise I don't expect this to break things. Bug: 214 Change-Id: I9b5dc27ae3ea020e9edaabec4d665fd73da7d31e Reviewed-on: https://boringssl-review.googlesource.com/23304 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-22 22:34:05 +00:00
David Benjamin	02514002fd	Use dec/jnz instead of loop in bn_add_words and bn_sub_words. Imported from upstream's a78324d95bd4568ce2c3b34bfa1d6f14cddf92ef. I think the "regression" part of that change is some tweak to BN_usub and I guess the bn_*_words was to compensate for it, but we may as well import it. Apparently the loop instruction is terrible. Before: Did 39871000 bn_add_words operations in 1000002us (39870920.3 ops/sec) Did 38621750 bn_sub_words operations in 1000001us (38621711.4 ops/sec) After: Did 64012000 bn_add_words operations in 1000007us (64011551.9 ops/sec) Did 81792250 bn_sub_words operations in 1000002us (81792086.4 ops/sec) loop sets no flags (even doing the comparison to zero without ZF) while dec sets all flags but CF, so Andres and I are assuming that because this prevents Intel from microcoding it to dec/jnz, they otherwise can't be bothered to add more circuitry since every compiler has internalized by now to never use loop. Change-Id: I3927cd1c7b707841bbe9963e3d4afd7ba9bd9b36 Reviewed-on: https://boringssl-review.googlesource.com/23344 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 21:56:05 +00:00
David Benjamin	2056d7290a	Remove DSA_sign_setup too. Change-Id: Ib406e7d1653fa57a863dbd5d4eb04401caf5de0a Reviewed-on: https://boringssl-review.googlesource.com/23284 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 21:01:11 +00:00
David Benjamin	42a8cbe37c	Remove ECDSA_sign_setup and friends. These allow precomputation of k, but bypass our nonce hardening and also make it harder to excise BIGNUM. As a bonus, ECDSATest.SignTestVectors is now actually covering the k^-1 and r computations. Change-Id: I4c71dae162874a88a182387ac43999be9559ddd7 Reviewed-on: https://boringssl-review.googlesource.com/23074 Reviewed-by: Adam Langley <agl@google.com>	2017-11-22 20:23:40 +00:00
David Benjamin	8dc226ca8f	Add some missing OpenSSL 1.1.0 accessors. wpa_supplicant appear to be using these. Change-Id: I1f220cae69162901bcd9452e8daf67379c5e276c Reviewed-on: https://boringssl-review.googlesource.com/23324 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-22 18:43:38 +00:00
David Benjamin	67623735e0	Fix memory leak on sk_X509_EXTENSION_push failure. (Imported from upstream's c29f83c05f3a3c5641c5ddf054789a29d2163bf3.) ext was being leaked. Upstream also did some stuff around *x which wasn't strictly necessary (usually OpenSSL only provides basic exception safety, not strong exception safety), but ah well. Change-Id: I52d230990b05501b4cee6deee8dcacba4a926c18 Reviewed-on: https://boringssl-review.googlesource.com/23204 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-21 17:48:00 +00:00
Adam Langley	8c565fa86c	Include a couple of missing header files. mem.h for \|OPENSSL_cleanse\| and bn/internal.h for things like \|bn_less_than_words\| and \|bn_correct_top\|. Change-Id: I3c447a565dd9e4f18fb2ff5d59f80564b4df8cea Reviewed-on: https://boringssl-review.googlesource.com/23164 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 20:36:38 +00:00
David Benjamin	6d218d6d7a	Remove unused function. Change-Id: Id12ab478b6ba441fb1b6f4c2f9479384fc3fbdb6 Reviewed-on: https://boringssl-review.googlesource.com/23144 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 18:32:44 +00:00
David Benjamin	0a5f006736	Test that EC_POINT_mul works with the order. \|EC_POINT_mul\| is almost exclusively used with reduced scalars, with this exception. This comes from consumers following NIST SP 800-56A section 5.6.2.3.2. (Though all our curves have cofactor one, so this check isn't useful.) Add a test for this so we don't accidentally break it. Change-Id: I42492db38a1ea03acec4febdd7945c8a3933530a Reviewed-on: https://boringssl-review.googlesource.com/23084 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 18:32:30 +00:00
David Benjamin	e7c95d91f8	Run TLS 1.3 tests at all variants and fix bugs. We were only running a random subset of TLS 1.3 tests with variants and let a lot of bugs through as a result. - HelloRetryRequest-EmptyCookie wasn't actually testing what we were trying to test. - The second HelloRetryRequest detection needs tweaks in draft-22. - The empty HelloRetryRequest logic can't be based on non-empty extensions in draft-22. - We weren't sending ChangeCipherSpec correctly in HRR or testing it right. - Rework how runner reads ChangeCipherSpec by setting a flag which affects the next readRecord. This cuts down a lot of cases and works correctly if the client didn't send early data. (In that case, we don't flush CCS until EndOfEarlyData and runner deadlocks waiting for the ChangeCipherSpec to arrive.) Change-Id: I559c96ea3a8b350067e391941231713c6edb2f78 Reviewed-on: https://boringssl-review.googlesource.com/23125 Reviewed-by: Steven Valdez <svaldez@chromium.org> Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org>	2017-11-20 18:19:18 +00:00
David Benjamin	b8d677bfd0	Deduplicate built-in curves and give custom curves an order_mont. I still need to revive the original CL, but right now I'm interested in giving every EC_GROUP an order_mont and having different ownership of that field between built-in and custom groups is kind of a nuisance. If I'm going to do that anyway, better to avoid computing the entire EC_GROUP in one go. I'm using some manual locking rather than CRYPTO_once here so that it behaves well in the face of malloc errors. Not that we especially care, but it was easy to do. This speeds up our ECDH benchmark a bit which otherwise must construct the EC_GROUP each time (matching real world usage). Before: Did 7619 ECDH P-224 operations in 1003190us (7594.8 ops/sec) Did 7518 ECDH P-256 operations in 1060844us (7086.8 ops/sec) Did 572 ECDH P-384 operations in 1055878us (541.7 ops/sec) Did 264 ECDH P-521 operations in 1062375us (248.5 ops/sec) After: Did 8415 ECDH P-224 operations in 1066695us (7888.9 ops/sec) Did 7952 ECDH P-256 operations in 1022819us (7774.6 ops/sec) Did 572 ECDH P-384 operations in 1055817us (541.8 ops/sec) Did 264 ECDH P-521 operations in 1060008us (249.1 ops/sec) Bug: 20 Change-Id: I7446cd0a69a840551dcc2dfabadde8ee1e3ff3e2 Reviewed-on: https://boringssl-review.googlesource.com/23073 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:52:03 +00:00
David Benjamin	66f8235510	Enforce some bounds and invariants on custom curves. Later code will take advantage of these invariants. Enforcing them on custom curves avoids making them go through a custom codepath. Change-Id: I23cee72a90c2e4846b41e03e6be26bc3abeb4a45 Reviewed-on: https://boringssl-review.googlesource.com/23072 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:27:51 +00:00
David Benjamin	a08bba51a5	Add bn_mod_exp_mont_small and bn_mod_inverse_prime_mont_small. These can be used to invert values in ECDSA. Unlike their BIGNUM counterparts, the caller is responsible for taking values in and out of Montgomery domain. This will save some work later on in the ECDSA computation. Change-Id: Ib7292900a0fdeedce6cb3e9a9123c94863659043 Reviewed-on: https://boringssl-review.googlesource.com/23071 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:23:48 +00:00
David Benjamin	40e4ecb793	Add "small" variants of Montgomery logic. These use the square and multiply functions added earlier. Change-Id: I723834f9a227a9983b752504a2d7ce0223c43d24 Reviewed-on: https://boringssl-review.googlesource.com/23070 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:23:01 +00:00
David Benjamin	a01aa9aa9f	Split BN_from_montgomery_word into a non-BIGNUM core. bn_from_montgomery_in_place is actually constant-time. It is, of course, only used by non-constant-time BIGNUM callers, but that will soon be fixed. Change-Id: I2b2c9943dc3b8d6a4b5b19a5bc4fa9ebad532bac Reviewed-on: https://boringssl-review.googlesource.com/23069 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:22:43 +00:00
David Benjamin	6bc18a3bd4	Add bn_mul_small and bn_sqr_small. As part of excising BIGNUM from EC scalars, we will need a "words" version of BN_mod_mul_montgomery. That, in turn, requires BN_sqr and BN_mul for cases where we don't have bn_mul_mont. BN_sqr and BN_mul have a lot of logic in there, with the most complex cases being not even remotely constant time. Fortunately, those only apply to RSA-sized numbers, not EC-sized numbers. (With the exception, I believe, of 32-bit P-521 which just barely exceeds the cutoff.) Imposing a limit also makes it easier to stack-allocate temporaries (BN_CTX serves a similar purpose in BIGNUM). Extract bn_mul_small and bn_sqr_small and test them as part of bn_tests.txt. Later changes will build on these. If we end up reusing these functions for RSA in the future (though that would require tending to the egregiously non-constant-time code in the no-asm build), we probably want to extract a version where there is an explicit tmp parameter as in bn_sqr_normal rather than the stack bits. Change-Id: If414981eefe12d6664ab2f5e991a359534aa7532 Reviewed-on: https://boringssl-review.googlesource.com/23068 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:22:30 +00:00
David Benjamin	64619deaa3	Const-correct some of the low-level BIGNUM functions. Change-Id: I8c6257e336f54a3a1786df9c4103fcf29177030a Reviewed-on: https://boringssl-review.googlesource.com/23067 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:20:40 +00:00
David Benjamin	bd275702d2	size_t a bunch of bn words bits. Also replace a pointless call to bn_mul_words with a memset. Change-Id: Ief30ddab0e84864561b73fe2776bd0477931cf7f Reviewed-on: https://boringssl-review.googlesource.com/23066 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:20:28 +00:00
David Benjamin	73df153be8	Make BN_generate_dsa_nonce internally constant-time. This rewrites the internals with a "words" variant that can avoid bn_correct_top. It still ultimately calls bn_correct_top as the calling convention is sadly still BIGNUM, but we can lift that calling convention out incrementally. Performance seems to be comparable, if not faster. Before: Did 85000 ECDSA P-256 signing operations in 5030401us (16897.3 ops/sec) Did 34278 ECDSA P-256 verify operations in 5048029us (6790.4 ops/sec) After: Did 85000 ECDSA P-256 signing operations in 5021057us (16928.7 ops/sec) Did 34086 ECDSA P-256 verify operations in 5010416us (6803.0 ops/sec) Change-Id: I1159746dfcc00726dc3f28396076a354556e6e7d Reviewed-on: https://boringssl-review.googlesource.com/23065 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:18:30 +00:00
David Benjamin	b25140c7b6	Fix timing leak in BN_from_montgomery_word. BN_from_montgomery_word doesn't have a constant memory access pattern. Replace the pointer trick with constant_time_select_w. There is, of course, still the bn_correct_top leak pervasive in BIGNUM itself. I wasn't able to measure a performance on RSA operations before or after this change, but the benchmarks would vary wildly run to run. But one would assume the logic here is nothing compared to the actual reduction. Change-Id: Ide761fde3a091a93679f0a803a287aa5d0d4600d Reviewed-on: https://boringssl-review.googlesource.com/22904 Reviewed-by: Adam Langley <agl@google.com>	2017-11-20 16:18:09 +00:00
David Benjamin	8db94be1d6	Add ECDSA tests for custom curves. We don't currently have test coverage for the order_mont bits (or lack thereof) for custom curves. Change-Id: I865d547c783226a5a3d3d203e10b0e59bad36984 Reviewed-on: https://boringssl-review.googlesource.com/23064 Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> Reviewed-by: Adam Langley <agl@google.com>	2017-11-17 12:18:16 +00:00
David Benjamin	a00fd08c2c	Use consistent notation in ECDSA_do_verify comments. Change-Id: Ia0cec71b5f8a6b7f03681b92cfacee13b2a74621 Reviewed-on: https://boringssl-review.googlesource.com/22890 Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:44:01 +00:00
David Benjamin	d66bbf3413	Tidy up BN_mod_exp_mont. This was primarily for my own understanding, but this should hopefully also be clearer and more amenable to using unsigned indices later. Change-Id: I09cc3d55de0f7d9284d3b3168d8b0446274b2ab7 Reviewed-on: https://boringssl-review.googlesource.com/22889 Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:43:54 +00:00
David Benjamin	607f9807e5	Remove BN_TBIT. Normal shifts do the trick just fine and are less likely to tempt the compiler into inserting a jump. Change-Id: Iaa1da1b6f986fd447694fcde8f3525efb9eeaf11 Reviewed-on: https://boringssl-review.googlesource.com/22888 Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:43:37 +00:00
David Benjamin	bf3f6caaf3	Document some BIGNUM internals. Change-Id: I8f044febf16afe04da8b176c638111a9574c4d02 Reviewed-on: https://boringssl-review.googlesource.com/22887 Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:43:13 +00:00
David Benjamin	0a9222b824	Fix comment typo. Change-Id: I482093000ee2e4ba371c78b4f7f8e8b121e71640 Reviewed-on: https://boringssl-review.googlesource.com/22886 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:22:42 +00:00
David Benjamin	238c274054	Capitalization nit. We capitalize things Go-style. Change-Id: Id002efb8a85e4e1886164421bba059d9ca425964 Reviewed-on: https://boringssl-review.googlesource.com/22885 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2017-11-10 22:22:35 +00:00

1 2 3 4 5 ...

2171 Commits