7e2a8a34ba
The first non-zero window (which we can condition on for public exponents) always multiplies by one. This means we can cut out one Montgomery multiplication. It also means we never actually need to initialize r to one, saving another Montgomery multiplication for P-521. This, in turn, means we don't need the bn_one_to_montgomery optimization for the public-exponent exponentations, so we can delete bn_one_to_montgomery_small. (The function does currently promise to handle p = 0, but this is not actually reachable, so it can just do a reduction on RR.) For RSA, where we're not doing many multiplications to begin with, saving one is noticeable. Before: Did 92000 RSA 2048 verify (same key) operations in 3002557us (30640.6 ops/sec) Did 25165 RSA 4096 verify (same key) operations in 3045046us (8264.2 ops/sec) After: Did 100000 RSA 2048 verify (same key) operations in 3002483us (33305.8 ops/sec) Did 26603 RSA 4096 verify (same key) operations in 3010942us (8835.4 ops/sec) (Not looking at the fresh key number yet as that still needs to be fixed.) Change-Id: I81a025a68d9b0f8eb0f9c6c04ec4eedf0995a345 Reviewed-on: https://boringssl-review.googlesource.com/27286 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com> CQ-Verified: CQ bot account: commit-bot@chromium.org <commit-bot@chromium.org> |
||
---|---|---|
.. | ||
asm | ||
add.c | ||
bn_test_to_fuzzer.go | ||
bn_test.cc | ||
bn_tests.txt | ||
bn.c | ||
bytes.c | ||
check_bn_tests.go | ||
cmp.c | ||
ctx.c | ||
div.c | ||
exponentiation.c | ||
gcd.c | ||
generic.c | ||
internal.h | ||
jacobi.c | ||
montgomery_inv.c | ||
montgomery.c | ||
mul.c | ||
prime.c | ||
random.c | ||
rsaz_exp.c | ||
rsaz_exp.h | ||
shift.c | ||
sqrt.c |