boringssl/crypto/poly1305/asm
David Benjamin bd30f480c5 poly1305/asm/poly1305-*.pl: flip horizontal add and reduction.
Only the 32-bit AVX2 code path needs this, but upstream choose to harmonize all
vector code paths.

RT#4346

(Imported from 1ea8ae5090f557fea2e5b4d5758b10566825d74b.)

Tested the new code manually on arm and aarch64, NEON and non-NEON. Steven
reports that all variants pass on x86 and x86-64 too.

I've left the 32-bit x86 AVX2 code disabled since valgrind can't measure the
code coverage, but this avoids diff with upstream. We can enable it if we ever
end up caring.

Change-Id: Id9becc2adfbe44b84764f8e9c1fb5e8349c4d5a8
Reviewed-on: https://boringssl-review.googlesource.com/7295
Reviewed-by: Adam Langley <agl@google.com>
2016-03-04 19:06:20 +00:00
..
poly1305-armv4.pl poly1305/asm/poly1305-*.pl: flip horizontal add and reduction. 2016-03-04 19:06:20 +00:00
poly1305-armv8.pl poly1305/asm/poly1305-*.pl: flip horizontal add and reduction. 2016-03-04 19:06:20 +00:00
poly1305-x86_64.pl poly1305/asm/poly1305-*.pl: flip horizontal add and reduction. 2016-03-04 19:06:20 +00:00
poly1305-x86.pl poly1305/asm/poly1305-*.pl: flip horizontal add and reduction. 2016-03-04 19:06:20 +00:00