Go to file

David Benjamin 55db667c62 Enable vpaes for aarch64, with CTR optimizations. This patches vpaes-armv8.pl to add vpaes_ctr32_encrypt_blocks. CTR mode is by far the most important mode these days. It should have access to _vpaes_encrypt_2x, which gives a considerable speed boost. Also exclude vpaes_ecb_* as they're not even used. For iOS, this change is completely a no-op. iOS ARMv8 always has crypto extensions, and we already statically drop all other AES implementations. Android ARMv8 is not required to have crypto extensions, but every ARMv8 device I've seen has them. For those, it is a no-op performance-wise and a win on size. vpaes appears to be about 5.6KiB smaller than the tables. ARMv8 always makes SIMD (NEON) available, so we can statically drop aes_nohw. In theory, however, crypto-less Android ARMv8 is possible. Today such chips get a variable-time AES. This CL fixes this, but the performance story is complex. The Raspberry Pi 3 is not Android but has a Cortex-A53 chip without crypto extensions. (But the official images are 32-bit, so even this is slightly artificial...) There, vpaes is a performance win. Raspberry Pi 3, Model B+, Cortex-A53 Before: Did 265000 AES-128-GCM (16 bytes) seal operations in 1003312us (264125.2 ops/sec): 4.2 MB/s Did 44000 AES-128-GCM (256 bytes) seal operations in 1002141us (43906.0 ops/sec): 11.2 MB/s Did 9394 AES-128-GCM (1350 bytes) seal operations in 1032104us (9101.8 ops/sec): 12.3 MB/s Did 1562 AES-128-GCM (8192 bytes) seal operations in 1008982us (1548.1 ops/sec): 12.7 MB/s After: Did 277000 AES-128-GCM (16 bytes) seal operations in 1001884us (276479.1 ops/sec): 4.4 MB/s Did 52000 AES-128-GCM (256 bytes) seal operations in 1001480us (51923.2 ops/sec): 13.3 MB/s Did 11000 AES-128-GCM (1350 bytes) seal operations in 1007979us (10912.9 ops/sec): 14.7 MB/s Did 2013 AES-128-GCM (8192 bytes) seal operations in 1085545us (1854.4 ops/sec): 15.2 MB/s The Pixel 3 has a Cortex-A75 with crypto extensions, so it would never run this code. However, artificially ignoring them gives another data point (ARM documentation[] suggests the extensions are still optional on a Cortex-A75.) Sadly, vpaes no longer wins on perf over aes_nohw. But, it is constant-time: Pixel 3, AES/PMULL extensions ignored, Cortex-A75: Before: Did 2102000 AES-128-GCM (16 bytes) seal operations in 1000378us (2101205.7 ops/sec): 33.6 MB/s Did 358000 AES-128-GCM (256 bytes) seal operations in 1002658us (357051.0 ops/sec): 91.4 MB/s Did 75000 AES-128-GCM (1350 bytes) seal operations in 1012830us (74049.9 ops/sec): 100.0 MB/s Did 13000 AES-128-GCM (8192 bytes) seal operations in 1036524us (12541.9 ops/sec): 102.7 MB/s After: Did 1453000 AES-128-GCM (16 bytes) seal operations in 1000213us (1452690.6 ops/sec): 23.2 MB/s Did 285000 AES-128-GCM (256 bytes) seal operations in 1002227us (284366.7 ops/sec): 72.8 MB/s Did 60000 AES-128-GCM (1350 bytes) seal operations in 1016106us (59049.0 ops/sec): 79.7 MB/s Did 11000 AES-128-GCM (8192 bytes) seal operations in 1094184us (10053.2 ops/sec): 82.4 MB/s Note the numbers above run with PMULL off, so the slow GHASH is dampening the regression. If we test aes_nohw and vpaes paired with PMULL on, the 20% perf hit becomes a 31% hit. The PMULL-less variant is more likely to represent a real chip. This is consistent with upstream's note in the comment, though it is unclear if 20% is the right order of magnitude: "these results are worse than scalar compiler-generated code, but it's constant-time and therefore preferred". [] http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.100458_0301_00_en/lau1442495529696.html Bug: 246 Change-Id: If1dc87f5131fce742052498295476fbae4628dbf Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/35026 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>		2019-03-04 20:31:39 +00:00
.github	Add a PULL_REQUEST_TEMPLATE.	2016-03-08 15:23:52 +00:00
crypto	Enable vpaes for aarch64, with CTR optimizations.	2019-03-04 20:31:39 +00:00
decrepit	Remove non-STRICT_ALIGNMENT code from xts.c.	2019-02-14 17:32:11 +00:00
fipstools	Add a CFI tester to CHECK_ABI.	2019-01-03 22:01:55 +00:00
fuzz	Refresh fuzzer corpus.	2019-01-08 17:55:08 +00:00
include/openssl	silence unused variable warnings when using OPENSSL_clear_free	2019-03-04 19:55:29 +00:00
ssl	Clear out a bunch of -Wextra-semi warnings.	2019-02-21 19:12:39 +00:00
third_party	Clear out a bunch of -Wextra-semi warnings.	2019-02-21 19:12:39 +00:00
tool	Add an option to configure bssl speed chunk size.	2019-02-25 20:25:58 +00:00
util	Also include abi_test.cc in ssl_test_files.	2019-03-02 04:15:28 +00:00
.clang-format	Import `newhope' (post-quantum key exchange).	2016-04-26 22:53:59 +00:00
.gitignore	Add compiled python files to .gitignore.	2019-02-21 17:41:59 +00:00
API-CONVENTIONS.md	Clarify "reference" and fix typo.	2018-09-05 19:06:48 +00:00
BREAKING-CHANGES.md	Add some notes on how to handle breaking changes.	2018-04-28 00:04:41 +00:00
BUILDING.md	Add instructions for debugging on Android with gdb.	2019-02-01 02:51:11 +00:00
CMakeLists.txt	Add a RelWithAsserts build configuration.	2019-01-23 17:21:56 +00:00
codereview.settings	Comment change in codereview.settings	2018-07-26 00:23:04 +00:00
CONTRIBUTING.md	Add a CONTRIBUTING.md file.	2016-02-10 21:38:19 +00:00
FUZZING.md	Switch to Clang 6.0's fuzzer support.	2018-08-27 17:18:56 +00:00
go.mod	Add go 1.11 to go.mod.	2019-02-21 16:42:44 +00:00
INCORPORATING.md	Update URL for GN quick start guide.	2018-08-16 20:18:41 +00:00
LICENSE	Note licenses for support code in the top-level LICENSE file.	2018-03-27 17:03:47 +00:00
PORTING.md	Remove reference to SSL3 in PORTING.md.	2018-06-29 17:46:32 +00:00
README.md	Add some notes on how to handle breaking changes.	2018-04-28 00:04:41 +00:00
sources.cmake	Add new curve/hash ECDSA combinations from Wycheproof.	2018-08-10 18:26:06 +00:00
STYLE.md	Fix some style guide samples.	2017-08-31 14:24:45 +00:00

README.md

BoringSSL

BoringSSL is a fork of OpenSSL that is designed to meet Google's needs.

Although BoringSSL is an open source project, it is not intended for general use, as OpenSSL is. We don't recommend that third parties depend upon it. Doing so is likely to be frustrating because there are no guarantees of API or ABI stability.

Programs ship their own copies of BoringSSL when they use it and we update everything as needed when deciding to make API changes. This allows us to mostly avoid compromises in the name of compatibility. It works for us, but it may not work for you.

BoringSSL arose because Google used OpenSSL for many years in various ways and, over time, built up a large number of patches that were maintained while tracking upstream OpenSSL. As Google's product portfolio became more complex, more copies of OpenSSL sprung up and the effort involved in maintaining all these patches in multiple places was growing steadily.

Currently BoringSSL is the SSL library in Chrome/Chromium, Android (but it's not part of the NDK) and a number of other apps/programs.

There are other files in this directory which might be helpful:

PORTING.md: how to port OpenSSL-using code to BoringSSL.
BUILDING.md: how to build BoringSSL
INCORPORATING.md: how to incorporate BoringSSL into a project.
API-CONVENTIONS.md: general API conventions for BoringSSL consumers and developers.
STYLE.md: rules and guidelines for coding style.
include/openssl: public headers with API documentation in comments. Also available online.
FUZZING.md: information about fuzzing BoringSSL.
CONTRIBUTING.md: how to contribute to BoringSSL.
BREAKING-CHANGES.md: notes on potentially-breaking changes.