summaryrefslogtreecommitdiff
path: root/drivers/crypto/caam/Makefile
diff options
context:
space:
mode:
authorEric Biggers <ebiggers@google.com>2017-12-19 03:40:26 +0300
committerHerbert Xu <herbert@gondor.apana.org.au>2017-12-28 09:56:44 +0300
commitd8c7fe9f2a486a6e5f0d5229ca43807af5ab22c6 (patch)
tree9053eee76e14bbb3bff22d06f9306715d8c72bc2 /drivers/crypto/caam/Makefile
parent427988d981c4c20a7f0421873351ccd14a88e2b2 (diff)
downloadlinux-d8c7fe9f2a486a6e5f0d5229ca43807af5ab22c6.tar.xz
crypto: x86/twofish-3way - Fix %rbp usage
Using %rbp as a temporary register breaks frame pointer convention and breaks stack traces when unwinding from an interrupt in the crypto code. In twofish-3way, we can't simply replace %rbp with another register because there are none available. Instead, we use the stack to hold the values that %rbp, %r11, and %r12 were holding previously. Each of these values represents the half of the output from the previous Feistel round that is being passed on unchanged to the following round. They are only used once per round, when they are exchanged with %rax, %rbx, and %rcx. As a result, we free up 3 registers (one per block) and can reassign them so that %rbp is not used, and additionally %r14 and %r15 are not used so they do not need to be saved/restored. There may be a small overhead caused by replacing 'xchg REG, REG' with the needed sequence 'mov MEM, REG; mov REG, MEM; mov REG, REG' once per round. But, counterintuitively, when I tested "ctr-twofish-3way" on a Haswell processor, the new version was actually about 2% faster. (Perhaps 'xchg' is not as well optimized as plain moves.) Reported-by: syzbot <syzkaller@googlegroups.com> Signed-off-by: Eric Biggers <ebiggers@google.com> Reviewed-by: Josh Poimboeuf <jpoimboe@redhat.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Diffstat (limited to 'drivers/crypto/caam/Makefile')
0 files changed, 0 insertions, 0 deletions