diff options
author | Chuck Lever <chuck.lever@oracle.com> | 2021-03-11 21:25:01 +0300 |
---|---|---|
committer | Chuck Lever <chuck.lever@oracle.com> | 2021-03-11 23:26:07 +0300 |
commit | bade4be69a6ea6f38c5894468ede10ee60b6f7a0 (patch) | |
tree | 2f612411b0860a12b31419b89076f36f635350a8 /fs | |
parent | b4250dd868d1b42c0a65de11ef3afbee67ba5d2f (diff) | |
download | linux-bade4be69a6ea6f38c5894468ede10ee60b6f7a0.tar.xz |
svcrdma: Revert "svcrdma: Reduce Receive doorbell rate"
I tested commit 43042b90cae1 ("svcrdma: Reduce Receive doorbell
rate") with mlx4 (IB) and software iWARP and didn't find any
issues. However, I recently got my hardware iWARP setup back on
line (FastLinQ) and it's crashing hard on this commit (confirmed
via bisect).
The failure mode is complex.
- After a connection is established, the first Receive completes
normally.
- But the second and third Receives have garbage in their Receive
buffers. The server responds with ERR_VERS as a result.
- When the client tears down the connection to retry, a couple
of posted Receives flush twice, and that corrupts the recv_ctxt
free list.
- __svc_rdma_free then faults or loops infinitely while destroying
the xprt's recv_ctxts.
Since 43042b90cae1 ("svcrdma: Reduce Receive doorbell rate") does
not fix a bug but is a scalability enhancement, it's safe and
appropriate to revert it while working on a replacement.
Fixes: 43042b90cae1 ("svcrdma: Reduce Receive doorbell rate")
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
Diffstat (limited to 'fs')
0 files changed, 0 insertions, 0 deletions