dm kcopyd: always complete failed jobs - BMC/Intel-BMC/linux.git - Intel OpenBMC Linux kernel source tree (mirror)

diff options

author	Dmitry Fomichev <dmitry.fomichev@wdc.com>	2019-08-06 02:56:03 +0300
committer	Mike Snitzer <snitzer@redhat.com>	2019-08-15 22:57:39 +0300
commit	d1fef41465f0e8cae0693fb184caa6bfafb6cd16 (patch)
tree	67f33ce9c7b628a4ebf0676f2199dc79e4df1179 /drivers/md/dm-io.c
parent	cf3591ef832915892f2499b7e54b51d4c578b28c (diff)
download	linux-d1fef41465f0e8cae0693fb184caa6bfafb6cd16.tar.xz

dm kcopyd: always complete failed jobs

This patch fixes a problem in dm-kcopyd that may leave jobs in complete queue indefinitely in the event of backing storage failure. This behavior has been observed while running 100% write file fio workload against an XFS volume created on top of a dm-zoned target device. If the underlying storage of dm-zoned goes to offline state under I/O, kcopyd sometimes never issues the end copy callback and dm-zoned reclaim work hangs indefinitely waiting for that completion. This behavior was traced down to the error handling code in process_jobs() function that places the failed job to complete_jobs queue, but doesn't wake up the job handler. In case of backing device failure, all outstanding jobs may end up going to complete_jobs queue via this code path and then stay there forever because there are no more successful I/O jobs to wake up the job handler. This patch adds a wake() call to always wake up kcopyd job wait queue for all I/O jobs that fail before dm_io() gets called for that job. The patch also sets the write error status in all sub jobs that are failed because their master job has failed. Fixes: b73c67c2cbb00 ("dm kcopyd: add sequential write feature") Cc: stable@vger.kernel.org Signed-off-by: Dmitry Fomichev <dmitry.fomichev@wdc.com> Reviewed-by: Damien Le Moal <damien.lemoal@wdc.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com>

Diffstat (limited to 'drivers/md/dm-io.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: