[v4] scsi: target: tcmu: Fix possible data corruption

When tcmu_vma_fault() gets one page successfully, before the current
context completes page fault procedure, find_free_blocks() may run in
and call unmap_mapping_range() to unmap this page. Assume when
find_free_blocks() completes its job firstly, previous page fault
procedure starts to run again and completes, then one truncated page has
beed mapped to use space, but note that tcmu_vma_fault() has gotten one
refcount for this page, so any other subsystem won't use this page,
unless later the use space addr is unmapped.

If another command runs in later and needs to extends dbi_thresh, it may
reuse the corresponding slot to previous page in data_bitmap, then though
we'll allocate new page for this slot in data_area, but no page fault will
happen again, because we have a valid map, real request's data will lose.

Filesystem implementations will also run into this issue, but they
usually lock page when vm_operations_struct->fault gets one page, and
unlock page after finish_fault() completes. In truncate sides, they
lock pages in truncate_inode_pages() to protect race with page fault.
We can also have similar codes like filesystem to fix this issue.

To fix this possible data corruption, we can apply similar method like
filesystem. For pages that are to be freed, tcmu_blocks_release() locks
and unlocks these pages, and make tcmu_vma_fault() also lock found page
under cmdr_lock. At the same time, since tcmu_vma_fault() gets one extra
page refcount, tcmu_blocks_release() won't free pages if pages are in
page fault procedure, which means it's safe to call tcmu_blocks_release()
before unmap_mapping_range().

With above action, for above race, tcmu_blocks_release()
will wait all page faults to be completed before calling
unmap_mapping_range(), and later if unmap_mapping_range() is called,
it will ensure stale mappings to be removed cleanly.

Signed-off-by: Xiaoguang Wang <xiaoguang.wang@linux.alibaba.com>
---
V4:
 Add comments to explain why it's safe to call tcmu_blocks_release()
before unmap_mapping_range().

V3:
 Just lock/unlock_page in tcmu_blocks_release(), and call
tcmu_blocks_release() before unmap_mapping_range().

V2:
  Wait all possible inflight page faults to be completed in
find_free_blocks() to fix possible stale map.
---
 drivers/target/target_core_user.c | 36 +++++++++++++++++++++++++++++++++---
 1 file changed, 33 insertions(+), 3 deletions(-)

Message ID	20220417052604.120942-1-xiaoguang.wang@linux.alibaba.com
State	Superseded
Headers	show Return-Path: <linux-scsi-owner@kernel.org> From: Xiaoguang Wang <xiaoguang.wang@linux.alibaba.com> To: linux-scsi@vger.kernel.org, target-devel@vger.kernel.org Cc: linux-block@vger.kernel.org, bostroesser@gmail.com Subject: [PATCH v4] scsi: target: tcmu: Fix possible data corruption Date: Sun, 17 Apr 2022 13:26:04 +0800 Message-Id: <20220417052604.120942-1-xiaoguang.wang@linux.alibaba.com> Precedence: bulk
Series	[v4] scsi: target: tcmu: Fix possible data corruption \| expand [v4] scsi: target: tcmu: Fix possible data corruption

[v4] scsi: target: tcmu: Fix possible data corruption

Commit Message

Comments

Patch