From patchwork Tue Dec 18 17:13:52 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Sebastian Andrzej Siewior X-Patchwork-Id: 154168 Delivered-To: patch@linaro.org Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp3962868ljp; Tue, 18 Dec 2018 09:15:26 -0800 (PST) X-Google-Smtp-Source: AFSGD/VIOb31wsUHerfQDcDcFQklJZ02lLBHbq+Lnqgi89E1/yYbcmZIkhGN0bLw8aDQOs5/wHHt X-Received: by 2002:a63:4745:: with SMTP id w5mr3674840pgk.377.1545153326286; Tue, 18 Dec 2018 09:15:26 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1545153326; cv=none; d=google.com; s=arc-20160816; b=lx2031Mutnt8xHw+NyyA6hE+dstVH/vM3DtmkjY5rzJQqkF6Lmi1VZRwH2HVBV3NlT 1vnXv8S/sNVFfkXCfnBurqhBj/IVnF9RFiT9GmGfAH1EXosXunl9sE74TLWSNa9LZTXV Lr0uELcNIQmRJrxjCmXF/xEJUZvzZV+3IgOYpoGimWb8yKwLrbYHr1vn/Oo6sSqwhH30 EAK/aWOc7cSSx3K4W3coy9CUzahtsiX/6OxQJXQGZwY86RThhNlLorXQNRsabtsaLPkc BKpDFay5apDskcHW3dNxv22SC3ffFw9rj11wcIfFGcIt6RpRoijyQKUcJPctbDp0kzhH 52/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=eEjUtvANlr0D4DVZbLpU9HxjnuDrZHrl0Q+AyZA2X+o=; b=whDwURUmLAmRQj8jMm5QuPA/wErbZ+1gSOxcxxbdoXybEMJPhWoZI7t90RDrNqP9Rs NScqoxlO9wxvZ4jYk27XpymrqBtzRA1TE/SH4QCqFw+jv3nIveRG7MPfQQU99/mpIka7 I8j+Ps7xXjIFfbKza01R7H7xcffdRj5BLpSNMyVMqqp97kuepy6Sqz71DVOmhLsDkEqL nWsrhLBYOvQ9vZPzXj7M3515TnnX7TO0tc/PZ+59wwSNmyZ3kYoJ6tp7ija0pQIIwYam BBiU6i0/FHS4kpDlFigCfyj1QhKOrVlQCmEoE2mprEFHObb0+rKe8/8HMLQj74Ic8AdE s8Vg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of stable-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k69si14181034pga.176.2018.12.18.09.15.25; Tue, 18 Dec 2018 09:15:26 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of stable-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of stable-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727365AbeLRRO3 (ORCPT + 15 others); Tue, 18 Dec 2018 12:14:29 -0500 Received: from Galois.linutronix.de ([146.0.238.70]:56666 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727333AbeLRRO2 (ORCPT ); Tue, 18 Dec 2018 12:14:28 -0500 Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc) by Galois.linutronix.de with esmtp (Exim 4.80) (envelope-from ) id 1gZIwg-0006QU-G3; Tue, 18 Dec 2018 18:14:14 +0100 From: Sebastian Andrzej Siewior To: stable@vger.kernel.org Cc: Peter Zijlstra , Will Deacon , Thomas Gleixner , Daniel Wagner , bigeasy@linutronix.de, Linus Torvalds , Ingo Molnar Subject: [PATCH STABLE v4.14 02/10] locking/qspinlock: Ensure node is initialised before updating prev->next Date: Tue, 18 Dec 2018 18:13:52 +0100 Message-Id: <20181218171400.22711-3-bigeasy@linutronix.de> X-Mailer: git-send-email 2.20.1 In-Reply-To: <20181218171400.22711-1-bigeasy@linutronix.de> References: <20181218171400.22711-1-bigeasy@linutronix.de> MIME-Version: 1.0 Sender: stable-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: stable@vger.kernel.org From: Will Deacon commit 95bcade33a8af38755c9b0636e36a36ad3789fe6 upstream. When a locker ends up queuing on the qspinlock locking slowpath, we initialise the relevant mcs node and publish it indirectly by updating the tail portion of the lock word using xchg_tail. If we find that there was a pre-existing locker in the queue, we subsequently update their ->next field to point at our node so that we are notified when it's our turn to take the lock. This can be roughly illustrated as follows: /* Initialise the fields in node and encode a pointer to node in tail */ tail = initialise_node(node); /* * Exchange tail into the lockword using an atomic read-modify-write * operation with release semantics */ old = xchg_tail(lock, tail); /* If there was a pre-existing waiter ... */ if (old & _Q_TAIL_MASK) { prev = decode_tail(old); smp_read_barrier_depends(); /* ... then update their ->next field to point to node. WRITE_ONCE(prev->next, node); } The conditional update of prev->next therefore relies on the address dependency from the result of xchg_tail ensuring order against the prior initialisation of node. However, since the release semantics of the xchg_tail operation apply only to the write portion of the RmW, then this ordering is not guaranteed and it is possible for the CPU to return old before the writes to node have been published, consequently allowing us to point prev->next to an uninitialised node. This patch fixes the problem by making the update of prev->next a RELEASE operation, which also removes the reliance on dependency ordering. Signed-off-by: Will Deacon Acked-by: Peter Zijlstra (Intel) Cc: Linus Torvalds Cc: Thomas Gleixner Link: http://lkml.kernel.org/r/1518528177-19169-2-git-send-email-will.deacon@arm.com Signed-off-by: Ingo Molnar Signed-off-by: Sebastian Andrzej Siewior --- kernel/locking/qspinlock.c | 15 ++++++++------- 1 file changed, 8 insertions(+), 7 deletions(-) -- 2.20.1 diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c index 5541acb79e152..d880296245c59 100644 --- a/kernel/locking/qspinlock.c +++ b/kernel/locking/qspinlock.c @@ -416,14 +416,15 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val) */ if (old & _Q_TAIL_MASK) { prev = decode_tail(old); - /* - * The above xchg_tail() is also a load of @lock which - * generates, through decode_tail(), a pointer. The address - * dependency matches the RELEASE of xchg_tail() such that - * the subsequent access to @prev happens after. - */ - WRITE_ONCE(prev->next, node); + /* + * We must ensure that the stores to @node are observed before + * the write to prev->next. The address dependency from + * xchg_tail is not sufficient to ensure this because the read + * component of xchg_tail is unordered with respect to the + * initialisation of @node. + */ + smp_store_release(&prev->next, node); pv_wait_node(node, prev); arch_mcs_spin_lock_contended(&node->locked);