From patchwork Tue Dec 18 22:10:41 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154212
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236430ljp;
 Tue, 18 Dec 2018 14:11:25 -0800 (PST)
X-Google-Smtp-Source: AFSGD/VSdxLRHDALQ38FH90rEYebKwfXjS0g5bqYNWg8PK2TFMBt0gRfP+R4EVgqUxtti5uJhlFq
X-Received: by 2002:a62:6408:: with SMTP id y8mr18072987pfb.202.1545171085715; 
 Tue, 18 Dec 2018 14:11:25 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171085; cv=none;
 d=google.com; s=arc-20160816;
 b=a9YzwrOL50KzR//AhlJSHNEt3luerlm4VX96LdLFFE3dW5v/nY95YcaHU7Vt615oJk
 v4K8/GN5Cr/enIq/CruarchJ6ADRX3dBfrn7hGjpOctFfmWOzg91ALlit28gbgv2EQOp
 +6c1zpyKbmL+cwFibc1iPOF71QRebcr4M4GS1uoKCX3+5tgrIrqMP2iSKIdEe2vBKB8h
 4sTHH4CSzBxs8pEBJQyQbAUCAZiUJQ7hn+9rYerEe3JPgtwI0tLRkuhu5QU9VTP9Q2+i
 Ae7TsRWMrOXb9jVdQemuSX6FvkyL+x4qtP5tcwi2KrLvvJWq8PhMcPWW8DFJ5OZ0gIeD
 5Bag==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=hfkyEYgakDOkHOZUq5RdouOloM5PU/6JeqZ77LRt/8A=;
 b=J+GckLhN4I0NUD6rqoz/U6RbUNgiJOYoCZbmUFa9xOHt2wXVivrpdB4bsRo0wErgMK
 rMZBTMnYeNUqSqWGFtJweQ3QEpaByulaozwvt5E1yzG5LpT8pjOUF5Tm5nsO9sXXdgW+
 iLDvuvDcROsRHM1/05C90a79PzBPzTuRqitj+4FyNZHCDQSep5GiPmwSdzyxoYkos08+
 mT/lr37LJI63Clk3m6Witlym0fdxyW4M+k4htrAigf4XPnWhk7mlPkk+AQr82b3Yvk9E
 NvLkoi01Tw0bIMRd0CSjpI/roZKQvYxO/GjZLv4D+5fhgKuUOXRjMsFZt8/oeiqRVn1v
 pXHw==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.25; 
 Tue, 18 Dec 2018 14:11:25 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727576AbeLRWLY (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:24 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57202 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727133AbeLRWLY (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:24 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaA-000889-Gi; Tue, 18 Dec 2018 23:11:18 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>, bigeasy@linutronix.de,
 Linus Torvalds <torvalds@linux-foundation.org>,
 Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 02/10] locking/qspinlock: Ensure node is
 initialised before updating prev->next
Date: Tue, 18 Dec 2018 23:10:41 +0100
Message-Id: <20181218221049.6816-3-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit 95bcade33a8af38755c9b0636e36a36ad3789fe6 upstream.

When a locker ends up queuing on the qspinlock locking slowpath, we
initialise the relevant mcs node and publish it indirectly by updating
the tail portion of the lock word using xchg_tail. If we find that there
was a pre-existing locker in the queue, we subsequently update their
->next field to point at our node so that we are notified when it's our
turn to take the lock.

This can be roughly illustrated as follows:

  /* Initialise the fields in node and encode a pointer to node in tail */
  tail = initialise_node(node);

  /*
   * Exchange tail into the lockword using an atomic read-modify-write
   * operation with release semantics
   */
  old = xchg_tail(lock, tail);

  /* If there was a pre-existing waiter ... */
  if (old & _Q_TAIL_MASK) {
	prev = decode_tail(old);
	smp_read_barrier_depends();

	/* ... then update their ->next field to point to node.
	WRITE_ONCE(prev->next, node);
  }

The conditional update of prev->next therefore relies on the address
dependency from the result of xchg_tail ensuring order against the
prior initialisation of node. However, since the release semantics of
the xchg_tail operation apply only to the write portion of the RmW,
then this ordering is not guaranteed and it is possible for the CPU
to return old before the writes to node have been published, consequently
allowing us to point prev->next to an uninitialised node.

This patch fixes the problem by making the update of prev->next a RELEASE
operation, which also removes the reliance on dependency ordering.

Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Link: http://lkml.kernel.org/r/1518528177-19169-2-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 kernel/locking/qspinlock.c | 15 ++++++++-------
 1 file changed, 8 insertions(+), 7 deletions(-)

-- 
2.20.1

diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c
index 8710fbe8d26c0..6fce84401dba1 100644
--- a/kernel/locking/qspinlock.c
+++ b/kernel/locking/qspinlock.c
@@ -532,14 +532,15 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 	 */
 	if (old & _Q_TAIL_MASK) {
 		prev = decode_tail(old);
-		/*
-		 * The above xchg_tail() is also a load of @lock which
-		 * generates, through decode_tail(), a pointer.  The address
-		 * dependency matches the RELEASE of xchg_tail() such that
-		 * the subsequent access to @prev happens after.
-		 */
 
-		WRITE_ONCE(prev->next, node);
+		/*
+		 * We must ensure that the stores to @node are observed before
+		 * the write to prev->next. The address dependency from
+		 * xchg_tail is not sufficient to ensure this because the read
+		 * component of xchg_tail is unordered with respect to the
+		 * initialisation of @node.
+		 */
+		smp_store_release(&prev->next, node);
 
 		pv_wait_node(node, prev);
 		arch_mcs_spin_lock_contended(&node->locked);

From patchwork Tue Dec 18 22:10:42 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154213
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236450ljp;
 Tue, 18 Dec 2018 14:11:28 -0800 (PST)
X-Google-Smtp-Source: AFSGD/WuTSvCjhJvV4aj+OhEKHVQ/XNkQOmt0QE7SaWNNvJTMvI8RLBvvghYMwTYXiUwQqqOfdkk
X-Received: by 2002:a63:cf02:: with SMTP id j2mr16917717pgg.113.1545171088264; 
 Tue, 18 Dec 2018 14:11:28 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171088; cv=none;
 d=google.com; s=arc-20160816;
 b=M2MV74bGecIG1jxKA7u00E6jOBGmdXKsDjVyXGIsH2MoPGI7fSBHzCLRSYEkDSJUI1
 53KdkQ/OPit74ro2xbUkSJIP9Ye2hT/r4DwNI+LbPS+Z0RQpv73BnFr9Usvw1SBZwfMk
 dVxkeltPGWDBwtW6pSeuRfsp9+lAFu+CjR0b76sM1ePdGxWVlRxuYR8BGgblFXzJk148
 1xOmfATccwI0dTkg7Jx7754dGMKVcMJzujYaooIKHM/seJs8V/4Mj8BxkA69+Czy7FoM
 d0H46oAslv3qbs7Gnj+aMWXfZpAtpwFXTsNkSnV8c8orEZlU9aY9LFipGOSROYXDiZ79
 mTeQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=va4BPl0uFoq1vj+3U4yoGesgEZ0r3WJ9T2BRV8XJmY8=;
 b=Mie/cYiwiNJ7Qt10xt4V0jWPQt+wLJtODRt55GTZVPj8Itjs4rPjx8Qw7uVVHTJ3hf
 N4K7v3+p90HC/yUjnQlG948wz6rB3ae5JwmT6nDyvcjBrFKeTbi1C6sdkPlk5QLo3oqf
 dYWEaqbqXyC+4GMk9nEOqMss+Qd+4shIU8lkfyiIl+D9k4sm1XHeWRpo+DqQ2W2rgez5
 2ffme9a7jrQazmiMCZtANuszP612JJAF4BitMvh1qhtjHuWgD2HaCDsdda9aBwb+tqNf
 S87cGuTxDeH82DVwaQ6nvjUA+8NqoHLRNet+n616oBPos2WvSoed6i/BGGk4GPlG8d6s
 nc3g==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.27; 
 Tue, 18 Dec 2018 14:11:28 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727617AbeLRWL0 (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:26 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57206 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727316AbeLRWLZ (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:25 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaB-000889-Lg; Tue, 18 Dec 2018 23:11:20 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 boqun.feng@gmail.com, linux-arm-kernel@lists.infradead.org,
 paulmck@linux.vnet.ibm.com, Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 03/10] locking/qspinlock: Bound spinning on
 pending->locked transition in slowpath
Date: Tue, 18 Dec 2018 23:10:42 +0100
Message-Id: <20181218221049.6816-4-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit 6512276d97b160d90b53285bd06f7f201459a7e3 upstream.

If a locker taking the qspinlock slowpath reads a lock value indicating
that only the pending bit is set, then it will spin whilst the
concurrent pending->locked transition takes effect.

Unfortunately, there is no guarantee that such a transition will ever be
observed since concurrent lockers could continuously set pending and
hand over the lock amongst themselves, leading to starvation. Whilst
this would probably resolve in practice, it means that it is not
possible to prove liveness properties about the lock and means that lock
acquisition time is unbounded.

Rather than removing the pending->locked spinning from the slowpath
altogether (which has been shown to heavily penalise a 2-threaded
locking stress test on x86), this patch replaces the explicit spinning
with a call to atomic_cond_read_relaxed and allows the architecture to
provide a bound on the number of spins. For architectures that can
respond to changes in cacheline state in their smp_cond_load implementation,
it should be sufficient to use the default bound of 1.

Suggested-by: Waiman Long <longman@redhat.com>
Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Waiman Long <longman@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: boqun.feng@gmail.com
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/1524738868-31318-4-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 kernel/locking/qspinlock.c | 20 +++++++++++++++++---
 1 file changed, 17 insertions(+), 3 deletions(-)

-- 
2.20.1

diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c
index 6fce84401dba1..a8da1fc5222eb 100644
--- a/kernel/locking/qspinlock.c
+++ b/kernel/locking/qspinlock.c
@@ -75,6 +75,18 @@
 #define MAX_NODES	4
 #endif
 
+/*
+ * The pending bit spinning loop count.
+ * This heuristic is used to limit the number of lockword accesses
+ * made by atomic_cond_read_relaxed when waiting for the lock to
+ * transition out of the "== _Q_PENDING_VAL" state. We don't spin
+ * indefinitely because there's no guarantee that we'll make forward
+ * progress.
+ */
+#ifndef _Q_PENDING_LOOPS
+#define _Q_PENDING_LOOPS	1
+#endif
+
 /*
  * Per-CPU queue node structures; we can never have more than 4 nested
  * contexts: task, softirq, hardirq, nmi.
@@ -422,13 +434,15 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 		return;
 
 	/*
-	 * wait for in-progress pending->locked hand-overs
+	 * Wait for in-progress pending->locked hand-overs with a bounded
+	 * number of spins so that we guarantee forward progress.
 	 *
 	 * 0,1,0 -> 0,0,1
 	 */
 	if (val == _Q_PENDING_VAL) {
-		while ((val = atomic_read(&lock->val)) == _Q_PENDING_VAL)
-			cpu_relax();
+		int cnt = _Q_PENDING_LOOPS;
+		val = smp_cond_load_acquire(&lock->val.counter,
+					       (VAL != _Q_PENDING_VAL) || !cnt--);
 	}
 
 	/*

From patchwork Tue Dec 18 22:10:43 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154216
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236496ljp;
 Tue, 18 Dec 2018 14:11:31 -0800 (PST)
X-Google-Smtp-Source: AFSGD/VCKPByP18W5UVR+kMPi0LyeJqkCjgvieNiF8pggcG9rTZjI8RHXCRkxK6ZKVRlcgKgu+fT
X-Received: by 2002:a63:5b48:: with SMTP id l8mr14156940pgm.80.1545171090913; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171090; cv=none;
 d=google.com; s=arc-20160816;
 b=gQmEuk5E/lWCn93+luZqnZQcsZJVS5ZYgNSSEfmbCfw0LbngnzZ7gMzfpUxont7VBv
 vEo6PW0vIy14nTCYgExdSt7joo9wrZuT+pcjN7d0A5uO0umpTiCvoumbsvStfgQV2Ap5
 59ObBytXDxBIB9xWF/c8DWz1aI38sLip31omTtySvHrNhuwW5gXEsLcmpySS7XX8kqtq
 uq+9h6SdAx8vtOr0wYfGX7C+LZvNjcUcbvJ2Qw6hp1/01/2TxbDbmjVcMMC4cxk2EMJ1
 1QHb49wwAuqWW3Q1bglX8ssGpvm3KC9THxxwYZC6/Ww6QuVxjDF+m9b0gAkXZ918APA0
 oUcw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=36kD8EQ6sVT1FdaP/7IXTFMjWO6o+qTza2P4pUUdfoo=;
 b=GhFJkUMrSIv3DE6jqI2qfuuFwgcC3HaPohDPBGeyFlUrOqPgztXlDN2G7ymlYmfi3g
 fUfo0BAUxYQ/1nKwFUqAvRa+PAvqmrkNItRwhqsudf29lu8BRQW24x8oimK65uIVlBFA
 UFyrdby3/oLE6yqv8XuL29Y1tCjB9RoznM++ZGko17bud+s2WZC31x5T/Ev9qkOCikbx
 XTlKx0RbmWj0NuXThdbMhEv4TKiJ6z0FnO5fy5hguoHNlpO5cvMbOEHpFEBGRKzvDwaN
 kBL0DISdbeBd0NZwALb2bPYW2m3LM5OQd0CJ5JQCjyVyzbMeAmpr5WUA7Lqs4NPXEH5X
 1gXg==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.30; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727316AbeLRWL3 (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:29 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57209 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727133AbeLRWL2 (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:28 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaC-000889-TY; Tue, 18 Dec 2018 23:11:21 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Boqun Feng <boqun.feng@gmail.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 linux-arm-kernel@lists.infradead.org, paulmck@linux.vnet.ibm.com,
 Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 04/10] locking/qspinlock: Merge 'struct
 __qspinlock' into 'struct qspinlock'
Date: Tue, 18 Dec 2018 23:10:43 +0100
Message-Id: <20181218221049.6816-5-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit 625e88be1f41b53cec55827c984e4a89ea8ee9f9 upstream.

'struct __qspinlock' provides a handy union of fields so that
subcomponents of the lockword can be accessed by name, without having to
manage shifts and masks explicitly and take endianness into account.

This is useful in qspinlock.h and also potentially in arch headers, so
move the 'struct __qspinlock' into 'struct qspinlock' and kill the extra
definition.

Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Waiman Long <longman@redhat.com>
Acked-by: Boqun Feng <boqun.feng@gmail.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/1524738868-31318-3-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 arch/x86/include/asm/qspinlock.h          |  2 +-
 arch/x86/include/asm/qspinlock_paravirt.h |  3 +-
 include/asm-generic/qspinlock_types.h     | 32 +++++++++++++++-
 kernel/locking/qspinlock.c                | 46 ++---------------------
 kernel/locking/qspinlock_paravirt.h       | 34 ++++++-----------
 5 files changed, 46 insertions(+), 71 deletions(-)

-- 
2.20.1

diff --git a/arch/x86/include/asm/qspinlock.h b/arch/x86/include/asm/qspinlock.h
index eaba080760300..e07cc206919d4 100644
--- a/arch/x86/include/asm/qspinlock.h
+++ b/arch/x86/include/asm/qspinlock.h
@@ -14,7 +14,7 @@
  */
 static inline void native_queued_spin_unlock(struct qspinlock *lock)
 {
-	smp_store_release((u8 *)lock, 0);
+	smp_store_release(&lock->locked, 0);
 }
 
 #ifdef CONFIG_PARAVIRT_SPINLOCKS
diff --git a/arch/x86/include/asm/qspinlock_paravirt.h b/arch/x86/include/asm/qspinlock_paravirt.h
index 9d55f9b6e167c..fc75415ae9719 100644
--- a/arch/x86/include/asm/qspinlock_paravirt.h
+++ b/arch/x86/include/asm/qspinlock_paravirt.h
@@ -21,8 +21,7 @@ PV_CALLEE_SAVE_REGS_THUNK(__pv_queued_spin_unlock_slowpath);
  *
  * void __pv_queued_spin_unlock(struct qspinlock *lock)
  * {
- *	struct __qspinlock *l = (void *)lock;
- *	u8 lockval = cmpxchg(&l->locked, _Q_LOCKED_VAL, 0);
+ *	u8 lockval = cmpxchg(&lock->locked, _Q_LOCKED_VAL, 0);
  *
  *	if (likely(lockval == _Q_LOCKED_VAL))
  *		return;
diff --git a/include/asm-generic/qspinlock_types.h b/include/asm-generic/qspinlock_types.h
index 034acd0c4956b..0763f065b975a 100644
--- a/include/asm-generic/qspinlock_types.h
+++ b/include/asm-generic/qspinlock_types.h
@@ -29,13 +29,41 @@
 #endif
 
 typedef struct qspinlock {
-	atomic_t	val;
+	union {
+		atomic_t val;
+
+		/*
+		 * By using the whole 2nd least significant byte for the
+		 * pending bit, we can allow better optimization of the lock
+		 * acquisition for the pending bit holder.
+		 */
+#ifdef __LITTLE_ENDIAN
+		struct {
+			u8	locked;
+			u8	pending;
+		};
+		struct {
+			u16	locked_pending;
+			u16	tail;
+		};
+#else
+		struct {
+			u16	tail;
+			u16	locked_pending;
+		};
+		struct {
+			u8	reserved[2];
+			u8	pending;
+			u8	locked;
+		};
+#endif
+	};
 } arch_spinlock_t;
 
 /*
  * Initializier
  */
-#define	__ARCH_SPIN_LOCK_UNLOCKED	{ ATOMIC_INIT(0) }
+#define	__ARCH_SPIN_LOCK_UNLOCKED	{ .val = ATOMIC_INIT(0) }
 
 /*
  * Bitfields in the atomic value:
diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c
index a8da1fc5222eb..cc98050e8bec0 100644
--- a/kernel/locking/qspinlock.c
+++ b/kernel/locking/qspinlock.c
@@ -125,40 +125,6 @@ static inline __pure struct mcs_spinlock *decode_tail(u32 tail)
 
 #define _Q_LOCKED_PENDING_MASK (_Q_LOCKED_MASK | _Q_PENDING_MASK)
 
-/*
- * By using the whole 2nd least significant byte for the pending bit, we
- * can allow better optimization of the lock acquisition for the pending
- * bit holder.
- *
- * This internal structure is also used by the set_locked function which
- * is not restricted to _Q_PENDING_BITS == 8.
- */
-struct __qspinlock {
-	union {
-		atomic_t val;
-#ifdef __LITTLE_ENDIAN
-		struct {
-			u8	locked;
-			u8	pending;
-		};
-		struct {
-			u16	locked_pending;
-			u16	tail;
-		};
-#else
-		struct {
-			u16	tail;
-			u16	locked_pending;
-		};
-		struct {
-			u8	reserved[2];
-			u8	pending;
-			u8	locked;
-		};
-#endif
-	};
-};
-
 #if _Q_PENDING_BITS == 8
 /**
  * clear_pending_set_locked - take ownership and clear the pending bit.
@@ -170,9 +136,7 @@ struct __qspinlock {
  */
 static __always_inline void clear_pending_set_locked(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
-	WRITE_ONCE(l->locked_pending, _Q_LOCKED_VAL);
+	WRITE_ONCE(lock->locked_pending, _Q_LOCKED_VAL);
 }
 
 /*
@@ -187,13 +151,11 @@ static __always_inline void clear_pending_set_locked(struct qspinlock *lock)
  */
 static __always_inline u32 xchg_tail(struct qspinlock *lock, u32 tail)
 {
-	struct __qspinlock *l = (void *)lock;
-
 	/*
 	 * Use release semantics to make sure that the MCS node is properly
 	 * initialized before changing the tail code.
 	 */
-	return (u32)xchg_release(&l->tail,
+	return (u32)xchg_release(&lock->tail,
 				 tail >> _Q_TAIL_OFFSET) << _Q_TAIL_OFFSET;
 }
 
@@ -248,9 +210,7 @@ static __always_inline u32 xchg_tail(struct qspinlock *lock, u32 tail)
  */
 static __always_inline void set_locked(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
-	WRITE_ONCE(l->locked, _Q_LOCKED_VAL);
+	WRITE_ONCE(lock->locked, _Q_LOCKED_VAL);
 }
 
 
diff --git a/kernel/locking/qspinlock_paravirt.h b/kernel/locking/qspinlock_paravirt.h
index e3b5520005db7..3e33336911ffe 100644
--- a/kernel/locking/qspinlock_paravirt.h
+++ b/kernel/locking/qspinlock_paravirt.h
@@ -69,10 +69,8 @@ struct pv_node {
 #define queued_spin_trylock(l)	pv_queued_spin_steal_lock(l)
 static inline bool pv_queued_spin_steal_lock(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
 	if (!(atomic_read(&lock->val) & _Q_LOCKED_PENDING_MASK) &&
-	    (cmpxchg(&l->locked, 0, _Q_LOCKED_VAL) == 0)) {
+	    (cmpxchg(&lock->locked, 0, _Q_LOCKED_VAL) == 0)) {
 		qstat_inc(qstat_pv_lock_stealing, true);
 		return true;
 	}
@@ -87,16 +85,12 @@ static inline bool pv_queued_spin_steal_lock(struct qspinlock *lock)
 #if _Q_PENDING_BITS == 8
 static __always_inline void set_pending(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
-	WRITE_ONCE(l->pending, 1);
+	WRITE_ONCE(lock->pending, 1);
 }
 
 static __always_inline void clear_pending(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
-	WRITE_ONCE(l->pending, 0);
+	WRITE_ONCE(lock->pending, 0);
 }
 
 /*
@@ -106,10 +100,8 @@ static __always_inline void clear_pending(struct qspinlock *lock)
  */
 static __always_inline int trylock_clear_pending(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
-
-	return !READ_ONCE(l->locked) &&
-	       (cmpxchg(&l->locked_pending, _Q_PENDING_VAL, _Q_LOCKED_VAL)
+	return !READ_ONCE(lock->locked) &&
+	       (cmpxchg(&lock->locked_pending, _Q_PENDING_VAL, _Q_LOCKED_VAL)
 			== _Q_PENDING_VAL);
 }
 #else /* _Q_PENDING_BITS == 8 */
@@ -353,7 +345,6 @@ static void pv_wait_node(struct mcs_spinlock *node, struct mcs_spinlock *prev)
 static void pv_kick_node(struct qspinlock *lock, struct mcs_spinlock *node)
 {
 	struct pv_node *pn = (struct pv_node *)node;
-	struct __qspinlock *l = (void *)lock;
 
 	/*
 	 * If the vCPU is indeed halted, advance its state to match that of
@@ -372,7 +363,7 @@ static void pv_kick_node(struct qspinlock *lock, struct mcs_spinlock *node)
 	 * the hash table later on at unlock time, no atomic instruction is
 	 * needed.
 	 */
-	WRITE_ONCE(l->locked, _Q_SLOW_VAL);
+	WRITE_ONCE(lock->locked, _Q_SLOW_VAL);
 	(void)pv_hash(lock, pn);
 }
 
@@ -387,7 +378,6 @@ static u32
 pv_wait_head_or_lock(struct qspinlock *lock, struct mcs_spinlock *node)
 {
 	struct pv_node *pn = (struct pv_node *)node;
-	struct __qspinlock *l = (void *)lock;
 	struct qspinlock **lp = NULL;
 	int waitcnt = 0;
 	int loop;
@@ -438,13 +428,13 @@ pv_wait_head_or_lock(struct qspinlock *lock, struct mcs_spinlock *node)
 			 *
 			 * Matches the smp_rmb() in __pv_queued_spin_unlock().
 			 */
-			if (xchg(&l->locked, _Q_SLOW_VAL) == 0) {
+			if (xchg(&lock->locked, _Q_SLOW_VAL) == 0) {
 				/*
 				 * The lock was free and now we own the lock.
 				 * Change the lock value back to _Q_LOCKED_VAL
 				 * and unhash the table.
 				 */
-				WRITE_ONCE(l->locked, _Q_LOCKED_VAL);
+				WRITE_ONCE(lock->locked, _Q_LOCKED_VAL);
 				WRITE_ONCE(*lp, NULL);
 				goto gotlock;
 			}
@@ -452,7 +442,7 @@ pv_wait_head_or_lock(struct qspinlock *lock, struct mcs_spinlock *node)
 		WRITE_ONCE(pn->state, vcpu_hashed);
 		qstat_inc(qstat_pv_wait_head, true);
 		qstat_inc(qstat_pv_wait_again, waitcnt);
-		pv_wait(&l->locked, _Q_SLOW_VAL);
+		pv_wait(&lock->locked, _Q_SLOW_VAL);
 
 		/*
 		 * Because of lock stealing, the queue head vCPU may not be
@@ -477,7 +467,6 @@ pv_wait_head_or_lock(struct qspinlock *lock, struct mcs_spinlock *node)
 __visible void
 __pv_queued_spin_unlock_slowpath(struct qspinlock *lock, u8 locked)
 {
-	struct __qspinlock *l = (void *)lock;
 	struct pv_node *node;
 
 	if (unlikely(locked != _Q_SLOW_VAL)) {
@@ -506,7 +495,7 @@ __pv_queued_spin_unlock_slowpath(struct qspinlock *lock, u8 locked)
 	 * Now that we have a reference to the (likely) blocked pv_node,
 	 * release the lock.
 	 */
-	smp_store_release(&l->locked, 0);
+	smp_store_release(&lock->locked, 0);
 
 	/*
 	 * At this point the memory pointed at by lock can be freed/reused,
@@ -532,7 +521,6 @@ __pv_queued_spin_unlock_slowpath(struct qspinlock *lock, u8 locked)
 #ifndef __pv_queued_spin_unlock
 __visible void __pv_queued_spin_unlock(struct qspinlock *lock)
 {
-	struct __qspinlock *l = (void *)lock;
 	u8 locked;
 
 	/*
@@ -540,7 +528,7 @@ __visible void __pv_queued_spin_unlock(struct qspinlock *lock)
 	 * unhash. Otherwise it would be possible to have multiple @lock
 	 * entries, which would be BAD.
 	 */
-	locked = cmpxchg_release(&l->locked, _Q_LOCKED_VAL, 0);
+	locked = cmpxchg_release(&lock->locked, _Q_LOCKED_VAL, 0);
 	if (likely(locked == _Q_LOCKED_VAL))
 		return;
 

From patchwork Tue Dec 18 22:10:44 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154214
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236483ljp;
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
X-Google-Smtp-Source: AFSGD/X6w5kxfFEwVt+Kk8w67+3S7K00ie+S9TBjA6sDx588bT+A9iOxHUu9w7wtWwR6gS0YLyuK
X-Received: by 2002:a17:902:d202:: with SMTP id
 t2mr18307061ply.193.1545171090066; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171090; cv=none;
 d=google.com; s=arc-20160816;
 b=zY6XjY/ThyvP+ffB0/xbv9rzi4OfW5KwZr6Yh89HxeAMG1K2SIObbwrLykJxormoqS
 BXfDvE46x8OZ38Dwkp+bDgoaxE32PIEZj6D9dHQPP7aPRkHmb4+4+MZ0ufZQjuA7oiE/
 Rm1bzKi3PCIkxkhq0lQpGlOVAk2f5llOLOF9fm79rxFzAzi6uAL6EyAGKV0HI36v17mt
 Sbx3Th8PDtZBvwuR02rPT7+JK7lYi73p4iYTWX4/5YpU7M5RrQkhFxQOzEPO9S5qSat8
 sR9dMV0aRg15Gji0QkmuPt2siRjX3rZWIYVfin4WHrdCMQfbJad14kncYQKL/uxwizTb
 JS+A==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=0Mf5iv0cDDSeFYbeR71CnQNCCI5kXQ+tE99aaLL9jt4=;
 b=AFmEiz26NWmhHqqbXurn8a47B+bClmlpGF7+ZAtkQobc8WbbVNM2mb5RHlTRoLlo0m
 KurTLZZWsWCO0mRp5cUUOu2BaWyd6rE1jr4t584efdey5qZ5OPb3j8NvwUS5G8+gL17d
 eEtK7njHF/f1zjNaKprp+0dT3ti+x6xMyXZqgr4gGl4/2oTkQDRrQO7pwcjtnSPA7jOs
 kA9S6mfJXRa4B7I9yzjX/N1gp9vgwD18R7mjtWe+H9u6PRb40qMQtTZTovobJIGTm3sA
 nkb+OaBmkV6qai/tGCCTjLddKRbf3SNVrCtdV29yQ9gsyFY8V/mP/c9kcm1s7elqDNA8
 9vFA==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.29; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727675AbeLRWL2 (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:28 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57213 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727594AbeLRWL1 (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:27 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaE-000889-6M; Tue, 18 Dec 2018 23:11:22 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 boqun.feng@gmail.com, linux-arm-kernel@lists.infradead.org,
 paulmck@linux.vnet.ibm.com, Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 05/10] locking/qspinlock: Remove unbounded
 cmpxchg() loop from locking slowpath
Date: Tue, 18 Dec 2018 23:10:44 +0100
Message-Id: <20181218221049.6816-6-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit 59fb586b4a07b4e1a0ee577140ab4842ba451acd upstream.

The qspinlock locking slowpath utilises a "pending" bit as a simple form
of an embedded test-and-set lock that can avoid the overhead of explicit
queuing in cases where the lock is held but uncontended. This bit is
managed using a cmpxchg() loop which tries to transition the uncontended
lock word from (0,0,0) -> (0,0,1) or (0,0,1) -> (0,1,1).

Unfortunately, the cmpxchg() loop is unbounded and lockers can be starved
indefinitely if the lock word is seen to oscillate between unlocked
(0,0,0) and locked (0,0,1). This could happen if concurrent lockers are
able to take the lock in the cmpxchg() loop without queuing and pass it
around amongst themselves.

This patch fixes the problem by unconditionally setting _Q_PENDING_VAL
using atomic_fetch_or, and then inspecting the old value to see whether
we need to spin on the current lock owner, or whether we now effectively
hold the lock. The tricky scenario is when concurrent lockers end up
queuing on the lock and the lock becomes available, causing us to see
a lockword of (n,0,0). With pending now set, simply queuing could lead
to deadlock as the head of the queue may not have observed the pending
flag being cleared. Conversely, if the head of the queue did observe
pending being cleared, then it could transition the lock from (n,0,0) ->
(0,0,1) meaning that any attempt to "undo" our setting of the pending
bit could race with a concurrent locker trying to set it.

We handle this race by preserving the pending bit when taking the lock
after reaching the head of the queue and leaving the tail entry intact
if we saw pending set, because we know that the tail is going to be
updated shortly.

Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Waiman Long <longman@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: boqun.feng@gmail.com
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/1524738868-31318-6-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 kernel/locking/qspinlock.c          | 102 ++++++++++++++++------------
 kernel/locking/qspinlock_paravirt.h |   5 --
 2 files changed, 58 insertions(+), 49 deletions(-)

-- 
2.20.1

diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c
index cc98050e8bec0..e7ab99a1f4387 100644
--- a/kernel/locking/qspinlock.c
+++ b/kernel/locking/qspinlock.c
@@ -126,6 +126,17 @@ static inline __pure struct mcs_spinlock *decode_tail(u32 tail)
 #define _Q_LOCKED_PENDING_MASK (_Q_LOCKED_MASK | _Q_PENDING_MASK)
 
 #if _Q_PENDING_BITS == 8
+/**
+ * clear_pending - clear the pending bit.
+ * @lock: Pointer to queued spinlock structure
+ *
+ * *,1,* -> *,0,*
+ */
+static __always_inline void clear_pending(struct qspinlock *lock)
+{
+	WRITE_ONCE(lock->pending, 0);
+}
+
 /**
  * clear_pending_set_locked - take ownership and clear the pending bit.
  * @lock: Pointer to queued spinlock structure
@@ -161,6 +172,17 @@ static __always_inline u32 xchg_tail(struct qspinlock *lock, u32 tail)
 
 #else /* _Q_PENDING_BITS == 8 */
 
+/**
+ * clear_pending - clear the pending bit.
+ * @lock: Pointer to queued spinlock structure
+ *
+ * *,1,* -> *,0,*
+ */
+static __always_inline void clear_pending(struct qspinlock *lock)
+{
+	atomic_andnot(_Q_PENDING_VAL, &lock->val);
+}
+
 /**
  * clear_pending_set_locked - take ownership and clear the pending bit.
  * @lock: Pointer to queued spinlock structure
@@ -382,7 +404,7 @@ EXPORT_SYMBOL(queued_spin_unlock_wait);
 void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 {
 	struct mcs_spinlock *prev, *next, *node;
-	u32 new, old, tail;
+	u32 old, tail;
 	int idx;
 
 	BUILD_BUG_ON(CONFIG_NR_CPUS >= (1U << _Q_TAIL_CPU_BITS));
@@ -405,59 +427,51 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 					       (VAL != _Q_PENDING_VAL) || !cnt--);
 	}
 
+	/*
+	 * If we observe any contention; queue.
+	 */
+	if (val & ~_Q_LOCKED_MASK)
+		goto queue;
+
 	/*
 	 * trylock || pending
 	 *
 	 * 0,0,0 -> 0,0,1 ; trylock
 	 * 0,0,1 -> 0,1,1 ; pending
 	 */
-	for (;;) {
+	val = atomic_fetch_or_acquire(_Q_PENDING_VAL, &lock->val);
+	if (!(val & ~_Q_LOCKED_MASK)) {
 		/*
-		 * If we observe any contention; queue.
+		 * We're pending, wait for the owner to go away.
+		 *
+		 * *,1,1 -> *,1,0
+		 *
+		 * this wait loop must be a load-acquire such that we match the
+		 * store-release that clears the locked bit and create lock
+		 * sequentiality; this is because not all
+		 * clear_pending_set_locked() implementations imply full
+		 * barriers.
 		 */
-		if (val & ~_Q_LOCKED_MASK)
-			goto queue;
-
-		new = _Q_LOCKED_VAL;
-		if (val == new)
-			new |= _Q_PENDING_VAL;
+		if (val & _Q_LOCKED_MASK) {
+			smp_cond_load_acquire(&lock->val.counter,
+					      !(VAL & _Q_LOCKED_MASK));
+		}
 
 		/*
-		 * Acquire semantic is required here as the function may
-		 * return immediately if the lock was free.
+		 * take ownership and clear the pending bit.
+		 *
+		 * *,1,0 -> *,0,1
 		 */
-		old = atomic_cmpxchg_acquire(&lock->val, val, new);
-		if (old == val)
-			break;
-
-		val = old;
+		clear_pending_set_locked(lock);
+		return;
 	}
 
 	/*
-	 * we won the trylock
+	 * If pending was clear but there are waiters in the queue, then
+	 * we need to undo our setting of pending before we queue ourselves.
 	 */
-	if (new == _Q_LOCKED_VAL)
-		return;
-
-	/*
-	 * we're pending, wait for the owner to go away.
-	 *
-	 * *,1,1 -> *,1,0
-	 *
-	 * this wait loop must be a load-acquire such that we match the
-	 * store-release that clears the locked bit and create lock
-	 * sequentiality; this is because not all clear_pending_set_locked()
-	 * implementations imply full barriers.
-	 */
-	smp_cond_load_acquire(&lock->val.counter, !(VAL & _Q_LOCKED_MASK));
-
-	/*
-	 * take ownership and clear the pending bit.
-	 *
-	 * *,1,0 -> *,0,1
-	 */
-	clear_pending_set_locked(lock);
-	return;
+	if (!(val & _Q_PENDING_MASK))
+		clear_pending(lock);
 
 	/*
 	 * End of pending bit optimistic spinning and beginning of MCS
@@ -561,15 +575,15 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 	 * claim the lock:
 	 *
 	 * n,0,0 -> 0,0,1 : lock, uncontended
-	 * *,0,0 -> *,0,1 : lock, contended
+	 * *,*,0 -> *,*,1 : lock, contended
 	 *
-	 * If the queue head is the only one in the queue (lock value == tail),
-	 * clear the tail code and grab the lock. Otherwise, we only need
-	 * to grab the lock.
+	 * If the queue head is the only one in the queue (lock value == tail)
+	 * and nobody is pending, clear the tail code and grab the lock.
+	 * Otherwise, we only need to grab the lock.
 	 */
 	for (;;) {
 		/* In the PV case we might already have _Q_LOCKED_VAL set */
-		if ((val & _Q_TAIL_MASK) != tail) {
+		if ((val & _Q_TAIL_MASK) != tail || (val & _Q_PENDING_MASK)) {
 			set_locked(lock);
 			break;
 		}
diff --git a/kernel/locking/qspinlock_paravirt.h b/kernel/locking/qspinlock_paravirt.h
index 3e33336911ffe..9c07c72fb10e9 100644
--- a/kernel/locking/qspinlock_paravirt.h
+++ b/kernel/locking/qspinlock_paravirt.h
@@ -88,11 +88,6 @@ static __always_inline void set_pending(struct qspinlock *lock)
 	WRITE_ONCE(lock->pending, 1);
 }
 
-static __always_inline void clear_pending(struct qspinlock *lock)
-{
-	WRITE_ONCE(lock->pending, 0);
-}
-
 /*
  * The pending bit check in pv_queued_spin_steal_lock() isn't a memory
  * barrier. Therefore, an atomic cmpxchg() is used to acquire the lock

From patchwork Tue Dec 18 22:10:45 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154215
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236491ljp;
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
X-Google-Smtp-Source: AFSGD/XaShdcK4tVxeXbu8E0FG0/ZchNSFr7FXAtk/qGM7B0fe9pS2Hjl5Jfi2QAxji6a5uPjeGL
X-Received: by 2002:a63:2f86:: with SMTP id
 v128mr4214618pgv.407.1545171090568; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171090; cv=none;
 d=google.com; s=arc-20160816;
 b=IW5vc9TANlKnojMYv3rcB4WJ5b2Emg2c1Hr3N/XRM/FFrsTCCbf/HfdopQPNYotDdL
 7k+QgjZFLzrV6BlpOy6eIUPh4dkYmXKI2wGG0vrcOd2ulmVuFgUV6r/vuqeccteHn6d5
 ay9V3LVzLH7vj1zZJRdRYvJ6oTRGzQna98SIJQTDbMSB40/ykoBkWvXkCZivqJ7wDrkT
 DYu2VbdlGa/T8f6EUNmukhFRIwkwgrsxVgKZfFjWgqFEHtJ3XYkUZXACc4dYQUSwqprq
 THNs8RcVdPO2yvs6BqbZD2lmxXvIjUS5rfUTHAWn6XEVjyHmqdVyhRymcDQbHA0JsgZ7
 9nQA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=bhnmtu0nQOZ/XQI8Lo4PjA3OpGo83tcEf7fFtrOjqio=;
 b=Eg8v60Dj8sY227k1M9Y3/RFeVLhxzUZNCba4uoES9AVgdEnBVGsfazG+gtyKLN+2oe
 c3GjqDblvYgQLJ+wqUuwWszcoW7QcW8ps1gwKsmfCDpKRgxFo2l84kwI0e3QHVEBdREg
 pp8vC75XKz1EVQI8tJh5a8JZ+YQfVgcy0fqOgWG1bQcphZc5lreSShAbOS2SaRUBlu9r
 wEczbpAt0sFtLZJcFkF6A9ASF21DoC4O7uTChUK+24YftmxnRhVQ7zadxlgbt2PQx3Fq
 KVahDt7Oocx1y5zj/HMj6R2OEyFjKbEhF+A7K02F9Qxu1M4Vbaf4DSKpEtqtfh+Ug5v+
 TZSA==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.30; 
 Tue, 18 Dec 2018 14:11:30 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727594AbeLRWL3 (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:29 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57218 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727316AbeLRWL2 (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:28 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaF-000889-L1; Tue, 18 Dec 2018 23:11:23 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 boqun.feng@gmail.com, linux-arm-kernel@lists.infradead.org,
 paulmck@linux.vnet.ibm.com, Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 06/10] locking/qspinlock: Remove duplicate
 clear_pending() function from PV code
Date: Tue, 18 Dec 2018 23:10:45 +0100
Message-Id: <20181218221049.6816-7-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit 3bea9adc96842b8a7345c7fb202c16ae9c8d5b25 upstream.

The native clear_pending() function is identical to the PV version, so the
latter can simply be removed.

This fixes the build for systems with >= 16K CPUs using the PV lock implementation.

Reported-by: Waiman Long <longman@redhat.com>
Signed-off-by: Will Deacon <will.deacon@arm.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: boqun.feng@gmail.com
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20180427101619.GB21705@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 kernel/locking/qspinlock_paravirt.h | 5 -----
 1 file changed, 5 deletions(-)

-- 
2.20.1

diff --git a/kernel/locking/qspinlock_paravirt.h b/kernel/locking/qspinlock_paravirt.h
index 9c07c72fb10e9..af2a24d484aab 100644
--- a/kernel/locking/qspinlock_paravirt.h
+++ b/kernel/locking/qspinlock_paravirt.h
@@ -105,11 +105,6 @@ static __always_inline void set_pending(struct qspinlock *lock)
 	atomic_or(_Q_PENDING_VAL, &lock->val);
 }
 
-static __always_inline void clear_pending(struct qspinlock *lock)
-{
-	atomic_andnot(_Q_PENDING_VAL, &lock->val);
-}
-
 static __always_inline int trylock_clear_pending(struct qspinlock *lock)
 {
 	int val = atomic_read(&lock->val);

From patchwork Tue Dec 18 22:10:46 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154217
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236519ljp;
 Tue, 18 Dec 2018 14:11:33 -0800 (PST)
X-Google-Smtp-Source: AFSGD/Ujeki5eapv2GRPNJO9p/2cSu/p9WbtTRMkGm2uP/7fNtAS4CCs4169I514vqVdbJRazfWp
X-Received: by 2002:a63:165e:: with SMTP id 30mr17308967pgw.103.1545171093123; 
 Tue, 18 Dec 2018 14:11:33 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171093; cv=none;
 d=google.com; s=arc-20160816;
 b=Z5wCqz414MEmjutX5UGWFXdbl1hnn1On6p9YOFCAjr6LHeuJW7itxKMWYnbKpKYwjJ
 9eiamqujpmYSzp4rGmfGWEAmdVspcUAa7Rg2PaipRHINeij13DjLRTRaf/rihOkZQCL0
 dgKa0zWGbxQrGd/aFsx112AJ/gacm1226YgESBrv444mRhDH2OhgSCskiMmpSaGcK1uV
 CdLUl1yJ8OQzpKbQOHqRJKA+Gm6aIJ/jzbZ+cPc5+MS7ZUZQYc0RIw6Z0+Vx0XDGkh27
 s5Fj8cpR6kaAok+K265KwuRB5mLwGc/s6qXyLnZJPUTdUnNUqpadk4LGzXIOQdS5Tp03
 Xgfw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=iRM+O+LoilH+Mu8IGYgGisJ6/FKxjHBdH54cI5ZNFSE=;
 b=GCDSYayIA6QzWBcjnu88qA4P7lYe98I4hu4JYfeFLezQp7/r/H4peUavgnUiuag3aQ
 x3i9hnaRDZxmE3bp2K2YRQAJmWN/Vhf8TRBHTEKcnYfqzqxmNPQMmXWA+vuoHZDTa19r
 eLMccJZnW09Zck0OmLsOsBWwJ/RULocPK4QP/lYj30DyyQFPMQwWGVnq/FkTWkuj2b3X
 YF8sJvqiujE9QjDAW8bOHG1nSDDGYUCw4sXEjirtvuYTCFDGL1xS1Giir3kKLNjpsN/Y
 2yzWI8FDfFwojGFfTQu2JoOekaO0HcGyY2soK6D2JAftopvgouei7RLXS7H/s7axJlX3
 1H+g==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.32; 
 Tue, 18 Dec 2018 14:11:33 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727678AbeLRWLc (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:32 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57223 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727333AbeLRWLa (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:30 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaG-000889-Tj; Tue, 18 Dec 2018 23:11:25 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 boqun.feng@gmail.com, linux-arm-kernel@lists.infradead.org,
 paulmck@linux.vnet.ibm.com, Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 07/10] locking/qspinlock: Kill cmpxchg() loop
 when claiming lock from head of queue
Date: Tue, 18 Dec 2018 23:10:46 +0100
Message-Id: <20181218221049.6816-8-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit c61da58d8a9ba9238250a548f00826eaf44af0f7 upstream.

When a queued locker reaches the head of the queue, it claims the lock
by setting _Q_LOCKED_VAL in the lockword. If there isn't contention, it
must also clear the tail as part of this operation so that subsequent
lockers can avoid taking the slowpath altogether.

Currently this is expressed as a cmpxchg() loop that practically only
runs up to two iterations. This is confusing to the reader and unhelpful
to the compiler. Rewrite the cmpxchg() loop without the loop, so that a
failed cmpxchg() implies that there is contention and we just need to
write to _Q_LOCKED_VAL without considering the rest of the lockword.

Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Waiman Long <longman@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: boqun.feng@gmail.com
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/1524738868-31318-7-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 kernel/locking/qspinlock.c | 19 ++++++++-----------
 1 file changed, 8 insertions(+), 11 deletions(-)

-- 
2.20.1

diff --git a/kernel/locking/qspinlock.c b/kernel/locking/qspinlock.c
index e7ab99a1f4387..ba5dc86a4d831 100644
--- a/kernel/locking/qspinlock.c
+++ b/kernel/locking/qspinlock.c
@@ -581,24 +581,21 @@ void queued_spin_lock_slowpath(struct qspinlock *lock, u32 val)
 	 * and nobody is pending, clear the tail code and grab the lock.
 	 * Otherwise, we only need to grab the lock.
 	 */
-	for (;;) {
-		/* In the PV case we might already have _Q_LOCKED_VAL set */
-		if ((val & _Q_TAIL_MASK) != tail || (val & _Q_PENDING_MASK)) {
-			set_locked(lock);
-			break;
-		}
+
+	/* In the PV case we might already have _Q_LOCKED_VAL set */
+	if ((val & _Q_TAIL_MASK) == tail) {
 		/*
 		 * The smp_cond_load_acquire() call above has provided the
-		 * necessary acquire semantics required for locking. At most
-		 * two iterations of this loop may be ran.
+		 * necessary acquire semantics required for locking.
 		 */
 		old = atomic_cmpxchg_relaxed(&lock->val, val, _Q_LOCKED_VAL);
 		if (old == val)
-			goto release;	/* No contention */
-
-		val = old;
+			goto release; /* No contention */
 	}
 
+	/* Either somebody is queued behind us or _Q_PENDING_VAL is set */
+	set_locked(lock);
+
 	/*
 	 * contended path; wait for next if not observed yet, release.
 	 */

From patchwork Tue Dec 18 22:10:48 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
X-Patchwork-Id: 154218
Delivered-To: patch@linaro.org
Received: by 2002:a2e:299d:0:0:0:0:0 with SMTP id p29-v6csp4236541ljp;
 Tue, 18 Dec 2018 14:11:35 -0800 (PST)
X-Google-Smtp-Source: AFSGD/Vd1QaRhVJQLjWR2GY94WuNZ1NBk3weMrDYtzgg0+MwCosPQFQjiT0X4f/mLvK1PIZ5aLas
X-Received: by 2002:a17:902:2aaa:: with SMTP id
 j39mr18466305plb.335.1545171094961; 
 Tue, 18 Dec 2018 14:11:34 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1545171094; cv=none;
 d=google.com; s=arc-20160816;
 b=iPgzdhX1iNtC3ciNe6y9w3e96Lsn1VJeOapzAM+cIm9B2+MLut7wXe1lCPnDjIeBAn
 M2i9NRt+Hl5tRYKDOIBHOXy6DVeSrn7vbeug3WMGO2WtshGg6ck9kbYULH+/VLeJO6vE
 ANFT4Xtc3U9CcRs5USdslYTIq3TNAig7CB0O8k1aNVUbPkKR8pTS5nqxwRrJ66hhR5+6
 5anlNxdi04ybVcvT+Buc8S6FymbIXdJoFYaWVJAXFc3jDmlwMS9yjJsj30lux7GvJDHA
 WEOl34GuxtbMQHxrq5B/v3Wft0Xw+WCz3cYnNSzmk+bjg7/b4neah+VI605hHCqA1KJN
 Nodw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=list-id:precedence:sender:content-transfer-encoding:mime-version
 :references:in-reply-to:message-id:date:subject:cc:to:from;
 bh=C4QOmC+4HCVv0U1au3HqqAg7kCaRfGMUZHvuCAhzEoQ=;
 b=Zcyanx+DnXkHsye0SnYxClow0irsIqp0GzA1tlWBcwLIIYfvloiI4PcCYNOp+g4VRM
 8K87mLUpSK3AwG1KvIV1a0zq8oa7Fhnlj72IP3elgvRcg1V3uWAhi1XlDg/tCL5OHcUt
 lVsD3PP7ZrbydHJD/hqAnxzWyF14tuC7denyN40xmpJb5PAoeTyBbdOiMIzZ8C/mbAWX
 HEiV9JXIcOTa1nnqBW5z2hvdRvLH+0O+lmJ13qu3VbcQ5Yh/Jll2WaCpx/q4bqeXt7uS
 5ZDB+7/rSVpGLneSbALm0ylnWSO7UshBzzQsXdMHyTSaixIKi+NmScLpaK2L8YCRI/bZ
 O6jQ==
ARC-Authentication-Results: i=1; mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Return-Path: <stable-owner@vger.kernel.org>
Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67])
 by mx.google.com with ESMTP id
 c8si13817215pgl.507.2018.12.18.14.11.34; 
 Tue, 18 Dec 2018 14:11:34 -0800 (PST)
Received-SPF: pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) client-ip=209.132.180.67; 
Authentication-Results: mx.google.com;
 spf=pass (google.com: best guess record for domain of
 stable-owner@vger.kernel.org designates 209.132.180.67 as
 permitted sender) smtp.mailfrom=stable-owner@vger.kernel.org
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
 id S1727464AbeLRWLd (ORCPT <rfc822;dave.long@linaro.org>
 + 15 others); Tue, 18 Dec 2018 17:11:33 -0500
Received: from Galois.linutronix.de ([146.0.238.70]:57234 "EHLO
 Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
 with ESMTP id S1727760AbeLRWLd (ORCPT
 <rfc822;stable@vger.kernel.org>); Tue, 18 Dec 2018 17:11:33 -0500
Received: from localhost ([127.0.0.1] helo=bazinga.breakpoint.cc)
 by Galois.linutronix.de with esmtp (Exim 4.80)
 (envelope-from <bigeasy@linutronix.de>)
 id 1gZNaJ-000889-Bq; Tue, 18 Dec 2018 23:11:27 +0100
From: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
To: stable@vger.kernel.org
Cc: Peter Zijlstra <peterz@infradead.org>, Will Deacon <will.deacon@arm.com>, 
 Thomas Gleixner <tglx@linutronix.de>,
 Daniel Wagner <daniel.wagner@siemens.com>,
 bigeasy@linutronix.de, Waiman Long <longman@redhat.com>,
 Linus Torvalds <torvalds@linux-foundation.org>,
 boqun.feng@gmail.com, linux-arm-kernel@lists.infradead.org,
 paulmck@linux.vnet.ibm.com, Ingo Molnar <mingo@kernel.org>
Subject: [PATCH STABLE v4.9 09/10] locking/qspinlock/x86: Increase
 _Q_PENDING_LOOPS upper bound
Date: Tue, 18 Dec 2018 23:10:48 +0100
Message-Id: <20181218221049.6816-10-bigeasy@linutronix.de>
X-Mailer: git-send-email 2.20.1
In-Reply-To: <20181218221049.6816-1-bigeasy@linutronix.de>
References: <20181218221049.6816-1-bigeasy@linutronix.de>
MIME-Version: 1.0
Sender: stable-owner@vger.kernel.org
Precedence: bulk
List-ID: <stable.vger.kernel.org>
X-Mailing-List: stable@vger.kernel.org

From: Will Deacon <will.deacon@arm.com>

commit b247be3fe89b6aba928bf80f4453d1c4ba8d2063 upstream.

On x86, atomic_cond_read_relaxed will busy-wait with a cpu_relax() loop,
so it is desirable to increase the number of times we spin on the qspinlock
lockword when it is found to be transitioning from pending to locked.

According to Waiman Long:

 | Ideally, the spinning times should be at least a few times the typical
 | cacheline load time from memory which I think can be down to 100ns or
 | so for each cacheline load with the newest systems or up to several
 | hundreds ns for older systems.

which in his benchmarking corresponded to 512 iterations.

Suggested-by: Waiman Long <longman@redhat.com>
Signed-off-by: Will Deacon <will.deacon@arm.com>
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Waiman Long <longman@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: boqun.feng@gmail.com
Cc: linux-arm-kernel@lists.infradead.org
Cc: paulmck@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/1524738868-31318-5-git-send-email-will.deacon@arm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
---
 arch/x86/include/asm/qspinlock.h | 2 ++
 1 file changed, 2 insertions(+)

-- 
2.20.1

diff --git a/arch/x86/include/asm/qspinlock.h b/arch/x86/include/asm/qspinlock.h
index e07cc206919d4..8b1ba1607091c 100644
--- a/arch/x86/include/asm/qspinlock.h
+++ b/arch/x86/include/asm/qspinlock.h
@@ -5,6 +5,8 @@
 #include <asm-generic/qspinlock_types.h>
 #include <asm/paravirt.h>
 
+#define _Q_PENDING_LOOPS	(1 << 9)
+
 #define	queued_spin_unlock queued_spin_unlock
 /**
  * queued_spin_unlock - release a queued spinlock