From patchwork Fri Aug 26 18:40:48 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Steve Muckle X-Patchwork-Id: 74829 Delivered-To: patch@linaro.org Received: by 10.140.29.52 with SMTP id a49csp506437qga; Fri, 26 Aug 2016 11:42:43 -0700 (PDT) X-Received: by 10.98.158.78 with SMTP id s75mr8461349pfd.137.1472236963133; Fri, 26 Aug 2016 11:42:43 -0700 (PDT) Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w8si22500414paj.25.2016.08.26.11.42.39; Fri, 26 Aug 2016 11:42:43 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753942AbcHZSmY (ORCPT + 27 others); Fri, 26 Aug 2016 14:42:24 -0400 Received: from mail-pf0-f178.google.com ([209.85.192.178]:35348 "EHLO mail-pf0-f178.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751742AbcHZSlu (ORCPT ); Fri, 26 Aug 2016 14:41:50 -0400 Received: by mail-pf0-f178.google.com with SMTP id x72so31089946pfd.2 for ; Fri, 26 Aug 2016 11:41:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=Jnj0W2Ss2QxnDaeQbjoIGo+Dp3Xkocp8jXjzcT84lmM=; b=YGvVp7zaD/I9ASbsrvwOJRwdEl9sN15e7cSNbg6MMqHe3Wb9CuDlE0rR+kzE/ihqRw aHRW4Q7huHHi5WdsrSbQBzIA1OUlhT9E6CU5F0w2tdzDUrM3LpaChnaAeQt5uhiZFmUj bdmxCyQNTW2kq/hPvIdf+vULlE4FLpEOsyLTs= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=Jnj0W2Ss2QxnDaeQbjoIGo+Dp3Xkocp8jXjzcT84lmM=; b=KXL/xfPrD8HA5IT0C8Cn1jdZS347OArAup7a9hXdaG2jz1eDfeoes2/O92LwMizAAQ W/6egS9rmK8Sq1CnfbpSKCYMCppnVDLITuA9KBUFmd6ta2pHXrVZPv6D5QzsBh/1kPMe pcg/AG+O1kdAKxQ2tr/XbZTPcxfJPwqWYAKt11j3ft0GVEqkIr10RKc0b4Da+X8hU+9m 5fKQFrMC2k7B4NdStUif7JmHBAn8CW8q4aE9INAfHB7wKFpbMQ8IH485D2cfayY15U5v t/8FFxwRO9VH5LREYBFvn7sN8Ri/EV+CUx1T97IeC+sn/IHU0Gj+hvgtmJxfizPDXcZB izqA== X-Gm-Message-State: AE9vXwOp42mK9wysdfGjldKE/u7zrScUax/qnLb+nU3ULTZJ4VwME9fH1tHRveTm+FpqiAJD X-Received: by 10.98.192.12 with SMTP id x12mr8499415pff.54.1472236861698; Fri, 26 Aug 2016 11:41:01 -0700 (PDT) Received: from graphite.smuckle.net (cpe-76-167-105-107.san.res.rr.com. [76.167.105.107]) by smtp.gmail.com with ESMTPSA id g27sm30480541pfd.47.2016.08.26.11.41.00 (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 26 Aug 2016 11:41:01 -0700 (PDT) From: Steve Muckle X-Google-Original-From: Steve Muckle To: Peter Zijlstra , Ingo Molnar , "Rafael J . Wysocki" Cc: linux-kernel@vger.kernel.org, linux-pm@vger.kernel.org, Vincent Guittot , Morten Rasmussen , Dietmar Eggemann , Juri Lelli , Patrick Bellasi , Steve Muckle Subject: [PATCH 2/2] sched: cpufreq: use rt_avg as estimate of required RT CPU capacity Date: Fri, 26 Aug 2016 11:40:48 -0700 Message-Id: <1472236848-17038-3-git-send-email-smuckle@linaro.org> X-Mailer: git-send-email 2.7.3 In-Reply-To: <1472236848-17038-1-git-send-email-smuckle@linaro.org> References: <1472236848-17038-1-git-send-email-smuckle@linaro.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org A policy of going to fmax on any RT activity will be detrimental for power on many platforms. Often RT accounts for only a small amount of CPU activity so sending the CPU frequency to fmax is overkill. Worse still, some platforms may not be able to even complete the CPU frequency change before the RT activity has already completed. Cpufreq governors have not treated RT activity this way in the past so it is not part of the expected semantics of the RT scheduling class. The DL class offers guarantees about task completion and could be used for this purpose. Modify the schedutil algorithm to instead use rt_avg as an estimate of RT utilization of the CPU. Based on previous work by Vincent Guittot . Signed-off-by: Steve Muckle --- kernel/sched/cpufreq_schedutil.c | 26 +++++++++++++++++--------- 1 file changed, 17 insertions(+), 9 deletions(-) -- 2.7.3 diff --git a/kernel/sched/cpufreq_schedutil.c b/kernel/sched/cpufreq_schedutil.c index cb8a77b1ef1b..89094a466250 100644 --- a/kernel/sched/cpufreq_schedutil.c +++ b/kernel/sched/cpufreq_schedutil.c @@ -146,13 +146,21 @@ static unsigned int get_next_freq(struct sugov_cpu *sg_cpu, unsigned long util, static void sugov_get_util(unsigned long *util, unsigned long *max) { - struct rq *rq = this_rq(); - unsigned long cfs_max; + int cpu = smp_processor_id(); + struct rq *rq = cpu_rq(cpu); + unsigned long max_cap, rt; + s64 delta; - cfs_max = arch_scale_cpu_capacity(NULL, smp_processor_id()); + max_cap = arch_scale_cpu_capacity(NULL, cpu); - *util = min(rq->cfs.avg.util_avg, cfs_max); - *max = cfs_max; + delta = rq_clock(rq) - rq->age_stamp; + if (unlikely(delta < 0)) + delta = 0; + rt = div64_u64(rq->rt_avg, sched_avg_period() + delta); + rt = (rt * max_cap) >> SCHED_CAPACITY_SHIFT; + + *util = min(rq->cfs.avg.util_avg + rt, max_cap); + *max = max_cap; } static void sugov_update_single(struct update_util_data *hook, u64 time, @@ -167,7 +175,7 @@ static void sugov_update_single(struct update_util_data *hook, u64 time, if (!sugov_should_update_freq(sg_policy, time)) return; - if (flags & SCHED_CPUFREQ_RT_DL) { + if (flags & SCHED_CPUFREQ_DL) { next_f = policy->cpuinfo.max_freq; } else { sugov_get_util(&util, &max); @@ -186,7 +194,7 @@ static unsigned int sugov_next_freq_shared(struct sugov_cpu *sg_cpu, u64 last_freq_update_time = sg_policy->last_freq_update_time; unsigned int j; - if (flags & SCHED_CPUFREQ_RT_DL) + if (flags & SCHED_CPUFREQ_DL) return max_f; for_each_cpu(j, policy->cpus) { @@ -209,7 +217,7 @@ static unsigned int sugov_next_freq_shared(struct sugov_cpu *sg_cpu, if (delta_ns > TICK_NSEC) continue; - if (j_sg_cpu->flags & SCHED_CPUFREQ_RT_DL) + if (j_sg_cpu->flags & SCHED_CPUFREQ_DL) return max_f; j_util = j_sg_cpu->util; @@ -467,7 +475,7 @@ static int sugov_start(struct cpufreq_policy *policy) if (policy_is_shared(policy)) { sg_cpu->util = 0; sg_cpu->max = 0; - sg_cpu->flags = SCHED_CPUFREQ_RT; + sg_cpu->flags = SCHED_CPUFREQ_DL; sg_cpu->last_update = 0; sg_cpu->cached_raw_freq = 0; cpufreq_add_update_util_hook(cpu, &sg_cpu->update_util,