Message ID | 20140515104055.e91844a5b75529edc560349a@gmail.com |
---|---|
State | New |
Headers | show |
On Thu, May 15, 2014 at 10:40:55AM +0200, Juri Lelli wrote: > Hi, > > > @@ -37,10 +38,7 @@ static inline int dl_time_before(u64 a, u64 b) > > > > static void cpudl_exchange(struct cpudl *cp, int a, int b) > > { > > - int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; > > - > > swap(cp->elements[a], cp->elements[b]); > > - swap(cp->cpu_to_idx[cpu_a], cp->cpu_to_idx[cpu_b]); > > } > > > > I think there is a problem here. Your patch "embeds" cpu_to_idx array > in elements array, but here the swap operates differently on the two. <snip> > Sorry for this long reply, but I had to convince also myself... Glad you explained it before I tried to untangle that code myself. > So, I think that having just one dynamic array is nicer and better, but > we still need to swap things separately. Something like (on top of > yours): > > diff --git a/kernel/sched/cpudeadline.c b/kernel/sched/cpudeadline.c > index 60ad0af..10dc7ab 100644 > --- a/kernel/sched/cpudeadline.c > +++ b/kernel/sched/cpudeadline.c > @@ -36,9 +36,31 @@ static inline int dl_time_before(u64 a, u64 b) > return (s64)(a - b) < 0; > } > > +static inline void swap_element(struct cpudl *cp, int a, int b) > +{ > + int cpu_tmp = cp->elements[a].cpu; > + u64 dl_tmp = cp->elements[a].dl; > + > + cp->elements[a].cpu = cp->elements[b].cpu; > + cp->elements[a].dl = cp->elements[b].dl; > + cp->elements[b].cpu = cpu_tmp; > + cp->elements[b].dl = dl_tmp; You could've just written: swap(cp->elements[a].cpu, cp->elements[b].cpu); swap(cp->elements[a].dl , cp->elements[b].dl ); The swap macro does the tmp var for you. > +} > + > +static inline void swap_idx(struct cpudl *cp, int a, int b) > +{ > + int idx_tmp = cp->elements[a].idx; > + > + cp->elements[a].idx = cp->elements[b].idx; > + cp->elements[b].idx = idx_tmp; swap(cp->elements[a].idx, cp->elements[b].idx); > +} > + > static void cpudl_exchange(struct cpudl *cp, int a, int b) > { > - swap(cp->elements[a], cp->elements[b]); > + int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; > + > + swap_element(cp, a, b); > + swap_idx(cp, cpu_a, cpu_b); Or just skip the lot and put the 3 swap() stmts here. > } > > static void cpudl_heapify(struct cpudl *cp, int idx) > --- > > But, I don't know if this is too ugly and we better go with two arrays > (or a better solution, as this is the fastest thing I could come up > with :)). Thanks for looking at it, and sorry for breaking it.
On Thu, 15 May 2014 10:51:56 +0200 Peter Zijlstra <peterz@infradead.org> wrote: > On Thu, May 15, 2014 at 10:40:55AM +0200, Juri Lelli wrote: > > Hi, > > > > > @@ -37,10 +38,7 @@ static inline int dl_time_before(u64 a, u64 b) > > > > > > static void cpudl_exchange(struct cpudl *cp, int a, int b) > > > { > > > - int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; > > > - > > > swap(cp->elements[a], cp->elements[b]); > > > - swap(cp->cpu_to_idx[cpu_a], cp->cpu_to_idx[cpu_b]); > > > } > > > > > > > I think there is a problem here. Your patch "embeds" cpu_to_idx array > > in elements array, but here the swap operates differently on the two. > > <snip> > > > Sorry for this long reply, but I had to convince also myself... > > Glad you explained it before I tried to untangle that code myself. > > > So, I think that having just one dynamic array is nicer and better, but > > we still need to swap things separately. Something like (on top of > > yours): > > > > diff --git a/kernel/sched/cpudeadline.c b/kernel/sched/cpudeadline.c > > index 60ad0af..10dc7ab 100644 > > --- a/kernel/sched/cpudeadline.c > > +++ b/kernel/sched/cpudeadline.c > > @@ -36,9 +36,31 @@ static inline int dl_time_before(u64 a, u64 b) > > return (s64)(a - b) < 0; > > } > > > > +static inline void swap_element(struct cpudl *cp, int a, int b) > > +{ > > + int cpu_tmp = cp->elements[a].cpu; > > + u64 dl_tmp = cp->elements[a].dl; > > + > > + cp->elements[a].cpu = cp->elements[b].cpu; > > + cp->elements[a].dl = cp->elements[b].dl; > > + cp->elements[b].cpu = cpu_tmp; > > + cp->elements[b].dl = dl_tmp; > > You could've just written: > > swap(cp->elements[a].cpu, cp->elements[b].cpu); > swap(cp->elements[a].dl , cp->elements[b].dl ); > > The swap macro does the tmp var for you. > > > +} > > + > > +static inline void swap_idx(struct cpudl *cp, int a, int b) > > +{ > > + int idx_tmp = cp->elements[a].idx; > > + > > + cp->elements[a].idx = cp->elements[b].idx; > > + cp->elements[b].idx = idx_tmp; > > swap(cp->elements[a].idx, cp->elements[b].idx); > > > +} > > + > > static void cpudl_exchange(struct cpudl *cp, int a, int b) > > { > > - swap(cp->elements[a], cp->elements[b]); > > + int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; > > + > > + swap_element(cp, a, b); > > + swap_idx(cp, cpu_a, cpu_b); > > Or just skip the lot and put the 3 swap() stmts here. > Ah, yes, sure! Maybe we could add in cpudeadline.c also a comment explaining a bit how the thing works. Something along the line of cpupri.c: /* * kernel/sched/cpudeadline.c * * Global CPU deadline management * * Author: Juri Lelli <juri.lelli@gmail.com> * * This code tracks the deadline of each CPU (i.e., the deadline of * current on the CPU) so that global migration decisions are easy * to calculate. Each CPU can be in a state as follows: * * (INVALID), WITH_DL_TASK(S) * * The system maintains this state with a max-heap implemented as * a simple array. To efficiently handle updates on intermediate nodes * the array can be thought as splitted in two parts, one that contains * heap nodes, and the other that keeps track of where nodes reside in * the first part. From kernel/sched/cpudeadline.h we conceptually * have: * * struct cpudl_item { * u64 dl; * int cpu; * ------------------- * int idx; * } * * Moreover, we keep track of CPUs in the INVALID state with a cpumask * (no need to use the array if some CPU is free). * * Let's clarify this with a simple example. Suppose at a certain * instant of time we have this situation (4CPUs box): * * CPU 1 * DL 50 * / \ * / \ * CPU 2 CPU 0 * DL 30 DL 40 * / * / * CPU 3 * DL 25 * * In this case the state is mantained as: * * elements[dl/cpu] | 50/1 | 30/2 | 40/0 | 25/3 | * ^ ^ ^ ^ * | +-----|-+ | * +---------+ | | | * +--------|------+ | | * elements[idx] | 2 | 0 | 1 | 3 | * * Operations on the heap (e.g., node updates) must thus handle * the two parts of elements array separately, see cpudl_set() * and cpudl_exchange() for details. * * This program is free software; you can redistribute it and/or * modify it under the terms of the GNU General Public License * as published by the Free Software Foundation; version 2 * of the License. */ Thanks, - Juri > > } > > > > static void cpudl_heapify(struct cpudl *cp, int idx) > > --- > > > > But, I don't know if this is too ugly and we better go with two arrays > > (or a better solution, as this is the fastest thing I could come up > > with :)). > > Thanks for looking at it, and sorry for breaking it. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
On Fri, 16 May 2014 12:43:36 +0200 Peter Zijlstra <peterz@infradead.org> wrote: > > OK I made that.. > Are the comments I proposed to add overdoing? Apart from this, Acked-by: Juri Lelli <juri.lelli@gmail.com> Thanks! - Juri > --- > > Subject: sched/cpudl: Replace NR_CPUS arrays > From: Peter Zijlstra <peterz@infradead.org> > Date: Wed May 14 16:13:56 CEST 2014 > > Tejun reported that his resume was failing due to order-3 allocations > from sched_domain building. > > Replace the NR_CPUS arrays in there with a dynamically allocated > array. > > Cc: Johannes Weiner <hannes@cmpxchg.org> > Cc: Juri Lelli <juri.lelli@gmail.com> > Reported-by: Tejun Heo <tj@kernel.org> > Signed-off-by: Peter Zijlstra <peterz@infradead.org> > --- > kernel/sched/cpudeadline.c | 33 ++++++++++++++++++++++++--------- > kernel/sched/cpudeadline.h | 6 +++--- > 2 files changed, 27 insertions(+), 12 deletions(-) > > --- a/kernel/sched/cpudeadline.c > +++ b/kernel/sched/cpudeadline.c > @@ -13,6 +13,7 @@ > > #include <linux/gfp.h> > #include <linux/kernel.h> > +#include <linux/slab.h> > #include "cpudeadline.h" > > static inline int parent(int i) > @@ -39,8 +40,10 @@ static void cpudl_exchange(struct cpudl > { > int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; > > - swap(cp->elements[a], cp->elements[b]); > - swap(cp->cpu_to_idx[cpu_a], cp->cpu_to_idx[cpu_b]); > + swap(cp->elements[a].cpu, cp->elements[b].cpu); > + swap(cp->elements[a].dl , cp->elements[b].dl ); > + > + swap(cp->elements[cpu_a].idx, cp->elements[cpu_b].idx); > } > > static void cpudl_heapify(struct cpudl *cp, int idx) > @@ -140,7 +143,7 @@ void cpudl_set(struct cpudl *cp, int cpu > WARN_ON(!cpu_present(cpu)); > > raw_spin_lock_irqsave(&cp->lock, flags); > - old_idx = cp->cpu_to_idx[cpu]; > + old_idx = cp->elements[cpu].idx; > if (!is_valid) { > /* remove item */ > if (old_idx == IDX_INVALID) { > @@ -155,8 +158,8 @@ void cpudl_set(struct cpudl *cp, int cpu > cp->elements[old_idx].dl = cp->elements[cp->size - 1].dl; > cp->elements[old_idx].cpu = new_cpu; > cp->size--; > - cp->cpu_to_idx[new_cpu] = old_idx; > - cp->cpu_to_idx[cpu] = IDX_INVALID; > + cp->elements[new_cpu].idx = old_idx; > + cp->elements[cpu].idx = IDX_INVALID; > while (old_idx > 0 && dl_time_before( > cp->elements[parent(old_idx)].dl, > cp->elements[old_idx].dl)) { > @@ -173,7 +176,7 @@ void cpudl_set(struct cpudl *cp, int cpu > cp->size++; > cp->elements[cp->size - 1].dl = 0; > cp->elements[cp->size - 1].cpu = cpu; > - cp->cpu_to_idx[cpu] = cp->size - 1; > + cp->elements[cpu].idx = cp->size - 1; > cpudl_change_key(cp, cp->size - 1, dl); > cpumask_clear_cpu(cpu, cp->free_cpus); > } else { > @@ -195,10 +198,21 @@ int cpudl_init(struct cpudl *cp) > memset(cp, 0, sizeof(*cp)); > raw_spin_lock_init(&cp->lock); > cp->size = 0; > - for (i = 0; i < NR_CPUS; i++) > - cp->cpu_to_idx[i] = IDX_INVALID; > - if (!alloc_cpumask_var(&cp->free_cpus, GFP_KERNEL)) > + > + cp->elements = kcalloc(nr_cpu_ids, > + sizeof(struct cpudl_item), > + GFP_KERNEL); > + if (!cp->elements) > + return -ENOMEM; > + > + if (!alloc_cpumask_var(&cp->free_cpus, GFP_KERNEL)) { > + kfree(cp->elements); > return -ENOMEM; > + } > + > + for_each_possible_cpu(i) > + cp->elements[i].idx = IDX_INVALID; > + > cpumask_setall(cp->free_cpus); > > return 0; > @@ -211,4 +225,5 @@ int cpudl_init(struct cpudl *cp) > void cpudl_cleanup(struct cpudl *cp) > { > free_cpumask_var(cp->free_cpus); > + kfree(cp->elements); > } > --- a/kernel/sched/cpudeadline.h > +++ b/kernel/sched/cpudeadline.h > @@ -5,17 +5,17 @@ > > #define IDX_INVALID -1 > > -struct array_item { > +struct cpudl_item { > u64 dl; > int cpu; > + int idx; > }; > > struct cpudl { > raw_spinlock_t lock; > int size; > - int cpu_to_idx[NR_CPUS]; > - struct array_item elements[NR_CPUS]; > cpumask_var_t free_cpus; > + struct cpudl_item *elements; > }; > > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
On Fri, May 16, 2014 at 01:01:53PM +0200, Juri Lelli wrote: > Are the comments I proposed to add overdoing? Dunno, might be helpful, if you post them as a proper patch I'll press 'A'. > Apart from this, > > Acked-by: Juri Lelli <juri.lelli@gmail.com> Thanks!
diff --git a/kernel/sched/cpudeadline.c b/kernel/sched/cpudeadline.c index 60ad0af..10dc7ab 100644 --- a/kernel/sched/cpudeadline.c +++ b/kernel/sched/cpudeadline.c @@ -36,9 +36,31 @@ static inline int dl_time_before(u64 a, u64 b) return (s64)(a - b) < 0; } +static inline void swap_element(struct cpudl *cp, int a, int b) +{ + int cpu_tmp = cp->elements[a].cpu; + u64 dl_tmp = cp->elements[a].dl; + + cp->elements[a].cpu = cp->elements[b].cpu; + cp->elements[a].dl = cp->elements[b].dl; + cp->elements[b].cpu = cpu_tmp; + cp->elements[b].dl = dl_tmp; +} + +static inline void swap_idx(struct cpudl *cp, int a, int b) +{ + int idx_tmp = cp->elements[a].idx; + + cp->elements[a].idx = cp->elements[b].idx; + cp->elements[b].idx = idx_tmp; +} + static void cpudl_exchange(struct cpudl *cp, int a, int b) { - swap(cp->elements[a], cp->elements[b]); + int cpu_a = cp->elements[a].cpu, cpu_b = cp->elements[b].cpu; + + swap_element(cp, a, b); + swap_idx(cp, cpu_a, cpu_b); } static void cpudl_heapify(struct cpudl *cp, int idx)