Skip to content

Commit b1bfeab

Browse files
ricardonPeter Zijlstra
authored and
Peter Zijlstra
committed
sched/fair: Consider the idle state of the whole core for load balance
should_we_balance() traverses the group_balance_mask (AND'ed with lb_env:: cpus) starting from lower numbered CPUs looking for the first idle CPU. In hybrid x86 systems, the siblings of SMT cores get CPU numbers, before non-SMT cores: [0, 1] [2, 3] [4, 5] 6 7 8 9 b i b i b i b i i i In the figure above, CPUs in brackets are siblings of an SMT core. The rest are non-SMT cores. 'b' indicates a busy CPU, 'i' indicates an idle CPU. We should let a CPU on a fully idle core get the first chance to idle load balance as it has more CPU capacity than a CPU on an idle SMT CPU with busy sibling. So for the figure above, if we are running should_we_balance() to CPU 1, we should return false to let CPU 7 on idle core to have a chance first to idle load balance. A partially busy (i.e., of type group_has_spare) local group with SMT  cores will often have only one SMT sibling busy. If the destination CPU is a non-SMT core, partially busy, lower-numbered, SMT cores should not be considered when finding the first idle CPU.  However, in should_we_balance(), when we encounter idle SMT first in partially busy core, we prematurely break the search for the first idle CPU. Higher-numbered, non-SMT cores is not given the chance to have idle balance done on their behalf. Those CPUs will only be considered for idle balancing by chance via CPU_NEWLY_IDLE. Instead, consider the idle state of the whole SMT core. Signed-off-by: Ricardo Neri <ricardo.neri-calderon@linux.intel.com> Co-developed-by: Tim Chen <tim.c.chen@linux.intel.com> Signed-off-by: Tim Chen <tim.c.chen@linux.intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lore.kernel.org/r/807bdd05331378ea3bf5956bda87ded1036ba769.1688770494.git.tim.c.chen@linux.intel.com
1 parent 7ff1693 commit b1bfeab

File tree

1 file changed

+15
-1
lines changed

1 file changed

+15
-1
lines changed

kernel/sched/fair.c

+15-1
Original file line numberDiff line numberDiff line change
@@ -10902,7 +10902,7 @@ static int active_load_balance_cpu_stop(void *data);
1090210902
static int should_we_balance(struct lb_env *env)
1090310903
{
1090410904
struct sched_group *sg = env->sd->groups;
10905-
int cpu;
10905+
int cpu, idle_smt = -1;
1090610906

1090710907
/*
1090810908
* Ensure the balancing environment is consistent; can happen
@@ -10929,10 +10929,24 @@ static int should_we_balance(struct lb_env *env)
1092910929
if (!idle_cpu(cpu))
1093010930
continue;
1093110931

10932+
/*
10933+
* Don't balance to idle SMT in busy core right away when
10934+
* balancing cores, but remember the first idle SMT CPU for
10935+
* later consideration. Find CPU on an idle core first.
10936+
*/
10937+
if (!(env->sd->flags & SD_SHARE_CPUCAPACITY) && !is_core_idle(cpu)) {
10938+
if (idle_smt == -1)
10939+
idle_smt = cpu;
10940+
continue;
10941+
}
10942+
1093210943
/* Are we the first idle CPU? */
1093310944
return cpu == env->dst_cpu;
1093410945
}
1093510946

10947+
if (idle_smt == env->dst_cpu)
10948+
return true;
10949+
1093610950
/* Are we the first CPU of this group ? */
1093710951
return group_balance_cpu(sg) == env->dst_cpu;
1093810952
}

0 commit comments

Comments
 (0)