Message ID | cover.1734361935.git.maciej.wieczor-retman@intel.com |
---|---|
Headers | show |
Series | selftests/resctrl: SNC kernel support discovery | expand |
Hi Maciej, On 12/16/24 7:18 AM, Maciej Wieczor-Retman wrote: > Sub-NUMA Cluster divides CPUs sharing an L3 cache into separate NUMA > nodes. Systems may support splitting into either two, three, four or six > nodes. When SNC mode is enabled the effective amount of L3 cache > available for allocation is divided by the number of nodes per L3. > > It's possible to detect which SNC mode is active by comparing the number > of CPUs that share a cache with CPU0, with the number of CPUs on node0. > > Detect SNC mode once and let other tests inherit that information. > > Update CFLAGS after including lib.mk in the Makefile so that fallthrough > macro can be used. > > To check if SNC detection is reliable one can check the > /sys/devices/system/cpu/offline file. If it's empty, it means all cores > are operational and the ratio should be calculated correctly. If it has > any contents, it means the detected SNC mode can't be trusted and should > be disabled. > > Check if detection was not reliable due to offline cpus. If it was skip > running tests since the results couldn't be trusted. > > Co-developed-by: Tony Luck <tony.luck@intel.com> > Signed-off-by: Tony Luck <tony.luck@intel.com> > Signed-off-by: Maciej Wieczor-Retman <maciej.wieczor-retman@intel.com> > --- Thank you very much. Reviewed-by: Reinette Chatre <reinette.chatre@intel.com> Reinette
Hi Shuah, On 12/16/24 7:18 AM, Maciej Wieczor-Retman wrote: > > Sub-Numa Clustering (SNC) allows splitting CPU cores, caches and memory > into multiple NUMA nodes. When enabled, NUMA-aware applications can > achieve better performance on bigger server platforms. > > SNC support was merged into the kernel [1]. With SNC enabled > and kernel support in place all the tests will function normally (aside > from effective cache size). There might be a problem when SNC is enabled > but the system is still using an older kernel version without SNC > support. Currently the only message displayed in that situation is a > guess that SNC might be enabled and is causing issues. That message also > is displayed whenever the test fails on an Intel platform. > > Add a mechanism to discover kernel support for SNC which will add more > meaning and certainty to the error message. > > Add runtime SNC mode detection and verify how reliable that information > is. > > Series was tested on Ice Lake server platforms with SNC disabled, SNC-2 > and SNC-4. The tests were also ran with and without kernel support for > SNC. > > Series applies cleanly on kselftest/next. > Could you please consider this series for inclusion? Thank you very much. Reinette