linux/include at c6890f36fc49848c61d2113a3442eb1b59e0bc4b - linux - Brook's Gitea instance

bybrooklyn/linux

mirror of https://github.com/torvalds/linux.git synced 2026-04-18 06:44:00 -04:00

Files

History

Breno Leitao 5920d046f7 workqueue: add WQ_AFFN_CACHE_SHARD affinity scope

On systems where many CPUs share one LLC, unbound workqueues using
WQ_AFFN_CACHE collapse to a single worker pool, causing heavy spinlock
contention on pool->lock. For example, Chuck Lever measured 39% of
cycles lost to native_queued_spin_lock_slowpath on a 12-core shared-L3
NFS-over-RDMA system.

The existing affinity hierarchy (cpu, smt, cache, numa, system) offers
no intermediate option between per-LLC and per-SMT-core granularity.

Add WQ_AFFN_CACHE_SHARD, which subdivides each LLC into groups of at
most wq_cache_shard_size cores (default 8, tunable via boot parameter).
Shards are always split on core (SMT group) boundaries so that
Hyper-Threading siblings are never placed in different pods. Cores are
distributed across shards as evenly as possible -- for example, 36 cores
in a single LLC with max shard size 8 produces 5 shards of 8+7+7+7+7
cores.

The implementation follows the same comparator pattern as other affinity
scopes: precompute_cache_shard_ids() pre-fills the cpu_shard_id[] array
from the already-initialized WQ_AFFN_CACHE and WQ_AFFN_SMT topology,
and cpus_share_cache_shard() is passed to init_pod_type().

Benchmark on NVIDIA Grace (72 CPUs, single LLC, 50k items/thread), show
cache_shard delivers ~5x the throughput and ~6.5x lower p50 latency
compared to cache scope on this 72-core single-LLC system.

Suggested-by: Tejun Heo <tj@kernel.org>
Signed-off-by: Breno Leitao <leitao@debian.org>
Signed-off-by: Tejun Heo <tj@kernel.org>

2026-04-01 10:24:18 -10:00

..

Merge tag 'mailbox-v6.20' of git://git.kernel.org/pub/scm/linux/kernel/git/jassibrar/mailbox

2026-02-14 11:13:32 -08:00

Merge tag 'hyperv-next-signed-20260218' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux

2026-02-20 08:48:31 -08:00

…

Merge tag 'net-next-7.0' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next

2026-02-11 19:31:52 -08:00

…

drm/pagemap: pass pagemap_addr by reference

2026-02-17 19:39:44 -05:00

Merge tag 'phy-for-7.0' of git://git.kernel.org/pub/scm/linux/kernel/git/phy/linux-phy

2026-02-17 11:40:04 -08:00

Merge tag 'hyperv-next-signed-20260218' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux

2026-02-20 08:48:31 -08:00

keys/trusted_keys: establish PKWM as a trusted source

2026-01-30 09:27:26 +05:30

treewide: Replace kmalloc with kmalloc_obj for non-scalar types

2026-02-21 01:02:28 -08:00

KVM: arm64: Use standard seq_file iterator for vgic-debug debugfs

2026-02-02 10:59:25 +00:00

workqueue: add WQ_AFFN_CACHE_SHARD affinity scope

2026-04-01 10:24:18 -10:00

…

Merge tag 'media/v7.0-2' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media

2026-02-11 12:20:25 -08:00

…

…

Convert more 'alloc_obj' cases to default GFP_KERNEL arguments

2026-02-21 20:03:00 -08:00

…

…

Merge tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma

2026-02-12 17:05:20 -08:00

rv: Fix multiple definition of __pcpu_unique_da_mon_this

2026-02-20 13:12:00 +01:00

Merge tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

2026-02-12 15:43:02 -08:00

Merge tag 'reset-for-v6.20' of https://git.pengutronix.de/git/pza/linux into soc/drivers

2026-01-29 10:24:25 +01:00

Merge tag 'asoc-v6.20' of https://git.kernel.org/pub/scm/linux/kernel/git/broonie/sound into for-linus

2026-02-09 17:39:11 +01:00

…

Merge tag 'vfs-7.0-rc1.misc.2' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs

2026-02-16 13:00:36 -08:00

Merge tag 'drm-next-2026-02-21' of https://gitlab.freedesktop.org/drm/kernel

2026-02-20 15:36:38 -08:00

scsi: ufs: host: mediatek: Require CONFIG_PM

2026-02-03 22:28:44 -05:00

…

…

Partial revert "x86/xen: fix balloon target initialization for PVH dom0"

2026-02-02 07:31:22 +01:00

Kbuild

…