On Fri, Dec 05, 2025 at 12:01:06PM -0800, Shakeel Butt wrote: > On x86-64, this_cpu_cmpxchg() uses CMPXCHG without LOCK prefix which > means it is only safe for the local CPU and not for multiple CPUs. > Recently the commit 36df6e3dbd7e ("cgroup: make css_rstat_updated nmi > safe") make css_rstat_updated lockless and uses lockless list to allow > reentrancy. Since css_rstat_updated can invoked from process context, be > IRQ and NMI, it uses this_cpu_cmpxchg() to select the winner which will > inset the lockless lnode into the global per-cpu lockless list. insert > > However the commit missed one case where lockless node of a cgroup can > be accessed and modified by another CPU doing the flushing. Basically > llist_del_first_init() in css_process_update_tree(). > > On a cursory look, it can be questioned how css_process_update_tree() > can see a lockless node in global lockless list where the updater is at > this_cpu_cmpxchg() and before llist_add() call in css_rstat_updated(). > This can indeed happen in the presence of IRQs/NMI. > > Consider this scenario: Updater for cgroup stat C on CPU A in process > context is after llist_on_list() check and before this_cpu_cmpxchg() in > css_rstat_updated() where it get interrupted by IRQ/NMI. In the IRQ/NMI gets (sorry for another mail, when I read it I noticed those in a different buffer that may be applied if you decide for v2+)