linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Dennis Zhou <dennis@kernel.org>
To: Yafang Shao <laoar.shao@gmail.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	linux-mm@kvack.org, Tejun Heo <tj@kernel.org>,
	Christoph Lameter <cl@linux.com>,
	Roman Gushchin <roman.gushchin@linux.dev>,
	Vasily Averin <vvs@openvz.org>
Subject: Re: [PATCH -mm] mm: percpu: fix incorrect size in pcpu_obj_full_size()
Date: Mon, 13 Feb 2023 12:12:15 -0800	[thread overview]
Message-ID: <Y+qZn3glIwRZEc6m@snowbird> (raw)
In-Reply-To: <CALOAHbCQprmAHdO-Zp+dkgr62Ai27iSP5DhxCC9uhkUQWhOdzg@mail.gmail.com>

On Sun, Feb 12, 2023 at 10:12:12PM +0800, Yafang Shao wrote:
> On Sat, Feb 11, 2023 at 6:39 AM Dennis Zhou <dennis@kernel.org> wrote:
> >
> > Hello,
> >
> > On Fri, Feb 10, 2023 at 02:05:08PM -0800, Andrew Morton wrote:
> > > On Fri, 10 Feb 2023 15:49:47 +0000 Yafang Shao <laoar.shao@gmail.com> wrote:
> > >
> > > > The extra space which is used to store the obj_cgroup membership is only
> > > > valid when kmemcg is enabled. The kmemcg can be disabled via the kernel
> > > > parameter "cgroup.memory=nokmem" at runtime.
> > > > This helper is also used in non-memcg code, for example the tracepoint,
> > > > so we should fix it.
> > > >
> > > > It was found by code review when I was implementing bpf memory usage[1].
> > > > No real issue happens in production environment.
> > > >
> > > > ...
> > > >
> > > > --- a/mm/percpu-internal.h
> > > > +++ b/mm/percpu-internal.h
> > > > @@ -4,6 +4,7 @@
> > > >
> > > >  #include <linux/types.h>
> > > >  #include <linux/percpu.h>
> > > > +#include <linux/memcontrol.h>
> > > >
> > > >  /*
> > > >   * pcpu_block_md is the metadata block struct.
> > > > @@ -125,7 +126,8 @@ static inline size_t pcpu_obj_full_size(size_t size)
> > > >     size_t extra_size = 0;
> > > >
> > > >  #ifdef CONFIG_MEMCG_KMEM
> > > > -   extra_size += size / PCPU_MIN_ALLOC_SIZE * sizeof(struct obj_cgroup *);
> > > > +   if (!mem_cgroup_kmem_disabled())
> > > > +           extra_size += size / PCPU_MIN_ALLOC_SIZE * sizeof(struct obj_cgroup *);
> > > >  #endif
> > > >
> > > >     return size * num_possible_cpus() + extra_size;
> > >
> >
> > Sorry I've been a bit mia...
> >
> > > Seems risky at the first look - enabling kmemcg at runtime will make
> > > prior calculations based on pcpu_obj_full_size) incorrect.  But as long
> > > as this is only used for accounting I guess that's OK.
> > >
> > > What happens if we do a bunch of allocations with kmemcg enabled, then
> > > disable kmemcg then free those allocations, or some such thing.  Does
> > > the accounting end up wrong?
> > >
> >
> > For now it works correctly because of 2 things. 1 - the function is only
> > called by accounting. 2 - the free path doesn't consult
> > mem_cgroup_kmem_disabled() but consults if a memcg exists for a percpu
> > allocation. If accounting is enabled, we'd always account the additional
> > memory for the memcg accounting. If it's not enabled, then percpu is
> > well unaccounted for.
> >
> > This function probably needs to be renamed a bit more carefully so it
> > doesn't bleed outside of mm/percpu.c.
> >
> 
> Do you have any suggestions on the new name ?
> 
> > In short, I don't think this change is correct.
> 
> Could you pls be more specific ?
> 

Hmmm I got ahead of myself. I misunderstood memcg_*_enabled() vs
memcg_*_disabled(). Roman clarified it just now in [1]. I was imagining
a world where we add disabled here and then eventually enabled would
propagate here too.

Anothing that was on my mind is, should a percpu object be charged for
the memcg space even if it's not in use. I now think it's yes and then
for general accounting outside of memcg, this function is correct.

Acked-by: Dennis Zhou <dennis@kernel.org>

Andrew, I have nothing queued. Do you mind picking this up?

[1] https://lore.kernel.org/linux-mm/20230213192922.1146370-1-roman.gushchin@linux.dev/T/#u

Thanks,
Dennis


  reply	other threads:[~2023-02-13 20:12 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-02-10 15:49 Yafang Shao
2023-02-10 22:05 ` Andrew Morton
2023-02-10 22:39   ` Dennis Zhou
2023-02-12 14:12     ` Yafang Shao
2023-02-13 20:12       ` Dennis Zhou [this message]
2023-02-12 14:05   ` Yafang Shao
2023-02-13 18:50     ` Andrew Morton
2023-02-14  1:57       ` Yafang Shao
2023-02-10 22:41 ` Roman Gushchin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Y+qZn3glIwRZEc6m@snowbird \
    --to=dennis@kernel.org \
    --cc=akpm@linux-foundation.org \
    --cc=cl@linux.com \
    --cc=laoar.shao@gmail.com \
    --cc=linux-mm@kvack.org \
    --cc=roman.gushchin@linux.dev \
    --cc=tj@kernel.org \
    --cc=vvs@openvz.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox