From: Andrew Morton <akpm@linux-foundation.org>
To: Joshua Hahn <joshua.hahnjy@gmail.com>
Cc: Michal Hocko <mhocko@suse.com>,
Johannes Weiner <hannes@cmpxchg.org>,
nphamcs@gmail.com, shakeel.butt@linux.dev,
roman.gushchin@linux.dev, muchun.song@linux.dev, tj@kernel.org,
lizefan.x@bytedance.com, mkoutny@suse.com, corbet@lwn.net,
lnyng@meta.com, cgroups@vger.kernel.org, linux-mm@kvack.org,
linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org,
kernel-team@meta.com
Subject: Re: [PATCH v3 1/1] memcg/hugetlb: Adding hugeTLB counters to memcg
Date: Thu, 31 Oct 2024 18:34:13 -0700 [thread overview]
Message-ID: <20241031183413.bb0bc34e8354cc14cdfc3c29@linux-foundation.org> (raw)
In-Reply-To: <CAN+CAwMioguv6itTSYVUO9__kQVv6HZO2-i0NWt10-x7f6JVSQ@mail.gmail.com>
On Thu, 31 Oct 2024 15:03:34 -0400 Joshua Hahn <joshua.hahnjy@gmail.com> wrote:
> Andrew -- I am sorry to ask again, but do you think you can replace
> the 3rd section in the patch (3. Implementation Details) with the
> following paragraphs?
No problem.
: This patch introduces a new counter to memory.stat that tracks hugeTLB
: usage, only if hugeTLB accounting is done to memory.current. This feature
: is enabled the same way hugeTLB accounting is enabled, via the
: memory_hugetlb_accounting mount flag for cgroupsv2.
:
: 1. Why is this patch necessary?
: Currently, memcg hugeTLB accounting is an opt-in feature [1] that adds
: hugeTLB usage to memory.current. However, the metric is not reported in
: memory.stat. Given that users often interpret memory.stat as a breakdown
: of the value reported in memory.current, the disparity between the two
: reports can be confusing. This patch solves this problem by including the
: metric in memory.stat as well, but only if it is also reported in
: memory.current (it would also be confusing if the value was reported in
: memory.stat, but not in memory.current)
:
: Aside from the consistency between the two files, we also see benefits in
: observability. Userspace might be interested in the hugeTLB footprint of
: cgroups for many reasons. For instance, system admins might want to
: verify that hugeTLB usage is distributed as expected across tasks: i.e.
: memory-intensive tasks are using more hugeTLB pages than tasks that don't
: consume a lot of memory, or are seen to fault frequently. Note that this
: is separate from wanting to inspect the distribution for limiting purposes
: (in which case, hugeTLB controller makes more sense).
:
: 2. We already have a hugeTLB controller. Why not use that? It is true
: that hugeTLB tracks the exact value that we want. In fact, by enabling
: the hugeTLB controller, we get all of the observability benefits that I
: mentioned above, and users can check the total hugeTLB usage, verify if it
: is distributed as expected, etc.
:
: 3. Implementation Details:
: In the alloc / free hugetlb functions, we call lruvec_stat_mod_folio
: regardless of whether memcg accounts hugetlb. mem_cgroup_commit_charge
: which is called from alloc_hugetlb_folio will set memcg for the folio
: only if the CGRP_ROOT_MEMORY_HUGETLB_ACCOUNTING cgroup mount option is
: used, so lruvec_stat_mod_folio accounts per-memcg hugetlb counters only
: if the feature is enabled. Regardless of whether memcg accounts for
: hugetlb, the newly added global counter is updated and shown in
: /proc/vmstat.
:
: The global counter is added because vmstats is the preferred framework
: for cgroup stats. It makes stat items consistent between global and
: cgroups. It also provides a per-node breakdown, which is useful.
: Because it does not use cgroup-specific hooks, we also keep generic MM
: code separate from memcg code.
:
: With this said, there are 2 problems:
: (a) They are still not reported in memory.stat, which means the
: disparity between the memcg reports are still there.
: (b) We cannot reasonably expect users to enable the hugeTLB controller
: just for the sake of hugeTLB usage reporting, especially since
: they don't have any use for hugeTLB usage enforcing [2].
:
: [1] https://lore.kernel.org/all/20231006184629.155543-1-nphamcs@gmail.com/
: [2] Of course, we can't make a new patch for every feature that can be
: duplicated. However, since the existing solution of enabling the
: hugeTLB controller is an imperfect solution that still leaves a
: discrepancy between memory.stat and memory.curent, I think that it
: is reasonable to isolate the feature in this case.
next prev parent reply other threads:[~2024-11-01 13:53 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-10-28 21:05 Joshua Hahn
2024-10-28 22:38 ` Shakeel Butt
2024-10-29 20:28 ` Roman Gushchin
2024-10-29 20:48 ` Johannes Weiner
2024-10-29 21:04 ` Nhat Pham
2024-10-30 11:35 ` Michal Hocko
2024-10-30 15:01 ` Johannes Weiner
2024-10-30 15:27 ` Michal Hocko
2024-10-30 18:30 ` Johannes Weiner
2024-10-30 20:43 ` Joshua Hahn
2024-10-30 23:26 ` Andrew Morton
2024-10-31 8:12 ` Michal Hocko
2024-10-31 16:09 ` Johannes Weiner
2024-10-31 19:03 ` Joshua Hahn
2024-11-01 1:34 ` Andrew Morton [this message]
2024-11-01 18:33 ` Joshua Hahn
2024-11-01 20:02 ` Andrew Morton
2024-11-01 20:28 ` Joshua Hahn
2024-10-30 14:52 ` Chris Down
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20241031183413.bb0bc34e8354cc14cdfc3c29@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=cgroups@vger.kernel.org \
--cc=corbet@lwn.net \
--cc=hannes@cmpxchg.org \
--cc=joshua.hahnjy@gmail.com \
--cc=kernel-team@meta.com \
--cc=linux-doc@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lizefan.x@bytedance.com \
--cc=lnyng@meta.com \
--cc=mhocko@suse.com \
--cc=mkoutny@suse.com \
--cc=muchun.song@linux.dev \
--cc=nphamcs@gmail.com \
--cc=roman.gushchin@linux.dev \
--cc=shakeel.butt@linux.dev \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox