From: Suren Baghdasaryan <surenb@google.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: Pasha Tatashin <pasha.tatashin@soleen.com>,
g@casper.infradead.org,
Kent Overstreet <kent.overstreet@linux.dev>,
Vlastimil Babka <vbabka@suse.cz>,
lsf-pc@lists.linux-foundation.org,
linux-fsdevel <linux-fsdevel@vger.kernel.org>,
linux-mm <linux-mm@kvack.org>
Subject: Re: [LSF/MM/BPF TOPIC] Memory profiling using code tagging
Date: Sun, 21 Jan 2024 19:29:41 -0800 [thread overview]
Message-ID: <CAJuCfpGpL6-7KEFFDY=nR5fA1zMi3-8MmJG=6iuHdpUcv07qnA@mail.gmail.com> (raw)
In-Reply-To: <Za3CbL5U7dFp6aL2@casper.infradead.org>
On Sun, Jan 21, 2024 at 5:18 PM Matthew Wilcox <willy@infradead.org> wrote:
>
> On Sun, Jan 21, 2024 at 06:39:26PM -0500, Pasha Tatashin wrote:
> > On Wed, May 10, 2023 at 12:28 PM Kent Overstreet
> > <kent.overstreet@linux.dev> wrote:
> > > Hasn't been addressed yet, but we were just talking about moving the
> > > codetag pointer from page_ext to page last night for memory overhead
> > > reasons.
> > >
> > > The disadvantage then is that the memory overhead doesn't go down if you
> > > disable memory allocation profiling at boot time...
> > >
> > > But perhaps the performance overhead is low enough now that this is not
> > > something we expect to be doing as much?
> > >
> > > Choices, choices...
> >
> > I would like to participate in this discussion, specifically to
>
> Umm, this is a discussion proposal for last year, not this. I don't
> remember if a followup discussion has been proposed for this year?
My bad. I should submit a proposal for followup discussion for this
year. Will do that this coming week.
>
> > 2. Reducing the memory overhead by not using page_ext pointer, but
> > instead use n-bits in the page->flags.
> >
> > The number of buckets is actually not that large, there is no need to
> > keep 8-byte pointer in page_ext, it could be an idx in an array of a
> > specific size. There could be buckets that contain several stacks.
>
> There are a lot of people using "n bits in page->flags" and I don't
> have a good feeling for how many we really have left. MGLRU uses a
> variable number of bits. There's PG_arch_2 and PG_arch_3. There's
> PG_uncached. There's PG_young and PG_idle. And of course we have
> NUMA node (10 bits?), section (?), zone (3 bits?) I count 28 bits
> allocated with all the CONFIG enabled, then 13 for node+zone, so it
> certainly seems like there's a lot free on 64-bit, but it'd be
> nice to have it written out properly.
>
> Related, what do we think is going to happen with page_ext in a memdesc
> world (also what's going to happen with the kmsan goop in struct page?)
>
> I see page_idle_ops, page_owner_ops and page_table_check_ops.
> page_idle_ops only uses the 8 byte flags. page_owner_ops uses an extra
> 64 bytes (!). page_table_check uses an extra 8 bytes.
>
> page_idle looks to be for folios only. page_table_check seems like
> it should be folded into pgdesc. page_owner maybe gets added to every
> allocation rather than every page (but that's going to be interesting
> for memdescs which don't normally need an allocation).
>
> That seems to imply that we can get rid of page_ext entirely, which will
> be nice. I don't understand kmsan well enough to understand what to
> do about it. If it's per-allocation, we can handle it like page_owner.
> If it really is per-page, we can make it an ifdef in struct page itself.
> I think it's OK to grow struct page for such a rarely used debugging
> option.
next prev parent reply other threads:[~2024-01-22 3:29 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-02-22 19:31 Suren Baghdasaryan
2023-03-28 16:28 ` Vlastimil Babka
2023-03-28 16:55 ` Kent Overstreet
2023-05-10 16:28 ` Kent Overstreet
2024-01-21 23:39 ` Pasha Tatashin
2024-01-21 23:56 ` Kent Overstreet
2024-01-22 1:18 ` Matthew Wilcox
2024-01-22 3:29 ` Suren Baghdasaryan [this message]
2023-05-10 16:26 ` Suren Baghdasaryan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAJuCfpGpL6-7KEFFDY=nR5fA1zMi3-8MmJG=6iuHdpUcv07qnA@mail.gmail.com' \
--to=surenb@google.com \
--cc=g@casper.infradead.org \
--cc=kent.overstreet@linux.dev \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lsf-pc@lists.linux-foundation.org \
--cc=pasha.tatashin@soleen.com \
--cc=vbabka@suse.cz \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox