linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: "David Wang" <00107082@163.com>
To: "Suren Baghdasaryan" <surenb@google.com>
Cc: "Vlastimil Babka" <vbabka@suse.cz>,
	akpm@linux-foundation.org, mhocko@suse.com, jackmanb@google.com,
	hannes@cmpxchg.org, ziy@nvidia.com, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org,
	"Shakeel Butt" <shakeel.butt@linux.dev>
Subject: Re: [PATCH] mm/codetag: sub in advance when free non-compound high order pages
Date: Tue, 6 May 2025 00:42:36 +0800 (CST)	[thread overview]
Message-ID: <3be93014.40dc.196a153f521.Coremail.00107082@163.com> (raw)
In-Reply-To: <CAJuCfpHS9obp7yoxRUPoj3Yp8KkkYWCnnoXwtmAOxfynnq8aug@mail.gmail.com>




At 2025-05-05 23:33:50, "Suren Baghdasaryan" <surenb@google.com> wrote:
>On Mon, May 5, 2025 at 7:55 AM Vlastimil Babka <vbabka@suse.cz> wrote:
>>
>> On 5/5/25 16:31, David Wang wrote:
>> >
>> >
>> > At 2025-05-05 21:12:55, "Vlastimil Babka" <vbabka@suse.cz> wrote:
>> >>On 5/4/25 08:19, David Wang wrote:
>> >>> When page is non-compound, page[0] could be released by other
>> >>> thread right after put_page_testzero failed in current thread,
>> >>> pgalloc_tag_sub_pages afterwards would manipulate an invalid
>> >>> page for accounting remaining pages:
>> >>>
>> >>> [timeline]   [thread1]                     [thread2]
>> >>>   |          alloc_page non-compound
>> >>>   V
>> >>>   |                                        get_page, rf counter inc
>> >>>   V
>> >>>   |          in ___free_pages
>> >>>   |          put_page_testzero fails
>> >>>   V
>> >>>   |                                        put_page, page released
>> >>>   V
>> >>>   |          in ___free_pages,
>> >>>   |          pgalloc_tag_sub_pages
>> >>>   |          manipulate an invalid page
>> >>>   V
>> >>>   V
>> >>>
>> >>> Move the tag page accounting ahead, and only account remaining pages
>> >>> for non-compound pages with non-zero order.
>> >>>
>> >>> Signed-off-by: David Wang <00107082@163.com>
>
>Thanks for reporting!
>
>> >>
>> >>Hmm, I think the problem was introduced by 51ff4d7486f0 ("mm: avoid extra
>> >>mem_alloc_profiling_enabled() checks"). Previously we'd get the tag pointer
>> >>upfront and avoid the page use-after-free.
>
>Right, sorry I missed that.
>
>> >
>> >
>> > Oh, you're right. I forgot to check history......
>> >
>> >
>> >>
>> >>It would likely be nicer to fix it by going back to that approach for
>> >>___free_pages(), while hopefully keeping the optimisations of 51ff4d7486f0
>> >>for the other call sites where it applies?
>> >
>> > After checking that commit, I kind of feels the changes in __free_pages are
>> >  the major optimization of the commit....
>>
>> We could have both pgalloc_tag_get() to use in __free_page() as before
>> 51ff4d7486f0, and keep __pgalloc_tag_get() to use in pgalloc_tag_split() and
>> pgalloc_tag_swap().
>
>Yes, we can add back pgalloc_tag_get() which would call
>__pgalloc_tag_get() if mem_alloc_profiling_enabled() is true and
>change pgalloc_tag_sub_pages() back to use tags instead of pages.
>__free_pages() is the only user of that function, so that change
>should not affect anything else.


Adding back pgalloc_tag_get() seems just reverting 51ff4d7486f0.....
Do you want me to do it or you take over and make further adjustments?



>
>>
>> I think __free_page() didn't benefit from the stated purpose of "avoiding
>> mem_alloc_profiling_enabled() ... which is often called after that check was
>> already done"
>>
>> > What about revert that commit and make optimization by condition checks,
>> > similar to what this patch did?
>>
>> The downside of the condition checks is they make the code more complex and
>> might actually increase overhead when mem_alloc_profiling_enabled() is
>> false, as those checks add non-static branches outside of the static branch
>> that's mem_alloc_profiling_enabled().
>>
>> I think __free_pages() before 51ff4d7486f0 was quite ok.
>>
>> - pgalloc_tag_get() is done unconditionally, but its code is all inside the
>> mem_alloc_profiling_enabled() static branch so that's a no-op when profiling
>> is not enabled
>>
>> - pgalloc_tag_sub_pages() is also all behind the static branch inside. Also
>> it's a very rare path anyway, most freeing should go through the
>> put_page_testzero() being true.
>
>Yeah, the main goal of that change in __free_page() was to make
>__pgalloc_tag_get() a local function for alloc_tags and limiting the
>direct use of struct alloc_tag in the core mm code. Obviously I
>screwed up forgetting why we had to store the tag in the first place.
>An additional comment in __free_page() is probably a good idea to
>avoid confusion in the future.
>Thanks,
>Suren.
>
>>
>> > David
>> >
>>

  reply	other threads:[~2025-05-05 16:43 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-05-04  6:19 David Wang
2025-05-05 13:12 ` Vlastimil Babka
2025-05-05 14:31   ` David Wang
2025-05-05 14:55     ` Vlastimil Babka
2025-05-05 15:33       ` Suren Baghdasaryan
2025-05-05 16:42         ` David Wang [this message]
2025-05-05 16:53           ` Suren Baghdasaryan
2025-05-05 18:34             ` [PATCH v2] mm/codetag: move tag retrieval back upfront in __free_pages() David Wang
2025-05-05 19:17               ` David Wang
2025-05-05 19:30             ` [PATCH v3] " David Wang
2025-05-05 20:32               ` Suren Baghdasaryan
2025-05-06  7:58               ` Vlastimil Babka

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3be93014.40dc.196a153f521.Coremail.00107082@163.com \
    --to=00107082@163.com \
    --cc=akpm@linux-foundation.org \
    --cc=hannes@cmpxchg.org \
    --cc=jackmanb@google.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mhocko@suse.com \
    --cc=shakeel.butt@linux.dev \
    --cc=surenb@google.com \
    --cc=vbabka@suse.cz \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox