From: Yang Shi <shy828301@gmail.com>
To: David Hildenbrand <david@redhat.com>
Cc: Ryan Roberts <ryan.roberts@arm.com>,
Baolin Wang <baolin.wang@linux.alibaba.com>,
Bang Li <libang.li@antgroup.com>,
hughd@google.com, akpm@linux-foundation.org,
wangkefeng.wang@huawei.com, ziy@nvidia.com,
linux-kernel@vger.kernel.org, linux-mm@kvack.org
Subject: Re: [PATCH] support "THPeligible" semantics for mTHP with anonymous shmem
Date: Mon, 1 Jul 2024 11:20:27 -0700 [thread overview]
Message-ID: <CAHbLzkrv2U39oOFuuHpmcfvDOuMayjwdgXLshxtDSSPGPzOkJQ@mail.gmail.com> (raw)
In-Reply-To: <7a0bbe69-1e3d-4263-b206-da007791a5c4@redhat.com>
On Mon, Jul 1, 2024 at 3:23 AM David Hildenbrand <david@redhat.com> wrote:
>
> On 01.07.24 12:16, Ryan Roberts wrote:
> > On 01/07/2024 10:17, David Hildenbrand wrote:
> >> On 01.07.24 11:14, Ryan Roberts wrote:
> >>> On 01/07/2024 09:57, David Hildenbrand wrote:
> >>>> On 01.07.24 10:50, Ryan Roberts wrote:
> >>>>> On 01/07/2024 09:48, David Hildenbrand wrote:
> >>>>>> On 01.07.24 10:40, Ryan Roberts wrote:
> >>>>>>> On 01/07/2024 09:33, Baolin Wang wrote:
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> On 2024/7/1 15:55, Ryan Roberts wrote:
> >>>>>>>>> On 28/06/2024 11:49, Bang Li wrote:
> >>>>>>>>>> After the commit 7fb1b252afb5 ("mm: shmem: add mTHP support for
> >>>>>>>>>> anonymous shmem"), we can configure different policies through
> >>>>>>>>>> the multi-size THP sysfs interface for anonymous shmem. But
> >>>>>>>>>> currently "THPeligible" indicates only whether the mapping is
> >>>>>>>>>> eligible for allocating THP-pages as well as the THP is PMD
> >>>>>>>>>> mappable or not for anonymous shmem, we need to support semantics
> >>>>>>>>>> for mTHP with anonymous shmem similar to those for mTHP with
> >>>>>>>>>> anonymous memory.
> >>>>>>>>>>
> >>>>>>>>>> Signed-off-by: Bang Li <libang.li@antgroup.com>
> >>>>>>>>>> ---
> >>>>>>>>>> fs/proc/task_mmu.c | 10 +++++++---
> >>>>>>>>>> include/linux/huge_mm.h | 11 +++++++++++
> >>>>>>>>>> mm/shmem.c | 9 +--------
> >>>>>>>>>> 3 files changed, 19 insertions(+), 11 deletions(-)
> >>>>>>>>>>
> >>>>>>>>>> diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
> >>>>>>>>>> index 93fb2c61b154..09b5db356886 100644
> >>>>>>>>>> --- a/fs/proc/task_mmu.c
> >>>>>>>>>> +++ b/fs/proc/task_mmu.c
> >>>>>>>>>> @@ -870,6 +870,7 @@ static int show_smap(struct seq_file *m, void *v)
> >>>>>>>>>> {
> >>>>>>>>>> struct vm_area_struct *vma = v;
> >>>>>>>>>> struct mem_size_stats mss = {};
> >>>>>>>>>> + bool thp_eligible;
> >>>>>>>>>> smap_gather_stats(vma, &mss, 0);
> >>>>>>>>>> @@ -882,9 +883,12 @@ static int show_smap(struct seq_file *m, void
> >>>>>>>>>> *v)
> >>>>>>>>>> __show_smap(m, &mss, false);
> >>>>>>>>>> - seq_printf(m, "THPeligible: %8u\n",
> >>>>>>>>>> - !!thp_vma_allowable_orders(vma, vma->vm_flags,
> >>>>>>>>>> - TVA_SMAPS | TVA_ENFORCE_SYSFS, THP_ORDERS_ALL));
> >>>>>>>>>> + thp_eligible = !!thp_vma_allowable_orders(vma, vma->vm_flags,
> >>>>>>>>>> + TVA_SMAPS | TVA_ENFORCE_SYSFS, THP_ORDERS_ALL);
> >>>>>>>>>> + if (vma_is_anon_shmem(vma))
> >>>>>>>>>> + thp_eligible =
> >>>>>>>>>> !!shmem_allowable_huge_orders(file_inode(vma->vm_file),
> >>>>>>>>>> + vma, vma->vm_pgoff, thp_eligible);
> >>>>>>>>>
> >>>>>>>>> Afraid I haven't been following the shmem mTHP support work as much as I
> >>>>>>>>> would
> >>>>>>>>> have liked, but is there a reason why we need a separate function for
> >>>>>>>>> shmem?
> >>>>>>>>
> >>>>>>>> Since shmem_allowable_huge_orders() only uses shmem specific logic to
> >>>>>>>> determine
> >>>>>>>> if huge orders are allowable, there is no need to complicate the
> >>>>>>>> thp_vma_allowable_orders() function by adding more shmem related logic,
> >>>>>>>> making
> >>>>>>>> it more bloated. In my view, providing a dedicated helper
> >>>>>>>> shmem_allowable_huge_orders(), specifically for shmem, simplifies the logic.
> >>>>>>>
> >>>>>>> My point was really that a single interface (thp_vma_allowable_orders)
> >>>>>>> should be
> >>>>>>> used to get this information. I have no strong opinon on how the
> >>>>>>> implementation
> >>>>>>> of that interface looks. What you suggest below seems perfectly reasonable
> >>>>>>> to me.
> >>>>>>
> >>>>>> Right. thp_vma_allowable_orders() might require some care as discussed in
> >>>>>> other
> >>>>>> context (cleanly separate dax and shmem handling/orders). But that would be
> >>>>>> follow-up cleanups.
> >>>>>
> >>>>> Are you planning to do that, or do you want me to send a patch?
> >>>>
> >>>> I'm planning on looking into some details, especially the interaction with large
> >>>> folios in the pagecache. I'll let you know once I have a better idea what
> >>>> actually should be done :)
> >>>
> >>> OK great - I'll scrub it from my todo list... really getting things done today :)
> >>
> >> Resolved the khugepaged thiny already? :P
> >>
> >> [khugepaged not active when only enabling the sub-size via the 2M folder IIRC]
> >
> > Hmm... baby brain?
>
> :)
>
> I think I only mentioned it in a private mail at some point.
>
> >
> > Sorry about that. I've been a bit useless lately. For some reason it wasn't on
> > my list, but its there now. Will prioritise it, because I agree it's not good.
>
>
> IIRC, if you do
>
> echo never > /sys/kernel/mm/transparent_hugepage/enabled
> echo always > /sys/kernel/mm/transparent_hugepage/hugepages-2048kB/enabled
>
> khugepaged will not get activated.
khugepaged is controlled by the top level knob. But the above setting
sounds confusing, can we disable the top level knob, but enable it on
a per-order basis? TBH, it sounds weird and doesn't make too much
sense to me.
>
> --
> Cheers,
>
> David / dhildenb
>
>
next prev parent reply other threads:[~2024-07-01 18:20 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-28 10:49 Bang Li
2024-07-01 6:47 ` Baolin Wang
2024-07-01 6:55 ` David Hildenbrand
2024-07-01 7:18 ` Baolin Wang
2024-07-01 8:22 ` Bang Li
2024-07-01 6:57 ` David Hildenbrand
2024-07-01 8:24 ` Bang Li
2024-07-01 7:55 ` Ryan Roberts
2024-07-01 8:33 ` Baolin Wang
2024-07-01 8:40 ` Ryan Roberts
2024-07-01 8:46 ` Baolin Wang
2024-07-01 8:48 ` David Hildenbrand
2024-07-01 8:50 ` Ryan Roberts
2024-07-01 8:57 ` David Hildenbrand
2024-07-01 9:14 ` Ryan Roberts
2024-07-01 9:17 ` David Hildenbrand
2024-07-01 10:16 ` Ryan Roberts
2024-07-01 10:22 ` David Hildenbrand
2024-07-01 18:20 ` Yang Shi [this message]
2024-07-02 8:24 ` Ryan Roberts
2024-07-02 8:28 ` David Hildenbrand
2024-07-03 16:08 ` Yang Shi
2024-07-03 16:19 ` David Hildenbrand
2024-07-04 9:43 ` Ryan Roberts
2024-07-09 19:01 ` Yang Shi
2024-07-01 9:43 ` Bang Li
2024-07-01 11:12 ` Baolin Wang
2024-07-01 14:51 ` Bang Li
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAHbLzkrv2U39oOFuuHpmcfvDOuMayjwdgXLshxtDSSPGPzOkJQ@mail.gmail.com \
--to=shy828301@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=baolin.wang@linux.alibaba.com \
--cc=david@redhat.com \
--cc=hughd@google.com \
--cc=libang.li@antgroup.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=ryan.roberts@arm.com \
--cc=wangkefeng.wang@huawei.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox