From: Zi Yan <ziy@nvidia.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: linux-mm@kvack.org, Andrew Morton <akpm@linux-foundation.org>,
"Kirill A . Shutemov" <kirill.shutemov@linux.intel.com>,
Ryan Roberts <ryan.roberts@arm.com>,
Hugh Dickins <hughd@google.com>,
David Hildenbrand <david@redhat.com>,
Yang Shi <yang@os.amperecomputing.com>,
Miaohe Lin <linmiaohe@huawei.com>,
Kefeng Wang <wangkefeng.wang@huawei.com>,
Yu Zhao <yuzhao@google.com>, John Hubbard <jhubbard@nvidia.com>,
Baolin Wang <baolin.wang@linux.alibaba.com>,
linux-kselftest@vger.kernel.org, linux-kernel@vger.kernel.org,
Kairui Song <kasong@tencent.com>
Subject: Re: [PATCH v9 2/8] mm/huge_memory: add two new (not yet used) functions for folio_split()
Date: Thu, 27 Feb 2025 10:42:54 -0500 [thread overview]
Message-ID: <DEA9200F-AC1E-42E4-8876-404179BCA465@nvidia.com> (raw)
In-Reply-To: <Z8CBPF9_gDZuDjED@casper.infradead.org>
On 27 Feb 2025, at 10:14, Matthew Wilcox wrote:
> On Thu, Feb 27, 2025 at 05:55:43AM +0000, Matthew Wilcox wrote:
>> On Wed, Feb 26, 2025 at 04:00:25PM -0500, Zi Yan wrote:
>>> +static int __split_unmapped_folio(struct folio *folio, int new_order,
>>> + struct page *split_at, struct page *lock_at,
>>> + struct list_head *list, pgoff_t end,
>>> + struct xa_state *xas, struct address_space *mapping,
>>> + bool uniform_split)
>>> +{
>> [...]
>>> + /* complete memcg works before add pages to LRU */
>>> + split_page_memcg(&folio->page, old_order, split_order);
>>> + split_page_owner(&folio->page, old_order, split_order);
>>> + pgalloc_tag_split(folio, old_order, split_order);
>>
>> At least split_page_memcg() needs to become aware of 'uniform_split'.
>>
>> if (folio_memcg_kmem(folio))
>> obj_cgroup_get_many(__folio_objcg(folio), old_nr / new_nr - 1);
>>
>> If we're doing uniform_split, that calculation should be
>> old_order - new_order - 1
>
> umm, old_order - new_order. Anyway, here's a patch I've done on top of
> your work, but it probably needs to be massaged slightly and placed
> before your work?
Wait. uniform_split is the existing splitting one order-9 to 512 order-0
approach, so split_page_memcg() still works. For !uniform_split,
split_page_memcg() is called multiple times,
each time old_order = new_order + 1, so what split_page_memcg() does
is:
1. two order-8 folios get their memcg, and ref count is increased by 1;
2. one of the order-8s is split into two order-7, each of which gets
their memcg, and ref count is increased by 1;
…
8. one of the order-1s is split into two order-0, each of which gets
their memcg, and ref count is increased by 1.
At the end, the refcount is increased by old_order - new_order like
you described above. Let me know if it makes sense to you.
>
> From 190e13ed77e562eb59fa1fa4bfefdefe5d0416ed Mon Sep 17 00:00:00 2001
> From: "Matthew Wilcox (Oracle)" <willy@infradead.org>
> Date: Mon, 28 Oct 2024 16:23:30 -0400
> Subject: [PATCH] mm: Separate folio_split_memcg() from split_page_memcg()
>
> Folios always use memcg_data to refer to the mem_cgroup while pages
> allocated with GFP_ACCOUNT have a pointer to the obj_cgroup. Since the
> caller already knows what it has, split the function into two and then
> we don't need to check.
>
> Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org>
> ---
> include/linux/memcontrol.h | 7 +++++++
> mm/huge_memory.c | 6 ++++--
> mm/memcontrol.c | 18 +++++++++++++++---
> 3 files changed, 26 insertions(+), 5 deletions(-)
>
> diff --git a/include/linux/memcontrol.h b/include/linux/memcontrol.h
> index 57664e2a8fb7..155c3f81f4df 100644
> --- a/include/linux/memcontrol.h
> +++ b/include/linux/memcontrol.h
> @@ -1039,6 +1039,8 @@ static inline void memcg_memory_event_mm(struct mm_struct *mm,
> }
>
> void split_page_memcg(struct page *head, int old_order, int new_order);
> +void folio_split_memcg(struct folio *folio, unsigned old_order,
> + unsigned new_order, bool uniform_split);
>
> static inline u64 cgroup_id_from_mm(struct mm_struct *mm)
> {
> @@ -1463,6 +1465,11 @@ static inline void split_page_memcg(struct page *head, int old_order, int new_or
> {
> }
>
> +static inline void folio_split_memcg(struct folio *folio, unsigned old_order,
> + unsigned new_order, bool uniform)
> +{
> +}
> +
> static inline u64 cgroup_id_from_mm(struct mm_struct *mm)
> {
> return 0;
> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
> index 1e45064046a0..75fa9c9d9ec9 100644
> --- a/mm/huge_memory.c
> +++ b/mm/huge_memory.c
> @@ -3401,6 +3401,9 @@ static void __split_folio_to_order(struct folio *folio, int new_order)
> folio_set_young(new_folio);
> if (folio_test_idle(folio))
> folio_set_idle(new_folio);
> +#ifdef CONFIG_MEMCG
> + new_folio->memcg_data = folio->memcg_data;
> +#endif
>
> folio_xchg_last_cpupid(new_folio, folio_last_cpupid(folio));
> }
> @@ -3529,8 +3532,7 @@ static int __split_unmapped_folio(struct folio *folio, int new_order,
> }
> }
>
> - /* complete memcg works before add pages to LRU */
> - split_page_memcg(&folio->page, old_order, split_order);
> + folio_split_memcg(folio, old_order, split_order, uniform_split);
> split_page_owner(&folio->page, old_order, split_order);
> pgalloc_tag_split(folio, old_order, split_order);
>
> diff --git a/mm/memcontrol.c b/mm/memcontrol.c
> index 16f3bdbd37d8..c2d41e1337cb 100644
> --- a/mm/memcontrol.c
> +++ b/mm/memcontrol.c
> @@ -3064,10 +3064,22 @@ void split_page_memcg(struct page *head, int old_order, int new_order)
> for (i = new_nr; i < old_nr; i += new_nr)
> folio_page(folio, i)->memcg_data = folio->memcg_data;
>
> - if (folio_memcg_kmem(folio))
> - obj_cgroup_get_many(__folio_objcg(folio), old_nr / new_nr - 1);
> + obj_cgroup_get_many(__folio_objcg(folio), old_nr / new_nr - 1);
> +}
> +
> +void folio_split_memcg(struct folio *folio, unsigned old_order,
> + unsigned new_order, bool uniform_split)
> +{
> + unsigned new_refs;
> +
> + if (mem_cgroup_disabled() || !folio_memcg_charged(folio))
> + return;
> +
> + if (uniform_split)
> + new_refs = (1 << (old_order - new_order)) - 1;
> else
> - css_get_many(&folio_memcg(folio)->css, old_nr / new_nr - 1);
> + new_refs = old_order - new_order;
> + css_get_many(&__folio_memcg(folio)->css, new_refs);
> }
>
> unsigned long mem_cgroup_usage(struct mem_cgroup *memcg, bool swap)
> --
> 2.47.2
Best Regards,
Yan, Zi
next prev parent reply other threads:[~2025-02-27 15:43 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-02-26 21:00 [PATCH v9 0/8] Buddy allocator like (or non-uniform) folio split Zi Yan
2025-02-26 21:00 ` [PATCH v9 1/8] xarray: add xas_try_split() to split a multi-index entry Zi Yan
2025-02-26 21:00 ` [PATCH v9 2/8] mm/huge_memory: add two new (not yet used) functions for folio_split() Zi Yan
2025-02-27 5:55 ` Matthew Wilcox
2025-02-27 15:14 ` Matthew Wilcox
2025-02-27 15:42 ` Zi Yan [this message]
2025-03-04 11:49 ` Hugh Dickins
2025-03-04 16:20 ` Zi Yan
2025-03-04 20:29 ` Andrew Morton
2025-03-04 20:34 ` Zi Yan
2025-03-05 21:03 ` Hugh Dickins
2025-03-05 21:10 ` Zi Yan
2025-03-05 22:38 ` Hugh Dickins
2025-03-06 16:21 ` Zi Yan
2025-03-07 15:23 ` Zi Yan
2025-03-10 8:54 ` Hugh Dickins
2025-03-10 15:35 ` Zi Yan
2025-03-05 19:45 ` Zi Yan
2025-03-05 20:50 ` Hugh Dickins
2025-03-05 21:08 ` Zi Yan
2025-03-05 21:49 ` Hugh Dickins
2025-03-06 9:19 ` David Hildenbrand
2025-03-06 16:27 ` Zi Yan
2025-03-07 17:46 ` David Hildenbrand
2025-02-26 21:00 ` [PATCH v9 3/8] mm/huge_memory: move folio split common code to __folio_split() Zi Yan
2025-02-26 21:00 ` [PATCH v9 4/8] mm/huge_memory: add buddy allocator like (non-uniform) folio_split() Zi Yan
2025-02-26 21:00 ` [PATCH v9 5/8] mm/huge_memory: remove the old, unused __split_huge_page() Zi Yan
2025-02-26 21:00 ` [PATCH v9 6/8] mm/huge_memory: add folio_split() to debugfs testing interface Zi Yan
2025-02-26 21:00 ` [PATCH v9 7/8] mm/truncate: use buddy allocator like folio split for truncate operation Zi Yan
2025-03-02 3:52 ` Zi Yan
2025-02-26 21:00 ` [PATCH v9 8/8] selftests/mm: add tests for folio_split(), buddy allocator like split Zi Yan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=DEA9200F-AC1E-42E4-8876-404179BCA465@nvidia.com \
--to=ziy@nvidia.com \
--cc=akpm@linux-foundation.org \
--cc=baolin.wang@linux.alibaba.com \
--cc=david@redhat.com \
--cc=hughd@google.com \
--cc=jhubbard@nvidia.com \
--cc=kasong@tencent.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linmiaohe@huawei.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-kselftest@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=ryan.roberts@arm.com \
--cc=wangkefeng.wang@huawei.com \
--cc=willy@infradead.org \
--cc=yang@os.amperecomputing.com \
--cc=yuzhao@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox