From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5B785C2BD09 for ; Mon, 1 Jul 2024 08:33:49 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C1D7F6B0098; Mon, 1 Jul 2024 04:33:48 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id BCD3A6B0099; Mon, 1 Jul 2024 04:33:48 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A95276B009A; Mon, 1 Jul 2024 04:33:48 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id 896B36B0098 for ; Mon, 1 Jul 2024 04:33:48 -0400 (EDT) Received: from smtpin22.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id 40E94161A64 for ; Mon, 1 Jul 2024 08:33:48 +0000 (UTC) X-FDA: 82290520536.22.11CFB8A Received: from out30-99.freemail.mail.aliyun.com (out30-99.freemail.mail.aliyun.com [115.124.30.99]) by imf13.hostedemail.com (Postfix) with ESMTP id E8D622000B for ; Mon, 1 Jul 2024 08:33:42 +0000 (UTC) Authentication-Results: imf13.hostedemail.com; dkim=pass header.d=linux.alibaba.com header.s=default header.b=MhLFeZkV; spf=pass (imf13.hostedemail.com: domain of baolin.wang@linux.alibaba.com designates 115.124.30.99 as permitted sender) smtp.mailfrom=baolin.wang@linux.alibaba.com; dmarc=pass (policy=none) header.from=linux.alibaba.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1719822806; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=BUFOSpzDuAC43YmekwvjTOLx8yMs9durOLK1l/9bwYU=; b=ZjTnL/c7isqxWDRi8bOnEd3Z02QXW9wFNUl91n+Z5HdXNUsszPQ5X1BavGHNRb07g523Kz vv7zgbxhzR6f1CMz7+1SRRcpsM9Sl9nii384ToJyJ9BBOJKnjK2JaYMbmffoIiSaP6h2Vi O1zI90gpk/ND7OcqBJkCGMtc629iADg= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1719822806; a=rsa-sha256; cv=none; b=xboXfrZpM02r8pZIMmFOBCSBQVloCgnMCHYEVHvu6RooF1peNkwTHixkJo2j17mWvwYhYl Ugl221aSR/YaN18jpJp5c+D2FMgVKjClcKxWLPUNRndh1mtaAGGbK7OKphQQA0Po1K48CX SzujTlBYTgb0lFlr1x7EZUArRfCHCMA= ARC-Authentication-Results: i=1; imf13.hostedemail.com; dkim=pass header.d=linux.alibaba.com header.s=default header.b=MhLFeZkV; spf=pass (imf13.hostedemail.com: domain of baolin.wang@linux.alibaba.com designates 115.124.30.99 as permitted sender) smtp.mailfrom=baolin.wang@linux.alibaba.com; dmarc=pass (policy=none) header.from=linux.alibaba.com DKIM-Signature:v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1719822816; h=Message-ID:Date:MIME-Version:Subject:To:From:Content-Type; bh=BUFOSpzDuAC43YmekwvjTOLx8yMs9durOLK1l/9bwYU=; b=MhLFeZkVLUmdAYv/IviF/IwxuI8SzvujBI1+8aaeX1Uyxozoxl65TFqws0iolAttbmnBcrSY/42nQz34myCZwsmXvgK6CSrKKa2PfIbHuxtYj7YEq72hLR8DH1U6ssWRYlQFcC2bUcIiZAeJAnVvWKlCDZoMJPg/KSsUwmci+O0= X-Alimail-AntiSpam:AC=PASS;BC=-1|-1;BR=01201311R551e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=maildocker-contentspam033037067112;MF=baolin.wang@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0W9c.k7Y_1719822814; Received: from 30.97.56.67(mailfrom:baolin.wang@linux.alibaba.com fp:SMTPD_---0W9c.k7Y_1719822814) by smtp.aliyun-inc.com; Mon, 01 Jul 2024 16:33:34 +0800 Message-ID: <4d54880e-03f4-460a-94b9-e21b8ad13119@linux.alibaba.com> Date: Mon, 1 Jul 2024 16:33:33 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH] support "THPeligible" semantics for mTHP with anonymous shmem To: Ryan Roberts , Bang Li , hughd@google.com, akpm@linux-foundation.org Cc: david@redhat.com, wangkefeng.wang@huawei.com, ziy@nvidia.com, linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <20240628104926.34209-1-libang.li@antgroup.com> <4b38db15-0716-4ffb-a38b-bd6250eb93da@arm.com> From: Baolin Wang In-Reply-To: <4b38db15-0716-4ffb-a38b-bd6250eb93da@arm.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: E8D622000B X-Stat-Signature: zjimpuf1ihfyuij3c7z45okw34g8fu9h X-Rspamd-Server: rspam09 X-Rspam-User: X-HE-Tag: 1719822822-223808 X-HE-Meta: U2FsdGVkX1+PJOYQ0lUQaGPdwHnKUdS9N1RCCS27OfCZ5DrL7nZjMThutgEyRjmHiNtpvsgN9ZwwZ1E6badKur0odSED4p5YtPyYH5WS0xTm2GPqNQsT78fYKeYw5UnMzp2VIgsI8PHfAnnLPQOHuu7jMVWtcFaSj5INSlRYkdRo+aH6P+VbhFeACdvwj/J8J3zmgOFeC5FxMAASbaPYspL5I7PEtZPZ1SO6kcj1KA9jenI3Sc7aOLwXfl8qpcxx5wyH9W7pmLH28wtNhIXsUy4et0e4pFhouyCvWrwnsk2Mvqq/pvozE190UX9lCl4pqegofKlygS13Cek+0U5z2Zc4UkLWC9s62HFeGxBSmujOKPIGIHyiCrfvsqWIK/nlusEY4cWuBnGteA4Q0sb+BUQCM3eIjTH/9I8jEgAQE6M5FSp2YlEYiFwqaPWxPYvYVlShd7JL7ggLNjB0Fd6RMjeShdBv9gNz12GmxMp6UwTIu4B2Od6nHtmLZinI2AE3or3Zvo7Hy1cnUM/z8Nha9XtDGyf7q1CpMUNiYWGXEx+ASJW2Z6bxfHZ8kOJT2XeHw0LwD5b8sVQqZ77MHHNOK9Lb7q/nnZqr2cgw+/vubgS+estJ89cSK532tdOhK7cBn3mkT8hC320jaWi5dJsND2Xc4GNHgjLX5ojxtu40C6iQkPuuRFcYu51+tV0l/gqxHM7ktc8K7sGej+tYJrFPhlSE0bbZsPR2UTG0b2RcjWOp8/TRw1Bplmzu7JaqR0SUXNZOKSELtGJlTbNXCgEC0JrMoN7ieZPTA6eWuP1FtoCKWEkubwG+JjE7FaSxB2ipH5OPrh5qXt2QxFhL1mAoh/CkD3Gzbg3MyMGOpNMSArWyeWKk4SA6n5XW1RVCD7KfSwdUOLuAcgp64C/79Ivjw+WkhVLzGnBfClO6WU1fGoZXDv+cO/mY2STVxMOmVBeYwAulJMZaIB9hdB32HY7 MbrZIrtQ l+q0sC14hzVByOOr8gqrR/iNxQdD/SXu6eX3r71Totckcq1PDjvlgEYd6pXuDYKdsetdL/XG167twEjmoTzuQh3w0fQp27cg0dcBTMHOhQjIow81RKPYrpmE0baxWcg4rIAtSz6z7nZ42yP1XRvHaVuBvXeo7Qz+JPqOZA1iW55Z/jk5dVvx4XUBVsSK6dLeBSussQWNAOVQa0oIMFNX60YyIvmZzLLMa1mbURgBt089U5PoTMY8g+MXWOHx07OgX5cJC0H0f0sK5QrhZMt32BKi2YUi2uksbuEkwe3QANES6/sM/9TyMtEmnOLFF+FyVp+BvtU+o0tDaT/dPtjlbnurc082CK8DDCUPPn0lWdixDpBo= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 2024/7/1 15:55, Ryan Roberts wrote: > On 28/06/2024 11:49, Bang Li wrote: >> After the commit 7fb1b252afb5 ("mm: shmem: add mTHP support for >> anonymous shmem"), we can configure different policies through >> the multi-size THP sysfs interface for anonymous shmem. But >> currently "THPeligible" indicates only whether the mapping is >> eligible for allocating THP-pages as well as the THP is PMD >> mappable or not for anonymous shmem, we need to support semantics >> for mTHP with anonymous shmem similar to those for mTHP with >> anonymous memory. >> >> Signed-off-by: Bang Li >> --- >> fs/proc/task_mmu.c | 10 +++++++--- >> include/linux/huge_mm.h | 11 +++++++++++ >> mm/shmem.c | 9 +-------- >> 3 files changed, 19 insertions(+), 11 deletions(-) >> >> diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c >> index 93fb2c61b154..09b5db356886 100644 >> --- a/fs/proc/task_mmu.c >> +++ b/fs/proc/task_mmu.c >> @@ -870,6 +870,7 @@ static int show_smap(struct seq_file *m, void *v) >> { >> struct vm_area_struct *vma = v; >> struct mem_size_stats mss = {}; >> + bool thp_eligible; >> >> smap_gather_stats(vma, &mss, 0); >> >> @@ -882,9 +883,12 @@ static int show_smap(struct seq_file *m, void *v) >> >> __show_smap(m, &mss, false); >> >> - seq_printf(m, "THPeligible: %8u\n", >> - !!thp_vma_allowable_orders(vma, vma->vm_flags, >> - TVA_SMAPS | TVA_ENFORCE_SYSFS, THP_ORDERS_ALL)); >> + thp_eligible = !!thp_vma_allowable_orders(vma, vma->vm_flags, >> + TVA_SMAPS | TVA_ENFORCE_SYSFS, THP_ORDERS_ALL); >> + if (vma_is_anon_shmem(vma)) >> + thp_eligible = !!shmem_allowable_huge_orders(file_inode(vma->vm_file), >> + vma, vma->vm_pgoff, thp_eligible); > > Afraid I haven't been following the shmem mTHP support work as much as I would > have liked, but is there a reason why we need a separate function for shmem? Since shmem_allowable_huge_orders() only uses shmem specific logic to determine if huge orders are allowable, there is no need to complicate the thp_vma_allowable_orders() function by adding more shmem related logic, making it more bloated. In my view, providing a dedicated helper shmem_allowable_huge_orders(), specifically for shmem, simplifies the logic. IIUC, I agree with David's suggestion that the shmem_allowable_huge_orders() helper function could be used in thp_vma_allowable_orders() to support shmem mTHP. Something like: diff --git a/mm/huge_memory.c b/mm/huge_memory.c index c7ce28f6b7f3..9677fe6cf478 100644 --- a/mm/huge_memory.c +++ b/mm/huge_memory.c @@ -151,10 +151,13 @@ unsigned long __thp_vma_allowable_orders(struct vm_area_struct *vma, * Must be done before hugepage flags check since shmem has its * own flags. */ - if (!in_pf && shmem_file(vma->vm_file)) - return shmem_is_huge(file_inode(vma->vm_file), vma->vm_pgoff, - !enforce_sysfs, vma->vm_mm, vm_flags) - ? orders : 0; + if (!in_pf && shmem_file(vma->vm_file)) { + bool global_huge = shmem_is_huge(file_inode(vma->vm_file), vma->vm_pgoff, + !enforce_sysfs, vma->vm_mm, vm_flags); + + return shmem_allowable_huge_orders(file_inode(vma->vm_file), + vma, vma->vm_pgoff, global_huge); + } if (!vma_is_anonymous(vma)) { /* > Couldn't (shouldn't) thp_vma_allowable_orders() be taught to handle shmem too? > >> + seq_printf(m, "THPeligible: %8u\n", thp_eligible); >> >> if (arch_pkeys_enabled()) >> seq_printf(m, "ProtectionKey: %8u\n", vma_pkey(vma)); >> diff --git a/include/linux/huge_mm.h b/include/linux/huge_mm.h >> index 212cca384d7e..f87136f38aa1 100644 >> --- a/include/linux/huge_mm.h >> +++ b/include/linux/huge_mm.h >> @@ -267,6 +267,10 @@ unsigned long thp_vma_allowable_orders(struct vm_area_struct *vma, >> return __thp_vma_allowable_orders(vma, vm_flags, tva_flags, orders); >> } >> >> +unsigned long shmem_allowable_huge_orders(struct inode *inode, >> + struct vm_area_struct *vma, pgoff_t index, >> + bool global_huge); >> + >> struct thpsize { >> struct kobject kobj; >> struct list_head node; >> @@ -460,6 +464,13 @@ static inline unsigned long thp_vma_allowable_orders(struct vm_area_struct *vma, >> return 0; >> } >> >> +static inline unsigned long shmem_allowable_huge_orders(struct inode *inode, >> + struct vm_area_struct *vma, pgoff_t index, >> + bool global_huge) >> +{ >> + return 0; >> +} >> + >> #define transparent_hugepage_flags 0UL >> >> #define thp_get_unmapped_area NULL >> diff --git a/mm/shmem.c b/mm/shmem.c >> index d495c0701a83..aa85df9c662a 100644 >> --- a/mm/shmem.c >> +++ b/mm/shmem.c >> @@ -1622,7 +1622,7 @@ static gfp_t limit_gfp_mask(gfp_t huge_gfp, gfp_t limit_gfp) >> } >> >> #ifdef CONFIG_TRANSPARENT_HUGEPAGE >> -static unsigned long shmem_allowable_huge_orders(struct inode *inode, >> +unsigned long shmem_allowable_huge_orders(struct inode *inode, >> struct vm_area_struct *vma, pgoff_t index, >> bool global_huge) >> { >> @@ -1707,13 +1707,6 @@ static unsigned long shmem_suitable_orders(struct inode *inode, struct vm_fault >> return orders; >> } >> #else >> -static unsigned long shmem_allowable_huge_orders(struct inode *inode, >> - struct vm_area_struct *vma, pgoff_t index, >> - bool global_huge) >> -{ >> - return 0; >> -} >> - >> static unsigned long shmem_suitable_orders(struct inode *inode, struct vm_fault *vmf, >> struct address_space *mapping, pgoff_t index, >> unsigned long orders)