From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id E7809EB64DC for ; Mon, 17 Jul 2023 13:13:59 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 7DEDB8D0001; Mon, 17 Jul 2023 09:13:59 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 78F216B0074; Mon, 17 Jul 2023 09:13:59 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 657338D0001; Mon, 17 Jul 2023 09:13:59 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id 590D56B0072 for ; Mon, 17 Jul 2023 09:13:59 -0400 (EDT) Received: from smtpin18.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 26B81801E3 for ; Mon, 17 Jul 2023 13:13:59 +0000 (UTC) X-FDA: 81021146598.18.7CFD003 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by imf08.hostedemail.com (Postfix) with ESMTP id 0D13F160003 for ; Mon, 17 Jul 2023 13:13:56 +0000 (UTC) Authentication-Results: imf08.hostedemail.com; dkim=none; spf=pass (imf08.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com; dmarc=pass (policy=none) header.from=arm.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1689599637; a=rsa-sha256; cv=none; b=alZ+96qbCqulXbJgzZWCtnSW/X4wC80dvIiBZnALaUbMtcAi3HJailf9IQCLDVWZJrvxnJ A+Cs+saVTvtWDeRw+s5mGWSHcBl6tPYsbId6QlKm7zPYb1d/BbuCHASz2WwNiMnKIM5Tbk /aZ84IVvkYMhZVWw7F48xc0OXZ88dgs= ARC-Authentication-Results: i=1; imf08.hostedemail.com; dkim=none; spf=pass (imf08.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com; dmarc=pass (policy=none) header.from=arm.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1689599637; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=POiBVqMa07XN85IUqH9eVE8G0AMaiGCpqsC9LycXc+A=; b=oQNAjAuQiX6xvgjChfZJLLNHvEE6PZWU4BexQ0JRo1geS9wjA58xhJunz3O/rF2FB6YWNK CsS7da6loBMPp/O2V1MRWYyxjSUi8oNpM3gaGAYXveuTVpIueodLY4M5Jnhc+PDawzyWQN H3xRVUcNYL99P16+UuGmq2q9nrgRXl0= Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 481F9C15; Mon, 17 Jul 2023 06:14:39 -0700 (PDT) Received: from [10.57.76.30] (unknown [10.57.76.30]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id A63E23F67D; Mon, 17 Jul 2023 06:13:53 -0700 (PDT) Message-ID: Date: Mon, 17 Jul 2023 14:13:52 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.13.0 Subject: Re: [PATCH v3 1/4] mm: Non-pmd-mappable, large folios for folio_add_new_anon_rmap() To: David Hildenbrand , Andrew Morton , Matthew Wilcox , "Kirill A. Shutemov" , Yin Fengwei , Yu Zhao , Catalin Marinas , Will Deacon , Anshuman Khandual , Yang Shi , "Huang, Ying" , Zi Yan , Luis Chamberlain Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <20230714160407.4142030-1-ryan.roberts@arm.com> <20230714161733.4144503-1-ryan.roberts@arm.com> From: Ryan Roberts In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Rspam-User: X-Rspamd-Server: rspam06 X-Rspamd-Queue-Id: 0D13F160003 X-Stat-Signature: ku5o4acege5yya3a4iyy36tw5u5zxwf5 X-HE-Tag: 1689599636-462682 X-HE-Meta: U2FsdGVkX1/Av4v6DBZX8J8ynPQyOhA8zBPgQAuVZ46aLjiYIkrTJY9wKMubhqeeZaoLv1wZcK4M3qnEsLPSkFHSOhbcQD78kW5O2GlOgnpRXyQhlqwVfH7UUJQ1WpR73ci4GNgsE2TRl0WOmpG2Bu59KW4xrxok/voTY0qEmZnE/lb+suA4w7xE5HsBvqoLJk5wsj85PiQVy1iqWAGdga180EtAZmLcK0HyR4uuSznANO+iPI11XQ1GlKjvVA55iUU+b02jRE4tPoeAfttrJ7D/o/X1QWfivBxiP/kRh4gJ1X1+sKkQ4ED1kcSXtgq/GlK8ZpnlTm8yk6nEdZH+TfDRqtnO5Lm5GArDZTew98vv7sfuiEG2XgL4dJTIEKCBExBaZ4l+hZ98Koyo4dfW/qRkR6zSeQ+tCLw8ovueE4Y/UFCL9HYILX1qqEMEZNOxN0wu1dpu7jmy3KO+glRDpZHyowFb9JM3AU5fTV2WrXK00GSakW/vQ+2Jxk3AW9vYUTklvP/i+bl5Vbp4VW1oRbmhzZTp1ckFWDH6i4yA69xETFV1g+rYrg5lQV1ZxzP8LMQCp4GwMZI/oLUHKeO3gbbFKHHZnLZFf5GVKNnivdtJmabR7rSGfEGTp5y+7iBEliM4Y+qBjqQ4LRe3fqGZ49lK3UAZ1E6L0dYolSy/EYhQSfdO+Spa7VPJrkUpaeTY5DXNCwdXrzo8XqYH2yvPoxSRyu96BgshysLzPqRv0oJq601+ls1qn9XPfBhcfH3FKp2Jbpyxfh2t/ppXVCETKmpMzl9A6R7rElW6YPAP+kaoKOs/YQTwsHkHtE1OAkcT8obtbNaSZwVKApSPdM+joFwsaFsKy6Ubkr71R3bWM4+G9Nj0W9ahW8MEkgHH+w0yqaJHBnR4zvAgBFO5OFyr7e3P3mVRWtq4gprAWCtp22AWJrsMZCQfRHj422wVQYPrfr3901JNLl4tEs8oqAl noQ3+WCr nK9MK6jJGGIX9F0C8AGCMi97Xjp3+DI+A310NGG5MYk5DV8o3r0eV5UqntQtO0OJZLV1pJqczQWAATB95utF1EfFWiMV6aoDXJjGZgRsnFIBqOh1+R7aFKjuYkNnCJs36xzpoj2Ws1Jtz8h1M47DgQVas0PLS2jFhHbyJ+SQ7xqW+puUDrqdf/5E8Vggr8b1J9pJ+5e0xCv2WMLobjYhClTuqaPN12M4LjsBYiOb6lJREu//2/g49GeZN+fIDpTtENHq3kZJNkfpoITo= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 17/07/2023 14:00, David Hildenbrand wrote: > On 14.07.23 18:17, Ryan Roberts wrote: >> In preparation for FLEXIBLE_THP support, improve >> folio_add_new_anon_rmap() to allow a non-pmd-mappable, large folio to be >> passed to it. In this case, all contained pages are accounted using the >> order-0 folio (or base page) scheme. >> >> Signed-off-by: Ryan Roberts >> Reviewed-by: Yu Zhao >> Reviewed-by: Yin Fengwei >> --- >>   mm/rmap.c | 28 +++++++++++++++++++++------- >>   1 file changed, 21 insertions(+), 7 deletions(-) >> >> diff --git a/mm/rmap.c b/mm/rmap.c >> index 0c0d8857dfce..f293d072368a 100644 >> --- a/mm/rmap.c >> +++ b/mm/rmap.c >> @@ -1278,31 +1278,45 @@ void page_add_anon_rmap(struct page *page, struct >> vm_area_struct *vma, >>    * This means the inc-and-test can be bypassed. >>    * The folio does not have to be locked. >>    * >> - * If the folio is large, it is accounted as a THP.  As the folio >> + * If the folio is pmd-mappable, it is accounted as a THP.  As the folio >>    * is new, it's assumed to be mapped exclusively by a single process. >>    */ >>   void folio_add_new_anon_rmap(struct folio *folio, struct vm_area_struct *vma, >>           unsigned long address) >>   { >> -    int nr; >> +    int nr = folio_nr_pages(folio); >> >> -    VM_BUG_ON_VMA(address < vma->vm_start || address >= vma->vm_end, vma); >> +    VM_BUG_ON_VMA(address < vma->vm_start || >> +            address + (nr << PAGE_SHIFT) > vma->vm_end, vma); >>       __folio_set_swapbacked(folio); >> >> -    if (likely(!folio_test_pmd_mappable(folio))) { >> +    if (!folio_test_large(folio)) { > > Why remove the "likely" here? The patch itself does not change anything about > that condition. Good question; I'm not sure why. Will have to put it down to bad copy/paste fixup. Will put it back in the next version. > >>           /* increment count (starts at -1) */ >>           atomic_set(&folio->_mapcount, 0); >> -        nr = 1; >> +        __page_set_anon_rmap(folio, &folio->page, vma, address, 1); >> +    } else if (!folio_test_pmd_mappable(folio)) { >> +        int i; >> + >> +        for (i = 0; i < nr; i++) { >> +            struct page *page = folio_page(folio, i); >> + >> +            /* increment count (starts at -1) */ >> +            atomic_set(&page->_mapcount, 0); >> +            __page_set_anon_rmap(folio, page, vma, >> +                    address + (i << PAGE_SHIFT), 1); >> +        } >> + >> +        /* increment count (starts at 0) */ > > That comment is a bit misleading. We're not talking about a mapcount as in the > other cases here. Correct, I'm talking about _nr_pages_mapped, which starts 0, not -1 like _mapcount. The comment was intended to be in the style used in other similar places in rmap.c. I could change it to: "_nr_pages_mapped is 0-based, so set it to the number of pages in the folio" or remove it entirely? What do you prefer? > >> +        atomic_set(&folio->_nr_pages_mapped, nr); >>       } else { >>           /* increment count (starts at -1) */ >>           atomic_set(&folio->_entire_mapcount, 0); >>           atomic_set(&folio->_nr_pages_mapped, COMPOUND_MAPPED); >> -        nr = folio_nr_pages(folio); >> +        __page_set_anon_rmap(folio, &folio->page, vma, address, 1); >>           __lruvec_stat_mod_folio(folio, NR_ANON_THPS, nr); >>       } >> > > Apart from that, LGTM. >