From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 15D8BEB64DA for ; Tue, 18 Jul 2023 09:51:10 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 857638D0001; Tue, 18 Jul 2023 05:51:10 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 8080A6B0074; Tue, 18 Jul 2023 05:51:10 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 6F7048D0001; Tue, 18 Jul 2023 05:51:10 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 5F7B86B0071 for ; Tue, 18 Jul 2023 05:51:10 -0400 (EDT) Received: from smtpin04.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 0CB18802D9 for ; Tue, 18 Jul 2023 09:51:10 +0000 (UTC) X-FDA: 81024264300.04.A8BBBDE Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by imf26.hostedemail.com (Postfix) with ESMTP id 0AC7B140017 for ; Tue, 18 Jul 2023 09:51:07 +0000 (UTC) Authentication-Results: imf26.hostedemail.com; dkim=none; dmarc=pass (policy=none) header.from=arm.com; spf=pass (imf26.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1689673868; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=CRw8mRQS8M1U9q/5jU8FjM1X3Fxmry65G9MoHhKi9fk=; b=gXWDf+W6NCb7blY2wzI4v7haREth5rrGYGeVGrw8tAp/VDDdbkCAP+fZ2OSPL8jG6/5dDd 1VmAV3VvONQuJF0JnLANglRABG0fdLRK6FJ9If50dmS9933yhE1UQ3c6PiC6q8RyiaIlUg 6v8NHKvJDYj+QLaeLaznDlBJmD2leIk= ARC-Authentication-Results: i=1; imf26.hostedemail.com; dkim=none; dmarc=pass (policy=none) header.from=arm.com; spf=pass (imf26.hostedemail.com: domain of ryan.roberts@arm.com designates 217.140.110.172 as permitted sender) smtp.mailfrom=ryan.roberts@arm.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1689673868; a=rsa-sha256; cv=none; b=8fHM3OSqS/nHr3xt7LBUaR91J4CuT0MSNM978Smww6qlCTkpaJ4jNS8MF7SF9ulP1WEWK3 YLqNYzEOw8V+dijzIYOscwTU1t2sBH4qoIfsdRdvWa13ZcE+jFw/a7k1Vumg8MGPJBbXMV Emr7fix4ohmDx5l4TQdSDjv9YtGzku4= Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 712612F4; Tue, 18 Jul 2023 02:51:50 -0700 (PDT) Received: from [10.1.34.52] (C02Z41KALVDN.cambridge.arm.com [10.1.34.52]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 880953F67D; Tue, 18 Jul 2023 02:51:05 -0700 (PDT) Message-ID: Date: Tue, 18 Jul 2023 10:51:04 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.13.0 Subject: Re: [PATCH v1 2/3] mm: Implement folio_remove_rmap_range() To: "Huang, Ying" Cc: Andrew Morton , Matthew Wilcox , Yin Fengwei , David Hildenbrand , Yu Zhao , Yang Shi , Zi Yan , linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <20230717143110.260162-1-ryan.roberts@arm.com> <20230717143110.260162-3-ryan.roberts@arm.com> <874jm1d9ic.fsf@yhuang6-desk2.ccr.corp.intel.com> From: Ryan Roberts In-Reply-To: <874jm1d9ic.fsf@yhuang6-desk2.ccr.corp.intel.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 0AC7B140017 X-Rspam-User: X-Rspamd-Server: rspam04 X-Stat-Signature: 3t987so6j84zdcdudcbjxb567gjrxtx8 X-HE-Tag: 1689673867-430049 X-HE-Meta: U2FsdGVkX1+VzznMvbGd9C5q51nRh99oULQHidG4ODM2OO2Do1v4dW5//Ht+V/YGOFNuVsvidPvwtnxB1iIMiM+puTnPP6zeVY+Y3SZ5vEzGzdqNXqY6ykq0GZ+b4bkfquWGLcs1k2ehNNpmZ6COQ+jnUYPqu4EFg4g+mKBtxBJMhRNKRXeL7i+7B/1CqJy9yG5LKRcyXtd8eWY8KGKvdKAsWeEwwAHW5veh22eaPgqWX9wN+KfTy/dppIRZfNHbd3NZ05mjmNHCatCoUXspykzXvT7fatrd3Mali8eG4MEMSDOBdcM/LjaCw8tCAPpJe0RNRCRTC2oLEQHBzYHlVstJes4n0EQb5RuZFtiVdoATf2iwhDnitwhVb3vg90eDz+KQjBlufyJqBtcqrNkHoSTe3HIW0H6C+co5YlCxbxxXCwXMnHZ13W9FqwohQe38w7JsyYMedY+h8QtzNh17VEnLe5iT2s9h7j7X87UFp32eEJSs9Yz6P3FgPRJhw6gmtQJdMUG4EEDdL0GxJU2FzF4sa7CzsN7ULnH2aQ4MnhjZ8k9AQ+1COO9W47aiMo6W/O0fMILawOEH6XW9cswSNUNgVwsgizMZsuCHcueUPqPleVptzklqf6y19scU/6ARxiob6yHHidIYhBj15vwRMfPkJgT4Eo+qxNfYXl4TSc+QzuUSq2xWD5tZXBEdgb2tYsPuWaK3Qhd6QsfaSvB29Qdj1WPwh9YZIGXsfLDecs/Kbmf58KZcby/nF1qb3Py7aJV7+wLZJAmDVlPzuU4NKSZ9V4eJMVRz/iH0nTZPD7tLGKK3oM4aH6RivEpbaXYMs+sf0eyiE/6aztRBUHi7IAcneNpVDw0CoieKIZf7Z/B8fUEzxkBHE3zZiBy6Q3nMbc4vilDIzP30ZbUyPz8aW7o5kN4oAsh4Wmb3KwJoykceqSaPn6mFtQRWdJpKrMu94PRUbnj1DiWHQPRPhXD PoB7ozEg zMe/PuXJLT1kyTCkXb2196Ijq4xE1wp8tkx8U44kjDUA78GTl6xn0kv4LC53knLq1WxTKOKzbV7sjm/WFC2bIqxvWrTSa8Pd8YeMHTFthwDpf7XG4t86/tWiWbesAbNzANQLoms+vLSmmyEwLuoShJLAUpUWWZKlYaeklZoQlOW1N1vAP73sLckvw6mSicccK5tz+fQYkCY6j9py9orfLQNXbRw== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 18/07/2023 07:22, Huang, Ying wrote: > Ryan Roberts writes: > >> Like page_remove_rmap() but batch-removes the rmap for a range of pages >> belonging to a folio. This can provide a small speedup due to less >> manipuation of the various counters. But more crucially, if removing the >> rmap for all pages of a folio in a batch, there is no need to >> (spuriously) add it to the deferred split list, which saves significant >> cost when there is contention for the split queue lock. >> >> All contained pages are accounted using the order-0 folio (or base page) >> scheme. >> >> Signed-off-by: Ryan Roberts >> --- >> include/linux/rmap.h | 2 ++ >> mm/rmap.c | 65 ++++++++++++++++++++++++++++++++++++++++++++ >> 2 files changed, 67 insertions(+) >> >> diff --git a/include/linux/rmap.h b/include/linux/rmap.h >> index b87d01660412..f578975c12c0 100644 >> --- a/include/linux/rmap.h >> +++ b/include/linux/rmap.h >> @@ -200,6 +200,8 @@ void page_add_file_rmap(struct page *, struct vm_area_struct *, >> bool compound); >> void page_remove_rmap(struct page *, struct vm_area_struct *, >> bool compound); >> +void folio_remove_rmap_range(struct folio *folio, struct page *page, >> + int nr, struct vm_area_struct *vma); >> >> void hugepage_add_anon_rmap(struct page *, struct vm_area_struct *, >> unsigned long address, rmap_t flags); >> diff --git a/mm/rmap.c b/mm/rmap.c >> index 2baf57d65c23..1da05aca2bb1 100644 >> --- a/mm/rmap.c >> +++ b/mm/rmap.c >> @@ -1359,6 +1359,71 @@ void page_add_file_rmap(struct page *page, struct vm_area_struct *vma, >> mlock_vma_folio(folio, vma, compound); >> } >> >> +/* >> + * folio_remove_rmap_range - take down pte mappings from a range of pages >> + * belonging to a folio. All pages are accounted as small pages. >> + * @folio: folio that all pages belong to >> + * @page: first page in range to remove mapping from >> + * @nr: number of pages in range to remove mapping from >> + * @vma: the vm area from which the mapping is removed >> + * >> + * The caller needs to hold the pte lock. >> + */ >> +void folio_remove_rmap_range(struct folio *folio, struct page *page, >> + int nr, struct vm_area_struct *vma) >> +{ >> + atomic_t *mapped = &folio->_nr_pages_mapped; >> + int nr_unmapped = 0; >> + int nr_mapped; >> + bool last; >> + enum node_stat_item idx; >> + >> + if (unlikely(folio_test_hugetlb(folio))) { >> + VM_WARN_ON_FOLIO(1, folio); >> + return; >> + } >> + >> + if (!folio_test_large(folio)) { >> + /* Is this the page's last map to be removed? */ >> + last = atomic_add_negative(-1, &page->_mapcount); >> + nr_unmapped = last; >> + } else { >> + for (; nr != 0; nr--, page++) { >> + /* Is this the page's last map to be removed? */ >> + last = atomic_add_negative(-1, &page->_mapcount); >> + if (last) { >> + /* Page still mapped if folio mapped entirely */ >> + nr_mapped = atomic_dec_return_relaxed(mapped); >> + if (nr_mapped < COMPOUND_MAPPED) >> + nr_unmapped++; >> + } >> + } >> + } >> + >> + if (nr_unmapped) { >> + idx = folio_test_anon(folio) ? NR_ANON_MAPPED : NR_FILE_MAPPED; >> + __lruvec_stat_mod_folio(folio, idx, -nr_unmapped); >> + >> + /* >> + * Queue anon THP for deferred split if we have just unmapped at > > Just some nitpicks. So feel free to ignore. > > s/anon THP/large folio/ ? ACK > >> + * least 1 page, while at least 1 page remains mapped. >> + */ >> + if (folio_test_large(folio) && folio_test_anon(folio)) >> + if (nr_mapped) > > if (folio_test_large(folio) && folio_test_anon(folio) && nr_mapped) ? ACK : I'll make these changes for the next version. > >> + deferred_split_folio(folio); >> + } >> + >> + /* >> + * It would be tidy to reset folio_test_anon mapping when fully >> + * unmapped, but that might overwrite a racing page_add_anon_rmap >> + * which increments mapcount after us but sets mapping before us: >> + * so leave the reset to free_pages_prepare, and remember that >> + * it's only reliable while mapped. >> + */ >> + >> + munlock_vma_folio(folio, vma, false); >> +} >> + >> /** >> * page_remove_rmap - take down pte mapping from a page >> * @page: page to remove mapping from > > Best Regards, > Huang, Ying