From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2C96BC433F5 for ; Mon, 9 May 2022 15:43:31 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 500F96B0072; Mon, 9 May 2022 11:43:30 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 489F86B0073; Mon, 9 May 2022 11:43:30 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 2DE4B6B0074; Mon, 9 May 2022 11:43:30 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id 1745D6B0072 for ; Mon, 9 May 2022 11:43:30 -0400 (EDT) Received: from smtpin08.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay11.hostedemail.com (Postfix) with ESMTP id DE29581013 for ; Mon, 9 May 2022 15:43:29 +0000 (UTC) X-FDA: 79446624138.08.8A46921 Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by imf12.hostedemail.com (Postfix) with ESMTP id C7DFD40087 for ; Mon, 9 May 2022 15:43:08 +0000 (UTC) Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id 5125D21C54; Mon, 9 May 2022 15:43:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_rsa; t=1652111008; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=G8T6pl1iznxs13Ey9cez14qQxbbfQ6jd4EKhTAHfONQ=; b=OIX4wvitWGHcNBQ8JmpcU3rPsZU1gR6+LHcNOwZdDXt/ctTFk3ovA/alCFpxrJgZHkU0jO j8H8ca2us26w1OtLpELOr66VCMVrsGfD9mSVAmr9PH2eHNxXsyMiymy4TEEYmN0veUZ4rO jmPneqJd3q3wmPSpiG3aBLWFqPbxo1E= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_ed25519; t=1652111008; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=G8T6pl1iznxs13Ey9cez14qQxbbfQ6jd4EKhTAHfONQ=; b=jK3ZXCl1D7XaTIz1+9uo9WVUNqIOJAFF6WB0oqleV2JaHMus92O7168s829hFh2XwbYV9M 8GbAzUyq/BtpAbBw== Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id 245CD13AA5; Mon, 9 May 2022 15:43:28 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id b11GCKA2eWJIIgAAMHmgww (envelope-from ); Mon, 09 May 2022 15:43:28 +0000 Message-ID: <0d8f368a-54bb-b4e7-931f-9bf3ae24af4c@suse.cz> Date: Mon, 9 May 2022 17:43:27 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.8.1 Subject: Re: [v3 PATCH 8/8] mm: mmap: register suitable readonly file vmas for khugepaged Content-Language: en-US To: Yang Shi , kirill.shutemov@linux.intel.com, linmiaohe@huawei.com, songliubraving@fb.com, riel@surriel.com, willy@infradead.org, ziy@nvidia.com, tytso@mit.edu, akpm@linux-foundation.org Cc: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org References: <20220404200250.321455-1-shy828301@gmail.com> <20220404200250.321455-9-shy828301@gmail.com> From: Vlastimil Babka In-Reply-To: <20220404200250.321455-9-shy828301@gmail.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: C7DFD40087 X-Stat-Signature: 9zc7ee3wcuypo93g11kc8su8sgxk6yc8 Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=suse.cz header.s=susede2_rsa header.b=OIX4wvit; dkim=pass header.d=suse.cz header.s=susede2_ed25519 header.b=jK3ZXCl1; dmarc=none; spf=pass (imf12.hostedemail.com: domain of vbabka@suse.cz designates 195.135.220.28 as permitted sender) smtp.mailfrom=vbabka@suse.cz X-Rspam-User: X-Rspamd-Server: rspam08 X-HE-Tag: 1652110988-228827 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 4/4/22 22:02, Yang Shi wrote: > The readonly FS THP relies on khugepaged to collapse THP for suitable > vmas. But it is kind of "random luck" for khugepaged to see the > readonly FS vmas (https://lore.kernel.org/linux-mm/00f195d4-d039-3cf2-d3a1-a2c88de397a0@suse.cz/) > since currently the vmas are registered to khugepaged when: > - Anon huge pmd page fault > - VMA merge > - MADV_HUGEPAGE > - Shmem mmap > > If the above conditions are not met, even though khugepaged is enabled > it won't see readonly FS vmas at all. MADV_HUGEPAGE could be specified > explicitly to tell khugepaged to collapse this area, but when khugepaged > mode is "always" it should scan suitable vmas as long as VM_NOHUGEPAGE > is not set. > > So make sure readonly FS vmas are registered to khugepaged to make the > behavior more consistent. > > Registering suitable vmas in common mmap path, that could cover both > readonly FS vmas and shmem vmas, so removed the khugepaged calls in > shmem.c. > > Still need to keep the khugepaged call in vma_merge() since vma_merge() > is called in a lot of places, for example, madvise, mprotect, etc. > > Reported-by: Vlastimil Babka > Signed-off-by: Yang Shi Acked-by: Vlastimil Babka > --- > mm/mmap.c | 6 ++++++ > mm/shmem.c | 4 ---- > 2 files changed, 6 insertions(+), 4 deletions(-) > > diff --git a/mm/mmap.c b/mm/mmap.c > index 604c8dece5dd..616ebbc2d052 100644 > --- a/mm/mmap.c > +++ b/mm/mmap.c > @@ -1842,6 +1842,12 @@ unsigned long mmap_region(struct file *file, unsigned long addr, > } > > vma_link(mm, vma, prev, rb_link, rb_parent); > + > + /* > + * vma_merge() calls khugepaged_enter_vma() either, the below > + * call covers the non-merge case. > + */ > + khugepaged_enter_vma(vma, vma->vm_flags); > /* Once vma denies write, undo our temporary denial count */ > unmap_writable: > if (file && vm_flags & VM_SHARED) > diff --git a/mm/shmem.c b/mm/shmem.c > index 92eca974771d..0c448080d210 100644 > --- a/mm/shmem.c > +++ b/mm/shmem.c > @@ -34,7 +34,6 @@ > #include > #include > #include > -#include > #include > #include > #include > @@ -2239,7 +2238,6 @@ static int shmem_mmap(struct file *file, struct vm_area_struct *vma) > > file_accessed(file); > vma->vm_ops = &shmem_vm_ops; > - khugepaged_enter_vma(vma, vma->vm_flags); > return 0; > } > > @@ -4132,8 +4130,6 @@ int shmem_zero_setup(struct vm_area_struct *vma) > vma->vm_file = file; > vma->vm_ops = &shmem_vm_ops; > > - khugepaged_enter_vma(vma, vma->vm_flags); > - > return 0; > } >