From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id D600BC369B1 for ; Wed, 16 Apr 2025 08:52:07 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id AC0366B0202; Wed, 16 Apr 2025 04:52:06 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id A49C46B0204; Wed, 16 Apr 2025 04:52:06 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 875576B0205; Wed, 16 Apr 2025 04:52:06 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 63E3E6B0202 for ; Wed, 16 Apr 2025 04:52:06 -0400 (EDT) Received: from smtpin18.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay03.hostedemail.com (Postfix) with ESMTP id D3D68AD539 for ; Wed, 16 Apr 2025 08:52:06 +0000 (UTC) X-FDA: 83339289852.18.D2C5E57 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by imf21.hostedemail.com (Postfix) with ESMTP id 50CCD1C0004 for ; Wed, 16 Apr 2025 08:52:04 +0000 (UTC) Authentication-Results: imf21.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=QtH3dt8n; dmarc=pass (policy=quarantine) header.from=redhat.com; spf=pass (imf21.hostedemail.com: domain of david@redhat.com designates 170.10.129.124 as permitted sender) smtp.mailfrom=david@redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1744793524; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=NhVqw4Ar1jXZxr2wztDNhgWh0er+EtWycdOEp3EQ4gM=; b=mOInghVX4lLPvVTyJczCSy+fxs4iry9RYEh9CjkImkkoaFYIxUpy0suqAW/Il9XmsyFKIp VRBI4wSxDnmEDn+v30GuGxgUai14koeZ9OeqyhjPKOrthBsofqTLqic2dm6khkayJWoe0B cgKEpPA5GSBIzbVtmtAKB8JjDDdaBdk= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1744793524; a=rsa-sha256; cv=none; b=Sh5i2QSb3RVEpoYfYqJFGk1SPtq8GzIoxcLd40Wx56ULR00ODU+2sUJFEcxjMYxWc8ejKX VtTql1gqZzBQXmZjlMhGqO66VariDLTonJIIueUScnR2k6QLhha6iPYCL2L3JjnnB5Juc/ hhQ4q7gIrbMpQR6iF9EewBre9v8Ywvc= ARC-Authentication-Results: i=1; imf21.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=QtH3dt8n; dmarc=pass (policy=quarantine) header.from=redhat.com; spf=pass (imf21.hostedemail.com: domain of david@redhat.com designates 170.10.129.124 as permitted sender) smtp.mailfrom=david@redhat.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1744793523; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:autocrypt:autocrypt; bh=NhVqw4Ar1jXZxr2wztDNhgWh0er+EtWycdOEp3EQ4gM=; b=QtH3dt8n33NVUmE4a6QSLaZ4AEgvdGjZW/BiHjc5+ZNVfNVswQoIoCKpl9+61Qt4eWKszf lk28t9opQwJcDtTkmD2RX21iydRLvWBIYSygYAZ4gORWcbeYgVLOhJ6C92eT4MYtIObwwE bwHuH6mLNL2pNdcYmUHw7FPHGz1spLE= Received: from mail-wr1-f72.google.com (mail-wr1-f72.google.com [209.85.221.72]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-453-eL2c0jrxMvOaelG68K84vA-1; Wed, 16 Apr 2025 04:52:01 -0400 X-MC-Unique: eL2c0jrxMvOaelG68K84vA-1 X-Mimecast-MFC-AGG-ID: eL2c0jrxMvOaelG68K84vA_1744793521 Received: by mail-wr1-f72.google.com with SMTP id ffacd0b85a97d-39ee4b91d1cso210332f8f.0 for ; Wed, 16 Apr 2025 01:52:01 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1744793521; x=1745398321; h=content-transfer-encoding:in-reply-to:organization:autocrypt :content-language:from:references:cc:to:subject:user-agent :mime-version:date:message-id:x-gm-message-state:from:to:cc:subject :date:message-id:reply-to; bh=NhVqw4Ar1jXZxr2wztDNhgWh0er+EtWycdOEp3EQ4gM=; b=Pu85V8yoc8Upr+uyV5fmQ9/jjjKWx0+pPJteXp6ui5hQLIrrSHFbWa6V42qycBNW7z NdOtkLeM6RjaSz7TrA0h8gTep8iwF/qwmJSlguBnc7f4xfhoX77LUQEuhMorR8OQ+dyt UQ7xfnLvE0EETdqzHLXRAaii0WYCeKCdpVHWhtVXlY5lo30Wd+l8/QVGV9zXRpu34lBc q8gZhSvnnUhVtB+8eKSuWpTO0ChduwKDPnpJbTER1sN4l6JqKNC2/ZHR+M8/s4VthZkt A7IQmyRFGEivOZy7lxSGyud7eYMXCHsAZb2CHamyOzbHiai/LBv/vj9vamkzMeJdDzkL inig== X-Forwarded-Encrypted: i=1; AJvYcCWC5hgcYXukuDtUoSEt2+w0SYzOIdA30wxZlLB2DsiNvrlBnPJ7NHYYF0jFrRxYfW+NeKW+vQ3l7A==@kvack.org X-Gm-Message-State: AOJu0Yw8NEu+UyS1vfTynhgGH07MMQOtcJBNwKPTUYXrlEx6vg7xjoHX WL66eEV0zib9NQOOmp1sqlng794eM/0qwfiFp8Ff15lsIFWEbo98Sex3MaOsw8pyQblV+1hiTUl GxJATEXPXqtZI1bk4xBZ4TL36aPAKb2NHIj9FQ2GrzdPVFSRA X-Gm-Gg: ASbGnctwN2ryC//Qs+TLvPQoiB7B7aL81R3ei2UhonmVj/OzsAk6ESBa2OGlrJabjfb XcuKyaFv3CDYVQLFo1Ke6cz/COph89ESn158Vu6QW5Gs7c2S1nQ/AsktPNtFGMkpbfQFsBLTHh5 C9o/hceTaiR6eOopFmvfKC1SqH8GQbXw+6Izh/3KW4PS+XI3lCqJe0+nr1KkrjvQk/lCr3+qqjW 6S1K+bzhaSNGLtRGkBkr8Hc/Dtjdy1Lv2vQvJCAvqE9CotcAxvOnpDvKdKAZVnsfPB9HedyBLso 2Ta0Gsx1wJGlnIyjrl4IRBIHuFMEEP4ZWZJc7tsFMWkNkc+O1vxviORaMgja3/ZMf3LgZnXzCcD CjKHbSVIY2CqKJYG3aYojkF2AVgbcVzswZHVAJg== X-Received: by 2002:a05:6000:184c:b0:391:3207:2e68 with SMTP id ffacd0b85a97d-39ee5e9b09amr962416f8f.9.1744793520731; Wed, 16 Apr 2025 01:52:00 -0700 (PDT) X-Google-Smtp-Source: AGHT+IE5qajaPEsTgVC5aViq9Iz00BX9NTLaVHSwJmqi5hZECXRe/JgmtMk48u1duqUBA1TBUwBPOQ== X-Received: by 2002:a05:6000:184c:b0:391:3207:2e68 with SMTP id ffacd0b85a97d-39ee5e9b09amr962397f8f.9.1744793520320; Wed, 16 Apr 2025 01:52:00 -0700 (PDT) Received: from ?IPV6:2003:d8:2f02:2900:f54f:bad7:c5f4:9404? (p200300d82f022900f54fbad7c5f49404.dip0.t-ipconnect.de. [2003:d8:2f02:2900:f54f:bad7:c5f4:9404]) by smtp.gmail.com with ESMTPSA id 5b1f17b1804b1-4405b4f39b1sm14556675e9.22.2025.04.16.01.51.59 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 16 Apr 2025 01:51:59 -0700 (PDT) Message-ID: <8b387a53-40e0-40d1-8bfa-b7524657a7dd@redhat.com> Date: Wed, 16 Apr 2025 10:51:58 +0200 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v3] mempolicy: Optimize queue_folios_pte_range by PTE batching To: Baolin Wang , Dev Jain , akpm@linux-foundation.org Cc: ryan.roberts@arm.com, willy@infradead.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, hughd@google.com, vishal.moola@gmail.com, yang@os.amperecomputing.com, ziy@nvidia.com References: <20250416053048.96479-1-dev.jain@arm.com> <7f96283b-11b3-49ee-9d2d-5ad977325cb0@linux.alibaba.com> <019d1c4a-ffd0-4602-b2ba-cf07379dab17@redhat.com> <7392a21b-10bf-4ce9-a6fd-807ed954c138@linux.alibaba.com> From: David Hildenbrand Autocrypt: addr=david@redhat.com; keydata= xsFNBFXLn5EBEAC+zYvAFJxCBY9Tr1xZgcESmxVNI/0ffzE/ZQOiHJl6mGkmA1R7/uUpiCjJ dBrn+lhhOYjjNefFQou6478faXE6o2AhmebqT4KiQoUQFV4R7y1KMEKoSyy8hQaK1umALTdL QZLQMzNE74ap+GDK0wnacPQFpcG1AE9RMq3aeErY5tujekBS32jfC/7AnH7I0v1v1TbbK3Gp XNeiN4QroO+5qaSr0ID2sz5jtBLRb15RMre27E1ImpaIv2Jw8NJgW0k/D1RyKCwaTsgRdwuK Kx/Y91XuSBdz0uOyU/S8kM1+ag0wvsGlpBVxRR/xw/E8M7TEwuCZQArqqTCmkG6HGcXFT0V9 PXFNNgV5jXMQRwU0O/ztJIQqsE5LsUomE//bLwzj9IVsaQpKDqW6TAPjcdBDPLHvriq7kGjt WhVhdl0qEYB8lkBEU7V2Yb+SYhmhpDrti9Fq1EsmhiHSkxJcGREoMK/63r9WLZYI3+4W2rAc UucZa4OT27U5ZISjNg3Ev0rxU5UH2/pT4wJCfxwocmqaRr6UYmrtZmND89X0KigoFD/XSeVv jwBRNjPAubK9/k5NoRrYqztM9W6sJqrH8+UWZ1Idd/DdmogJh0gNC0+N42Za9yBRURfIdKSb B3JfpUqcWwE7vUaYrHG1nw54pLUoPG6sAA7Mehl3nd4pZUALHwARAQABzSREYXZpZCBIaWxk ZW5icmFuZCA8ZGF2aWRAcmVkaGF0LmNvbT7CwZgEEwEIAEICGwMGCwkIBwMCBhUIAgkKCwQW AgMBAh4BAheAAhkBFiEEG9nKrXNcTDpGDfzKTd4Q9wD/g1oFAl8Ox4kFCRKpKXgACgkQTd4Q 9wD/g1oHcA//a6Tj7SBNjFNM1iNhWUo1lxAja0lpSodSnB2g4FCZ4R61SBR4l/psBL73xktp rDHrx4aSpwkRP6Epu6mLvhlfjmkRG4OynJ5HG1gfv7RJJfnUdUM1z5kdS8JBrOhMJS2c/gPf wv1TGRq2XdMPnfY2o0CxRqpcLkx4vBODvJGl2mQyJF/gPepdDfcT8/PY9BJ7FL6Hrq1gnAo4 3Iv9qV0JiT2wmZciNyYQhmA1V6dyTRiQ4YAc31zOo2IM+xisPzeSHgw3ONY/XhYvfZ9r7W1l pNQdc2G+o4Di9NPFHQQhDw3YTRR1opJaTlRDzxYxzU6ZnUUBghxt9cwUWTpfCktkMZiPSDGd KgQBjnweV2jw9UOTxjb4LXqDjmSNkjDdQUOU69jGMUXgihvo4zhYcMX8F5gWdRtMR7DzW/YE BgVcyxNkMIXoY1aYj6npHYiNQesQlqjU6azjbH70/SXKM5tNRplgW8TNprMDuntdvV9wNkFs 9TyM02V5aWxFfI42+aivc4KEw69SE9KXwC7FSf5wXzuTot97N9Phj/Z3+jx443jo2NR34XgF 89cct7wJMjOF7bBefo0fPPZQuIma0Zym71cP61OP/i11ahNye6HGKfxGCOcs5wW9kRQEk8P9 M/k2wt3mt/fCQnuP/mWutNPt95w9wSsUyATLmtNrwccz63XOwU0EVcufkQEQAOfX3n0g0fZz Bgm/S2zF/kxQKCEKP8ID+Vz8sy2GpDvveBq4H2Y34XWsT1zLJdvqPI4af4ZSMxuerWjXbVWb T6d4odQIG0fKx4F8NccDqbgHeZRNajXeeJ3R7gAzvWvQNLz4piHrO/B4tf8svmRBL0ZB5P5A 2uhdwLU3NZuK22zpNn4is87BPWF8HhY0L5fafgDMOqnf4guJVJPYNPhUFzXUbPqOKOkL8ojk CXxkOFHAbjstSK5Ca3fKquY3rdX3DNo+EL7FvAiw1mUtS+5GeYE+RMnDCsVFm/C7kY8c2d0G NWkB9pJM5+mnIoFNxy7YBcldYATVeOHoY4LyaUWNnAvFYWp08dHWfZo9WCiJMuTfgtH9tc75 7QanMVdPt6fDK8UUXIBLQ2TWr/sQKE9xtFuEmoQGlE1l6bGaDnnMLcYu+Asp3kDT0w4zYGsx 5r6XQVRH4+5N6eHZiaeYtFOujp5n+pjBaQK7wUUjDilPQ5QMzIuCL4YjVoylWiBNknvQWBXS lQCWmavOT9sttGQXdPCC5ynI+1ymZC1ORZKANLnRAb0NH/UCzcsstw2TAkFnMEbo9Zu9w7Kv AxBQXWeXhJI9XQssfrf4Gusdqx8nPEpfOqCtbbwJMATbHyqLt7/oz/5deGuwxgb65pWIzufa N7eop7uh+6bezi+rugUI+w6DABEBAAHCwXwEGAEIACYCGwwWIQQb2cqtc1xMOkYN/MpN3hD3 AP+DWgUCXw7HsgUJEqkpoQAKCRBN3hD3AP+DWrrpD/4qS3dyVRxDcDHIlmguXjC1Q5tZTwNB boaBTPHSy/Nksu0eY7x6HfQJ3xajVH32Ms6t1trDQmPx2iP5+7iDsb7OKAb5eOS8h+BEBDeq 3ecsQDv0fFJOA9ag5O3LLNk+3x3q7e0uo06XMaY7UHS341ozXUUI7wC7iKfoUTv03iO9El5f XpNMx/YrIMduZ2+nd9Di7o5+KIwlb2mAB9sTNHdMrXesX8eBL6T9b+MZJk+mZuPxKNVfEQMQ a5SxUEADIPQTPNvBewdeI80yeOCrN+Zzwy/Mrx9EPeu59Y5vSJOx/z6OUImD/GhX7Xvkt3kq Er5KTrJz3++B6SH9pum9PuoE/k+nntJkNMmQpR4MCBaV/J9gIOPGodDKnjdng+mXliF3Ptu6 3oxc2RCyGzTlxyMwuc2U5Q7KtUNTdDe8T0uE+9b8BLMVQDDfJjqY0VVqSUwImzTDLX9S4g/8 kC4HRcclk8hpyhY2jKGluZO0awwTIMgVEzmTyBphDg/Gx7dZU1Xf8HFuE+UZ5UDHDTnwgv7E th6RC9+WrhDNspZ9fJjKWRbveQgUFCpe1sa77LAw+XFrKmBHXp9ZVIe90RMe2tRL06BGiRZr jPrnvUsUUsjRoRNJjKKA/REq+sAnhkNPPZ/NNMjaZ5b8Tovi8C0tmxiCHaQYqj7G2rgnT0kt WNyWQQ== Organization: Red Hat In-Reply-To: <7392a21b-10bf-4ce9-a6fd-807ed954c138@linux.alibaba.com> X-Mimecast-Spam-Score: 0 X-Mimecast-MFC-PROC-ID: bh56dEmaZjQGLeCj9oA3mlq1lokB9BzQ5W7msIEF4FI_1744793521 X-Mimecast-Originator: redhat.com Content-Language: en-US Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: 50CCD1C0004 X-Rspam-User: X-Stat-Signature: er4514m6444ufpm5sf3b873d6hbqebqm X-HE-Tag: 1744793524-874112 X-HE-Meta: U2FsdGVkX19ndKBwmH8eeXX+NgUx8mIcsoronan6Jqt1n+V9ft6wOOvwkcOszFNBMAgMHpgxKSiGFWzCAeOSgbOXx0pC6bI9D/EjLtros+2iFiihZZbEkUsDC70A5Uk2reZ6NAW4dssCerOqLAtxIYcfWp70SOaKRfjLgVVeYa5fbVwpunnTwJLWBH8OkD5HRHOrHO2D46TOgaoh0EmpunVw84mD+c+bNT9jPFIX7+cYtgFpt8fVjIQrj1gCY83hvW6kTz7O+EGdYchOGFk4c+R6hiWHZv4qKE/DcXL7GmfzPBG9WD6VterbBUsEkZeGP/4YwUwjovAJZkRycfn2snyBQ0l8uTauD4d544G54SEP4/LXDceaunumgn7Jc1m+3t7XqnMjZDXfByicdiAfw9+W5hLZj/IlrRU+paCZEUiie22qipTXRtz9rYg5h3CGsvzDM8HXZhnSvl40lfCgkBD++Is5kREjP7539Cr5xwJUhNro7Sg8Ifk5Ulb/fuCPO67qIIZqWAdDEMTgZRdPgxo5DiXH+2lz8MzhOkwjo1r8HlJ5cfJfcACKp5eeYuPmKWxBHpGPZkVg0l0diZEWDHMyLqhFtMdzWjUmFXkAF7afBSUblT1sAqlb93qQOYkOzbraLbZHCYCf+LEJCLyFOloWEoOVJ/KE9Qoli/ahJQuoOvDCMixzWiX9H0JnFVpCVmCO3RQjckbpHhXYTr2m8C5e8vCRzui5VMJfaTiyYQM85ioLJGgC2MeKwMjaX/a5wYhDyKJWfxi0TJWj4e4jlB2tubQ4XpuaW2YZsCYbwd8sq9x3lCYhDs/39LK3BdHAIQDnhEh/DjKkP/renR2kUK5UzNgdtcYMhwwCoUylTSOJ3yWJ/gwqWIWrUKbNQKmFhaK1TxAuTlqqMBS5xfMAmSWe1TzvTFAs6Ypz9b8aU5twZ6ScsKHXzEdK2HnepRqDd87ef5wRhD/HWyG2ZsS WDLNyKNI WVqx8T0I1hOxRDx+be2poDglVOYBHjzstiV91Xuy93rT7dwj1ZB57iCLBdSGXFQq1tBamxTyHxii5z1ZXGtZSAGkH03SbbwEGwC3orwLkwyoWfl2UUNBMkIxnoEdnavp5l2f+xcWUL3kQxregY870eOqEsWsXp2Dh2yXH54TfXDvNfbBJiyiU1Rj9kZOa+cDdiPFHr7ojGZOqhuTWO0bOeXsTWqn1ovaueN62dUDUL1vzExTQwp7NAa7zFenRZebYoT1kcTYb+PB/Prm/HAOSE8b37kw4o2OQLSfpIE+yuNG/hpVFHv0rI7qFVqk8OKDrsfXbccY0YCGiIjnjCexd3wkgBHYcVsXHWKWCAVWOeJJWEu2mLwS6deex3/ce6zxKqEmtQwjwXfAmmU4es4HHaSvaqoJFuyHfnfjwoIa5u0gHu7YNzA8KzKGhz6adx9ldcOxNWjKZYnStn+Xo0gx+vHQC/32Zf06Hb3cBwySVYeEAx04KgUxaB0uQBG162xUXuzc8nbrld0y2gwwkBaGOJFtNYBL8tTjzFEpwNdPi8fFUMamCrLmBrfnfxLNMRrQMeZcp88lQFMwpyxKGV9Jk/udTdS6568jStozUoKNyH09GWRc59l3TjJcijg== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 16.04.25 10:41, Baolin Wang wrote: > > > On 2025/4/16 16:22, David Hildenbrand wrote: >> On 16.04.25 08:32, Baolin Wang wrote: >>> >>> >>> On 2025/4/16 13:30, Dev Jain wrote: >>>> After the check for queue_folio_required(), the code only cares about >>>> the >>>> folio in the for loop, i.e the PTEs are redundant. Therefore, optimize >>>> this loop by skipping over a PTE batch mapping the same folio. >>>> >>>> With a test program migrating pages of the calling process, which >>>> includes >>>> a mapped VMA of size 4GB with pte-mapped large folios of order-9, and >>>> migrating once back and forth node-0 and node-1, the average execution >>>> time reduces from 7.5 to 4 seconds, giving an approx 47% speedup. >>>> >>>> v2->v3: >>>>    - Don't use assignment in if condition >>>> >>>> v1->v2: >>>>    - Follow reverse xmas tree declarations >>>>    - Don't initialize nr >>>>    - Move folio_pte_batch() immediately after retrieving a normal folio >>>>    - increment nr_failed in one shot >>>> >>>> Acked-by: David Hildenbrand >>>> Signed-off-by: Dev Jain >>>> --- >>>>    mm/mempolicy.c | 12 ++++++++++-- >>>>    1 file changed, 10 insertions(+), 2 deletions(-) >>>> >>>> diff --git a/mm/mempolicy.c b/mm/mempolicy.c >>>> index b28a1e6ae096..4d2dc8b63965 100644 >>>> --- a/mm/mempolicy.c >>>> +++ b/mm/mempolicy.c >>>> @@ -566,6 +566,7 @@ static void queue_folios_pmd(pmd_t *pmd, struct >>>> mm_walk *walk) >>>>    static int queue_folios_pte_range(pmd_t *pmd, unsigned long addr, >>>>                unsigned long end, struct mm_walk *walk) >>>>    { >>>> +    const fpb_t fpb_flags = FPB_IGNORE_DIRTY | FPB_IGNORE_SOFT_DIRTY; >>>>        struct vm_area_struct *vma = walk->vma; >>>>        struct folio *folio; >>>>        struct queue_pages *qp = walk->private; >>>> @@ -573,6 +574,7 @@ static int queue_folios_pte_range(pmd_t *pmd, >>>> unsigned long addr, >>>>        pte_t *pte, *mapped_pte; >>>>        pte_t ptent; >>>>        spinlock_t *ptl; >>>> +    int max_nr, nr; >>>>        ptl = pmd_trans_huge_lock(pmd, vma); >>>>        if (ptl) { >>>> @@ -586,7 +588,9 @@ static int queue_folios_pte_range(pmd_t *pmd, >>>> unsigned long addr, >>>>            walk->action = ACTION_AGAIN; >>>>            return 0; >>>>        } >>>> -    for (; addr != end; pte++, addr += PAGE_SIZE) { >>>> +    for (; addr != end; pte += nr, addr += nr * PAGE_SIZE) { >>>> +        max_nr = (end - addr) >> PAGE_SHIFT; >>>> +        nr = 1; >>>>            ptent = ptep_get(pte); >>>>            if (pte_none(ptent)) >>>>                continue; >>>> @@ -598,6 +602,10 @@ static int queue_folios_pte_range(pmd_t *pmd, >>>> unsigned long addr, >>>>            folio = vm_normal_folio(vma, addr, ptent); >>>>            if (!folio || folio_is_zone_device(folio)) >>>>                continue; >>>> +        if (folio_test_large(folio) && max_nr != 1) >>>> +            nr = folio_pte_batch(folio, addr, pte, ptent, >>>> +                         max_nr, fpb_flags, >>>> +                         NULL, NULL, NULL); >>>>            /* >>>>             * vm_normal_folio() filters out zero pages, but there might >>>>             * still be reserved folios to skip, perhaps in a VDSO. >>>> @@ -630,7 +638,7 @@ static int queue_folios_pte_range(pmd_t *pmd, >>>> unsigned long addr, >>>>            if (!(flags & (MPOL_MF_MOVE | MPOL_MF_MOVE_ALL)) || >>>>                !vma_migratable(vma) || >>>>                !migrate_folio_add(folio, qp->pagelist, flags)) { >>>> -            qp->nr_failed++; >>>> +            qp->nr_failed += nr; >>> >>> Sorry for chiming in late, but I am not convinced that 'qp->nr_failed' >>> should add 'nr' when isolation fails. >> >> This patch does not change the existing behavior. But I stumbled over >> that as well ... and scratched my head. >> >>> >>>   From the comments of queue_pages_range(): >>> " >>> * >0 - this number of misplaced folios could not be queued for moving >>>    *      (a hugetlbfs page or a transparent huge page being counted >>> as 1). >>> " >>> >>> That means if a large folio is failed to isolate, we should only add '1' >>> for qp->nr_failed instead of the number of pages in this large folio. >>> Right? >> >> I think what the doc really meant is "PMD-mapped THP". PTE-mapped THPs >> always had the same behavior: per PTE of the THP we would increment >> nr_failed by 1. > > No? For pte-mapped THPs, it only adds 1 for the large folio, since we > have below check in queue_folios_pte_range(). > > if (folio == qp->large) > continue; > > Or I missed anything else? Ah, I got confused by that and thought it would only be for LRU isolation purposes. Yeah, it will kind-of work for now and I think you are correct that we would only increment nr_failed by 1. I still think that counting nr_failed that way is dubious. We should be counting pages, which is something that user space from migrate_pages() could understand. Having it count arbitrary THPs/large folio sizes is really questionable. But that is indeed a separate issue to resolve. -- Cheers, David / dhildenb