From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 15BA9C4167B for ; Fri, 10 Nov 2023 01:57:32 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 7D090280011; Thu, 9 Nov 2023 20:57:32 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 7806C280009; Thu, 9 Nov 2023 20:57:32 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 5D301280011; Thu, 9 Nov 2023 20:57:32 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 4B037280009 for ; Thu, 9 Nov 2023 20:57:32 -0500 (EST) Received: from smtpin19.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay02.hostedemail.com (Postfix) with ESMTP id 21088120223 for ; Fri, 10 Nov 2023 01:57:32 +0000 (UTC) X-FDA: 81440382744.19.B7C98DB Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.115]) by imf04.hostedemail.com (Postfix) with ESMTP id 9C52F40019 for ; Fri, 10 Nov 2023 01:57:28 +0000 (UTC) Authentication-Results: imf04.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=eXcV2RES; spf=pass (imf04.hostedemail.com: domain of fengwei.yin@intel.com designates 192.55.52.115 as permitted sender) smtp.mailfrom=fengwei.yin@intel.com; arc=reject ("signature check failed: fail, {[1] = sig:microsoft.com:reject}"); dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=2; s=arc-20220608; d=hostedemail.com; t=1699581448; a=rsa-sha256; cv=fail; b=5Uv/47zNEXJbRoaO9xyb4af0rI8enz6mYL86YK3e9FFmyF4CzeMo8sQ43lU9EIv30iTAuf WGKgFWZ3iXuJy7CoEbIu1hucYb40xbn2Gzoq7RqoBvFTNrygKQ5OBW+d+qNe6EsSt+lfLi gZTri+TK3xTQExc7paMShqC+VhHKY9I= ARC-Authentication-Results: i=2; imf04.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=eXcV2RES; spf=pass (imf04.hostedemail.com: domain of fengwei.yin@intel.com designates 192.55.52.115 as permitted sender) smtp.mailfrom=fengwei.yin@intel.com; arc=reject ("signature check failed: fail, {[1] = sig:microsoft.com:reject}"); dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1699581448; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=KNzp8QECXxqehaNp0p45lrTTDVen6d+EJIe0SkHlYS4=; b=CGEo0Qm4yCl5HwfLSXFZwLjh4fhBFVKHRRDou/ROtzgB+ksxVJnrf3VKG4NW88ancoS7Hp aHIJPABjfmI0k4I2Wk3Laq47/orW5cCw186BMq/nOuLHghg4IwtszPdLCAWc9rnSbwxj/j HZE1m8yFlLi0hWQe+aOkYV7urX1YdOk= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1699581448; x=1731117448; h=message-id:date:subject:to:cc:references:from: in-reply-to:content-transfer-encoding:mime-version; bh=J05Be78+Tin0qjdLpPITOdsimh9c/tHEnI+Ajiz6FT0=; b=eXcV2RESpDwMbfM76B35D24ky0QR1AB0mJ+mRfC/jxIMlFFDKV5tZjb6 NZW3RKe1Xyum9kDb5kLkf82yrKDJUfsOOXaa8K1KgvUcXEZ0HHwR9nn1b zy47g8fmHXMSDzF7cM+1oPloSiJXiktFz9tuqJRwpxhfixAYGtRP8uiLh W/2iTU5DR8MD2EP7bU09/i2hqy1buYH21Qwr95ARKMfKVlOHwo/r/rov1 MJK2GDWzHGhsgmpF4WrSB7KY3KyStNW+4c9FcNXPago3mZEQYZWIZnuwu KfsqrUwrymqb9j+Z+aV7Rys5JpQOOSY9KRsJKJ8ODb4ODhvd/+EsnjgHe A==; X-IronPort-AV: E=McAfee;i="6600,9927,10889"; a="389922671" X-IronPort-AV: E=Sophos;i="6.03,291,1694761200"; d="scan'208";a="389922671" Received: from fmviesa001.fm.intel.com ([10.60.135.141]) by fmsmga103.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 09 Nov 2023 17:57:27 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.03,291,1694761200"; d="scan'208";a="11725690" Received: from orsmsx603.amr.corp.intel.com ([10.22.229.16]) by fmviesa001.fm.intel.com with ESMTP/TLS/AES256-GCM-SHA384; 09 Nov 2023 17:57:26 -0800 Received: from orsmsx610.amr.corp.intel.com (10.22.229.23) by ORSMSX603.amr.corp.intel.com (10.22.229.16) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.34; Thu, 9 Nov 2023 17:57:26 -0800 Received: from orsedg603.ED.cps.intel.com (10.7.248.4) by orsmsx610.amr.corp.intel.com (10.22.229.23) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.34 via Frontend Transport; Thu, 9 Nov 2023 17:57:26 -0800 Received: from NAM12-BN8-obe.outbound.protection.outlook.com (104.47.55.169) by edgegateway.intel.com (134.134.137.100) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.1.2507.34; Thu, 9 Nov 2023 17:57:25 -0800 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Nw/JxegTnm5bVErnzgq/IFKLarkClDhNnsXC8pf7wSpTEZKcWX9qP3LpjNlxnDLAYvoeKWDnOknOzSmK5CiVlStDK3nKBcxTBcd6iT3Ie980jo6ZwzettRKVGNlPeVDyQRAXFZbs0caW8oM9Rn1jb3Y5Z1TH9DwKfl3mg5CipuAHIC4RcC19ekYVn4vBWwlkwk/yKCPsehw7D8BY5Yn5XujIZqb7jDRhdX8DQLYOABbzacxkhrTMhqyqMYNWuIMkViY1CgS9taObltx0Eo6GHMkSkwP+UPW8OUVD5e29Z+wAAShoWldnU7TvbRqlqLSNE3xhSbYjmpNPPTavLK3fGw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=KNzp8QECXxqehaNp0p45lrTTDVen6d+EJIe0SkHlYS4=; b=OjWeW3hlMqoe2As5pIVtPR+GXKwvRJ/O1XGVqV/yzwHlETRh3oyXXV+wrpAyBLhxstFCmdvoRJ5QzwA+KmUpFhSe7D0F5E7GNw6XaMqBr3EyIaaZObn3ydQ2Exn6iP0sY2uO02JMLgEA0c1P/z4aQ2TJs/A686xKdWCng55mMEkM35FETUIZbvN70h4/QwXK7FNrrb7ImrsQhEcPF+y3a3tFt7ZHUmLCrsLbP6uTQ6sv8ZoEx7lny0gfFimm3T2aqJClgHVvRDVu6IRIUgpNntH3AAYMD0M5Y1zvoSXPDKA+FSIeYoaFMimTMmh7heaoqnjEnVcebZpUQN4qe6F9lA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=intel.com; dmarc=pass action=none header.from=intel.com; dkim=pass header.d=intel.com; arc=none Received: from CO1PR11MB4820.namprd11.prod.outlook.com (2603:10b6:303:6f::8) by CH3PR11MB7179.namprd11.prod.outlook.com (2603:10b6:610:142::8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6954.29; Fri, 10 Nov 2023 01:57:23 +0000 Received: from CO1PR11MB4820.namprd11.prod.outlook.com ([fe80::53f6:8fa2:1b02:6012]) by CO1PR11MB4820.namprd11.prod.outlook.com ([fe80::53f6:8fa2:1b02:6012%3]) with mapi id 15.20.6954.029; Fri, 10 Nov 2023 01:57:23 +0000 Message-ID: <648aa9dc-fc42-4f28-af9a-b24adfdcd43d@intel.com> Date: Fri, 10 Nov 2023 09:57:13 +0800 User-Agent: Mozilla Thunderbird Subject: Re: [Question]: major faults are still triggered after mlockall when numa balancing To: Yang Shi , "zhangpeng (AS)" CC: , , , Matthew Wilcox , , , , , , , , , , , Nanyong Sun , Kefeng Wang References: <9e62fd9a-bee0-52bf-50a7-498fa17434ee@huawei.com> Content-Language: en-US From: "Yin, Fengwei" In-Reply-To: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit X-ClientProxiedBy: SI2PR04CA0011.apcprd04.prod.outlook.com (2603:1096:4:197::10) To CO1PR11MB4820.namprd11.prod.outlook.com (2603:10b6:303:6f::8) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: CO1PR11MB4820:EE_|CH3PR11MB7179:EE_ X-MS-Office365-Filtering-Correlation-Id: 2909afdf-fc19-45cc-8335-08dbe19061a9 X-LD-Processed: 46c98d88-e344-4ed4-8496-4ed7712e255d,ExtAddr X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: VPzb4FTPRqIzvk59w4Fm8nS0sy6oxnYFkgxxxSzlvbcOdOKnjVyO7WBIdaMB+HyA2Hdm8bJM1Lulsy5G1D2H1eyjUvx7/SqE42jEM6Im9/SOA2BnIeDxVh0g2XLMWcVH805OvdpA1LMHZO3hoyLkoZUUlzFtGpwLAiFPm1oyqDnbuVA+Mfxi79gIX/anurhNcijTggNmE8iNlQlpqqLzXmcqrACQ1il1clWnfeI0tLxDwbS5Lyfn9BydzI+OfBOdkWgGpPEwQ6xjD0Xujyoii7rWgRrBHELiUII64E1tsT3NLyz7SOkZDEAEiMoWlUt6rm6gWpcPxfz4S4WlSr/geC3jPBtF3YzcH+oAoiQiGovHa9HEcG1ZAKbqkmXMHl4mUnxLPbINauvUoWT38OJgmDkjZaIKxBa4aNqxB6qUewmKOTiWrqNF2t5Jrk5vdYkZ7EXwMnFw7q3zV8unCHAmdkLmS1pSr+qjP9V0KvmXC9WA6LM/Lb+7eAGJKydoWMoqxnBGGdGJSYX9Fy2PRHfD4C7Av88AY3Gn5ylt28Ww17/SPX+fvjDJwuHkF7yhic/HdtPiPqkF5gOrIqN+k0vFgxzyzG6E52vrFt1jVsLpfXP2Ymb3ZRZuYYlA/xINSTz95UiA10jfTNe6lY+E3NqLPw== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:CO1PR11MB4820.namprd11.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230031)(376002)(39860400002)(346002)(396003)(136003)(366004)(230922051799003)(186009)(1800799009)(64100799003)(451199024)(26005)(82960400001)(38100700002)(5660300002)(31696002)(83380400001)(2616005)(6512007)(478600001)(6506007)(6666004)(53546011)(31686004)(6486002)(36756003)(316002)(110136005)(66946007)(8936002)(54906003)(66476007)(4326008)(8676002)(66556008)(86362001)(7416002)(2906002)(41300700001)(43740500002)(45980500001);DIR:OUT;SFP:1102; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?WGM0WFRNVEdVaFF1UDU3RSt4Q3BEdXBuQmFvbU5XUW5ZQ1hEWVVIUm54ajAw?= =?utf-8?B?Nm94NmRKNUhMZjJLRjZmOUdqRWpBOTBqNUVLNWdhRWd6OVdnZVA5RGkxcmNC?= =?utf-8?B?ejhnTWpjK0FOMFpJNWEveG02MXdMYVF3U042b1B1VjhXOTk0QjZNck9HSkVj?= =?utf-8?B?ZDFWK3p3VkZ1TW5CNElubTlTVTBZSElhRi9ETDJWb21YcGVHTlRzZHNSZDVn?= =?utf-8?B?V29pbUJhcVF6dHpET29OUmZjNXdXb0ZGNTJmeGhWc2ZLdjNjbk9URVZqZkhP?= =?utf-8?B?ZnBJdjczcFAyOUZNenM2OU5TeHRNdk9yKzVGdjRFOUEwVTg5RTM0aFZkWDNa?= =?utf-8?B?M0M2SDg3cnFaeFV2SEYzS2ljcVNFanVqQkJxYkhSd1RXRzMrL3l5VTFienVl?= =?utf-8?B?UmJ0NkwrZ2E1Z3JuK1lGc0VsWlZzK3RCRVZ6eXJzdkdHOFduSzRiY01FbWxz?= =?utf-8?B?Q1FNNUdFMU1qMFdOLzZMdGdmWHRUcGMyUXpEZUZJVWh3VldMTXR0b2dQYnly?= =?utf-8?B?ZE9MZm4rMVhFOC9sUmxOTE5RWC9xQ3BFQzhrWFJPd2g4cVRQeEpOZnRScnQx?= =?utf-8?B?YURTZ1ZxazVvQ1F5OVFHbm9TVTcrdlZmUGhmSTBGMTV0MHp1V0t6S2F5OHp1?= =?utf-8?B?WjE3ejN5R2lMd1JlUjE1VlFydzBmcEZxRk9pTWRNZ2s3ZjhlSWlzSVVCYzhn?= =?utf-8?B?dDRUVHpBd2tkY2w1cFIrd1BZcFRmZGJxaStCZlZ0cjBWeDg4SmlkcHpDN1cv?= =?utf-8?B?OGsvRk9EbG9Ka2p6WWRjdCtSTmx0MVVUcjgrd0ZkZyttOWYrOC8wbmYwb280?= =?utf-8?B?YWtzWjJ2WHM2bXByVDZmQU1oNHhuVEhib0NwWVIyYnNTbmJUblJVYnUrcTFO?= =?utf-8?B?RXYyK0VUVytld0czZklDbHkzTTFYNVhmU0JWemNETUJ1VVhBaURBL0wzcVlj?= =?utf-8?B?M3VGOFF0N0tIUVpoczgwYi9JWWE3bGwxd0FMcFlLam4rWmJIZ2hzd1BoMzVS?= =?utf-8?B?U2lOeUpqNzNQZ29XSTUxL3RnVW9VOFZXb2xMSDJNMDNPWktJTkRZSjJ2dDRh?= =?utf-8?B?VWlRQ2JocG9WaXRtMzQyOUlsWFhMa3pCa0gwZDNtVkRRWWNrWWFqVDBtVDRw?= =?utf-8?B?WWJ4a214Z1lNK0JINkZrdEhGb0RsQ2pDTmR1bW9yZTdnT3JibVNXbmxtQ2tP?= =?utf-8?B?Y1djTUFHeUJWTVdPVFlFc0JqczBrMDBDeUdsdGltRHRncUMzOHM0RENHSENi?= =?utf-8?B?MEt6Y0RWd2UvY2thUVlWelhHOHVFN3dkcUNsaWVQZnV0Q3J0UzZTbXVkdFJD?= =?utf-8?B?TXJYQ3hJN0swWk1yUUo0VFY2dFBFSm5raGkxZEJ6U2d6WnArcm9ySEJyL1Zy?= =?utf-8?B?cy9KYUxQa0ZTSlRDVFNmY2Y0ZGNyeHhmSXdyTUxzRDdIelNlL2JWckU4TTl6?= =?utf-8?B?WlFmNnNDbld4djNDNkpXZlExeGgrSEEyYXlyRWVuTGR1bVF3R3ZuWE0vTUJ2?= =?utf-8?B?ajRJMTVXWFV0bVVIMU9HV2xnZEZLU1BhZCs5U0YwSloyMGQwc1NPV0IvL0FG?= =?utf-8?B?Zk1aZnNPWDQ1elpKNlF5cDN5a2ZrNjNjNGMrTldaN0dhU2k2dEZOMncza2RB?= =?utf-8?B?bk5VT0dta0RLcy9LY281L3BQeXVBeFpvb2FwOFIvZUcwK043b0poNzZ5UDJO?= =?utf-8?B?STNUcUpsM3NoeFhzYXJrRmswa25pUzdreWlnd0FvQ2hHbEZlWkVtRXd3Y21o?= =?utf-8?B?MTh5VHV6RTN1UnZPd21ZSkszMnhmT2N4d0t2N1F6RFNCZUZsWkdlbVlJYjl2?= =?utf-8?B?M005dnpnUWg2K1BHSEJDS2VMaUlIRHJTdW9VUENiT3ROVGtXNlhLMlNMOHhJ?= =?utf-8?B?d1hoUU1FSGpSV3RSaDRPMVRIMFRVTmpPcUo4T1FzMUN4cjdnS1pkSDV1R3pI?= =?utf-8?B?eWFjN0JCYS9IR09BckhFTUpXbytSRXE5bkF3Y0tMM0RrZ0JYWUg1SGtXdXFR?= =?utf-8?B?bFBvbmFHck5tSUdMN09BaXNVdWg5d2JXNkV0N0pyM1E4SFY0QnJlOXN6eWJX?= =?utf-8?B?NFFYRWtJVEU4NDhOenFtV2ljVGtoVlR6c0NUc0tOcXdUUWtZdU5uTG52RE9Q?= =?utf-8?Q?kSuarkgGGvsqI3l6WdVy2+XwU?= X-MS-Exchange-CrossTenant-Network-Message-Id: 2909afdf-fc19-45cc-8335-08dbe19061a9 X-MS-Exchange-CrossTenant-AuthSource: CO1PR11MB4820.namprd11.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 10 Nov 2023 01:57:23.6172 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 46c98d88-e344-4ed4-8496-4ed7712e255d X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: hFMdQvcgFcOnL1o0N0PXSODz44TDaKK6DLNQ/fCtKYwrzLetK1R91PhdrPAclYdfmBpVCfi32SZm3B6n5ZK1kg== X-MS-Exchange-Transport-CrossTenantHeadersStamped: CH3PR11MB7179 X-OriginatorOrg: intel.com X-Rspamd-Server: rspam08 X-Rspamd-Queue-Id: 9C52F40019 X-Stat-Signature: b7h85r7ax9qnr7bw13touu47n6gsup7e X-Rspam-User: X-HE-Tag: 1699581448-925964 X-HE-Meta: U2FsdGVkX1/sFI7BuVmR2++k8+aBGRI9tc76EZgywkqajnC6TeE+vQ2YVX1Fa+b+Sx05L36bHm+6W6Iijne/0rYGvGG7+zScB7ivKmXd1nbhqLy3DpE10itlCZgAJHVKRKPCIVcJnUPIrC47JEy/j0LKq/2J+g/ZYQjY4K2TlQYTE7Lx/JztOQjngdctMLbY9duBKHkZIi5KJrqRn5uB0fuCdqIjxiUUlnBg1peMnNq4sMCyWoRAW1p9mK1KrrVAGw30oPIBJ2rOFKix2/pFdRIulJeT3nhR0WYCPaYKK1nJFi3Lwk45i7GrENbUOH+LtWTsKnZSyIC6oIFduKI1SZZ5cnzykHwAEhWTL5frkGBmYxCYS9P2WSfowniTosUcbBArev0X5XvFkHwk3FDTRb05ORD7kvyuZRC+N00jY5nofIuulYCioaX3L3JghMop4PubogYjIet/+OpS8y/kjl/RnRB41DYani9Wo7XFnimk5TbOijNnRzL0FMxxR4m4mqRfbdLXczUAaXmeObYZBOkfOazJJrkUPcQSlCry1WImHmod7Z+9/T8k632k1K4s13IE7TRNFGEEiiAmaOaF223RJubqS29bIzMzCLlMSKpFaytX2GKMivTJWjO5IHjqgSHpoLKZ2jRF3S5QrX0OQ4fstipSecIcYTdnP4uV7ZsFc29jIUyobprEqiw+gNHMFIsrBOmiKeRc197MO9yMlqurHhK9fVY4g5lNXYzDF4X26DzaRw43OIF51+NacNB2L3uausg0mywGm3ooZEUODsyWQop+UfTKLnFYItdIQkEsXP0UqAOt5UOmtt9M9s8yBJ6rTqMu8oCo5ZDMtTurU788fkeWKdgereeT1V3RCLDrscXQ5aHixtWo9sbIt/RL1g8CCJt2kASXfOYYqxb8Nc2Xl920/Rha+3D6fgt6DW3NziTPkjc4L/X0YsujWfEIUSh9QvE/BAF04eOmVoO dS3dgWlE a1IaY1NR6EN6YBOgVet4gyMrBWetRQccDfYWmKDBl2JSLXmRvgkGKATL8ByF+kdlSnrL82GKbLZl1FT6Q+j2DYWta6emXtKYHMLpZueiFJCf6d+p/I4Wr3J9WyY5xPioHTacKXblqjeaa9sWufv73L/Q3piR0ExeeY4duPFvBI7jF0gfRPeKwsiqJys3IFg8nkNl+Wp5jQU0uAnAugdyQ1jW2KCuFk/yyPzahBvqrYZ5tweoRR8mfm1TIUiGCQ6qBExjfa7NbTr1QVUHppEQmK/iNaY1/FedU5mPYSbv5ZAjKwydrF1/wijO6qEp97isEnB6mCRFedm1YZExKvycVh22XqSkS2AFinJABulv+Uo6GBtzitmh0EAh02wx2OYfXaV2a X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 11/10/2023 6:54 AM, Yang Shi wrote: > On Thu, Nov 9, 2023 at 5:48 AM zhangpeng (AS) wrote: >> >> Hi everyone, >> >> There is a performance issue that has been bothering us recently. >> This problem can reproduce in the latest mainline version (Linux 6.6). >> >> We use mlockall(MCL_CURRENT | MCL_FUTURE) in the user mode process >> to avoid performance problems caused by major fault. >> >> There is a stage in numa fault which will set pte as 0 in do_numa_page() : >> ptep_modify_prot_start() will clear the vmf->pte, until >> ptep_modify_prot_commit() assign a value to the vmf->pte. >> >> For the data segment of the user-mode program, the global variable area >> is a private mapping. After the pagecache is loaded, the private >> anonymous page is generated after the COW is triggered. Mlockall can >> lock COW pages (anonymous pages), but the original file pages cannot >> be locked and may be reclaimed. If the global variable (private anon page) >> is accessed when vmf->pte is zero which is concurrently set by numa fault, >> a file page fault will be triggered. >> >> At this time, the original private file page may have been reclaimed. >> If the page cache is not available at this time, a major fault will be >> triggered and the file will be read, causing additional overhead. >> >> Our problem scenario is as follows: >> >> task 1 task 2 >> ------ ------ >> /* scan global variables */ >> do_numa_page() >> spin_lock(vmf->ptl) >> ptep_modify_prot_start() >> /* set vmf->pte as null */ >> /* Access global variables */ >> handle_pte_fault() >> /* no pte lock */ >> do_pte_missing() >> do_fault() >> do_read_fault() >> ptep_modify_prot_commit() >> /* ptep update done */ >> pte_unmap_unlock(vmf->pte, vmf->ptl) >> do_fault_around() >> __do_fault() >> filemap_fault() >> /* page cache is not available >> and a major fault is triggered */ >> do_sync_mmap_readahead() >> /* page_not_uptodate and goto >> out_retry. */ >> >> Is there any way to avoid such a major fault? > > IMHO I don't think it is a bug. The man page quoted by Willy says "All > mapped pages are guaranteed to be resident in RAM when the call > returns successfully", but the later COW already made the file page > unmapped, right? The PTE pointed to the COW'ed anon page. > Hypothetically if we kept the file page mlocked and unmapped, > munlock() would have not munlocked the file page at all, it would be > mlocked in memory forever. But in this case, even the COW page is mlocked. There is small window that PTE is set to null in do_numa_page(). data segment access (it's to COW page which has nothing to do with original page cache) happens in this small window will trigger filemap_fault() to fault in original page cache. I had thought to do double check whether vmf->pte is NULL in do_read_fault(). But it's not reliable enough. Matthew's idea to use protnone to block both hardware accessing and do_pte_missing() looks more promising to me. Regards Yin, Fengwei > >> >> -- >> Best Regards, >> Peng >>