From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0562EC3DA45 for ; Wed, 10 Jul 2024 12:04:06 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 22EF46B0085; Wed, 10 Jul 2024 08:04:06 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 1DFCC6B0088; Wed, 10 Jul 2024 08:04:06 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 00B4D6B0089; Wed, 10 Jul 2024 08:04:05 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id D4FE76B0085 for ; Wed, 10 Jul 2024 08:04:05 -0400 (EDT) Received: from smtpin06.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 7095181CEE for ; Wed, 10 Jul 2024 12:04:05 +0000 (UTC) X-FDA: 82323709650.06.EF38DF3 Received: from NAM11-DM6-obe.outbound.protection.outlook.com (mail-dm6nam11on2068.outbound.protection.outlook.com [40.107.223.68]) by imf17.hostedemail.com (Postfix) with ESMTP id 6A55A40034 for ; Wed, 10 Jul 2024 12:04:02 +0000 (UTC) Authentication-Results: imf17.hostedemail.com; dkim=pass header.d=amd.com header.s=selector1 header.b=MtUGYhyg; arc=pass ("microsoft.com:s=arcselector9901:i=1"); dmarc=pass (policy=quarantine) header.from=amd.com; spf=pass (imf17.hostedemail.com: domain of bharata@amd.com designates 40.107.223.68 as permitted sender) smtp.mailfrom=bharata@amd.com ARC-Seal: i=2; s=arc-20220608; d=hostedemail.com; t=1720613016; a=rsa-sha256; cv=pass; b=RVHQFvDR6WovifKgyU4aV1xuJvAL1ZHsoNSUpo6aWDvKj94k21oGIqbvnRYoAEKkuRQn65 O54JlIuPQNuZ6FCZyxOjN/tE1VWRwEgMVwbi044p60F8DLfx2jU7R6R48uv8YU0S6sEujm qGlxMgRgrGNgxATZwSa9gbGi89cngOE= ARC-Authentication-Results: i=2; imf17.hostedemail.com; dkim=pass header.d=amd.com header.s=selector1 header.b=MtUGYhyg; arc=pass ("microsoft.com:s=arcselector9901:i=1"); dmarc=pass (policy=quarantine) header.from=amd.com; spf=pass (imf17.hostedemail.com: domain of bharata@amd.com designates 40.107.223.68 as permitted sender) smtp.mailfrom=bharata@amd.com ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1720613016; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=NI0OAN9qpw5kvL46/YxSKPqdM4s4XeT2CCebjitDP6o=; b=dNT8OKsOhCfm6NguOEKPRHAkfbXUQga/6Nu+KqSvw7wbIqbkdZ5xn+cOlLcWF4FwBKSvBs UBQlnozpNlW8QoobALuPKG4zXlYlozQ59qbQHPeo2rmceFhfw1nyRvZXlrf53RVFXMX2N/ lFFFVfoIrcgJ/JTeNYI8bsnNY/VI6Tk= ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Y2nI/RsqQy7pjPFZFvW61+Gb6hvKZx0pYYvDUmLDukoJ8xdB1ONd5RDNQTKi4PnxOFFYZKXmte8HgJDhuh8fMt4Q9J3Z/nwDw+BqIFuKSWZaYxdoZ7tMTKmUjPaFCMtuKo64cGeHYv2fuWzssAQMnTeNKSOcqXm/INzbpUcRFELx+9czN2B+BCNrD42S7bDLZqFooQ/FZHFozW1qu0Vq0VMFjhdjURpbZtkOUscnaQDK2jN+Ff0+Ygl/VoCoIN+sFVIstuy4IR8eLEUAt/7yheXtRubYBGDJ8ASewkLjh0ehbZANVYI6dZCyKN+brsf4UK373bZbQY2Y8cqQjjppDQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=NI0OAN9qpw5kvL46/YxSKPqdM4s4XeT2CCebjitDP6o=; b=hNt+aSj0/km+qBROmUhCbXG0WQ714I1X0VMAyGgf+gP3FMQiAJ04HvcxGb+gFU3c0jtdeHy49iKJDJaYYhPqudz+XOOo2csxHbKqGji5FJ2Z3Inoitn8ApoNgq6qD47c62HEtvb79WvbZCMx/YeOoH6/uUOerOnCS9idMDIXcOzy86I17V3HQa33pGF4V0CvsFToPf7EFqpDf5tWcxb3wZd1EkBcKjDCQj0tZwkidJdx+rQ0cVv0xpd2bM3+IkQXm4TtjCCOf3SGmYnIFb/YIgpCkrimMFD0MJ9aXlOVYzt3XA2wL0ysC+dT8Rocz7842G5SptQew9fmvn9bN78BgQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=amd.com; dmarc=pass action=none header.from=amd.com; dkim=pass header.d=amd.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=NI0OAN9qpw5kvL46/YxSKPqdM4s4XeT2CCebjitDP6o=; b=MtUGYhygXM+sj4xbujiwGuhQ7DRWb75JoZbQGCaB9nEFcUEjHuXfhS6UzGl45IfRKKEYchBbkTlpO4UbYwChxU27w5vWnlpwHIi5YYRHWBnLToA9l1nsNV0KCYjScOrjw2kEXp9v3IILkifswfTPCeCFTXsrbXYqcE+0j3DV4Rs= Received: from IA1PR12MB6434.namprd12.prod.outlook.com (2603:10b6:208:3ae::10) by BL1PR12MB5825.namprd12.prod.outlook.com (2603:10b6:208:394::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7741.36; Wed, 10 Jul 2024 12:03:59 +0000 Received: from IA1PR12MB6434.namprd12.prod.outlook.com ([fe80::dbf7:e40c:4ae9:8134]) by IA1PR12MB6434.namprd12.prod.outlook.com ([fe80::dbf7:e40c:4ae9:8134%3]) with mapi id 15.20.7741.033; Wed, 10 Jul 2024 12:03:59 +0000 Message-ID: <4307e984-a593-4495-b4cc-8ef509ddda03@amd.com> Date: Wed, 10 Jul 2024 17:33:47 +0530 User-Agent: Mozilla Thunderbird Subject: Re: Hard and soft lockups with FIO and LTP runs on a large system To: Yu Zhao , mjguzik@gmail.com, david@fromorbit.com, kent.overstreet@linux.dev Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, nikunj@amd.com, "Upadhyay, Neeraj" , Andrew Morton , David Hildenbrand , willy@infradead.org, vbabka@suse.cz, kinseyho@google.com, Mel Gorman , linux-fsdevel@vger.kernel.org References: Content-Language: en-US From: Bharata B Rao In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-ClientProxiedBy: MA1PR01CA0166.INDPRD01.PROD.OUTLOOK.COM (2603:1096:a00:71::36) To IA1PR12MB6434.namprd12.prod.outlook.com (2603:10b6:208:3ae::10) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: IA1PR12MB6434:EE_|BL1PR12MB5825:EE_ X-MS-Office365-Filtering-Correlation-Id: 83c0c832-e241-48f4-fa52-08dca0d86153 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0;ARA:13230040|376014|7416014|1800799024|366016; X-Microsoft-Antispam-Message-Info: =?utf-8?B?OTlxclJEMFVxT1pZTC9oZDRnSTZwYjdxRmtGeDc4SGljYUNoNTc2b3VvZzRT?= =?utf-8?B?TysyM3ZoOFJyblRtQmpGSXExbkVZc1lQUVJCc3FTeE4zUVYvaW1aTjNlWTJF?= =?utf-8?B?aHJMMllPYmQzSXMwRWJBVXI5UnpobWtSQnZaNDVCVDR3ZW5ldUtrUnY5akZD?= =?utf-8?B?MGNUL3ZhenpwamQ1eGNqZC9qS3ZGTnRhYnBlVmJpdG5HdXFsQ3A1UzNiYU1R?= =?utf-8?B?YmxWZGJtTlFJcmcyRXUrSFdmc1cwc3J6OFJnOXJDNVJ3KzJHbmNoUXVjRlFJ?= =?utf-8?B?L0RTcFlRNnFhaVNTeURoU1pQSEtQNjd2VFh5dmtEek1MU0xNcVArTFA3ZmI0?= =?utf-8?B?eEQ4UjNnM2RTbkJvR3RKaThmTjlJYkViQTVvMjBpUTU5VGt0aFBFRU5kV3Z0?= =?utf-8?B?MkUxMTJMWTltbDhFakZJcFo0MDAwSVYyejV6aldJbHlzbFhXd1pleElRREpC?= =?utf-8?B?dTg1UE1kbktkZW5mWGZkQ2FGaVg5MmxwUkhtK1RFYWE5OXlycmV2NjdzU09S?= =?utf-8?B?OVQySXl3SG1iSUtwV3JFdG5HdjA2RVZwUmNaUFNzZ2VHU2VjRlpmZS9XYTJn?= =?utf-8?B?Ym5qK095bkhWOHR1d2Mwc0JqTTR5Vi9NOXF0OVpCMElaSEtxOStheFE3Q0tq?= =?utf-8?B?M2pYV0JiUGRoRTNOUjdsOEh4dXZRREEzOE53UEw3aVRHeTdkK2lrZ3l6OHM4?= =?utf-8?B?OS9xVXZhdFlQTFJ2REdmZUxOZW41Q1lXdmppanV5ZUVvVFNoajQzbEFkMk9R?= =?utf-8?B?ZEsrV2d6QkdqbTB5azZwZDM2NVVVZitEQ1NyMVdsRkJhb2Y0WWJKNXdKTkx3?= =?utf-8?B?TDZSblZmOFo5MHM2VDFFN0RSL0RmKzZpNmZPRGZPTzBaNVBSNXVlc2NVR094?= =?utf-8?B?c2VzTFNVQ0NtZENyT2JvRyt3K2lQSzFDbUgzbEVOd2ltYjBueXBJR0RrNnJH?= =?utf-8?B?TmdMOVFwSFkwcWpCN0FUV28yRDFaNnpCMjl2Y0hjR3dONVpuSzB3MEljcTNT?= =?utf-8?B?TExSVWJHVjIvM25rQ1p2N2o4ZzFRc04ydG1NM1NpbDR6RTJLZEtKdVlmc3Nt?= =?utf-8?B?ekNWM0ZUN3dicngxaHlvb21GRGl2MXdJS1JIRXEyZFdVb3E2L1d6Q0lXVWR6?= =?utf-8?B?M1hwZXcxcDIwY01xaGY1M1o1czlUVHFJU2pZR1I2SFYrUnQxRzNqUjlGNnZD?= =?utf-8?B?UHRWeXF4N3pCOTF1ditjbC9rLzZiK05rZ0FVUUl6RUgyb3dZaHJZQ0RkY0s5?= =?utf-8?B?cWtTdUFORy9XZDRKRWpWcHRycjkvYW1tS1JRNnY1TXVFRmZaSHhVTFY3NWJh?= =?utf-8?B?R3o2VmlraTBhQmFhdHdFbjJZUW9oQjNmZExCNlBVeVJWbFhlR1VSeTZxbExC?= =?utf-8?B?aHJRSitlWmRjVXB6UEU5a1BhOWVDUUUvRWpERWwvb1M1RXp6eUUwbXZVMFlj?= =?utf-8?B?aWhiSXJ1UktpcEp3N0xDT3JaUDBkUElQQjU5ekc1NnczeVRvRy9rYUpReTdU?= =?utf-8?B?ODB0V1JWZ01QYzY2MTloZVdyTnQyN2FQaGpVUzBQNGNtaXNsU2k4Nm1pdjda?= =?utf-8?B?cjhYMEc2eDZ4czFYNWhwZUpDQzJXTm1xZnRlOFpuRG5rSGFXOFhGUTNEdHBD?= =?utf-8?B?SmRSNGs3K1M4RmExZXZxMEFCL3greEhtWklhVCsxblRDZTlObUkwdjZNcVpU?= =?utf-8?B?L0dBUmhaSHNDM0trVW1PbUd4eGpFYlc5RFZtVUtmVzQ1QmJIK2lkdDN1cU1U?= =?utf-8?Q?v7yCjGeSwP3YLdQrmk=3D?= X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:IA1PR12MB6434.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230040)(376014)(7416014)(1800799024)(366016);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?UmVXb1lhNE9sMXBaQjdCN0ZKWTQ4YU8yRUYwU0NYdngxUkIwR0Z1MFdNNTVE?= =?utf-8?B?NmxTTm41dDJZZkRsK2poRW4yb3VIQmk0dUQrc2dOVlgzTm1ieHlmRkxITjFw?= =?utf-8?B?SFgvMytUKzhpRzZtZXZITTJ2c1dxdFpsaHdjRUNmMkFaemhZV1had25VYUs4?= =?utf-8?B?SjJ2dGQvUGgwTG5rVStQRTNtV2Z6UXpCTjlsTTlBNnBwWGh5RmlzTFVUaTFD?= =?utf-8?B?S3oyNCtNMzhQVFBNRXFXYUpwVGhwK2o3L0hzRWRTMWI5eGJrVDhVdTVYZFgr?= =?utf-8?B?Q2lpU1FnQ256d1BpZkc1WEZzNGpLYldBSnBYMm9nMStvZkJoVnB4Qy9jMStv?= =?utf-8?B?Ymg4Y1dUTmJSTnl1WUhLOTdydm8vYllrMzEweE5VNGpmdjVGZ0dpbGgraHpo?= =?utf-8?B?VWwwSkNmaDhBQWY1ckFJTGN6WE5HUEg2T2VpQXEzdmJEYnZ6bll6NE9LUU5n?= =?utf-8?B?eXZEVUNLenZlS3BBQndBMWVnYjBpRmpnTkI0cktTTzQ3UVRZV3d5K2JmM1U3?= =?utf-8?B?a0h2Vk1sejNEcGkyYjFDcmdPKzZQaDRGSHcyMkMvKzd2QmROZS9Zd0pPWXR0?= =?utf-8?B?bGtMN281QlZKTlB3TXJ3TEU0WUxvVHQ4THRaQVZOTENhRklUNExZV1dQTjJr?= =?utf-8?B?ZGNMVEROMUgvV3VKcytZbUNkRjNvVWhLOCtMUElJcnpVQ3N6ZlkxMFJoUXBQ?= =?utf-8?B?TExpeWo4U1pGSzYrN01yd1pqLysrS04rV09ibmVZQTVhSDZDSml6NE5ndTFl?= =?utf-8?B?bUo2Q2R3RUJSL0xOVG1SUDRrVjRSK0JCQmluSWJ0emJZZ0wrNkFVV2hkMmU5?= =?utf-8?B?K3JVUHh0R3N6eHZKQXJnano0a1BtVjlrNXB1VnF3Tm1HVTF1SW51UVBCeGNH?= =?utf-8?B?SHowQWlaYTJaMjZGbUFyQjNWQlBYdzMwL001NlNPUTY3dWVacEdvdGxER0hV?= =?utf-8?B?ek9LL2t6ZURGR2ZvYzk1dlo5Tms2cDRvc2haQ05pa2N2MXhrWHlKbGdtVTVi?= =?utf-8?B?cW9pbnhOZHJ2NW1FeDMzTUdqUkZGQlV0TU5pN2NMTTlQTkMwQnRZUzRucE5w?= =?utf-8?B?U0JIN1hhZTZOQ285UTdMZWFaQTMzcG1XckJkaU54WDc2Z1Jld29ydlBtSWRI?= =?utf-8?B?bkJoa3BGM1R0NC8xM0NHZHYxYTFaRGphd1dLNENsU0lNT1AydVZTby83QUti?= =?utf-8?B?NSt6bTFFMDFzWWpnVTdqSTFYYU5sMUVZQ2pEQmZFWUhjdTFQS3YvS2RqWFVr?= =?utf-8?B?b2tZZUMzcVQzcFF1M3VocFVoVk1YK2czOWt0cFdEa2lJZTNVQ3lJd21neEZs?= =?utf-8?B?dmVmQVVmVDlaTkt1Ym9IcVQzZXBQOHR3bFRPZG9UbWNXcmd1RWpOdEF1ejdJ?= =?utf-8?B?YWVxMjltc1VuLy9UNWRZY0ZFVGJhMmEyZlhxU0dJUGd6RG9mRUh0eHhJek5V?= =?utf-8?B?VUNaR1gza3JvRlNPejBmNGZwTXZYUkVDS0xvK2xBSVJkeUZVK3RUQlhUTHQ5?= =?utf-8?B?M1N6bElHbHhaZTF1N1BWcVNJa1JqRHBDeUZsTk5QYnRFYy85MmtYWnQ5SWV0?= =?utf-8?B?eDF0dVVWbEtzMjNMK01NY0xBUml6YWFyN3VCVTloMDErQ1Zkdmo3ZW9zeU9X?= =?utf-8?B?ZnVrb1NXdFY4ck1rZXlIVE5BZW1CUjdWdXMrTDRqbGIxMURVc0daRTB6Sms1?= =?utf-8?B?eGhVQjZWNUlrdVdteC9YN3NucEJpTDNZK1c2Ymowbno3ZW1SOEdNR3VERHNs?= =?utf-8?B?SlBDNWg0VVdaTVdDYkxkUnB0Zys3Q3dIbVRUYjF6NHpldTJjbUdPSjUvOU9p?= =?utf-8?B?THlXcUlaczF6N0d2WEJPZmk3OHpaTjZDRlI4c1BPN21aL0h6WGxHWEdCK1Zt?= =?utf-8?B?NXBocC9LcTY2Yy9wUjRDR29zS1FGaXRkUkcxUDF1U04rdXhrRUwvSDc1MDRr?= =?utf-8?B?MnkvU1dkbzZQR0lYUnZUSWtIZTQybnZrcHpFSWx5S0Nra1J1UGY2K3RiMEFw?= =?utf-8?B?aU42em5HQzFodFNRU2dDVldMcEVsTytoMTZ6cEZaTDZobUVCTUlETVNMRmxB?= =?utf-8?B?b2pkRGoxSmh5aVp4UWdMaGtsb0RkUjl3U3pCVzFYQjhrTnpQcU9IeVBHNXhr?= =?utf-8?Q?GoS/TQKwvIIS7Tt4dTtdMD8Gv?= X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-Network-Message-Id: 83c0c832-e241-48f4-fa52-08dca0d86153 X-MS-Exchange-CrossTenant-AuthSource: IA1PR12MB6434.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 10 Jul 2024 12:03:58.9634 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: r4NKPsr105lVIb49jTD8MSijKTVTLY+LscW5sI+IvRKqvg23tLDN8KJFeYbHztuSzsNVNlUHSS3zRarPXOGtxQ== X-MS-Exchange-Transport-CrossTenantHeadersStamped: BL1PR12MB5825 X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: 6A55A40034 X-Stat-Signature: hm79ix8nw93ftj5ynyfgu6yykih8574m X-Rspam-User: X-HE-Tag: 1720613042-477684 X-HE-Meta: U2FsdGVkX1+2VwER6Bxa6+SgUsz5hEf5kwB/1SoCF3lFmNQJI8GvwYGdyy50vV+09rnmyUkss/tKdKY/w4YyKDbDErT7f7KunUcA35Lr+ubFzV43J4aIbZ1/ISfw2nltqstfFKQtlQyT2doTNhgkwxhsbPeL9znIPLPg1zbwW6+35wwuFz30yhv4Deq7fQsu9mEqH099nVKl9HfGK6KBSQxofSBVsk1AfGfhdFU5yVO2pK3ykU+Wy3qync/GQRvJNaQctgTDf+S1sxgsO0TNpSTsbCkVyJFiAzLXS4mH78Bul0N8HOuN3lBnmY8Eae+3wTuDb7Ylw1Q+o93dUVUChRhmZK5/3003zTi3HCOcZ4YGs1Rj44cS+7ZwPDtHo/KlxVZLrQzU+mHszMKOTQEEpM3saTGx5ZuQ9Lggm5mg71awlUp284MRQsWTYyeack+bngazdjsN8SbEP+Ptyi0r19kBfWFm7rTHeJmr4fGwYUAGFmHSjoxhqYMdMSPEHNFbF5DEZZx3sMOaFaGJwKzMTGBWgyV15k59uirk+tNFX9u4+WmK0g4oLwXC9VCcfR7RYxL8hJterDY3x0s9kURTN15GmSKGtmz80iYHqeXpJcTN24HfEVzBlpmyN+DMrAGTCsBzcKQaR0K5/86lZf327/EL40+nbuZ3BW+NZQxl/6EnrEwTi9kvrWMIu/mOEqvzE7k/CKlGJQSDP2Lvr4tVRfo6Oy0VVYrs19mpVvGaSvpWSbfauYI1AKyhuFkQRi4Rz44MKfHAwfs3+4pborJhngN3SFrLWF5NxkRqjKMsDDsLDXzWe++ma6B5j51YFKtuNnc1MjExMatPrlrFVEtYbYbpO8PTEb9CkJixIYdHV0ou2CYWFYrxnfohj9LeLJbwN8ORwxCx0R1YeWbPZU3GjFitunIyKD2zpv2eNWDil0Rkr/vrVKXvWw4fU824FHwPc9bN3Ta/rLkN25bUMch nES+n3P0 Q6pD6u3d10PY+OmxE7ru6qkpOxipNaY/iwAQc+ouK9VSezUuMwQga1Ci52D5VS6vb0GJVz3RugnIcoz4nGuP/pKDhY8eviN0YI+Pda5cPCDTe3RxrENlWz/wURNLCi1aEwstIDo20SMYMZrbvUq3osO54+G2m3BtvgrMiGj5VR4qHi9fFRFekH9G8o+yWdAlpu3VjzbB9NCuYjOdoOvVHno2nwlY8G3bqGoBFDp2C5K5Yd/nbTqdV/6Lc0CyaSOs6suwkE9jp2BIAGTxHUyu9PsNnOlt3C/6fO5yfABmhLEmhNVuF3thFNs3LDuixNCPvFOrVsII0lCHpH87EEpKA3JjZ8tjqUcrWUkl24p3/75jZCpSQb1jp3P/UfCfBOh7+AvxKxvDA/PgKaVsyBbby9CQQWiml7J/Xc67YiTm3m7XS92E= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000017, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 07-Jul-24 4:12 AM, Yu Zhao wrote: >> Some experiments tried >> ====================== >> 1) When MGLRU was enabled many soft lockups were observed, no hard >> lockups were seen for 48 hours run. Below is once such soft lockup. >> Below preemptirqsoff trace points to preemption being disabled for more >> than 10s and the lock in picture is lruvec spinlock. > > Also if you could try the other patch (mglru.patch) please. It should > help reduce unnecessary rotations from deactivate_file_folio(), which > in turn should reduce the contention on the LRU lock for MGLRU. Thanks. With mglru.patch on a MGLRU-enabled system, the below latency trace record is no longer seen for a 30hr workload run. > >> # tracer: preemptirqsoff >> # >> # preemptirqsoff latency trace v1.1.5 on 6.10.0-rc3-mglru-irqstrc >> # -------------------------------------------------------------------- >> # latency: 10382682 us, #4/4, CPU#128 | (M:desktop VP:0, KP:0, SP:0 >> HP:0 #P:512) >> # ----------------- >> # | task: fio-2701523 (uid:0 nice:0 policy:0 rt_prio:0) >> # ----------------- >> # => started at: deactivate_file_folio >> # => ended at: deactivate_file_folio >> # >> # >> # _------=> CPU# >> # / _-----=> irqs-off/BH-disabled >> # | / _----=> need-resched >> # || / _---=> hardirq/softirq >> # ||| / _--=> preempt-depth >> # |||| / _-=> migrate-disable >> # ||||| / delay >> # cmd pid |||||| time | caller >> # \ / |||||| \ | / >> fio-2701523 128...1. 0us$: deactivate_file_folio >> <-deactivate_file_folio >> fio-2701523 128.N.1. 10382681us : deactivate_file_folio >> <-deactivate_file_folio >> fio-2701523 128.N.1. 10382683us : tracer_preempt_on >> <-deactivate_file_folio >> fio-2701523 128.N.1. 10382691us : >> => deactivate_file_folio >> => mapping_try_invalidate >> => invalidate_mapping_pages >> => invalidate_bdev >> => blkdev_common_ioctl >> => blkdev_ioctl >> => __x64_sys_ioctl >> => x64_sys_call >> => do_syscall_64 >> => entry_SYSCALL_64_after_hwframe However the contention now has shifted to inode_hash_lock. Around 55 softlockups in ilookup() were observed: # tracer: preemptirqsoff # # preemptirqsoff latency trace v1.1.5 on 6.10.0-rc3-trnmglru # -------------------------------------------------------------------- # latency: 10620430 us, #4/4, CPU#260 | (M:desktop VP:0, KP:0, SP:0 HP:0 #P:512) # ----------------- # | task: fio-3244715 (uid:0 nice:0 policy:0 rt_prio:0) # ----------------- # => started at: ilookup # => ended at: ilookup # # # _------=> CPU# # / _-----=> irqs-off/BH-disabled # | / _----=> need-resched # || / _---=> hardirq/softirq # ||| / _--=> preempt-depth # |||| / _-=> migrate-disable # ||||| / delay # cmd pid |||||| time | caller # \ / |||||| \ | / fio-3244715 260...1. 0us$: _raw_spin_lock <-ilookup fio-3244715 260.N.1. 10620429us : _raw_spin_unlock <-ilookup fio-3244715 260.N.1. 10620430us : tracer_preempt_on <-ilookup fio-3244715 260.N.1. 10620440us : => _raw_spin_unlock => ilookup => blkdev_get_no_open => blkdev_open => do_dentry_open => vfs_open => path_openat => do_filp_open => do_sys_openat2 => __x64_sys_openat => x64_sys_call => do_syscall_64 => entry_SYSCALL_64_after_hwframe It appears that scalability issues with inode_hash_lock has been brought up multiple times in the past and there were patches to address the same. https://lore.kernel.org/all/20231206060629.2827226-9-david@fromorbit.com/ https://lore.kernel.org/lkml/20240611173824.535995-2-mjguzik@gmail.com/ CC'ing FS folks/list for awareness/comments. Regards, Bharata.