From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1D2C5C61D85 for ; Tue, 21 Nov 2023 15:48:01 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id B11EF6B0496; Tue, 21 Nov 2023 10:48:00 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id AC0B36B049A; Tue, 21 Nov 2023 10:48:00 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 9B0406B049C; Tue, 21 Nov 2023 10:48:00 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id 8A9B86B0496 for ; Tue, 21 Nov 2023 10:48:00 -0500 (EST) Received: from smtpin09.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id 6AC0F1CB3B9 for ; Tue, 21 Nov 2023 15:48:00 +0000 (UTC) X-FDA: 81482392320.09.5CF39AA Received: from out-183.mta1.migadu.com (out-183.mta1.migadu.com [95.215.58.183]) by imf14.hostedemail.com (Postfix) with ESMTP id 3A94910001F for ; Tue, 21 Nov 2023 15:47:57 +0000 (UTC) Authentication-Results: imf14.hostedemail.com; dkim=pass header.d=linux.dev header.s=key1 header.b=rvCdEZdn; spf=pass (imf14.hostedemail.com: domain of chengming.zhou@linux.dev designates 95.215.58.183 as permitted sender) smtp.mailfrom=chengming.zhou@linux.dev; dmarc=pass (policy=none) header.from=linux.dev ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1700581678; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=zVdBdlkIsnH6I2p3E5sm8f7fpOQeckAW/foyxumZ7OY=; b=EnD+B1kCi3f1Wxn0iByN5sMdti6zJJx7BFZvSsVEHf6mS2tKeu4sDzWEmJjNVVy6tRai9T dZw3BCAGd0ikVmeL8loCXVEskmRTbKrBwjD2Mfb2lbloqrsuBlbbfUiVFZYt9xm3QJN4Ov k6MypG0GVmc495hTUeD6Ja05LhXgTMM= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1700581678; a=rsa-sha256; cv=none; b=C2IflEVzfq/e6v9KnhPwhnFyDN07WDtiHRGdS+ORYFZ+z7RwZNVqodCLgP0ZXTQsMB71NT tbw9qZfekJZRJUJ5z86FQpq8HzWVO2E8EqKCUHCapMFqbLvi8U2c9YOnXcrezEroQYL4y8 0tfiyL878yrHJ2Qm4OfWV5QIny3TDHI= ARC-Authentication-Results: i=1; imf14.hostedemail.com; dkim=pass header.d=linux.dev header.s=key1 header.b=rvCdEZdn; spf=pass (imf14.hostedemail.com: domain of chengming.zhou@linux.dev designates 95.215.58.183 as permitted sender) smtp.mailfrom=chengming.zhou@linux.dev; dmarc=pass (policy=none) header.from=linux.dev Message-ID: DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.dev; s=key1; t=1700581676; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=zVdBdlkIsnH6I2p3E5sm8f7fpOQeckAW/foyxumZ7OY=; b=rvCdEZdneqNOUqfvNgM8d99zC5r7Hm7vbnpqQFZJWCW0yCRnq4Mebspr9XdHZ8UY5+giaE HTGgWBHAIqQCryqFsD7hJeOB7HWW9cZr4dJMhVVMAI/Lns9q20GyA8hbbwYm4nWX/hWBGk 0AMg52MLYLOzj2rDPE0PqBd2xWyR3Zk= Date: Tue, 21 Nov 2023 23:47:26 +0800 MIME-Version: 1.0 Subject: Re: [PATCH v5 6/9] slub: Delay freezing of partial slabs Content-Language: en-US To: Mark Brown Cc: vbabka@suse.cz, cl@linux.com, penberg@kernel.org, rientjes@google.com, iamjoonsoo.kim@lge.com, akpm@linux-foundation.org, roman.gushchin@linux.dev, 42.hyeyoo@gmail.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Chengming Zhou References: <20231102032330.1036151-1-chengming.zhou@linux.dev> <20231102032330.1036151-7-chengming.zhou@linux.dev> <4f3bc1bd-ea87-465d-b58a-0ed57b15187b@sirena.org.uk> X-Report-Abuse: Please report any abuse attempt to abuse@migadu.com and include these headers. From: Chengming Zhou In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Migadu-Flow: FLOW_OUT X-Stat-Signature: wywo7cmc3i1tonu3z1sqyu7pbir5gr78 X-Rspamd-Server: rspam10 X-Rspamd-Queue-Id: 3A94910001F X-Rspam-User: X-HE-Tag: 1700581677-955097 X-HE-Meta: U2FsdGVkX19ytPx/p0ooapzsIQrh3WOZUimsoquxoPyrstUR1HIQ7pFgS/5jQdu5M2WlD33xjhEKUH12F/Sx8mbsn8RSt6pnQmLIn6IvPZkxXKogg7aYEuj10StsDbhmm35o6WELsGXwK4dLJ1KZRRfuSknHB19BOS+ExZHGzsp7ZxVvJGADpFsD7SiyoHC2lEpT5JJ3Gr+jHsbIeex66N/oElA1F9NekuSgwx8Yq6Sqq9XVl7KhVRaLMgbFygmMmrxtrEmZAUaA9lD5Nr1rZoM+WREYNFRFhASARY3y8edT+JG17LUEmKv1MehhULrDXFx8aOxKAI3V1SqYqC5H2lqQIU//b+rxUfjwnUyY7FZl2ag/sajd+9G1IeG0qj2uxbMZw0x5tSfl0Fj6MOZSPCz7zaAJD9RDkxDbsoeAbk6KEadejFftBVR6mubonMzVKtDJROQzV4yP7TelDz/GEjC2w7WzEK/gvCS+CVlSAqim3W8o3cVFFoyCnIiszQmxJm1BOD3UZEUp6Sy5Q7mSpcsGWJMoh4agv2JYZi5SriTbFaWNFQpmCURt7Dg7fsavD10MB15LrpOwHrgHpp3zZR666zJdZkB1O9MNspIQfvoXFqr4i3TpDPyasfceZZnEKmYl4Ypse9proYjFmTEMMSUZg8l4xnrLtpQTWv0hkUxy0h+1uXkdhZwClpmbbEDXVv0628uGGJUHYc63KRH8kmU9CXx8Ef/iU8TcVcQIvZcVjRAwfeIUWqDXZnW5tZ2Et3R49XDdLGAcpH4/4SRTet/SsD38E7hP1VShMhQV6pHeKIKguZw3cUcurNwjpm38cxw1BO6Mola30tDT5lWyfr8i8gUILsyTmI3ypxaalltpHeqhYvSNM43PAEQJW5RTuHn9ZAKZRFGTZp76SevgXfELDzjwFVsC4ocPHQd4bAGGuATSGYzoSX0OuvowsbvI7Eo+kQzpLPe8/gKSAYQ rqsT1kyl Zt9hg59KJ8GPn7qMjqad/UTIykvktDvOdPu3OhnpYAoEp8x1ydKY3QVlI9Rn06+bff3YK0pZ8qE4UU9ZO8xiE2BM9djMKCM0gjlWm4EQRsVpyF8wGJYxQIuu5k6M/dQZLGD1UH9UrV6xWZNK9Bn8u16YyFw04Q8PGbzSzXigkOs/gkUfTLsHuAqorSWJMpdGnZLsSsf9t4yhjsLJxnbeNH3u3SXuqnWMB9ypT X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 2023/11/21 09:29, Mark Brown wrote: > On Tue, Nov 21, 2023 at 08:58:40AM +0800, Chengming Zhou wrote: >> On 2023/11/21 02:49, Mark Brown wrote: >>> On Thu, Nov 02, 2023 at 03:23:27AM +0000, chengming.zhou@linux.dev wrote: > >>> When we see problems we see RCU stalls while logging in, for example: > >>> [ 46.453323] rcu: INFO: rcu_sched detected stalls on CPUs/tasks: >>> [ 46.459361] rcu: 3-...0: (1 GPs behind) idle=def4/1/0x40000000 softirq=1304/1304 fqs=951 >>> [ 46.467669] rcu: (detected by 0, t=2103 jiffies, g=1161, q=499 ncpus=4) >>> [ 46.474472] Sending NMI from CPU 0 to CPUs 3: > >> IIUC, here should print the backtrace of CPU 3, right? It looks like CPU 3 is the cause, >> but we couldn't see what it's doing from the log. > > AIUI yes, but it looks like we've just completely lost the CPU - there's > more attempts to talk to it visible in the log: > >>> A full log for that run can be seen at: >>> >>> https://validation.linaro.org/scheduler/job/4017095 > > but none of them appear to cause CPU 3 to respond. Note that 32 bit ARM > is just using a regular IPI rather than something that's actually a NMI > so this isn't hugely out of the ordinary, I'd guess it's stuck with > interrupts masked. Ah yes, there is no NMI on ARM, so CPU 3 maybe running somewhere with interrupts disabled. I searched the full log, but still haven't a clue. And there is no any WARNING or BUG related to SLUB in the log. I wonder how to reproduce it locally with a Qemu VM since I don't have the ARM machine. Thanks!