From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id D3DCBC3ABD8 for ; Fri, 16 May 2025 07:45:30 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C03646B00E2; Fri, 16 May 2025 03:45:29 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id B65EC6B00E4; Fri, 16 May 2025 03:45:29 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 9B9A66B00E5; Fri, 16 May 2025 03:45:29 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 747286B00E2 for ; Fri, 16 May 2025 03:45:29 -0400 (EDT) Received: from smtpin29.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay06.hostedemail.com (Postfix) with ESMTP id A5312E2F20 for ; Fri, 16 May 2025 07:45:29 +0000 (UTC) X-FDA: 83447985978.29.19CA4DF Received: from mail-lf1-f41.google.com (mail-lf1-f41.google.com [209.85.167.41]) by imf05.hostedemail.com (Postfix) with ESMTP id A38F310000D for ; Fri, 16 May 2025 07:45:27 +0000 (UTC) Authentication-Results: imf05.hostedemail.com; dkim=pass header.d=gmail.com header.s=20230601 header.b=Dk2OMcYB; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (imf05.hostedemail.com: domain of ryncsn@gmail.com designates 209.85.167.41 as permitted sender) smtp.mailfrom=ryncsn@gmail.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1747381527; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=hI46q2/HMrgYXwAsW6z4GpzR4isBPsAyHmvgP/K9kHI=; b=utX4APK+H9A03wI5fv85WwvHAMSOLPsnsPi8O2nb0WC3GgHV5HJTrBdHkx9bA5q6und4IY Hb05KzWABX/KlMuJeZd74/UkcUb3uo6xSpFaP8S+4QzDk98mXW13vCsqeWyYCIXS3dYDXF uYxeWSrSsPe0pktLc2EDhqMwr2v6Waw= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1747381527; a=rsa-sha256; cv=none; b=LwdT7XpEs5DoW6+Nr2LmYNh2GJDqRUnWdcNhrveuEGBWEPQrzGgyt8+SW7OLpbCEE23YyL YvUhdKZFdYzQLon02c0+0BPgVtIEH7h600GgkRRHJW6AFgAE0NEJFeoITQZZiy5Q3mep12 f/tmEPWaCzYfgLDIEQ2IZUlLIrpcWHY= ARC-Authentication-Results: i=1; imf05.hostedemail.com; dkim=pass header.d=gmail.com header.s=20230601 header.b=Dk2OMcYB; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (imf05.hostedemail.com: domain of ryncsn@gmail.com designates 209.85.167.41 as permitted sender) smtp.mailfrom=ryncsn@gmail.com Received: by mail-lf1-f41.google.com with SMTP id 2adb3069b0e04-54b166fa41bso2351599e87.0 for ; Fri, 16 May 2025 00:45:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1747381526; x=1747986326; darn=kvack.org; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=hI46q2/HMrgYXwAsW6z4GpzR4isBPsAyHmvgP/K9kHI=; b=Dk2OMcYB8+dGYZviZ8djnVl5nNgTqcuVMKC+M2t0E5tMIlPZ7WkGyuKPR5Dghfhd5w fUwpeahkyXDKAlvo1hgYjazLbA8boUsqB/jEs9Xe2hWYAgs3d3QXw/sQMgcY+pyx1C3Z qU68s2oJtK1RDOw9CqYXQYG5mRN2WtqgNN9GDh6NenMAsX8i0f00ivHnYP60jgBbvT2+ GFa2q/rqov2JCdJdHb4oatsgAtuFUswx+6zCvvg2lxSh7t/74PxYSx1YI2jOjxycEYBS wOdyLFldJglmZqQwAz2vFa5OyT1hcQo6v/Q0W9OpV0A3VjbH50T1CGX83mxogLi0APWh Y9Jg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1747381526; x=1747986326; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=hI46q2/HMrgYXwAsW6z4GpzR4isBPsAyHmvgP/K9kHI=; b=qc6PROtdgDuFK0HtJSlETnYhOeQ+ROJvgSdb4QvmbB0qOS014+J8u9e4IHHej3j8jA q2xllwEGe9SYfRGD3LqFv6gEKoCfjH4Q+coeYd/X1o8CmJzubhx1/zX7CCLwhCj2BTnT NiuLYvXmseM8UXkaSHqZkYjOyDGPMmAL+Nmd1IlHck/VHxhYy0uZlPI6KnhaDoizHzTB E1f/D7qYP9gflN5S0nuj2K5rerMMznng0LZ/AC2o7b+cstg025VDF3dW7+j78f2oBaQQ ayuS8SLLfFejtYVFRSyPE4aDadoCy8BmIaFFhLjGsW4FQS21tdMsXmV3aNFsWile/iGU N5NQ== X-Forwarded-Encrypted: i=1; AJvYcCUnWTVWGU8lQNEczouLM/6BFHUR1R+8iPOc5p213kpg03+EwKKBy8jFO5sjYnYoMCdNZEXLfqnq7Q==@kvack.org X-Gm-Message-State: AOJu0YyZMSnjCJ78xeQlGORFtmP1EedvJl5RRtssmEDhz7vUWc2atPNd 3Y+9O8cQNebKJzuVZN9KRwp+Y2lqZ+JB5wSav+nOf4JDDANmPQ5M3OGnW1hKjTQqBHRg3xE8WoO fS43p/8eGMwwjON5aMlZn9E0aNchcxZs= X-Gm-Gg: ASbGncse9IUJciodoJsNBaBqrrWchcFezh7LtZj+t8BQVPom2EwtxETN5+tvVs5E4hZ 57gLISB8bJrSQY7dxtkifSNmSqNhMtXqSZAB24+oZUviewwZfAsa0XH7AiKIOwTd9UMcGCQlk49 PHbkvGIWTKcUHGZ5fpakMEBIymz3lXeWs= X-Google-Smtp-Source: AGHT+IE8OTrafodRzlwsNQ0LbGEoU7Z4K2jZxv2+PaWYwl+yJohaTcyWEEq/j5hYkoQ+jvirHnqjLCLO/khyRjYncEA= X-Received: by 2002:a2e:a98e:0:b0:308:f860:7c1 with SMTP id 38308e7fff4ca-328077a3117mr8773871fa.30.1747381525559; Fri, 16 May 2025 00:45:25 -0700 (PDT) MIME-Version: 1.0 References: <202505161438.9009cf47-lkp@intel.com> In-Reply-To: From: Kairui Song Date: Fri, 16 May 2025 15:45:07 +0800 X-Gm-Features: AX0GCFswC6j0TLVd98lh9U5fR-YYcAHDxe-Vt4VAjeRl7WcdhnGzLfFtf6HCWLY Message-ID: Subject: Re: [linus:master] [mm, swap] b487a2da35: BUG:soft_lockup-CPU##stuck_for#s![stress-ng-swap:#] To: Kemeng Shi Cc: kernel test robot , oe-lkp@lists.linux.dev, lkp@intel.com, linux-kernel@vger.kernel.org, Andrew Morton , Baolin Wang , Baoquan He , Barry Song , Chris Li , "Huang, Ying" , Hugh Dickins , Johannes Weiner , Kalesh Singh , Matthew Wilcow , Nhat Pham , Yosry Ahmed , linux-mm@kvack.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Stat-Signature: gztcd5fbuaftsjoqkca78779gni7fj9r X-Rspam-User: X-Rspamd-Server: rspam05 X-Rspamd-Queue-Id: A38F310000D X-HE-Tag: 1747381527-202091 X-HE-Meta: U2FsdGVkX1/KHbEfxRhap6F9m4/L0fJb4D45oInJE9feT4HrIvVt0SrVBPAgd/sVdboMKYgAYb+8odQuqgo+Om3rvn0OLELxPUUXCIQAXOGxpvbkq7vQi8qlGCGHu+1c5zXuLQRQwhCfI/jmX1UaZDtt6sOpCIgdtE5+RfIb0tATgstCMWsr85NTdI6S/c6mUfbX0TJWCAyN+Y9jKWRDosWxOPrMtSN4DsdNidBCV51qmBqwnqfT3mVk7uP/DNvyodK0r2Anm7h0CuCny/zOT+X8Y6GGU7JozOi/LVSA8fBLSbFxCHHOIU7EbB7BF/Os0ZonI1Owmt+DqMHS9Gjnkfmj8GNOHpZ2JHKoR3eP+E+sp2c/WAqVegydae7JRLi6chXEGodPLB20aVlmbXB/A73DOu4QH5skvkmfvfKY36y5cOQydiea6gGXo7C8qNwtmWAE3Kz1epy5MvmRKIwZID9wk/SLBrlpFrwx7VqbGzXZaUrcS/y2ToveMNHTj5NJ36jndYf8gVbRnEa0NA8g1MCW8BStixvWgaLoUtPMNkUKAmYly1JU2dt6GAioRpvqxjy7e7skyA0PFUQrbUDdM0f6v7CH54SfNwDu/fwEp46hnn5x6iR3q/1L+eL+iagmkIkGSL+l+tsrvliXuG7edaIdxyG/wbAMGIRgXc89DI+9FtFyxdzKlL6gvq3smgtSlnA6k+xfeXKUoSBgZnBQp4yvOTg++CeBISyPQjb7XcIZb46Wc9HjM7ULn5hs6yRgTeRWsdvngPvpODtyZ6Zo3Mb4YIKO6PueCBfXrT5KXO1G/kHsZ25L27FnnJMmfkgBhvMD5U10xxa8/eHy+GV+oSTJZxB6e0+xXA+9nWP3vMz030AwpslsNFyb85MtXJXIBE0UYJdQGz7CE95SvE7aSFgyc2sFx65V94Cnq8CQMjooFIYhNmCsyCERVeGdnGQtCul/wTLh3mUWweaVG6W QFJgRiE9 qO23YzW1KnFDrFC4H8m/HqyUU/q+S2cg9o/cyU0NTKU2Y2fPMGf4jGXGErs2F7+qT5i9xT4YI43+vdyRC0e7N8RMrQyY4nSokEu+0gxQ+pEIz/KBp/LXjaLfPsVP2Jfu/4P6LePjjDEK+Jjjdfpcu2Y/8nRuBEbg32hEUGketCh8FfcLxwxXC2BOgqTdRd9f6Stz1OJmfx0LYxdVP3e8kJW5ttiTnLpaDDPqdTTKwWDzGDvHXVlk0u4/d75TwNnS4kVJ1yqUQbXgmctOHdJGFIkTvsRwY0sp5vjeSQ3PtY1a/+9mMsMtX+DKbx6C3iNed6ymiXyHkeBTR3JxGkvwEmlQIeYL5/1veC+G6qe4OXg534uvNJIHZ9QUVmrG0qfUIqXpk3MlZBPnEh6qhKlPo9OCkZl7HK6BOdCUMca8eozCjwb63SylNv5OGXgXIqjknLI2X8dR3e+3RIc5GRlRc86QUk2sOA2MMX4Q8Pt4Ezlhon/qe+FMvH8fA71Tm5gf9AxlVPT2HesowSfsKlhzFdummQyc3tzUKaF+uEsKIRypWwXDT5Qwztw2VSA== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On Fri, May 16, 2025 at 3:40=E2=80=AFPM Kemeng Shi wrote: > > > > on 5/16/2025 3:19 PM, Kairui Song wrote: > > On Fri, May 16, 2025 at 2:37=E2=80=AFPM kernel test robot wrote: > >> > >> > >> > >> Hello, > >> > >> kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![stress-ng= -swap:#]" on: > >> > >> commit: b487a2da3575b6cdfb6d6559311830c8fea70bb9 ("mm, swap: simplify = folio swap allocation") > >> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master > >> > >> [test failed on linus/master fee3e843b309444f48157e2188efa6818bae= 85cf] > >> [test failed on linux-next/master 484803582c77061b470ac64a634f25f89715= be3f] > >> > >> in testcase: stress-ng > >> version: stress-ng-x86_64-f76f86ffb-1_20250412 > >> with following parameters: > >> > >> nr_threads: 100% > >> disk: 1HDD > >> testtime: 60s > >> fs: xfs > >> test: swap > >> cpufreq_governor: performance > >> > >> > >> > >> config: x86_64-rhel-9.4 > >> compiler: gcc-12 > >> test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.= 10GHz (Ice Lake) with 256G memory > >> > >> (please refer to attached dmesg/kmsg for entire log/backtrace) > >> > >> > >> > >> If you fix the issue in a separate patch/commit (i.e. not just a new v= ersion of > >> the same patch/commit), kindly add following tags > >> | Reported-by: kernel test robot > >> | Closes: https://lore.kernel.org/oe-lkp/202505161438.9009cf47-lkp@int= el.com > >> > >> > >> [ 76.825919][ C17] watchdog: BUG: soft lockup - CPU#17 stuck for 2= 6s! [stress-ng-swap:3017] > >> [ 76.825921][ C17] Modules linked in: xfs ipmi_ssif intel_rapl_msr= intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common i10= nm_edac skx_edac_common nfit libnvdimm x86_pkg_temp_thermal coretemp kvm_in= tel btrfs kvm sd_mod sg snd_pcm blake2b_generic ghash_clmulni_intel xor dax= _hmem ast raid6_pq snd_timer rapl cxl_acpi drm_client_lib ahci intel_cstate= cxl_port drm_shmem_helper snd libahci cxl_core intel_th_gth nvme acpi_powe= r_meter isst_if_mbox_pci isst_if_mmio soundcore intel_th_pci mei_me intel_u= ncore ioatdma i2c_i801 megaraid_sas ipmi_si einj acpi_ipmi pcspkr libata dr= m_kms_helper nvme_core mei isst_if_common intel_th intel_pch_thermal i2c_sm= bus intel_vsec dca wmi ipmi_devintf ipmi_msghandler joydev binfmt_misc drm = fuse dm_mod loop ip_tables > >> [ 76.825949][ C17] CPU: 17 UID: 0 PID: 3017 Comm: stress-ng-swap N= ot tainted 6.14.0-rc6-00228-gb487a2da3575 #1 > >> [ 76.825951][ C17] Hardware name: Inspur NF5180M6/NF5180M6, BIOS 0= 6.00.04 04/12/2022 > >> [ 76.825952][ C17] RIP: 0010:shmem_unuse (include/linux/list.h:119 inc= lude/linux/list.h:215 include/linux/list.h:287 mm/shmem.c:1497) > >> [ 76.825959][ C17] Code: 30 48 8d 6b 30 4c 8d 60 d0 48 81 fd 20 e3 26 = 83 0f 84 8a 00 00 00 48 83 7b 18 00 75 89 48 8b 43 38 48 8b 53 30 48 39 28 = 75 6c <48> 39 6a 08 75 66 48 89 42 08 48 89 10 48 89 6b 30 48 89 6b 38 4c > >> All code > >> =3D=3D=3D=3D=3D=3D=3D=3D > >> 0: 30 48 8d xor %cl,-0x73(%rax) > >> 3: 6b 30 4c imul $0x4c,(%rax),%esi > >> 6: 8d 60 d0 lea -0x30(%rax),%esp > >> 9: 48 81 fd 20 e3 26 83 cmp $0xffffffff8326e320,%rbp > >> 10: 0f 84 8a 00 00 00 je 0xa0 > >> 16: 48 83 7b 18 00 cmpq $0x0,0x18(%rbx) > >> 1b: 75 89 jne 0xffffffffffffffa6 > >> 1d: 48 8b 43 38 mov 0x38(%rbx),%rax > >> 21: 48 8b 53 30 mov 0x30(%rbx),%rdx > >> 25: 48 39 28 cmp %rbp,(%rax) > >> 28: 75 6c jne 0x96 > >> 2a:* 48 39 6a 08 cmp %rbp,0x8(%rdx) <-- tr= apping instruction > >> 2e: 75 66 jne 0x96 > >> 30: 48 89 42 08 mov %rax,0x8(%rdx) > >> 34: 48 89 10 mov %rdx,(%rax) > >> 37: 48 89 6b 30 mov %rbp,0x30(%rbx) > >> 3b: 48 89 6b 38 mov %rbp,0x38(%rbx) > >> 3f: 4c rex.WR > >> > >> Code starting with the faulting instruction > >> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > >> 0: 48 39 6a 08 cmp %rbp,0x8(%rdx) > >> 4: 75 66 jne 0x6c > >> 6: 48 89 42 08 mov %rax,0x8(%rdx) > >> a: 48 89 10 mov %rdx,(%rax) > >> d: 48 89 6b 30 mov %rbp,0x30(%rbx) > >> 11: 48 89 6b 38 mov %rbp,0x38(%rbx) > >> 15: 4c rex.WR > >> [ 76.825961][ C17] RSP: 0018:ffa00000232ffce0 EFLAGS: 00000246 > >> [ 76.825964][ C17] RAX: ff110020c063bde0 RBX: ff110020c063bdb0 RCX= : 0000000000000006 > >> [ 76.825965][ C17] RDX: ff110020c063bde0 RSI: 0000000000000000 RDI= : ff110020c063be2c > >> [ 76.825966][ C17] RBP: ff110020c063bde0 R08: ffffffffffffffc0 R09= : 0000000000000000 > >> [ 76.825967][ C17] R10: 000000000000ffff R11: ffa000000cafa000 R12= : ff110020c063bdb0 > >> [ 76.825968][ C17] R13: ff110020c063be2c R14: 0000000000000014 R15= : ff110020c063bdb0 > >> [ 76.825969][ C17] FS: 00007fcb691d2580(0000) GS:ff11003fbac80000= (0000) knlGS:0000000000000000 > >> [ 76.825970][ C17] CS: 0010 DS: 0000 ES: 0000 CR0: 00000000800500= 33 > >> [ 76.825971][ C17] CR2: 00007fcb6490f000 CR3: 00000001897e4006 CR4= : 0000000000773ef0 > >> [ 76.825972][ C17] DR0: 0000000000000000 DR1: 0000000000000000 DR2= : 0000000000000000 > >> [ 76.825973][ C17] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7= : 0000000000000400 > >> [ 76.825973][ C17] PKRU: 55555554 > >> [ 76.825974][ C17] Call Trace: > >> [ 76.825975][ C17] > >> [ 76.825976][ C17] ? watchdog_timer_fn (kernel/watchdog.c:770) > >> [ 76.825980][ C17] ? __pfx_watchdog_timer_fn (kernel/watchdog.c:685) > >> [ 76.825981][ C17] ? __hrtimer_run_queues (kernel/time/hrtimer.c:1801 = kernel/time/hrtimer.c:1865) > >> [ 76.825985][ C17] ? hrtimer_interrupt (kernel/time/hrtimer.c:1930) > >> [ 76.825986][ C17] ? __sysvec_apic_timer_interrupt (arch/x86/kernel/ap= ic/apic.c:1038 arch/x86/kernel/apic/apic.c:1055) > >> [ 76.825989][ C17] ? sysvec_apic_timer_interrupt (arch/x86/kernel/apic= /apic.c:1049 arch/x86/kernel/apic/apic.c:1049) > >> [ 76.825992][ C17] > >> [ 76.825992][ C17] > >> [ 76.825993][ C17] ? asm_sysvec_apic_timer_interrupt (arch/x86/include= /asm/idtentry.h:702) > >> [ 76.825997][ C17] ? shmem_unuse (include/linux/list.h:119 include/lin= ux/list.h:215 include/linux/list.h:287 mm/shmem.c:1497) > >> [ 76.825999][ C17] try_to_unuse (mm/swapfile.c:2295) > >> [ 76.826003][ C17] __do_sys_swapoff (mm/swapfile.c:2736) > >> [ 76.826005][ C17] do_syscall_64 (arch/x86/entry/common.c:52 arch/x86/= entry/common.c:83) > >> [ 76.826008][ C17] ? __rseq_handle_notify_resume (kernel/rseq.c:420) > >> [ 76.826012][ C17] ? syscall_exit_to_user_mode (include/linux/rseq.h:3= 8 include/linux/resume_user_mode.h:62 kernel/entry/common.c:114 include/lin= ux/entry-common.h:329 kernel/entry/common.c:207 kernel/entry/common.c:218) > >> [ 76.826013][ C17] ? do_syscall_64 (arch/x86/entry/common.c:102) > >> [ 76.826015][ C17] ? __perf_sw_event (kernel/events/internal.h:229 ker= nel/events/core.c:10210 kernel/events/core.c:10235) > >> [ 76.826016][ C17] ? handle_mm_fault (include/linux/perf_event.h:1503 = mm/memory.c:6106 mm/memory.c:6233) > >> [ 76.826020][ C17] ? do_user_addr_fault (arch/x86/mm/fault.c:1338) > >> [ 76.826022][ C17] ? clear_bhb_loop (arch/x86/entry/entry_64.S:1538) > >> [ 76.826024][ C17] ? clear_bhb_loop (arch/x86/entry/entry_64.S:1538) > >> [ 76.826026][ C17] ? clear_bhb_loop (arch/x86/entry/entry_64.S:1538) > >> [ 76.826028][ C17] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entr= y_64.S:130) > >> [ 76.826030][ C17] RIP: 0033:0x7fcb69976fc7 > >> [ 76.826031][ C17] Code: 73 01 c3 48 8b 0d 39 2e 0d 00 f7 d8 64 89 01 = 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 a8 00 00 00 = 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 09 2e 0d 00 f7 d8 64 89 01 48 > >> All code > >> =3D=3D=3D=3D=3D=3D=3D=3D > >> 0: 73 01 jae 0x3 > >> 2: c3 ret > >> 3: 48 8b 0d 39 2e 0d 00 mov 0xd2e39(%rip),%rcx # 0xd= 2e43 > >> a: f7 d8 neg %eax > >> c: 64 89 01 mov %eax,%fs:(%rcx) > >> f: 48 83 c8 ff or $0xffffffffffffffff,%rax > >> 13: c3 ret > >> 14: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > >> 1b: 00 00 00 > >> 1e: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1) > >> 23: b8 a8 00 00 00 mov $0xa8,%eax > >> 28: 0f 05 syscall > >> 2a:* 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax = <-- trapping instruction > >> 30: 73 01 jae 0x33 > >> 32: c3 ret > >> 33: 48 8b 0d 09 2e 0d 00 mov 0xd2e09(%rip),%rcx # 0xd= 2e43 > >> 3a: f7 d8 neg %eax > >> 3c: 64 89 01 mov %eax,%fs:(%rcx) > >> 3f: 48 rex.W > >> > >> Code starting with the faulting instruction > >> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > >> 0: 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax > >> 6: 73 01 jae 0x9 > >> 8: c3 ret > >> 9: 48 8b 0d 09 2e 0d 00 mov 0xd2e09(%rip),%rcx # 0xd= 2e19 > >> 10: f7 d8 neg %eax > >> 12: 64 89 01 mov %eax,%fs:(%rcx) > >> 15: 48 rex.W > >> [ 76.826032][ C17] RSP: 002b:00007ffd6d92dbd8 EFLAGS: 00000206 ORI= G_RAX: 00000000000000a8 > >> [ 76.826033][ C17] RAX: ffffffffffffffda RBX: 0000000000000019 RCX= : 00007fcb69976fc7 > >> [ 76.826034][ C17] RDX: fffffffffffffccc RSI: 000000000003d000 RDI= : 00007ffd6d92e690 > >> [ 76.826035][ C17] RBP: 00007fcb64910000 R08: 0000000000000007 R09= : 0000559500e1f530 > >> [ 76.826036][ C17] R10: c7c9ff097bf742d3 R11: 0000000000000206 R12= : 00007fcb691d2180 > >> [ 76.826036][ C17] R13: 00007ffd6d92e690 R14: 0000000000000001 R15= : 0000000000000000 > >> [ 76.826037][ C17] > >> [ 76.826038][ C17] Kernel panic - not syncing: softlockup: hung ta= sks > >> [ 77.239533][ C17] CPU: 17 UID: 0 PID: 3017 Comm: stress-ng-swap T= ainted: G L 6.14.0-rc6-00228-gb487a2da3575 #1 > >> [ 77.251353][ C17] Tainted: [L]=3DSOFTLOCKUP > >> [ 77.255672][ C17] Hardware name: Inspur NF5180M6/NF5180M6, BIOS 0= 6.00.04 04/12/2022 > >> [ 77.263635][ C17] Call Trace: > >> [ 77.266915][ C17] > >> [ 77.269756][ C17] panic (kernel/panic.c:354) > >> [ 77.273643][ C17] watchdog_timer_fn (kernel/watchdog.c:735) > >> [ 77.278568][ C17] ? __pfx_watchdog_timer_fn (kernel/watchdog.c:685) > >> [ 77.284015][ C17] __hrtimer_run_queues (kernel/time/hrtimer.c:1801 ke= rnel/time/hrtimer.c:1865) > >> [ 77.289201][ C17] hrtimer_interrupt (kernel/time/hrtimer.c:1930) > >> [ 77.294040][ C17] __sysvec_apic_timer_interrupt (arch/x86/kernel/apic= /apic.c:1038 arch/x86/kernel/apic/apic.c:1055) > >> [ 77.299835][ C17] sysvec_apic_timer_interrupt (arch/x86/kernel/apic/a= pic.c:1049 arch/x86/kernel/apic/apic.c:1049) > >> [ 77.305454][ C17] > >> [ 77.308382][ C17] > >> [ 77.311313][ C17] asm_sysvec_apic_timer_interrupt (arch/x86/include/a= sm/idtentry.h:702) > >> [ 77.317275][ C17] RIP: 0010:shmem_unuse (include/linux/list.h:119 inc= lude/linux/list.h:215 include/linux/list.h:287 mm/shmem.c:1497) > >> [ 77.322373][ C17] Code: 30 48 8d 6b 30 4c 8d 60 d0 48 81 fd 20 e3 26 = 83 0f 84 8a 00 00 00 48 83 7b 18 00 75 89 48 8b 43 38 48 8b 53 30 48 39 28 = 75 6c <48> 39 6a 08 75 66 48 89 42 08 48 89 10 48 89 6b 30 48 89 6b 38 4c > >> All code > >> =3D=3D=3D=3D=3D=3D=3D=3D > >> 0: 30 48 8d xor %cl,-0x73(%rax) > >> 3: 6b 30 4c imul $0x4c,(%rax),%esi > >> 6: 8d 60 d0 lea -0x30(%rax),%esp > >> 9: 48 81 fd 20 e3 26 83 cmp $0xffffffff8326e320,%rbp > >> 10: 0f 84 8a 00 00 00 je 0xa0 > >> 16: 48 83 7b 18 00 cmpq $0x0,0x18(%rbx) > >> 1b: 75 89 jne 0xffffffffffffffa6 > >> 1d: 48 8b 43 38 mov 0x38(%rbx),%rax > >> 21: 48 8b 53 30 mov 0x30(%rbx),%rdx > >> 25: 48 39 28 cmp %rbp,(%rax) > >> 28: 75 6c jne 0x96 > >> 2a:* 48 39 6a 08 cmp %rbp,0x8(%rdx) <-- tr= apping instruction > >> 2e: 75 66 jne 0x96 > >> 30: 48 89 42 08 mov %rax,0x8(%rdx) > >> 34: 48 89 10 mov %rdx,(%rax) > >> 37: 48 89 6b 30 mov %rbp,0x30(%rbx) > >> 3b: 48 89 6b 38 mov %rbp,0x38(%rbx) > >> 3f: 4c rex.WR > >> > >> > >> The kernel config and materials to reproduce are available at: > >> https://download.01.org/0day-ci/archive/20250516/202505161438.9009cf47= -lkp@intel.com > >> > >> > >> > >> -- > >> 0-DAY CI Kernel Test Service > >> https://github.com/intel/lkp-tests/wiki > >> > >> > > > > It seems this is exactly the problem "mm/shmem: Fix potential dead > > loop in shmem_unuse" trying to fix? > > https://lore.kernel.org/linux-mm/20250515154758.956521-4-shikemeng@huaw= eicloud.com/ > > > > Hi Kemeng, > > > > Can you help take a look? And maybe add a "Closes: > > https://lore.kernel.org/oe-lkp/202505161438.9009cf47-lkp@intel.com" in > > your patch? > > > > BTW I also just found there is another potential problem with shmem, > > the "list_del_init(&info->swaplist);" in shmem_writepage could lead to > > leaked shmem inode on swapoff, it's unrelated to this problem though, > > I can fix it posting one patch for that after double checking. > > > Hi Kairui, > The "list_del_init(&info->swaplist);" issue is also fixed in my patchset,= see > https://lore.kernel.org/linux-mm/20250515154758.956521-5-shikemeng@huawei= cloud.com/. > This issue could also lead to a dead loop. > > I have not taken a deep look, but I think the "list_del_init(&info->swapl= ist);" issue > is more likely to blame as it happens to be introduced in > b487a2da3575b6cdfb6d6559311830c8fea70bb9 ("mm, swap: simplify folio swap= allocation"). > > With these information helps. Yes, thanks, I think you can add a Closes for this report, with both patches applied this should be fixed just fine.