From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id F05DAC30653 for ; Thu, 4 Jul 2024 20:12:54 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 4DF3D6B0099; Thu, 4 Jul 2024 16:12:54 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 48F086B00C5; Thu, 4 Jul 2024 16:12:54 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 307B16B00C8; Thu, 4 Jul 2024 16:12:54 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id 115CE6B0099 for ; Thu, 4 Jul 2024 16:12:54 -0400 (EDT) Received: from smtpin02.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id BF60A161442 for ; Thu, 4 Jul 2024 20:12:53 +0000 (UTC) X-FDA: 82303168626.02.DDEF1D0 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by imf12.hostedemail.com (Postfix) with ESMTP id 00C5D4000F for ; Thu, 4 Jul 2024 20:12:50 +0000 (UTC) Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b=cGOz8Nxu; spf=pass (imf12.hostedemail.com: domain of hawk@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=hawk@kernel.org; dmarc=pass (policy=none) header.from=kernel.org ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1720123945; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=hYsSuv1IrdFpqM/l4jLk8VLOcnwfps5PLuoNvM4effA=; b=2P6IsH9DdIWGLNWkD291OZXcKotr8NQfMkpTu+UIlJmDeqN16DCCqKJUn4zCfL8MpQPGb1 F2xShsncHbsviLGQEzA7+5pnfQ9xG1xr/81tJzC/zhSomOviZV+gbNpsT3PjG7peOFscxK iFliUMnbjeL1JwJCSsQelgMFk+buqr4= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1720123945; a=rsa-sha256; cv=none; b=fkFUQj931oW4CPlMvjKlQ11jK4SiTNlgOkw3sHwxN8YDXi6lGkQdSwtjXAuPUjKON+xaYK tCpOWq698wHErRRh4SQSK2/EAPee0uToIXQoIuamGoxneuSagjxStwiPF0A0aT/8V+AW6p MePUNN/2xM5KIlL1pErzMm59hJv5e80= ARC-Authentication-Results: i=1; imf12.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b=cGOz8Nxu; spf=pass (imf12.hostedemail.com: domain of hawk@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=hawk@kernel.org; dmarc=pass (policy=none) header.from=kernel.org Received: from smtp.kernel.org (transwarp.subspace.kernel.org [100.75.92.58]) by dfw.source.kernel.org (Postfix) with ESMTP id CCFE560BBA; Thu, 4 Jul 2024 20:12:49 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D7682C3277B; Thu, 4 Jul 2024 20:12:46 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1720123969; bh=WW3qk8d9UMfloUEb6y9UsZPVb7ExeGyuGO0bhcs2Bqk=; h=Date:Subject:To:References:From:In-Reply-To:From; b=cGOz8Nxu+wJTMztZ48PWxT0Q9xHkEUGSUzWMaEi+FgDUSchNZ1b3vKLivGHKe/DFw N//M4PWrtBnbkP7JUPZCBS9APKgXgggQ7ib2V2MIpuAeJ1KrMBpX1KBzUGJkQGW/pg cbegEb6wlOZNK7O5JVr/pb/jkr2Ca4hraqef4p/HIWN/xaVWIenxvdU3WSyAJ37Y1I D+jx8vDGok5CgMx35anoCJQNYJQppiLeJE4/9mQ+f9Bz93KKks+jeH8+8u5r9nps35 5rdumcbAFMxJzKufPnanE+s6GiPg8FeNlcoWlmxk/8tjyKHTadFR+Kh4YV2+H7Vfc4 nqFUMkrDCYUlw== Message-ID: <95930836-5b56-4c40-b2a0-2ddd4a59ae74@kernel.org> Date: Thu, 4 Jul 2024 22:12:45 +0200 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [syzbot] [mm?] possible deadlock in __mmap_lock_do_trace_released To: syzbot , akpm@linux-foundation.org, cgroups@vger.kernel.org, hannes@cmpxchg.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-trace-kernel@vger.kernel.org, lizefan.x@bytedance.com, mathieu.desnoyers@efficios.com, mhiramat@kernel.org, netdev@vger.kernel.org, rostedt@goodmis.org, syzkaller-bugs@googlegroups.com, tj@kernel.org References: <0000000000002be09b061c483ea1@google.com> Content-Language: en-US From: Jesper Dangaard Brouer In-Reply-To: <0000000000002be09b061c483ea1@google.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Rspam-User: X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 00C5D4000F X-Stat-Signature: f8m17fdtz3efm7x4em98ker5wf45gxpd X-HE-Tag: 1720123970-35652 X-HE-Meta: U2FsdGVkX18XVUiqcxOxjsTKTixRYlwzol6d4Hd1nNaUG2FOpj+HTnIOiSeqksqIgOJVExdJ3BAgunMEqG/5Pq7NHlSx+1QHnacZwvrY4eSLFyJo1WKopqgBIJevbh+xFlyiqAfU0uDZFdYhBiJUsIb9dbSfCgqYU7n1Z+MShZMzHpxe+2SEK2AkvPqp1g92hQjNLCRS4cggCRiw55nzzMmja+GEQXtRSS/H5X3lx6Lka3VIlZgopK4TMdghd3oiGSvifXy9ObQQs6W1LUd/toVA4R3p/ZBsc+/lUAbIKdf1WZ+l+0hmSGzBvuwyBAWC5Ua/AACNTwh+OtPEYHRuKrghMCcFbjHDYwqSpxUMt7F9LeniZg/f7vnBnkoKaTLo0U8z0Z22EkU/bfcMEghhQaAzg2vxd/AEuKZN1wj8lhYjhto5XvDE6nsACMaPfhdY2PacXwj0PMIi3/g02nIIOGJz1V7qDinoKYJMU4uyNYdwktbeXk9lKmXxZ0jRuj1xc3w+IvT2F3L4GoG8jCe/MSaZQYm6kusBQ/OLfypYIWI9oNm1DqPDyoHDQYJV0cMkWSIdoXNNt5KfxqGEwIE6XWnI8bOKeESD3R7tdyPBbR/Lp5jAGbgKIR/gUdeG17JKn94pgKPYFhLegjggy6ndVUZe11pwJiGB39vQ0iqRMBIexUMdZeiE3OfbymDwd85BhLiiAcbjVp+YRhByP0e/seJzxi6v3rwxcRcP7fXMQeRroiR1b/vAW3mFKxL9oDplbg/HrvFt1l5hN9YISBQ6TmefdSvPCSHSY/KwCb9LuE47vVe4k1ZcEUmVBTaBybeGHwBXlulbBS2NsogKioLYhPgB3OY2Id3R+xTCRYlpcGzwMemuFtjJI/VFjXeCIZEqg9x5PdW7O/9WG7zTcwYdsuqJIfvQTibupQCb7BK9JzoEPaO4Qg5LcitFl5YvygOaiDafCVBECTlz2logd9g 3Oh2HhX3 x/r9LuZU/BmJ875MEQTpkXD8pPeSzVqkFeZyp4NMmLyEZFVBmb/o7wGcw1+W9BRHdJyOjFkybNgUySsWPFhXmEu26wSWcopsOqL0qM7CWTO+NuCiZTAD5pfE9UF4TWGE9uUNvNsNs0+iJlxtzu1Yj3elr+XvNZngg1FufheaAkepCafkBWfEdT5kz1xhViBExwwK2ESq5MjBavQSLfxGrM0zvtyDXrNdY9hEylCY6JaLHh0Xquc44V0Q3CIgRoTOHV7fWlRLB0J1tDe6vwPcSsu1WSyLU+ZsAgTA6OzHpPqJJqjNTEaZZxG1WIQYkRjoVhC7lAh65Skaqlg2DCDRkyRZOAAAlmUIqzTDF44jaGmNl6+ali3eOMdYMc2R4K9Tbz9k5wrAn45ympcZx6tdGDDPE/hVF/5/uyqZal+T31jZVTT+tqtOvqHc+wTcy7jzhZ79jIx3j209P4Db3MEDHfykVkC42FX4xRfnBRCz9/iWu3fISN0LXoFfK9DQW6pCS03ExOGLvcHL+qU2VzyhwTmBnemctAKK1yhbWaSxeTOspHMCpqqWVjvpvHTBg8VYM06iUTtku5WhNCI3szyVBsjgtIDN6pSdamzXZvsAuWIEZfEKvZGXFMfLUzrTt6guD6QARdkSWpv0itT7IMEyPdoMaOAR8slweBkRYuZ3V7tgvxR8bMnqyMdIN1xbf/t7opJ6gGnk2dOqS2ztU/JA1clv0ATxZdppPL65+gpqc7xdUrjRCQauosJAqT6+bw7V6QCiM+nPAuV6pmaua0Q9IOR2tO2DILLaOggyvzpV997LKJrYsFUMJmRsrOeEY9kDhE0/b6SOpDxZrOiBk48yHw+8ZZGL/RAbQP4bCG0mSlWya3xC2W75jB1NobNJw1hIMINuqAoMfnjIdM2mRDfQVlBtyjHr/C/mGjM99aI1xpoiSTm737bDhSItTx3kEy/Hmgyt4fWuj1f8ww+6fg5COHREMbVX7 PFkHWpNx Ha/knnxlvKW3rzVwMkbfP4JJ43Sgk+g0jW5CrUsJzmJQvOqVsDU94rExLY5S3GCFMOo3edXNSwkyWw/mPgxOflh3p5IzicNFuW6PPIzEJm1TdHKO2rIHxQ== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 02/07/2024 20.54, syzbot wrote: > Hello, > > syzbot found the following issue on: > > HEAD commit: a12978712d90 selftests/bpf: Move ARRAY_SIZE to bpf_misc.h > git tree: bpf-next > console+strace: https://syzkaller.appspot.com/x/log.txt?x=130457fa980000 > kernel config: https://syzkaller.appspot.com/x/.config?x=736daf12bd72e034 > dashboard link: https://syzkaller.appspot.com/bug?extid=16b6ab88e66b34d09014 > compiler: Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40 > syz repro: https://syzkaller.appspot.com/x/repro.syz?x=125718be980000 > C reproducer: https://syzkaller.appspot.com/x/repro.c?x=14528876980000 I cannot reproduce with reproducer on my testlab. (More below) > > Downloadable assets: > disk image: https://storage.googleapis.com/syzbot-assets/9d845a55bf58/disk-a1297871.raw.xz > vmlinux: https://storage.googleapis.com/syzbot-assets/12cb27bdb2de/vmlinux-a1297871.xz > kernel image: https://storage.googleapis.com/syzbot-assets/db09a1fa448c/bzImage-a1297871.xz > > The issue was bisected to: > > commit 21c38a3bd4ee3fb7337d013a638302fb5e5f9dc2 > Author: Jesper Dangaard Brouer > Date: Wed May 1 14:04:11 2024 +0000 > > cgroup/rstat: add cgroup_rstat_cpu_lock helpers and tracepoints > > bisection log: https://syzkaller.appspot.com/x/bisect.txt?x=14ecc085980000 > final oops: https://syzkaller.appspot.com/x/report.txt?x=16ecc085980000 > console output: https://syzkaller.appspot.com/x/log.txt?x=12ecc085980000 > > IMPORTANT: if you fix the issue, please add the following tag to the commit: > Reported-by: syzbot+16b6ab88e66b34d09014@syzkaller.appspotmail.com > Fixes: 21c38a3bd4ee ("cgroup/rstat: add cgroup_rstat_cpu_lock helpers and tracepoints") > > ============================================ > WARNING: possible recursive locking detected > 6.10.0-rc2-syzkaller-00797-ga12978712d90 #0 Not tainted > -------------------------------------------- > syz-executor646/5097 is trying to acquire lock: > ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: local_lock_acquire include/linux/local_lock_internal.h:29 [inline] > ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: __mmap_lock_do_trace_released+0x83/0x620 mm/mmap_lock.c:243 > > but task is already holding lock: > ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: local_lock_acquire include/linux/local_lock_internal.h:29 [inline] > ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: __mmap_lock_do_trace_released+0x83/0x620 mm/mmap_lock.c:243 > > other info that might help us debug this: > Possible unsafe locking scenario: > > CPU0 > ---- > lock(lock#9); > lock(lock#9); > > *** DEADLOCK *** > > May be due to missing lock nesting notation > To me, this looks like a lockdep false-positive, but I might be wrong. Could someone with more LOCKDEP knowledge give their interpretation? The commit[1] adds a fairly standard trylock scheme. Do I need to lockdep annotate trylock's in some special way? [1] https://git.kernel.org/torvalds/c/21c38a3bd4ee3fb733 Also notice change uses raw_spin_lock, which might be harder for lockdep? So, I also enabled CONFIG_PROVE_RAW_LOCK_NESTING in my testlab to help with this, and CONFIG_PROVE_LOCKING. (And obviously I also enabled LOCKDEP*) --Jesper > 5 locks held by syz-executor646/5097: > #0: ffff8880182eb118 (&mm->mmap_lock){++++}-{3:3}, at: mmap_read_lock include/linux/mmap_lock.h:144 [inline] > #0: ffff8880182eb118 (&mm->mmap_lock){++++}-{3:3}, at: acct_collect+0x1cf/0x830 kernel/acct.c:563 > #1: ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: local_lock_acquire include/linux/local_lock_internal.h:29 [inline] > #1: ffff8880b94387e8 (lock#9){+.+.}-{2:2}, at: __mmap_lock_do_trace_released+0x83/0x620 mm/mmap_lock.c:243 > #2: ffffffff8e333fa0 (rcu_read_lock){....}-{1:2}, at: rcu_lock_acquire include/linux/rcupdate.h:329 [inline] > #2: ffffffff8e333fa0 (rcu_read_lock){....}-{1:2}, at: rcu_read_lock include/linux/rcupdate.h:781 [inline] > #2: ffffffff8e333fa0 (rcu_read_lock){....}-{1:2}, at: get_memcg_path_buf mm/mmap_lock.c:139 [inline] > #2: ffffffff8e333fa0 (rcu_read_lock){....}-{1:2}, at: get_mm_memcg_path+0xb1/0x600 mm/mmap_lock.c:209 > #3: ffffffff8e333fa0 (rcu_read_lock){....}-{1:2}, at: trace_call_bpf+0xbc/0x8a0 > #4: ffff8880182eb118 (&mm->mmap_lock){++++}-{3:3}, at: mmap_read_trylock include/linux/mmap_lock.h:163 [inline] > #4: ffff8880182eb118 (&mm->mmap_lock){++++}-{3:3}, at: stack_map_get_build_id_offset+0x237/0x9d0 kernel/bpf/stackmap.c:141 > > stack backtrace: > CPU: 0 PID: 5097 Comm: syz-executor646 Not tainted 6.10.0-rc2-syzkaller-00797-ga12978712d90 #0 > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024 > Call Trace: > > __dump_stack lib/dump_stack.c:88 [inline] > dump_stack_lvl+0x241/0x360 lib/dump_stack.c:114 > check_deadlock kernel/locking/lockdep.c:3062 [inline] > validate_chain+0x15d3/0x5900 kernel/locking/lockdep.c:3856 > __lock_acquire+0x1346/0x1fd0 kernel/locking/lockdep.c:5137 > lock_acquire+0x1ed/0x550 kernel/locking/lockdep.c:5754 > local_lock_acquire include/linux/local_lock_internal.h:29 [inline] > __mmap_lock_do_trace_released+0x9c/0x620 mm/mmap_lock.c:243 > __mmap_lock_trace_released include/linux/mmap_lock.h:42 [inline] > mmap_read_unlock include/linux/mmap_lock.h:170 [inline] > bpf_mmap_unlock_mm kernel/bpf/mmap_unlock_work.h:52 [inline] > stack_map_get_build_id_offset+0x9c7/0x9d0 kernel/bpf/stackmap.c:173 > __bpf_get_stack+0x4ad/0x5a0 kernel/bpf/stackmap.c:449 > bpf_prog_e6cf5f9c69743609+0x42/0x46 > bpf_dispatcher_nop_func include/linux/bpf.h:1243 [inline] > __bpf_prog_run include/linux/filter.h:691 [inline] > bpf_prog_run include/linux/filter.h:698 [inline] > bpf_prog_run_array include/linux/bpf.h:2104 [inline] > trace_call_bpf+0x369/0x8a0 kernel/trace/bpf_trace.c:147 > perf_trace_run_bpf_submit+0x7c/0x1d0 kernel/events/core.c:10269 > perf_trace_mmap_lock+0x3d7/0x510 include/trace/events/mmap_lock.h:16 > trace_mmap_lock_released include/trace/events/mmap_lock.h:50 [inline] > __mmap_lock_do_trace_released+0x5bb/0x620 mm/mmap_lock.c:243 > __mmap_lock_trace_released include/linux/mmap_lock.h:42 [inline] > mmap_read_unlock include/linux/mmap_lock.h:170 [inline] > acct_collect+0x81d/0x830 kernel/acct.c:566 > do_exit+0x936/0x27e0 kernel/exit.c:853 > do_group_exit+0x207/0x2c0 kernel/exit.c:1023 > __do_sys_exit_group kernel/exit.c:1034 [inline] > __se_sys_exit_group kernel/exit.c:1032 [inline] > __x64_sys_exit_group+0x3f/0x40 kernel/exit.c:1032 > do_syscall_x64 arch/x86/entry/common.c:52 [inline] > do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83 > entry_SYSCALL_64_after_hwframe+0x77/0x7f > RIP: 0033:0x7f8fac26d039 > Code: 90 49 c7 c0 b8 ff ff ff be e7 00 00 00 ba 3c 00 00 00 eb 12 0f 1f 44 00 00 89 d0 0f 05 48 3d 00 f0 ff ff 77 1c f4 89 f0 0f 05 <48> 3d 00 f0 ff ff 76 e7 f7 d8 64 41 89 00 eb df 0f 1f 80 00 00 00 > RSP: 002b:00007ffd95d56e68 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7 > RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f8fac26d039 > RDX: 000000000000003c RSI: 00000000000000e7 RDI: 0000000000000000 > RBP: 00007f8fac2e82b0 R08: ffffffffffffffb8 R09: 00000000000000a0 > R10: 0000000000000000 R11: 0000000000000246 R12: 00007f8fac2e82b0 > R13: 0000000000000000 R14: 00007f8fac2e8d20 R15: 00007f8fac23e1e0 > > > > --- > This report is generated by a bot. It may contain errors. > See https://goo.gl/tpsmEJ for more information about syzbot. > syzbot engineers can be reached at syzkaller@googlegroups.com. > > syzbot will keep track of this issue. See: > https://goo.gl/tpsmEJ#status for how to communicate with syzbot. > For information about bisection process see: https://goo.gl/tpsmEJ#bisection > > If the report is already addressed, let syzbot know by replying with: > #syz fix: exact-commit-title > > If you want syzbot to run the reproducer, reply with: > #syz test: git://repo/address.git branch-or-commit-hash > If you attach or paste a git patch, syzbot will apply it before testing. > > If you want to overwrite report's subsystems, reply with: > #syz set subsystems: new-subsystem > (See the list of subsystem names on the web dashboard) > > If the report is a duplicate of another one, reply with: > #syz dup: exact-subject-of-another-report > > If you want to undo deduplication, reply with: > #syz undup