* Re: [syzbot] [bpf?] KASAN: vmalloc-out-of-bounds Write in vrealloc_noprof (2)
[not found] <68213ddf.050a0220.f2294.0045.GAE@google.com>
@ 2025-05-12 22:51 ` Andrii Nakryiko
2025-05-13 8:13 ` Dmitry Vyukov
0 siblings, 1 reply; 3+ messages in thread
From: Andrii Nakryiko @ 2025-05-12 22:51 UTC (permalink / raw)
To: syzbot, Linux Memory Management List
Cc: andrii, ast, bpf, daniel, eddyz87, haoluo, john.fastabend, jolsa,
kpsingh, linux-kernel, martin.lau, sdf, song, syzkaller-bugs,
yonghong.song
On Sun, May 11, 2025 at 5:16 PM syzbot
<syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com> wrote:
>
> Hello,
>
> syzbot found the following issue on:
>
> HEAD commit: 707df3375124 Merge tag 'media/v6.15-2' of git://git.kernel..
> git tree: upstream
> console output: https://syzkaller.appspot.com/x/log.txt?x=16b1b2bc580000
> kernel config: https://syzkaller.appspot.com/x/.config?x=91c351a0f6229e67
> dashboard link: https://syzkaller.appspot.com/bug?extid=659fcc0678e5a1193143
> compiler: Debian clang version 20.1.2 (++20250402124445+58df0ef89dd6-1~exp1~20250402004600.97), Debian LLD 20.1.2
>
> Unfortunately, I don't have any reproducer for this issue yet.
>
> Downloadable assets:
> disk image (non-bootable): https://storage.googleapis.com/syzbot-assets/d900f083ada3/non_bootable_disk-707df337.raw.xz
> vmlinux: https://storage.googleapis.com/syzbot-assets/bc3944720ea5/vmlinux-707df337.xz
> kernel image: https://storage.googleapis.com/syzbot-assets/7bc2f45ae23f/bzImage-707df337.xz
>
> IMPORTANT: if you fix the issue, please add the following tag to the commit:
> Reported-by: syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com
>
> syz.0.0 uses obsolete (PF_INET,SOCK_PACKET)
> ==================================================================
> BUG: KASAN: vmalloc-out-of-bounds in vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> Write of size 4064 at addr ffffc9000efa1020 by task syz.0.0/5317
>
A while back I sent a fix for kasan handling of vrealloc ([0]), but
this issue came back even with my changes in [0]. Can anyone from mm
side take a look at vrealloc_noprof() and see if we are missing
anything else to convince KASAN that we are using vrealloc()
correctly?
Seems like kasan_poison_vmalloc() + kasan_unpoison_vmalloc() dance
isn't covering all cases? Or am I missing something? It's doubtful
that there is any BPF-side bug in using kvrealloc().
[0] https://lore.kernel.org/linux-mm/20241126005206.3457974-1-andrii@kernel.org/
> CPU: 0 UID: 0 PID: 5317 Comm: syz.0.0 Not tainted 6.15.0-rc5-syzkaller-00038-g707df3375124 #0 PREEMPT(full)
> Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
> Call Trace:
> <TASK>
> dump_stack_lvl+0x189/0x250 lib/dump_stack.c:120
> print_address_description mm/kasan/report.c:408 [inline]
> print_report+0xb4/0x290 mm/kasan/report.c:521
> kasan_report+0x118/0x150 mm/kasan/report.c:634
> check_region_inline mm/kasan/generic.c:-1 [inline]
> kasan_check_range+0x29a/0x2b0 mm/kasan/generic.c:189
> __asan_memset+0x22/0x50 mm/kasan/shadow.c:84
> vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> do_check_main kernel/bpf/verifier.c:22867 [inline]
> bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> do_syscall_64+0xf6/0x210 arch/x86/entry/syscall_64.c:94
> entry_SYSCALL_64_after_hwframe+0x77/0x7f
> RIP: 0033:0x7f649c58e969
> Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
> RSP: 002b:00007f649d4dd038 EFLAGS: 00000246 ORIG_RAX: 0000000000000141
> RAX: ffffffffffffffda RBX: 00007f649c7b5fa0 RCX: 00007f649c58e969
> RDX: 0000000000000048 RSI: 00002000000017c0 RDI: 0000000000000005
> RBP: 00007f649c610ab1 R08: 0000000000000000 R09: 0000000000000000
> R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> R13: 0000000000000000 R14: 00007f649c7b5fa0 R15: 00007fff542287e8
> </TASK>
>
> The buggy address belongs to the virtual mapping at
> [ffffc9000ef81000, ffffc9000efa3000) created by:
> kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
>
> The buggy address belongs to the physical page:
> page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x3ffd0 pfn:0x3efe5
> flags: 0x4fff00000000000(node=1|zone=1|lastcpupid=0x7ff)
> raw: 04fff00000000000 0000000000000000 dead000000000122 0000000000000000
> raw: 000000000003ffd0 0000000000000000 00000001ffffffff 0000000000000000
> page dumped because: kasan: bad access detected
> page_owner tracks the page as allocated
> page last allocated via order 0, migratetype Unmovable, gfp_mask 0x102cc2(GFP_HIGHUSER|__GFP_NOWARN), pid 5317, tgid 5316 (syz.0.0), ts 82587533383, free_ts 81110216781
> set_page_owner include/linux/page_owner.h:32 [inline]
> post_alloc_hook+0x1d8/0x230 mm/page_alloc.c:1718
> prep_new_page mm/page_alloc.c:1726 [inline]
> get_page_from_freelist+0x21ce/0x22b0 mm/page_alloc.c:3688
> __alloc_pages_slowpath+0x2fe/0xcc0 mm/page_alloc.c:4509
> __alloc_frozen_pages_noprof+0x319/0x370 mm/page_alloc.c:4983
> alloc_pages_mpol+0x232/0x4a0 mm/mempolicy.c:2301
> alloc_frozen_pages_noprof mm/mempolicy.c:2372 [inline]
> alloc_pages_noprof+0xa9/0x190 mm/mempolicy.c:2392
> vm_area_alloc_pages mm/vmalloc.c:3591 [inline]
> __vmalloc_area_node mm/vmalloc.c:3669 [inline]
> __vmalloc_node_range_noprof+0x8fe/0x12c0 mm/vmalloc.c:3844
> __kvmalloc_node_noprof+0x3a0/0x5e0 mm/slub.c:5034
> kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
> push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> do_check_main kernel/bpf/verifier.c:22867 [inline]
> bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> page last free pid 82 tgid 82 stack trace:
> reset_page_owner include/linux/page_owner.h:25 [inline]
> free_pages_prepare mm/page_alloc.c:1262 [inline]
> free_unref_folios+0xb81/0x14a0 mm/page_alloc.c:2782
> shrink_folio_list+0x3053/0x4e90 mm/vmscan.c:1552
> evict_folios+0x417b/0x5110 mm/vmscan.c:4698
> try_to_shrink_lruvec+0x705/0x990 mm/vmscan.c:4859
> shrink_one+0x21b/0x7c0 mm/vmscan.c:4904
> shrink_many mm/vmscan.c:4967 [inline]
> lru_gen_shrink_node mm/vmscan.c:5045 [inline]
> shrink_node+0x3139/0x3750 mm/vmscan.c:6016
> kswapd_shrink_node mm/vmscan.c:6867 [inline]
> balance_pgdat mm/vmscan.c:7050 [inline]
> kswapd+0x1675/0x2970 mm/vmscan.c:7315
> kthread+0x70e/0x8a0 kernel/kthread.c:464
> ret_from_fork+0x4b/0x80 arch/x86/kernel/process.c:153
> ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:245
>
> Memory state around the buggy address:
> ffffc9000efa0f00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> ffffc9000efa0f80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >ffffc9000efa1000: 00 00 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> ^
> ffffc9000efa1080: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> ffffc9000efa1100: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> ==================================================================
>
>
> ---
> This report is generated by a bot. It may contain errors.
> See https://goo.gl/tpsmEJ for more information about syzbot.
> syzbot engineers can be reached at syzkaller@googlegroups.com.
>
> syzbot will keep track of this issue. See:
> https://goo.gl/tpsmEJ#status for how to communicate with syzbot.
>
> If the report is already addressed, let syzbot know by replying with:
> #syz fix: exact-commit-title
>
> If you want to overwrite report's subsystems, reply with:
> #syz set subsystems: new-subsystem
> (See the list of subsystem names on the web dashboard)
>
> If the report is a duplicate of another one, reply with:
> #syz dup: exact-subject-of-another-report
>
> If you want to undo deduplication, reply with:
> #syz undup
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [syzbot] [bpf?] KASAN: vmalloc-out-of-bounds Write in vrealloc_noprof (2)
2025-05-12 22:51 ` [syzbot] [bpf?] KASAN: vmalloc-out-of-bounds Write in vrealloc_noprof (2) Andrii Nakryiko
@ 2025-05-13 8:13 ` Dmitry Vyukov
2025-05-13 16:20 ` Andrii Nakryiko
0 siblings, 1 reply; 3+ messages in thread
From: Dmitry Vyukov @ 2025-05-13 8:13 UTC (permalink / raw)
To: Andrii Nakryiko
Cc: syzbot, Linux Memory Management List, andrii, ast, bpf, daniel,
eddyz87, haoluo, john.fastabend, jolsa, kpsingh, linux-kernel,
martin.lau, sdf, song, syzkaller-bugs, yonghong.song
On Tue, 13 May 2025 at 00:52, Andrii Nakryiko <andrii.nakryiko@gmail.com> wrote:
>
> On Sun, May 11, 2025 at 5:16 PM syzbot
> <syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com> wrote:
> >
> > Hello,
> >
> > syzbot found the following issue on:
> >
> > HEAD commit: 707df3375124 Merge tag 'media/v6.15-2' of git://git.kernel..
> > git tree: upstream
> > console output: https://syzkaller.appspot.com/x/log.txt?x=16b1b2bc580000
> > kernel config: https://syzkaller.appspot.com/x/.config?x=91c351a0f6229e67
> > dashboard link: https://syzkaller.appspot.com/bug?extid=659fcc0678e5a1193143
> > compiler: Debian clang version 20.1.2 (++20250402124445+58df0ef89dd6-1~exp1~20250402004600.97), Debian LLD 20.1.2
> >
> > Unfortunately, I don't have any reproducer for this issue yet.
> >
> > Downloadable assets:
> > disk image (non-bootable): https://storage.googleapis.com/syzbot-assets/d900f083ada3/non_bootable_disk-707df337.raw.xz
> > vmlinux: https://storage.googleapis.com/syzbot-assets/bc3944720ea5/vmlinux-707df337.xz
> > kernel image: https://storage.googleapis.com/syzbot-assets/7bc2f45ae23f/bzImage-707df337.xz
> >
> > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > Reported-by: syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com
> >
> > syz.0.0 uses obsolete (PF_INET,SOCK_PACKET)
> > ==================================================================
> > BUG: KASAN: vmalloc-out-of-bounds in vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> > Write of size 4064 at addr ffffc9000efa1020 by task syz.0.0/5317
> >
>
> A while back I sent a fix for kasan handling of vrealloc ([0]), but
> this issue came back even with my changes in [0]. Can anyone from mm
> side take a look at vrealloc_noprof() and see if we are missing
> anything else to convince KASAN that we are using vrealloc()
> correctly?
>
> Seems like kasan_poison_vmalloc() + kasan_unpoison_vmalloc() dance
> isn't covering all cases? Or am I missing something? It's doubtful
> that there is any BPF-side bug in using kvrealloc().
>
> [0] https://lore.kernel.org/linux-mm/20241126005206.3457974-1-andrii@kernel.org/
Hi Andrii,
The report flags the very memset that's visible in this patch chunk, right?
https://lore.kernel.org/linux-mm/20241126005206.3457974-1-andrii@kernel.org/
Unless I am missing something obvious, the unpoison is added _after_
the memset, so it can't help. The unpoison should be done _before_ the
memset.
> > CPU: 0 UID: 0 PID: 5317 Comm: syz.0.0 Not tainted 6.15.0-rc5-syzkaller-00038-g707df3375124 #0 PREEMPT(full)
> > Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
> > Call Trace:
> > <TASK>
> > dump_stack_lvl+0x189/0x250 lib/dump_stack.c:120
> > print_address_description mm/kasan/report.c:408 [inline]
> > print_report+0xb4/0x290 mm/kasan/report.c:521
> > kasan_report+0x118/0x150 mm/kasan/report.c:634
> > check_region_inline mm/kasan/generic.c:-1 [inline]
> > kasan_check_range+0x29a/0x2b0 mm/kasan/generic.c:189
> > __asan_memset+0x22/0x50 mm/kasan/shadow.c:84
> > vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> > push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> > do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> > do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> > do_check_main kernel/bpf/verifier.c:22867 [inline]
> > bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> > bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> > __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> > __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> > __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> > __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> > do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > do_syscall_64+0xf6/0x210 arch/x86/entry/syscall_64.c:94
> > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > RIP: 0033:0x7f649c58e969
> > Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
> > RSP: 002b:00007f649d4dd038 EFLAGS: 00000246 ORIG_RAX: 0000000000000141
> > RAX: ffffffffffffffda RBX: 00007f649c7b5fa0 RCX: 00007f649c58e969
> > RDX: 0000000000000048 RSI: 00002000000017c0 RDI: 0000000000000005
> > RBP: 00007f649c610ab1 R08: 0000000000000000 R09: 0000000000000000
> > R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> > R13: 0000000000000000 R14: 00007f649c7b5fa0 R15: 00007fff542287e8
> > </TASK>
> >
> > The buggy address belongs to the virtual mapping at
> > [ffffc9000ef81000, ffffc9000efa3000) created by:
> > kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
> >
> > The buggy address belongs to the physical page:
> > page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x3ffd0 pfn:0x3efe5
> > flags: 0x4fff00000000000(node=1|zone=1|lastcpupid=0x7ff)
> > raw: 04fff00000000000 0000000000000000 dead000000000122 0000000000000000
> > raw: 000000000003ffd0 0000000000000000 00000001ffffffff 0000000000000000
> > page dumped because: kasan: bad access detected
> > page_owner tracks the page as allocated
> > page last allocated via order 0, migratetype Unmovable, gfp_mask 0x102cc2(GFP_HIGHUSER|__GFP_NOWARN), pid 5317, tgid 5316 (syz.0.0), ts 82587533383, free_ts 81110216781
> > set_page_owner include/linux/page_owner.h:32 [inline]
> > post_alloc_hook+0x1d8/0x230 mm/page_alloc.c:1718
> > prep_new_page mm/page_alloc.c:1726 [inline]
> > get_page_from_freelist+0x21ce/0x22b0 mm/page_alloc.c:3688
> > __alloc_pages_slowpath+0x2fe/0xcc0 mm/page_alloc.c:4509
> > __alloc_frozen_pages_noprof+0x319/0x370 mm/page_alloc.c:4983
> > alloc_pages_mpol+0x232/0x4a0 mm/mempolicy.c:2301
> > alloc_frozen_pages_noprof mm/mempolicy.c:2372 [inline]
> > alloc_pages_noprof+0xa9/0x190 mm/mempolicy.c:2392
> > vm_area_alloc_pages mm/vmalloc.c:3591 [inline]
> > __vmalloc_area_node mm/vmalloc.c:3669 [inline]
> > __vmalloc_node_range_noprof+0x8fe/0x12c0 mm/vmalloc.c:3844
> > __kvmalloc_node_noprof+0x3a0/0x5e0 mm/slub.c:5034
> > kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
> > push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> > do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> > do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> > do_check_main kernel/bpf/verifier.c:22867 [inline]
> > bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> > bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> > __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> > __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> > __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> > __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> > page last free pid 82 tgid 82 stack trace:
> > reset_page_owner include/linux/page_owner.h:25 [inline]
> > free_pages_prepare mm/page_alloc.c:1262 [inline]
> > free_unref_folios+0xb81/0x14a0 mm/page_alloc.c:2782
> > shrink_folio_list+0x3053/0x4e90 mm/vmscan.c:1552
> > evict_folios+0x417b/0x5110 mm/vmscan.c:4698
> > try_to_shrink_lruvec+0x705/0x990 mm/vmscan.c:4859
> > shrink_one+0x21b/0x7c0 mm/vmscan.c:4904
> > shrink_many mm/vmscan.c:4967 [inline]
> > lru_gen_shrink_node mm/vmscan.c:5045 [inline]
> > shrink_node+0x3139/0x3750 mm/vmscan.c:6016
> > kswapd_shrink_node mm/vmscan.c:6867 [inline]
> > balance_pgdat mm/vmscan.c:7050 [inline]
> > kswapd+0x1675/0x2970 mm/vmscan.c:7315
> > kthread+0x70e/0x8a0 kernel/kthread.c:464
> > ret_from_fork+0x4b/0x80 arch/x86/kernel/process.c:153
> > ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:245
> >
> > Memory state around the buggy address:
> > ffffc9000efa0f00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > ffffc9000efa0f80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > >ffffc9000efa1000: 00 00 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > ^
> > ffffc9000efa1080: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > ffffc9000efa1100: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > ==================================================================
> >
> >
> > ---
> > This report is generated by a bot. It may contain errors.
> > See https://goo.gl/tpsmEJ for more information about syzbot.
> > syzbot engineers can be reached at syzkaller@googlegroups.com.
> >
> > syzbot will keep track of this issue. See:
> > https://goo.gl/tpsmEJ#status for how to communicate with syzbot.
> >
> > If the report is already addressed, let syzbot know by replying with:
> > #syz fix: exact-commit-title
> >
> > If you want to overwrite report's subsystems, reply with:
> > #syz set subsystems: new-subsystem
> > (See the list of subsystem names on the web dashboard)
> >
> > If the report is a duplicate of another one, reply with:
> > #syz dup: exact-subject-of-another-report
> >
> > If you want to undo deduplication, reply with:
> > #syz undup
>
> --
> You received this message because you are subscribed to the Google Groups "syzkaller-bugs" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to syzkaller-bugs+unsubscribe@googlegroups.com.
> To view this discussion visit https://groups.google.com/d/msgid/syzkaller-bugs/CAEf4BzbsmHonD-G45-Jo8RQHPjDYEz-Nwx0MGtsk427tgsqGkg%40mail.gmail.com.
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [syzbot] [bpf?] KASAN: vmalloc-out-of-bounds Write in vrealloc_noprof (2)
2025-05-13 8:13 ` Dmitry Vyukov
@ 2025-05-13 16:20 ` Andrii Nakryiko
0 siblings, 0 replies; 3+ messages in thread
From: Andrii Nakryiko @ 2025-05-13 16:20 UTC (permalink / raw)
To: Dmitry Vyukov
Cc: syzbot, Linux Memory Management List, andrii, ast, bpf, daniel,
eddyz87, haoluo, john.fastabend, jolsa, kpsingh, linux-kernel,
martin.lau, sdf, song, syzkaller-bugs, yonghong.song
On Tue, May 13, 2025 at 1:13 AM Dmitry Vyukov <dvyukov@google.com> wrote:
>
> On Tue, 13 May 2025 at 00:52, Andrii Nakryiko <andrii.nakryiko@gmail.com> wrote:
> >
> > On Sun, May 11, 2025 at 5:16 PM syzbot
> > <syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com> wrote:
> > >
> > > Hello,
> > >
> > > syzbot found the following issue on:
> > >
> > > HEAD commit: 707df3375124 Merge tag 'media/v6.15-2' of git://git.kernel..
> > > git tree: upstream
> > > console output: https://syzkaller.appspot.com/x/log.txt?x=16b1b2bc580000
> > > kernel config: https://syzkaller.appspot.com/x/.config?x=91c351a0f6229e67
> > > dashboard link: https://syzkaller.appspot.com/bug?extid=659fcc0678e5a1193143
> > > compiler: Debian clang version 20.1.2 (++20250402124445+58df0ef89dd6-1~exp1~20250402004600.97), Debian LLD 20.1.2
> > >
> > > Unfortunately, I don't have any reproducer for this issue yet.
> > >
> > > Downloadable assets:
> > > disk image (non-bootable): https://storage.googleapis.com/syzbot-assets/d900f083ada3/non_bootable_disk-707df337.raw.xz
> > > vmlinux: https://storage.googleapis.com/syzbot-assets/bc3944720ea5/vmlinux-707df337.xz
> > > kernel image: https://storage.googleapis.com/syzbot-assets/7bc2f45ae23f/bzImage-707df337.xz
> > >
> > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > Reported-by: syzbot+659fcc0678e5a1193143@syzkaller.appspotmail.com
> > >
> > > syz.0.0 uses obsolete (PF_INET,SOCK_PACKET)
> > > ==================================================================
> > > BUG: KASAN: vmalloc-out-of-bounds in vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> > > Write of size 4064 at addr ffffc9000efa1020 by task syz.0.0/5317
> > >
> >
> > A while back I sent a fix for kasan handling of vrealloc ([0]), but
> > this issue came back even with my changes in [0]. Can anyone from mm
> > side take a look at vrealloc_noprof() and see if we are missing
> > anything else to convince KASAN that we are using vrealloc()
> > correctly?
> >
> > Seems like kasan_poison_vmalloc() + kasan_unpoison_vmalloc() dance
> > isn't covering all cases? Or am I missing something? It's doubtful
> > that there is any BPF-side bug in using kvrealloc().
> >
> > [0] https://lore.kernel.org/linux-mm/20241126005206.3457974-1-andrii@kernel.org/
>
> Hi Andrii,
>
> The report flags the very memset that's visible in this patch chunk, right?
> https://lore.kernel.org/linux-mm/20241126005206.3457974-1-andrii@kernel.org/
> Unless I am missing something obvious, the unpoison is added _after_
> the memset, so it can't help. The unpoison should be done _before_ the
> memset.
So that's the case when we realloc to a size that's smaller than
previously alloc'ed vma. So presumably the previous allocation should
have unpoisoned that. But I think you are right, there is a disconnect
between requested size of allocation (which doesn't have to be a
multiple of PAGE_SIZE), and actual page size-aligned VMA size. We
don't seem to keep track of the original requested memory size.
So yes, a simple "fix" would be to temporarily unpoison and memset.
I'll send a patch, don't know if mm/kasan folks would have any better
suggestions. Thanks for suggestion, Dmitry!
diff --git a/mm/vmalloc.c b/mm/vmalloc.c
index 3ed720a787ec..93b4c1758498 100644
--- a/mm/vmalloc.c
+++ b/mm/vmalloc.c
@@ -4089,8 +4089,11 @@ void *vrealloc_noprof(const void *p, size_t
size, gfp_t flags)
*/
if (size <= old_size) {
/* Zero out spare memory. */
- if (want_init_on_alloc(flags))
+ if (want_init_on_alloc(flags)) {
+ kasan_unpoison_vmalloc(p + size, old_size - size,
+ KASAN_VMALLOC_PROT_NORMAL);
memset((void *)p + size, 0, old_size - size);
+ }
kasan_poison_vmalloc(p + size, old_size - size);
kasan_unpoison_vmalloc(p, size, KASAN_VMALLOC_PROT_NORMAL);
return (void *)p;
(note, the diff formatting will be butchered courtesy of gmail, so
don't try to actually apply that)
>
>
> > > CPU: 0 UID: 0 PID: 5317 Comm: syz.0.0 Not tainted 6.15.0-rc5-syzkaller-00038-g707df3375124 #0 PREEMPT(full)
> > > Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
> > > Call Trace:
> > > <TASK>
> > > dump_stack_lvl+0x189/0x250 lib/dump_stack.c:120
> > > print_address_description mm/kasan/report.c:408 [inline]
> > > print_report+0xb4/0x290 mm/kasan/report.c:521
> > > kasan_report+0x118/0x150 mm/kasan/report.c:634
> > > check_region_inline mm/kasan/generic.c:-1 [inline]
> > > kasan_check_range+0x29a/0x2b0 mm/kasan/generic.c:189
> > > __asan_memset+0x22/0x50 mm/kasan/shadow.c:84
> > > vrealloc_noprof+0x396/0x430 mm/vmalloc.c:4093
> > > push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> > > do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> > > do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> > > do_check_main kernel/bpf/verifier.c:22867 [inline]
> > > bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> > > bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> > > __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> > > __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> > > __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> > > __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> > > do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
> > > do_syscall_64+0xf6/0x210 arch/x86/entry/syscall_64.c:94
> > > entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > RIP: 0033:0x7f649c58e969
> > > Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
> > > RSP: 002b:00007f649d4dd038 EFLAGS: 00000246 ORIG_RAX: 0000000000000141
> > > RAX: ffffffffffffffda RBX: 00007f649c7b5fa0 RCX: 00007f649c58e969
> > > RDX: 0000000000000048 RSI: 00002000000017c0 RDI: 0000000000000005
> > > RBP: 00007f649c610ab1 R08: 0000000000000000 R09: 0000000000000000
> > > R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> > > R13: 0000000000000000 R14: 00007f649c7b5fa0 R15: 00007fff542287e8
> > > </TASK>
> > >
> > > The buggy address belongs to the virtual mapping at
> > > [ffffc9000ef81000, ffffc9000efa3000) created by:
> > > kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
> > >
> > > The buggy address belongs to the physical page:
> > > page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x3ffd0 pfn:0x3efe5
> > > flags: 0x4fff00000000000(node=1|zone=1|lastcpupid=0x7ff)
> > > raw: 04fff00000000000 0000000000000000 dead000000000122 0000000000000000
> > > raw: 000000000003ffd0 0000000000000000 00000001ffffffff 0000000000000000
> > > page dumped because: kasan: bad access detected
> > > page_owner tracks the page as allocated
> > > page last allocated via order 0, migratetype Unmovable, gfp_mask 0x102cc2(GFP_HIGHUSER|__GFP_NOWARN), pid 5317, tgid 5316 (syz.0.0), ts 82587533383, free_ts 81110216781
> > > set_page_owner include/linux/page_owner.h:32 [inline]
> > > post_alloc_hook+0x1d8/0x230 mm/page_alloc.c:1718
> > > prep_new_page mm/page_alloc.c:1726 [inline]
> > > get_page_from_freelist+0x21ce/0x22b0 mm/page_alloc.c:3688
> > > __alloc_pages_slowpath+0x2fe/0xcc0 mm/page_alloc.c:4509
> > > __alloc_frozen_pages_noprof+0x319/0x370 mm/page_alloc.c:4983
> > > alloc_pages_mpol+0x232/0x4a0 mm/mempolicy.c:2301
> > > alloc_frozen_pages_noprof mm/mempolicy.c:2372 [inline]
> > > alloc_pages_noprof+0xa9/0x190 mm/mempolicy.c:2392
> > > vm_area_alloc_pages mm/vmalloc.c:3591 [inline]
> > > __vmalloc_area_node mm/vmalloc.c:3669 [inline]
> > > __vmalloc_node_range_noprof+0x8fe/0x12c0 mm/vmalloc.c:3844
> > > __kvmalloc_node_noprof+0x3a0/0x5e0 mm/slub.c:5034
> > > kvrealloc_noprof+0x82/0xe0 mm/slub.c:5109
> > > push_insn_history+0x184/0x650 kernel/bpf/verifier.c:3874
> > > do_check+0x597/0xd630 kernel/bpf/verifier.c:19450
> > > do_check_common+0x168d/0x20b0 kernel/bpf/verifier.c:22776
> > > do_check_main kernel/bpf/verifier.c:22867 [inline]
> > > bpf_check+0x13679/0x19a70 kernel/bpf/verifier.c:24033
> > > bpf_prog_load+0x1318/0x1930 kernel/bpf/syscall.c:2971
> > > __sys_bpf+0x5f1/0x860 kernel/bpf/syscall.c:5834
> > > __do_sys_bpf kernel/bpf/syscall.c:5941 [inline]
> > > __se_sys_bpf kernel/bpf/syscall.c:5939 [inline]
> > > __x64_sys_bpf+0x7c/0x90 kernel/bpf/syscall.c:5939
> > > page last free pid 82 tgid 82 stack trace:
> > > reset_page_owner include/linux/page_owner.h:25 [inline]
> > > free_pages_prepare mm/page_alloc.c:1262 [inline]
> > > free_unref_folios+0xb81/0x14a0 mm/page_alloc.c:2782
> > > shrink_folio_list+0x3053/0x4e90 mm/vmscan.c:1552
> > > evict_folios+0x417b/0x5110 mm/vmscan.c:4698
> > > try_to_shrink_lruvec+0x705/0x990 mm/vmscan.c:4859
> > > shrink_one+0x21b/0x7c0 mm/vmscan.c:4904
> > > shrink_many mm/vmscan.c:4967 [inline]
> > > lru_gen_shrink_node mm/vmscan.c:5045 [inline]
> > > shrink_node+0x3139/0x3750 mm/vmscan.c:6016
> > > kswapd_shrink_node mm/vmscan.c:6867 [inline]
> > > balance_pgdat mm/vmscan.c:7050 [inline]
> > > kswapd+0x1675/0x2970 mm/vmscan.c:7315
> > > kthread+0x70e/0x8a0 kernel/kthread.c:464
> > > ret_from_fork+0x4b/0x80 arch/x86/kernel/process.c:153
> > > ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:245
> > >
> > > Memory state around the buggy address:
> > > ffffc9000efa0f00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > > ffffc9000efa0f80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> > > >ffffc9000efa1000: 00 00 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > > ^
> > > ffffc9000efa1080: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > > ffffc9000efa1100: f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8 f8
> > > ==================================================================
> > >
> > >
> > > ---
> > > This report is generated by a bot. It may contain errors.
> > > See https://goo.gl/tpsmEJ for more information about syzbot.
> > > syzbot engineers can be reached at syzkaller@googlegroups.com.
> > >
> > > syzbot will keep track of this issue. See:
> > > https://goo.gl/tpsmEJ#status for how to communicate with syzbot.
> > >
> > > If the report is already addressed, let syzbot know by replying with:
> > > #syz fix: exact-commit-title
> > >
> > > If you want to overwrite report's subsystems, reply with:
> > > #syz set subsystems: new-subsystem
> > > (See the list of subsystem names on the web dashboard)
> > >
> > > If the report is a duplicate of another one, reply with:
> > > #syz dup: exact-subject-of-another-report
> > >
> > > If you want to undo deduplication, reply with:
> > > #syz undup
> >
> > --
> > You received this message because you are subscribed to the Google Groups "syzkaller-bugs" group.
> > To unsubscribe from this group and stop receiving emails from it, send an email to syzkaller-bugs+unsubscribe@googlegroups.com.
> > To view this discussion visit https://groups.google.com/d/msgid/syzkaller-bugs/CAEf4BzbsmHonD-G45-Jo8RQHPjDYEz-Nwx0MGtsk427tgsqGkg%40mail.gmail.com.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2025-05-13 16:21 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <68213ddf.050a0220.f2294.0045.GAE@google.com>
2025-05-12 22:51 ` [syzbot] [bpf?] KASAN: vmalloc-out-of-bounds Write in vrealloc_noprof (2) Andrii Nakryiko
2025-05-13 8:13 ` Dmitry Vyukov
2025-05-13 16:20 ` Andrii Nakryiko
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox