* Re: ❌ FAIL: Test report for kernel 5.14.0-rc1 (arm-next, e6f85cbe) [not found] <cki.552CEC4C85.JGTIJ5O0Q4@redhat.com> @ 2021-07-16 9:03 ` Will Deacon 2021-07-16 10:01 ` Jan Kara 0 siblings, 1 reply; 3+ messages in thread From: Will Deacon @ 2021-07-16 9:03 UTC (permalink / raw) To: CKI Project Cc: skt-results-master, catalin.marinas, linux-arm-kernel, Fendy Tjahjadi, Memory Management, Jan Stancek, guro, willy, tj, dennis, jack, linux-mm On Thu, Jul 15, 2021 at 11:55:34PM -0000, CKI Project wrote: > All kernel binaries, config files, and logs are available for download here: > > https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/index.html?prefix=datawarehouse-public/2021/07/15/337870374 > > One or more kernel tests failed: > > aarch64: > 💥 Podman system integration test - as user This is a NULL pointer dereference in the recently introduced cleanup_offline_cgwbs_workfn() function [c22d70a162d3 ("writeback, cgroup: release dying cgwbs by switching attached inodes")]. Adding some of those folks to CC. addr2line gives: | linux/./include/linux/atomic-arch-fallback.h:2265 | linux/./include/linux/atomic-arch-fallback.h:2290 | linux/./include/asm-generic/atomic-instrumented.h:1149 | linux/./include/asm-generic/atomic-long.h:491 | linux/percpu-refcount.h:247 | linux/percpu-refcount.h:266 | linux/backing-dev-defs.h:227 | linux/mm/backing-dev.c:679 and the crash (this is an arm64 machine) is: [ 4794.406516] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000 [ 4794.415307] Mem abort info: [ 4794.418088] ESR = 0x96000004 [ 4794.421130] EC = 0x25: DABT (current EL), IL = 32 bits [ 4794.426428] SET = 0, FnV = 0 [ 4794.429470] EA = 0, S1PTW = 0 [ 4794.432597] FSC = 0x04: level 0 translation fault [ 4794.437461] Data abort info: [ 4794.440329] ISV = 0, ISS = 0x00000004 [ 4794.444151] CM = 0, WnR = 0 [ 4794.447105] user pgtable: 4k pages, 48-bit VAs, pgdp=00000009072be000 [ 4794.453534] [0000000000000000] pgd=0000000000000000, p4d=0000000000000000 [ 4794.460312] Internal error: Oops: 96000004 [#1] SMP [ 4794.465179] Modules linked in: xt_nat xt_addrtype xt_mark nft_chain_nat xt_MASQUERADE nf_nat xt_comment veth bridge stp llc loop tun af_key crypto_user scsi_transport_iscsi xt_multiport ip_gre ip_tunnel gre overlay xt_CONNSECMARK xt_SECMARK nft_counter xt_state xt_conntrack nft_compat ah6 ah4 nft_objref nft_ct nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink jfs sctp ip6_udp_tunnel udp_tunnel dm_log_writes dm_flakey rfkill mlx5_ib ib_uverbs ib_core sunrpc coresight_etm4x i2c_smbus coresight_replicator coresight_tpiu coresight_tmc joydev mlx5_core acpi_ipmi mlxfw ipmi_ssif psample ipmi_devintf ipmi_msghandler coresight_funnel coresight thunderx2_pmu vfat fat fuse zram ip_tables xfs crct10dif_ce ast ghash_ce i2c_algo_bit drm_vram_helper drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec drm_ttm_helper ttm drm gpio_xlp i2c_xlp9xx uas usb_storage aes_neon_bs [last unloaded: raid10] [ 4794.545404] CPU: 86 PID: 1121023 Comm: kworker/u513:9 Tainted: G OE 5.14.0-rc1 #1 [ 4794.554090] Hardware name: HPE Apollo 70 /C01_APACHE_MB , BIOS L50_5.13_1.15 05/08/2020 [ 4794.563817] Workqueue: events_unbound cleanup_offline_cgwbs_workfn [ 4794.569990] pstate: 004000c9 (nzcv daIF +PAN -UAO -TCO BTYPE=--) [ 4794.575985] pc : cleanup_offline_cgwbs_workfn+0x2d8/0x350 [ 4794.581372] lr : cleanup_offline_cgwbs_workfn+0xc8/0x350 [ 4794.586672] sp : ffff800043ae3d40 [ 4794.589973] x29: ffff800043ae3d40 x28: 0000000000000000 x27: 0000000000000001 [ 4794.597097] x26: 0000000000000000 x25: 00000000000000e0 x24: ffff800011f01438 [ 4794.604221] x23: ffff800043ae3db0 x22: ffff800011f01428 x21: ffff80001228c5f0 [ 4794.611345] x20: ffff000817299800 x19: ffff000817299a80 x18: 000000000000000e [ 4794.618469] x17: 0000000079c37e0f x16: 0000000000010000 x15: fffffc002084ac02 [ 4794.625593] x14: 0000000000000000 x13: 0000000000000030 x12: 0000000000000040 [ 4794.632716] x11: ffff000812333d98 x10: ffff000812333d9a x9 : ffff8000102dfd84 [ 4794.639840] x8 : ffff000812331b88 x7 : 0000000000000000 x6 : ffff000812331d48 [ 4794.646964] x5 : 0000000000000000 x4 : ffff00086f2a4a80 x3 : 0000000000000000 [ 4794.654088] x2 : ffff800043ae3db0 x1 : ffff00086f2a4a80 x0 : 0000000000000003 [ 4794.661212] Call trace: [ 4794.663646] cleanup_offline_cgwbs_workfn+0x2d8/0x350 [ 4794.668685] process_one_work+0x1f0/0x4ac [ 4794.672686] worker_thread+0x180/0x400 [ 4794.676423] kthread+0x110/0x114 [ 4794.679641] ret_from_fork+0x10/0x18 [ 4794.683209] Code: d63f0020 97f9c259 17ffffb1 f8588263 (f9400061) [ 4794.689290] ---[ end trace 8c5d2f7872fef978 ]--- Will ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: ❌ FAIL: Test report for kernel 5.14.0-rc1 (arm-next, e6f85cbe) 2021-07-16 9:03 ` ❌ FAIL: Test report for kernel 5.14.0-rc1 (arm-next, e6f85cbe) Will Deacon @ 2021-07-16 10:01 ` Jan Kara 2021-07-16 10:16 ` Will Deacon 0 siblings, 1 reply; 3+ messages in thread From: Jan Kara @ 2021-07-16 10:01 UTC (permalink / raw) To: Will Deacon Cc: CKI Project, skt-results-master, catalin.marinas, linux-arm-kernel, Fendy Tjahjadi, Memory Management, Jan Stancek, guro, willy, tj, dennis, jack, linux-mm On Fri 16-07-21 10:03:25, Will Deacon wrote: > On Thu, Jul 15, 2021 at 11:55:34PM -0000, CKI Project wrote: > > All kernel binaries, config files, and logs are available for download here: > > > > https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/index.html?prefix=datawarehouse-public/2021/07/15/337870374 > > > > One or more kernel tests failed: > > > > aarch64: > > 💥 Podman system integration test - as user > > This is a NULL pointer dereference in the recently introduced > cleanup_offline_cgwbs_workfn() function [c22d70a162d3 ("writeback, cgroup: > release dying cgwbs by switching attached inodes")]. Adding some of those > folks to CC. Thanks for report! This bug was already reported here [1] and Roman already has a tenative fix. Stay tuned :) [1] https://lore.kernel.org/linux-fsdevel/YO5kCzI133B/fHiS@carbon.dhcp.thefacebook.com Honza > addr2line gives: > > | linux/./include/linux/atomic-arch-fallback.h:2265 > | linux/./include/linux/atomic-arch-fallback.h:2290 > | linux/./include/asm-generic/atomic-instrumented.h:1149 > | linux/./include/asm-generic/atomic-long.h:491 > | linux/percpu-refcount.h:247 > | linux/percpu-refcount.h:266 > | linux/backing-dev-defs.h:227 > | linux/mm/backing-dev.c:679 > > and the crash (this is an arm64 machine) is: > > [ 4794.406516] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000 > [ 4794.415307] Mem abort info: > [ 4794.418088] ESR = 0x96000004 > [ 4794.421130] EC = 0x25: DABT (current EL), IL = 32 bits > [ 4794.426428] SET = 0, FnV = 0 > [ 4794.429470] EA = 0, S1PTW = 0 > [ 4794.432597] FSC = 0x04: level 0 translation fault > [ 4794.437461] Data abort info: > [ 4794.440329] ISV = 0, ISS = 0x00000004 > [ 4794.444151] CM = 0, WnR = 0 > [ 4794.447105] user pgtable: 4k pages, 48-bit VAs, pgdp=00000009072be000 > [ 4794.453534] [0000000000000000] pgd=0000000000000000, p4d=0000000000000000 > [ 4794.460312] Internal error: Oops: 96000004 [#1] SMP > [ 4794.465179] Modules linked in: xt_nat xt_addrtype xt_mark nft_chain_nat xt_MASQUERADE nf_nat xt_comment veth bridge stp llc loop tun af_key crypto_user scsi_transport_iscsi xt_multiport ip_gre ip_tunnel gre overlay xt_CONNSECMARK xt_SECMARK nft_counter xt_state xt_conntrack nft_compat ah6 ah4 nft_objref nft_ct nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink jfs sctp ip6_udp_tunnel udp_tunnel dm_log_writes dm_flakey rfkill mlx5_ib ib_uverbs ib_core sunrpc coresight_etm4x i2c_smbus coresight_replicator coresight_tpiu coresight_tmc joydev mlx5_core acpi_ipmi mlxfw ipmi_ssif psample ipmi_devintf ipmi_msghandler coresight_funnel coresight thunderx2_pmu vfat fat fuse zram ip_tables xfs crct10dif_ce ast ghash_ce i2c_algo_bit drm_vram_helper drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec drm_ttm_helper ttm drm gpio_xlp i2c_xlp9xx uas usb_storage aes_neon_bs [last unloaded: raid10] > [ 4794.545404] CPU: 86 PID: 1121023 Comm: kworker/u513:9 Tainted: G OE 5.14.0-rc1 #1 > [ 4794.554090] Hardware name: HPE Apollo 70 /C01_APACHE_MB , BIOS L50_5.13_1.15 05/08/2020 > [ 4794.563817] Workqueue: events_unbound cleanup_offline_cgwbs_workfn > [ 4794.569990] pstate: 004000c9 (nzcv daIF +PAN -UAO -TCO BTYPE=--) > [ 4794.575985] pc : cleanup_offline_cgwbs_workfn+0x2d8/0x350 > [ 4794.581372] lr : cleanup_offline_cgwbs_workfn+0xc8/0x350 > [ 4794.586672] sp : ffff800043ae3d40 > [ 4794.589973] x29: ffff800043ae3d40 x28: 0000000000000000 x27: 0000000000000001 > [ 4794.597097] x26: 0000000000000000 x25: 00000000000000e0 x24: ffff800011f01438 > [ 4794.604221] x23: ffff800043ae3db0 x22: ffff800011f01428 x21: ffff80001228c5f0 > [ 4794.611345] x20: ffff000817299800 x19: ffff000817299a80 x18: 000000000000000e > [ 4794.618469] x17: 0000000079c37e0f x16: 0000000000010000 x15: fffffc002084ac02 > [ 4794.625593] x14: 0000000000000000 x13: 0000000000000030 x12: 0000000000000040 > [ 4794.632716] x11: ffff000812333d98 x10: ffff000812333d9a x9 : ffff8000102dfd84 > [ 4794.639840] x8 : ffff000812331b88 x7 : 0000000000000000 x6 : ffff000812331d48 > [ 4794.646964] x5 : 0000000000000000 x4 : ffff00086f2a4a80 x3 : 0000000000000000 > [ 4794.654088] x2 : ffff800043ae3db0 x1 : ffff00086f2a4a80 x0 : 0000000000000003 > [ 4794.661212] Call trace: > [ 4794.663646] cleanup_offline_cgwbs_workfn+0x2d8/0x350 > [ 4794.668685] process_one_work+0x1f0/0x4ac > [ 4794.672686] worker_thread+0x180/0x400 > [ 4794.676423] kthread+0x110/0x114 > [ 4794.679641] ret_from_fork+0x10/0x18 > [ 4794.683209] Code: d63f0020 97f9c259 17ffffb1 f8588263 (f9400061) > [ 4794.689290] ---[ end trace 8c5d2f7872fef978 ]--- > > Will -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: ❌ FAIL: Test report for kernel 5.14.0-rc1 (arm-next, e6f85cbe) 2021-07-16 10:01 ` Jan Kara @ 2021-07-16 10:16 ` Will Deacon 0 siblings, 0 replies; 3+ messages in thread From: Will Deacon @ 2021-07-16 10:16 UTC (permalink / raw) To: Jan Kara Cc: CKI Project, skt-results-master, catalin.marinas, linux-arm-kernel, Fendy Tjahjadi, Memory Management, Jan Stancek, guro, willy, tj, dennis, linux-mm On Fri, Jul 16, 2021 at 12:01:59PM +0200, Jan Kara wrote: > On Fri 16-07-21 10:03:25, Will Deacon wrote: > > On Thu, Jul 15, 2021 at 11:55:34PM -0000, CKI Project wrote: > > > All kernel binaries, config files, and logs are available for download here: > > > > > > https://arr-cki-prod-datawarehouse-public.s3.amazonaws.com/index.html?prefix=datawarehouse-public/2021/07/15/337870374 > > > > > > One or more kernel tests failed: > > > > > > aarch64: > > > 💥 Podman system integration test - as user > > > > This is a NULL pointer dereference in the recently introduced > > cleanup_offline_cgwbs_workfn() function [c22d70a162d3 ("writeback, cgroup: > > release dying cgwbs by switching attached inodes")]. Adding some of those > > folks to CC. > > Thanks for report! This bug was already reported here [1] and Roman already > has a tenative fix. Stay tuned :) Perfect, thanks! I tried searching lkml for "cleanup_offline_cgwbs_workfn" but didn't think to try fsdevel, so sorry for the duplicate report. Will ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2021-07-16 10:16 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <cki.552CEC4C85.JGTIJ5O0Q4@redhat.com>
2021-07-16 9:03 ` ❌ FAIL: Test report for kernel 5.14.0-rc1 (arm-next, e6f85cbe) Will Deacon
2021-07-16 10:01 ` Jan Kara
2021-07-16 10:16 ` Will Deacon
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox