From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6D6D4C10F16 for ; Mon, 6 May 2024 09:40:07 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C40196B0087; Mon, 6 May 2024 05:40:06 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id BF00A6B0089; Mon, 6 May 2024 05:40:06 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id ADE986B008A; Mon, 6 May 2024 05:40:06 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 91B4F6B0087 for ; Mon, 6 May 2024 05:40:06 -0400 (EDT) Received: from smtpin15.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay08.hostedemail.com (Postfix) with ESMTP id E6BCF140A46 for ; Mon, 6 May 2024 09:40:05 +0000 (UTC) X-FDA: 82087474770.15.6E180B7 Received: from out30-98.freemail.mail.aliyun.com (out30-98.freemail.mail.aliyun.com [115.124.30.98]) by imf27.hostedemail.com (Postfix) with ESMTP id E5B4140018 for ; Mon, 6 May 2024 09:40:01 +0000 (UTC) Authentication-Results: imf27.hostedemail.com; dkim=pass header.d=linux.alibaba.com header.s=default header.b=wGx2QahS; spf=pass (imf27.hostedemail.com: domain of baolin.wang@linux.alibaba.com designates 115.124.30.98 as permitted sender) smtp.mailfrom=baolin.wang@linux.alibaba.com; dmarc=pass (policy=none) header.from=linux.alibaba.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1714988403; a=rsa-sha256; cv=none; b=sglziTTqqvq2cFhFc4r1Wqle1Wzpdzz1xBFT4Mn/ZxqTuIXf9vGj0bu+Jy747X9KbEhJ7D zexdJFKsjbeQk/NFIqpWOIr3R7GrLdxdVpFuATAP1C4svEzLnkhogVV//+w3t+8tL3PY3v O2TP3n85Kocnf7gjCwOUwLBzHOpW/dw= ARC-Authentication-Results: i=1; imf27.hostedemail.com; dkim=pass header.d=linux.alibaba.com header.s=default header.b=wGx2QahS; spf=pass (imf27.hostedemail.com: domain of baolin.wang@linux.alibaba.com designates 115.124.30.98 as permitted sender) smtp.mailfrom=baolin.wang@linux.alibaba.com; dmarc=pass (policy=none) header.from=linux.alibaba.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1714988403; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=deYrq65N4Eb1Fo5O50bg/9mCSRYwfA+xMgjohc4R+kc=; b=WJoxMukg1vLIOnXL3Rd9GOLzl5ranbiU6sCF+X8aVM6KvjeL76gA6t/WnPdIIkfGyX+biU QF6+fmbPgXyIVN4NSDfE4IDCHGRMKRuTei0InYdZLfP4DQFBPBD/HsEo2gMrAjhonpY2qq GsfrdpoT3J2miJUvVX/TPl6lw0HZeAQ= DKIM-Signature:v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1714988399; h=Message-ID:Date:MIME-Version:Subject:To:From:Content-Type; bh=deYrq65N4Eb1Fo5O50bg/9mCSRYwfA+xMgjohc4R+kc=; b=wGx2QahSmyqqTO0ScfH2Nwr8r3rGc0MBbbXi7dNXnqzLdqtUnpNKZQDQB9qF7STE7Vh79gTPazZourgWIwkErlFxmfRhFcw9OJNwMwafO/L4oqPZzm6ulcmLEwA8+qctkVY0WBUDhQnvkNAVFd/ZoWxaQaENVO4HfQ45kNS+oi4= X-Alimail-AntiSpam:AC=PASS;BC=-1|-1;BR=01201311R171e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=maildocker-contentspam033068173054;MF=baolin.wang@linux.alibaba.com;NM=1;PH=DS;RN=4;SR=0;TI=SMTPD_---0W5xPX2B_1714988397; Received: from 30.97.56.56(mailfrom:baolin.wang@linux.alibaba.com fp:SMTPD_---0W5xPX2B_1714988397) by smtp.aliyun-inc.com; Mon, 06 May 2024 17:39:58 +0800 Message-ID: Date: Mon, 6 May 2024 17:39:56 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: get_pageblock_bitmap() sometimes returns a NULL pointer which needs to be properly handled To: Markus Gothe , "linux-mm@kvack.org" Cc: "akpm@linux-foundation.org" , David Hildenbrand References: From: Baolin Wang In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Stat-Signature: xwtaxfxboe689u3yy8r15rh3dxfqs153 X-Rspam-User: X-Rspamd-Server: rspam02 X-Rspamd-Queue-Id: E5B4140018 X-HE-Tag: 1714988401-186379 X-HE-Meta: U2FsdGVkX193IEaEV59MYDkstK6TBs4opEJ7ctDYZi7NaCFzdkI0JaK0O0COAWBGivefYXH/bejxPkde/n1aFklRz67DyuYlHDhmtc6BKU2PDEeoadoSBZjD4thnzBFytkVWNh6fuYQv1Lo/cznSzMrIZo7NddkNo3GjvLXM4s7rOVuckTo7RhjCukAvDjzhQVdZPF81NaaTogAaVmA99nNUgBeiS3W8Z3gybUYttrqJ010QkI6asQDgJZHCnMhME/HLhyh33XjfIbItkiWhPLY/89SnBuwZ6tZivqJPud/qG5KGiV6Ldx83Fjjtl5sapeeeR4GiCOnxYCB7t3MRQ9YiLmG3xEJiMdrBzSbzG2jAdG2G65BMAo/YFjPhhEvqSCGLRFUFp3QS4VPXbnM3Z/OnjxBQwjeMRDUFQ0Xvcx070fU6zrbzvDwOsNVpU3ywGieWv7zAleHOC0LW/Zm8SvynKYndskLEC5vmfLXUKBuxafdgV2Slw/Wjj7/Qh69KC+tAW/zKWljrkBuZplMlfCqd5scF8PmUeDENPaIJg4Q1tfMKhLmbL9aSQ3fHU2CWFHirRy7TSozP5N+QAGELnzk7euwEuUsqIcs+8KbSW+qmnTc50isDHvJuikaK2/rhS5v/yeHZgnPCoiSfBkfUTL4221izxFgf8nCZackpH82DR6AKEQxEZjbtaiuiV7N/mLa1bNXCi1nQ2O8wB1cAZ9ryb+7dpROPR1LhJ1U0n6u+6aTxuMHjZlUTE/XAvhJzRivqv2z4WADzAr1qgvNZkm1LwPJ0yhQ1Gucm0Z+yNMWwLXvQCqEa4Nqg2yvQILLAMUI/drEQiTzl3kI0E6y/OkbpyLC0ayQAhSvX3qWayOAH/ddQiSEKzFGkYKYgsbUTZtd/1F2helKT2mHTT2cbd2CeUr+sJG4NmJtAVHoBzOyRLByR+J/Eu1Mg4+ywE6ReYof9H6UUBoBxSBF79RG c0us5VJU 7rkMYGn+v3OQgrwjexgTna12pziRxWUBD2e0ay5Lw8xTNq6kdny0DGsT4JUMNCdPJZ1INqCND7Rc/jqnP+/nXVt/5e2PMfE+IfanbGExsbEZue52hxYhRyqDRWuFvFT45jczS991q/YsCxMGuvxl1vxQLVDOIkH6zD+AsXFoen85LudYzGfmGdwlH+hCTiPO43vp46qZ+Ohmq+1fjmXC5D5Ao2pmm7ZJ4rx9lHda8CWuoK0E1mBGhlNqAhA7JKVNq6wGQIxYna0zTu6n+L01wc1TChnyofhgaeNcVcYxntrTRNKeKxWpRcpS14F0yyjHR6bxf63rf9RB1Dk+cGiWy5jpOe7ETjuytMnMxov1ZHb6aLOcBd4qpk4rM4yN+BallXkVQ X-Bogosity: Ham, tests=bogofilter, spamicity=0.000001, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Ccing David. On 2024/5/3 00:02, Markus Gothe wrote: > Hi, > under some rare occasion I run into the following crash: > > [ 41.417606] pstate: 80400005 (Nzcv daif +PAN -UAO) > [ 41.422406] pc : set_pfnblock_flags_mask+0x50/0x94 > [ 41.427193] lr : compaction_alloc+0x220/0x804 > [ 41.431544] sp : ffffffc01104bb10 > [ 41.434852] x29: ffffffc01104bb10 x28: ffffffc010e5b500 > [ 41.440165] x27: 0000000000098000 x26: ffffffc010e5b500 > [ 41.445477] x25: 0000000000000066 x24: 0000000000090800 > [ 41.450789] x23: 0000000000000200 x22: 0000000000084000 > [ 41.456093] x21: ffffffc010e82000 x20: ffffffc010b88000 > [ 41.461396] x19: ffffffc01104bd70 x18: 0000000000000000 > [ 41.466700] x17: f1f24e35df34dda4 x16: 6b3f63a0e1157268 > [ 41.472004] x15: 4b3990ec2568ada0 x14: 757ebc126939cb5f > [ 41.477308] x13: 9df9488aba179ccb x12: 0000000000000000 > [ 41.482612] x11: 0000000000000000 x10: ffffffc010c5fc30 > [ 41.487916] x9 : ffffff801eea7c00 x8 : 000000001bf00000 > [ 41.493219] x7 : 0000000000000000 x6 : 000000000000003f > [ 41.498525] x5 : 0000000000000108 x4 : 1000000000000000 > [ 41.503835] x3 : 0000000000000021 x2 : 000000000000003c > [ 41.509139] x1 : 0000000000000001 x0 : 0000000000000003 > [ 41.514443] Call trace: > [ 41.516887] set_pfnblock_flags_mask+0x50/0x94 > [ 41.521330] migrate_pages+0x90/0x7f0 > [ 41.524992] compact_zone+0x854/0x9f0 > [ 41.528647] kcompactd_do_work+0x168/0x230 > [ 41.532734] kcompactd+0x58/0x140 > [ 41.536043] kthread+0x120/0x124 > [ 41.539263] ret_from_fork+0x10/0x24 > [ 41.542835] Code: d346fc43 4b0000c2 8b030ce5 9ac22084 (f86378e0) > [ 41.548925] ---[ end trace 731400a587304db3 ]--- > > > I've pin-pointed it down to pageblock_flags pointer being initialized to NULL under certain conditions. I don't know why this happens. > Maybe it is some obscure race condition which only shows up on my system. Is there memory hotplug in your test? It seems to be caused by the race between memory hotplug and PFN walkers (such as compaction), which is already a known issue. > Nonetheless, I've made a fix for this in the attached patch. It adds sanity checking and protects the system for crashing. > I think it is better to be safe than sorry. > > -- > Best Regards, > Markus >