From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wi0-f169.google.com (mail-wi0-f169.google.com [209.85.212.169]) by kanga.kvack.org (Postfix) with ESMTP id 663AB6B0253 for ; Wed, 14 Oct 2015 08:54:28 -0400 (EDT) Received: by wicge5 with SMTP id ge5so235260591wic.0 for ; Wed, 14 Oct 2015 05:54:28 -0700 (PDT) Received: from mx2.suse.de (mx2.suse.de. [195.135.220.15]) by mx.google.com with ESMTPS id bx5si29447875wib.48.2015.10.14.05.54.25 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Wed, 14 Oct 2015 05:54:25 -0700 (PDT) Subject: Re: [PATCHv4] mm: Don't offset memmap for flatmem References: <1444253335-5811-1-git-send-email-labbott@fedoraproject.org> From: Vlastimil Babka Message-ID: <561E507E.2090309@suse.cz> Date: Wed, 14 Oct 2015 14:54:22 +0200 MIME-Version: 1.0 In-Reply-To: <1444253335-5811-1-git-send-email-labbott@fedoraproject.org> Content-Type: text/plain; charset=iso-8859-2 Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org List-ID: To: Laura Abbott , Srinivas Kandagatla , Bjorn Andersson Cc: Laura Abbott , Santosh Shilimkar , Russell King , Kevin Hilman , Arnd Bergman , Stephen Boyd , Andy Gross , Mel Gorman , Steven Rostedt , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Andrew Morton , Laura Abbott On 10/07/2015 11:28 PM, Laura Abbott wrote: > From: Laura Abbott > > Srinivas Kandagatla reported bad page messages when trying to > remove the bottom 2MB on an ARM based IFC6410 board > > BUG: Bad page state in process swapper pfn:fffa8 > page:ef7fb500 count:0 mapcount:0 mapping: (null) index:0x0 > flags: 0x96640253(locked|error|dirty|active|arch_1|reclaim|mlocked) > page dumped because: PAGE_FLAGS_CHECK_AT_FREE flag(s) set > bad because of flags: > flags: 0x200041(locked|active|mlocked) > Modules linked in: > CPU: 0 PID: 0 Comm: swapper Not tainted 3.19.0-rc3-00007-g412f9ba-dirty #816 > Hardware name: Qualcomm (Flattened Device Tree) > [] (unwind_backtrace) from [] (show_stack+0x20/0x24) > [] (show_stack) from [] (dump_stack+0x80/0x9c) > [] (dump_stack) from [] (bad_page+0xc8/0x128) > [] (bad_page) from [] (free_pages_prepare+0x168/0x1e0) > [] (free_pages_prepare) from [] (free_hot_cold_page+0x3c/0x174) > [] (free_hot_cold_page) from [] (__free_pages+0x54/0x58) > [] (__free_pages) from [] (free_highmem_page+0x38/0x88) > [] (free_highmem_page) from [] (mem_init+0x240/0x430) > [] (mem_init) from [] (start_kernel+0x1e4/0x3c8) > [] (start_kernel) from [<80208074>] (0x80208074) > Disabling lock debugging due to kernel taint > > Removing the lower 2MB made the start of the lowmem zone to no longer > be page block aligned. IFC6410 uses CONFIG_FLATMEM where > alloc_node_mem_map allocates memory for the mem_map. alloc_node_mem_map > will offset for unaligned nodes with the assumption the pfn/page > translation functions will account for the offset. The functions for > CONFIG_FLATMEM do not offset however, resulting in overrunning > the memmap array. Just use the allocated memmap without any offset > when running with CONFIG_FLATMEM to avoid the overrun. > > Signed-off-by: Laura Abbott > Signed-off-by: Laura Abbott > Reported-by: Srinivas Kandagatla > Tested-by: Srinivas Kandagatla > Acked-by: Vlastimil Babka > Tested-by: Bjorn Andersson > > > --- > I was reminded at Linaro Connect that I never followed up on this patch. > v3 (http://marc.info/?l=linux-mm&m=142629173021316&w=2) > was dropped due to crashing (https://lkml.org/lkml/2015/3/20/546) > The fundamentals of the patch were fine, the VM_BUG_ON I added turned out > to be incorrect. It was supposed to be a sanity check but it didn't > actually hold true in all cases (or even most cases...) Yeah, sorry about the VM_BUG_ON (I think I suggested it originally). It's indeed wrong, I can see it now... the self-referentiality of "page_to_pfn(mem_map)" on FLATMEM makes my head hurt. > v4 just drops > the sanity check. I verified that the offset is working on both ARM, x86, > and i386 QEMU with the config provided by Steven Rostedt. Great, thanks. > Since this was minor diff from the previous patch, I kept all the previous > Acked-by/Tested-by. > --- > mm/page_alloc.c | 9 ++++++--- > 1 file changed, 6 insertions(+), 3 deletions(-) > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > index 40e2942..9e5ba5e 100644 > --- a/mm/page_alloc.c > +++ b/mm/page_alloc.c > @@ -4953,6 +4953,8 @@ static void __paginginit free_area_init_core(struct pglist_data *pgdat, > > static void __init_refok alloc_node_mem_map(struct pglist_data *pgdat) > { > + unsigned long __maybe_unused offset = 0; > + > /* Skip empty nodes */ > if (!pgdat->node_spanned_pages) > return; > @@ -4969,6 +4971,7 @@ static void __init_refok alloc_node_mem_map(struct pglist_data *pgdat) > * for the buddy allocator to function correctly. > */ > start = pgdat->node_start_pfn & ~(MAX_ORDER_NR_PAGES - 1); > + offset = pgdat->node_start_pfn - start; > end = pgdat_end_pfn(pgdat); > end = ALIGN(end, MAX_ORDER_NR_PAGES); > size = (end - start) * sizeof(struct page); > @@ -4976,7 +4979,7 @@ static void __init_refok alloc_node_mem_map(struct pglist_data *pgdat) > if (!map) > map = memblock_virt_alloc_node_nopanic(size, > pgdat->node_id); > - pgdat->node_mem_map = map + (pgdat->node_start_pfn - start); > + pgdat->node_mem_map = map + offset; > } > #ifndef CONFIG_NEED_MULTIPLE_NODES > /* > @@ -4984,9 +4987,9 @@ static void __init_refok alloc_node_mem_map(struct pglist_data *pgdat) > */ > if (pgdat == NODE_DATA(0)) { > mem_map = NODE_DATA(0)->node_mem_map; > -#ifdef CONFIG_HAVE_MEMBLOCK_NODE_MAP > +#if defined(CONFIG_HAVE_MEMBLOCK_NODE_MAP) || defined(CONFIG_FLATMEM) > if (page_to_pfn(mem_map) != pgdat->node_start_pfn) > - mem_map -= (pgdat->node_start_pfn - ARCH_PFN_OFFSET); > + mem_map -= offset; > #endif /* CONFIG_HAVE_MEMBLOCK_NODE_MAP */ > } > #endif > -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org