From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from psmtp.com (na3sys010amx172.postini.com [74.125.245.172]) by kanga.kvack.org (Postfix) with SMTP id 22C0B6B007B for ; Thu, 14 Jun 2012 19:57:20 -0400 (EDT) Received: by dakp5 with SMTP id p5so3940271dak.14 for ; Thu, 14 Jun 2012 16:57:19 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <1339709672.3321.11.camel@lappy> References: <1339623535.3321.4.camel@lappy> <20120614032005.GC3766@dhcp-172-17-108-109.mtv.corp.google.com> <1339667440.3321.7.camel@lappy> <1339709672.3321.11.camel@lappy> Date: Thu, 14 Jun 2012 16:57:18 -0700 Message-ID: Subject: Re: Early boot panic on machine with lots of memory From: Yinghai Lu Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Sender: owner-linux-mm@kvack.org List-ID: To: Sasha Levin Cc: Tejun Heo , Andrew Morton , David Miller , hpa@linux.intel.com, linux-mm , "linux-kernel@vger.kernel.org" On Thu, Jun 14, 2012 at 2:34 PM, Sasha Levin wrot= e: > On Thu, 2012-06-14 at 13:56 -0700, Yinghai Lu wrote: >> On Thu, Jun 14, 2012 at 2:50 AM, Sasha Levin w= rote: >> > On Thu, 2012-06-14 at 12:20 +0900, Tejun Heo wrote: >> >> On Wed, Jun 13, 2012 at 11:38:55PM +0200, Sasha Levin wrote: >> >> > Hi all, >> >> > >> >> > I'm seeing the following when booting a KVM guest with 65gb of RAM,= on latest linux-next. >> >> > >> >> > Note that it happens with numa=3Doff. >> >> > >> >> > [ =A0 =A00.000000] BUG: unable to handle kernel paging request at f= fff88102febd948 >> >> > [ =A0 =A00.000000] IP: [] __next_free_mem_range+0= x9b/0x155 >> >> >> >> Can you map it back to the source line please? >> > >> > mm/memblock.c:583 >> > >> > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0phys_addr_t r_start =3D= ri ? r[-1].base + r[-1].size : 0; >> > =A097: =A0 85 d2 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 test =A0 %edx,%ed= x >> > =A099: =A0 74 08 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 je =A0 =A0 a3 <__= next_free_mem_range+0xa3> >> > =A09b: =A0 49 8b 48 f0 =A0 =A0 =A0 =A0 =A0 =A0 mov =A0 =A0-0x10(%r8),%= rcx >> > =A09f: =A0 49 03 48 e8 =A0 =A0 =A0 =A0 =A0 =A0 add =A0 =A0-0x18(%r8),%= rcx >> > >> > It's the deref on 9b (r8=3Dffff88102febd958). >> >> that reserved.region is allocated by memblock. >> >> can you boot with "memblock=3Ddebug debug ignore_loglevel" and post >> whole boot log? > > Attached below. I've also noticed it doesn't always happen, but > increasing the vcpu count (to something around 254) makes it happen > almost every time. > ... [ 0.000000] memblock: reserved array is doubled to 512 at [0x102febc080-0x102febf07f] [ 0.000000] memblock_free: [0x0000102febf080-0x0000102fec0880] memblock_double_array+0x1b0/0x1e2 [ 0.000000] memblock_reserve: [0x0000102febc080-0x0000102febf080] memblock_double_array+0x1c5/0x1e2 the reserved regions get double two times to 512. .... > [ =A0 =A00.000000] =A0 =A0memblock_free: [0x0000102febc080-0x0000102febf0= 80] memblock_free_reserved_regions+0x37/0x39 > [ =A0 =A00.000000] BUG: unable to handle kernel paging request at ffff881= 02febd948 > [ =A0 =A00.000000] IP: [] __next_free_mem_range+0x9b/0x= 155 > [ =A0 =A00.000000] PGD 4826063 PUD cf67a067 PMD cf7fa067 PTE 800000102feb= d160 that page table for them is [ 0.000000] kernel direct mapping tables up to 0x102fffffff @ [mem 0xc7e3e000-0xcfffffff] [ 0.000000] memblock_reserve: [0x000000c7e3e000-0x000000cf7fb000] native_pagetable_reserve+0xc/0xe only near by allocation is swiotlb. [ 0.000000] __ex_table already sorted, skipping sort [ 0.000000] memblock_reserve: [0x000000c3e3e000-0x000000c7e3e000] __alloc_memory_core_early+0x5c/0x73 ... [ 0.000000] memblock_reserve: [0x000000cfff8000-0x000000d0000000] __alloc_memory_core_early+0x5c/0x73 [ 0.000000] Checking aperture... so the memblock allocation is ok... can you please boot with "memtest" to see if there is any memory problem? Thanks Yinghai -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org