From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from d03relay02.boulder.ibm.com (d03relay02.boulder.ibm.com [9.17.195.227])
	by e38.co.us.ibm.com (8.13.1/8.13.1) with ESMTP id mB5Iuw1s026134
	for <linux-mm@kvack.org>; Fri, 5 Dec 2008 11:56:58 -0700
Received: from d03av04.boulder.ibm.com (d03av04.boulder.ibm.com [9.17.195.170])
	by d03relay02.boulder.ibm.com (8.13.8/8.13.8/NCO v9.1) with ESMTP id mB5Ivt4W136048
	for <linux-mm@kvack.org>; Fri, 5 Dec 2008 11:58:00 -0700
Received: from d03av04.boulder.ibm.com (loopback [127.0.0.1])
	by d03av04.boulder.ibm.com (8.12.11.20060308/8.13.3) with ESMTP id mB5Ivsem005026
	for <linux-mm@kvack.org>; Fri, 5 Dec 2008 11:57:54 -0700
Subject: Re: [Bug 12134] New: can't shmat() 1GB hugepage segment   from
 second process more than one time
From: Adam Litke <agl@us.ibm.com>
In-Reply-To: <6.2.5.6.2.20081205124907.01c38670@binnacle.cx>
References: <bug-12134-27@http.bugzilla.kernel.org/>
	 <20081201181459.49d8fcca.akpm@linux-foundation.org>
	 <1228245880.13482.19.camel@localhost.localdomain>
	 <6.2.5.6.2.20081203221021.01cf8e88@binnacle.cx>
	 <1228497450.13428.26.camel@localhost.localdomain>
	 <6.2.5.6.2.20081205124907.01c38670@binnacle.cx>
Content-Type: text/plain
Date: Fri, 05 Dec 2008 12:57:42 -0600
Message-Id: <1228503462.13428.36.camel@localhost.localdomain>
Mime-Version: 1.0
Content-Transfer-Encoding: 7bit
Sender: owner-linux-mm@kvack.org
Return-Path: <owner-linux-mm@kvack.org>
To: starlight@binnacle.cx
Cc: Andrew Morton <akpm@linux-foundation.org>, linux-mm@kvack.org, bugme-daemon@bugzilla.kernel.org, Andy Whitcroft <apw@shadowen.org>, David Gibson <david@gibson.dropbear.id.au>
List-ID: <linux-mm.kvack.org>

On Fri, 2008-12-05 at 12:49 -0500, starlight@binnacle.cx wrote:
> At 11:17 12/5/2008 -0600, you wrote:
> >On Wed, 2008-12-03 at 22:15 -0500, starlight@binnacle.cx wrote:
> >> At 13:24 12/2/2008 -0600, Adam Litke wrote:
> >> >starlight@binnacle.cx:  I need more information
> >> >to reproduce this bug.
> >> 
> >> I'm too swamped to build a test-case, but here are straces
> >> that show the relevant system calls and the failure.
> >
> >Starlight,
> >
> >Thanks for the strace output.  As I suspected, this is more 
> >complex than it first appeared.  There are several hugetlb 
> >shared memory segments involved.  Couple that with threading and 
> >an interesting approach to mlocking the address space and I've 
> >got a very difficult to reproduce scenario.  Is it 
> >possible/practical for me to have access to your program?
> 
> Sorry, I'm not permitted to share the code.
> 
> The program fork/execs a script in addition to creating many 
> worker threads (have contemplated switching to 'pthread_spawn()', 
> but it seems it does a fork/exec anyway).  I wonder if that has 
> anything to do with it.  Will try disabling that and then 
> disabling the 'mlock()' calls to see if either eliminates
> the issue.   Doubt that worker thread creation is a factor.

Great.  I was going to ask you to disable mlock() as well.  Is this the
same machine that was running your workload on RHEL4 successfully?  One
theory I've been contemplating is that, with all of the mlocking and
threads, you might be running out of memory for page tables and that
perhaps the hugetlb code is not handling that case correctly.  When do
the bad pmd messages appear?  When the daemon starts?  When the first
separate process attaches?  When the second one does?  or later?

> >If so, I could quickly bisect the kernel and identify the guilty 
> >patch.  Without the program, I am left stabbing in the dark. 
> >Could you try on a 2.6.18 kernel to see if it works or not?  
> >Thanks.
> 
> Any particular version of 2.6.18?

Nothing specific.  You could try 2.6.18.8 (latest -stable).  We could
probably bisect this with approximately 8 kernel build-boot-test cycles
if you are willing to engage on that.  I am looking forward to your
disabled-mlock() results.

-- 
Adam Litke - (agl at us.ibm.com)
IBM Linux Technology Center

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>