From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from d03relay04.boulder.ibm.com (d03relay04.boulder.ibm.com [9.17.195.106])
	by e33.co.us.ibm.com (8.13.8/8.13.8) with ESMTP id m4NKUFqP030198
	for <linux-mm@kvack.org>; Fri, 23 May 2008 16:30:15 -0400
Received: from d03av02.boulder.ibm.com (d03av02.boulder.ibm.com [9.17.195.168])
	by d03relay04.boulder.ibm.com (8.13.8/8.13.8/NCO v8.7) with ESMTP id m4NKUEcT063526
	for <linux-mm@kvack.org>; Fri, 23 May 2008 14:30:14 -0600
Received: from d03av02.boulder.ibm.com (loopback [127.0.0.1])
	by d03av02.boulder.ibm.com (8.12.11.20060308/8.13.3) with ESMTP id m4NKUCRg018230
	for <linux-mm@kvack.org>; Fri, 23 May 2008 14:30:13 -0600
Date: Fri, 23 May 2008 13:30:07 -0700
From: Nishanth Aravamudan <nacc@us.ibm.com>
Subject: Re: [patch 06/18] hugetlb: multi hstate proc files
Message-ID: <20080523203007.GB23924@us.ibm.com>
References: <20080423015302.745723000@nick.local0.net> <20080423015430.311388000@nick.local0.net> <20080502195311.GD26273@us.ibm.com> <20080523052215.GF13071@wotan.suse.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20080523052215.GF13071@wotan.suse.de>
Sender: owner-linux-mm@kvack.org
Return-Path: <owner-linux-mm@kvack.org>
To: Nick Piggin <npiggin@suse.de>
Cc: akpm@linux-foundation.org, linux-mm@kvack.org, andi@firstfloor.org, kniht@linux.vnet.ibm.com, abh@cray.com, wli@holomorphy.com
List-ID: <linux-mm.kvack.org>

On 23.05.2008 [07:22:15 +0200], Nick Piggin wrote:
> On Fri, May 02, 2008 at 12:53:11PM -0700, Nishanth Aravamudan wrote:
> > On 23.04.2008 [11:53:08 +1000], npiggin@suse.de wrote:
> > > Convert /proc output code over to report multiple hstates
> > > 
> > > I chose to just report the numbers in a row, in the hope 
> > > to minimze breakage of existing software. The "compat" page size
> > > is always the first number.
> > 
> > Only if add_huge_hstate() is called first for the compat page size,
> > right? That seems bad if we depend on an ordering.
> > 
> > For instance, for power, I think Jon is calling huge_add_hstate() from
> > the arch/powerpc/mm/hugetlbpage.c init routine. Which runs before
> > hugetlb_init, which means that if he adds hugepages like
> > 
> > huge_add_hstate(64k-order);
> > huge_add_hstate(16m-order);
> > huge_add_hstate(16g-order);
> > 
> > We'll get 64k as the first field in meminfo.
> > 
> > So perhaps what we should do is:
> > 
> > 1) architectures define HPAGE_* as the default (compat) hugepage values
> > 2) architectures have a call into generic code at their init time to
> > specify what sizes they support
> > 3) the core is the only place that actually does huge_add_hstate() and
> > it always does it first for the compat order?
> > 
> > I wonder if this might lead to issues in timing between processing
> > hugepagesz= (in arch code) and hugepages= (in generic code). Not sure. I
> > guess if we always add all hugepage sizes, we should have all the
> > hstates we know about ready to configure and as long as hugetlb_init
> > runs before hugepages= processing, we should be fine? Dunno.
> 
> You're right I think. The other thing is that we could just have
> a small map from the hstate array to reporting order for sysctls.
> We could report them in the order specified on the cmdline, with
> the default size first if it was not specified on the cmdline.
> 
> Hmm, I'll see how that looks.

Yeah, either way is fine. I just wanted to make sure any implicit
assumptions were laid out clearly (and should be spelled out in
kernel-parameters.txt and vm/hugetlbpage.txt, probably).

And if we really are worried about backwards compatibility, then we
should be careful about any ordering issues.

Thanks,
Nish

-- 
Nishanth Aravamudan <nacc@us.ibm.com>
IBM Linux Technology Center

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>