linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
To: Nishanth Aravamudan <nacc@us.ibm.com>
Cc: Mel Gorman <mel@csn.ul.ie>,
	Andrew Morton <akpm@linux-foundation.org>,
	agl@us.ibm.com, wli@holomorphy.com, clameter@sgi.com, ak@suse.de,
	kamezawa.hiroyu@jp.fujitsu.com, rientjes@google.com,
	linux-mm@kvack.org, eric.whitney@hp.com
Subject: Re: [PATCH] 2.6.25-rc3-mm1 - Mempolicy:  make dequeue_huge_page_vma() obey MPOL_BIND nodemask
Date: Thu, 06 Mar 2008 10:38:00 -0500	[thread overview]
Message-ID: <1204817880.5294.28.camel@localhost> (raw)
In-Reply-To: <20080306010440.GE28746@us.ibm.com>

On Wed, 2008-03-05 at 17:04 -0800, Nishanth Aravamudan wrote:
> On 05.03.2008 [14:02:53 -0500], Lee Schermerhorn wrote:
> > On Wed, 2008-03-05 at 10:03 -0800, Nishanth Aravamudan wrote:
> > > On 05.03.2008 [11:06:34 -0500], Lee Schermerhorn wrote:
> > > > PATCH Mempolicy - make dequeue_huge_page_vma() obey MPOL_BIND nodemask
> > > > 
> > > > dequeue_huge_page_vma() is not obeying the MPOL_BIND nodemask
> > > > with the zonelist rework.  It needs to search only zones in 
> > > > the mempolicy nodemask for hugepages.
> > > > 
> > > > Use for_each_zone_zonelist_nodemask() instead of
> > > > for_each_zone_zonelist().
> > > > 
> > > > Note:  this will bloat mm/hugetlb.o a bit until Mel reworks the
> > > > inlining of the for_each_zone... macros and helpers.
> > > > 
> > > > Added mempolicy helper function mpol_bind_nodemask() to hide
> > > > the details of mempolicy from hugetlb and to avoid
> > > > #ifdef CONFIG_NUMA in dequeue_huge_page_vma().
> > > > 
> > > > Signed-off-by:  Lee Schermerhorn <lee.schermerhorn@hp.com>
> > > > 
> > > >  include/linux/mempolicy.h |   13 +++++++++++++
> > > >  mm/hugetlb.c              |    4 +++-
> > > >  2 files changed, 16 insertions(+), 1 deletion(-)
> > > > 
> > > > Index: linux-2.6.25-rc3-mm1/mm/hugetlb.c
> > > > ===================================================================
> > > > --- linux-2.6.25-rc3-mm1.orig/mm/hugetlb.c	2008-03-05 10:35:12.000000000 -0500
> > > > +++ linux-2.6.25-rc3-mm1/mm/hugetlb.c	2008-03-05 10:37:09.000000000 -0500
> > > > @@ -99,8 +99,10 @@ static struct page *dequeue_huge_page_vm
> > > >  					htlb_alloc_mask, &mpol);
> > > >  	struct zone *zone;
> > > >  	struct zoneref *z;
> > > > +	nodemask_t *nodemask = mpol_bind_nodemask(mpol);
> > > 
> > > We get this mpol from huge_zonelist(). Would it perhaps make sense to
> > > pass the nodemask as a parameter, too, to huge_zonelist(), rather than
> > > adding mpol_bind_nodemask()? This is the only user of it in-tree.
> > 
> > Nish:
> > 
> > I thought of that.  I didn't go that way because I'd either need to
> > pass a [pointer to a pointer to] a nodemask in addition to the
> > [pointer to a pointer to] the mpol, so that I can release the
> > reference on the mpol after the allocation is finished;
> 
> See I looked at that and thought: "We're already passing a pointer to a
> pointer to mpol, so a pointer to a pointer to a nodemask shouldn't be
> that big of deal. 

:-)  I looked at that and thought:  "Yuck!  we're already passing a
pointer to a pointer, ...  I don't want to add another one."  Don't know
why I have such an aversion to passing results back like that.  


> This is the one call-site, as well. The idea being,
> we've pushed as much of the zonelist/nodemask knowledge into
> huge_zonelist(), keeping hugetlb.c relatively clear of it. Maybe it
> doesn't matter, was really just a question. 

No, I do see your point.  As I say, I had the same thoughts.

> Not sure what other folks
> think.

> 
> > or I'd need to copy the nodemask [which can get pretty big] in the
> > allocation path.  I wanted to avoid both of those.  I suppose I could
> > be convinced that one or the other of those options is better than the
> > single use helper function.  What do you think?
> 
> What you have is fine, I guess -- and has been picked up by Andrew.

We'll need more work in this area, I think.  I can fix it up then.  I'll
try a patch and see how it "feels"...

Thanks for your attention,
Lee

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2008-03-06 15:38 UTC|newest]

Thread overview: 37+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-02-27 21:47 [PATCH 0/6] Use two zonelists per node instead of multiple zonelists v11r3 Lee Schermerhorn, Mel Gorman
2008-02-27 21:47 ` [PATCH 1/6] Use zonelists instead of zones when direct reclaiming pages Lee Schermerhorn, Mel Gorman
2008-02-27 21:47 ` [PATCH 2/6] Introduce node_zonelist() for accessing the zonelist for a GFP mask Lee Schermerhorn, Mel Gorman
2008-02-27 21:47 ` [PATCH 3/6] Remember what the preferred zone is for zone_statistics Lee Schermerhorn, Mel Gorman
2008-02-27 22:00   ` Christoph Lameter
2008-02-28 17:45     ` Lee Schermerhorn
2008-02-29 14:19     ` Mel Gorman
2008-02-29  2:30   ` KAMEZAWA Hiroyuki
2008-02-29 14:32     ` Mel Gorman
2008-02-27 21:47 ` [PATCH 4/6] Use two zonelist that are filtered by GFP mask Lee Schermerhorn, Mel Gorman
2008-02-28 21:32   ` Andrew Morton
2008-02-28 21:53     ` Lee Schermerhorn
2008-02-29  2:37       ` KAMEZAWA Hiroyuki
2008-02-29 14:50     ` Mel Gorman
2008-02-29 15:48       ` Lee Schermerhorn
2008-02-29 21:07         ` Christoph Lameter
2008-03-04 18:01         ` Mel Gorman
2008-03-05 16:06           ` [PATCH] 2.6.25-rc3-mm1 - Mempolicy: make dequeue_huge_page_vma() obey MPOL_BIND nodemask Lee Schermerhorn
2008-03-05 18:03             ` Nishanth Aravamudan
2008-03-05 19:02               ` Lee Schermerhorn
2008-03-06  1:04                 ` Nishanth Aravamudan
2008-03-06 15:38                   ` Lee Schermerhorn [this message]
2008-03-06 21:24                   ` [PATCH] Mempolicy: make dequeue_huge_page_vma() obey MPOL_BIND nodemask rework Lee Schermerhorn
2008-03-07 17:35                     ` Nishanth Aravamudan
2008-03-07 18:31                       ` Lee Schermerhorn
2008-03-08  0:27                         ` Nishanth Aravamudan
2008-03-06  0:39             ` [PATCH] 2.6.25-rc3-mm1 - Mempolicy: make dequeue_huge_page_vma() obey MPOL_BIND nodemask Andrew Morton
2008-03-06 15:17               ` Lee Schermerhorn
2008-03-06 18:41     ` [PATCH 4/6] Use two zonelist that are filtered by GFP mask Mel Gorman
2008-02-27 21:47 ` [PATCH 5/6] Have zonelist contains structs with both a zone pointer and zone_idx Lee Schermerhorn, Mel Gorman
2008-02-29  7:49   ` KOSAKI Motohiro
2008-02-27 21:47 ` [PATCH 6/6] Filter based on a nodemask as well as a gfp_mask Lee Schermerhorn, Mel Gorman
2008-02-29  2:59   ` KAMEZAWA Hiroyuki
2008-03-07 11:56     ` Mel Gorman
2008-02-29  8:48   ` KOSAKI Motohiro
2008-02-27 21:53 ` [PATCH 0/6] Use two zonelists per node instead of multiple zonelists v11r3 Lee Schermerhorn
2008-02-29 14:12 ` Mel Gorman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1204817880.5294.28.camel@localhost \
    --to=lee.schermerhorn@hp.com \
    --cc=agl@us.ibm.com \
    --cc=ak@suse.de \
    --cc=akpm@linux-foundation.org \
    --cc=clameter@sgi.com \
    --cc=eric.whitney@hp.com \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=linux-mm@kvack.org \
    --cc=mel@csn.ul.ie \
    --cc=nacc@us.ibm.com \
    --cc=rientjes@google.com \
    --cc=wli@holomorphy.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox