From mboxrd@z Thu Jan  1 00:00:00 1970
Date: Fri, 21 Mar 2008 13:32:22 -0700 (PDT)
From: David Rientjes <rientjes@google.com>
Subject: Re: Couple of questions about mempolicy rebinding
In-Reply-To: <1205786207.5297.30.camel@localhost>
Message-ID: <alpine.DEB.1.00.0803211318430.3009@chino.kir.corp.google.com>
References: <200803122118.03942.ak@suse.de>  <alpine.DEB.1.00.0803131219380.28673@chino.kir.corp.google.com>  <1205437802.5300.69.camel@localhost>  <alpine.DEB.1.00.0803131255150.32474@chino.kir.corp.google.com> <1205786207.5297.30.camel@localhost>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-linux-mm@kvack.org
Return-Path: <owner-linux-mm@kvack.org>
To: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
Cc: Paul Jackson <pj@sgi.com>, Andi Kleen <ak@suse.de>, Christoph Lameter <clameter@sgi.com>, cpw@sgi.com, linux-mm <linux-mm@kvack.org>
List-ID: <linux-mm.kvack.org>

On Mon, 17 Mar 2008, Lee Schermerhorn wrote:

> Like the subject says, I have a couple of questions that arose while
> looking through the mempolicy code for corner cases that might be
> affected by some pending cleanup patches.  Nothing major, I think, I
> just want to understand.  I'm looking at 25-rc5-mm1 plus the recent
> patch:  "disallow static or relative flags for local preferred".
> 

Sorry for the delay in getting to this email.

> 1) In __mpol_copy():  when the "current_cpuset_is_being_rebound", why do
> we rebind the old policy policy and then copy it to the new?  Seems like
> the old policy will get rebound in due time if, indeed, it needs to be
> rebound.  I don't see any usage now, where it won't, but this seems less
> general than just rebinding the new copy.  E.g., the old mempolicy being
> copied may be a context-free policy that shouln't be rebound.   I think
> we should at least add a comment to warn future callers.  Comments?
> 

I don't quite understand.  The mempolicy being returned by __mpol_copy() 
is brand new for a forked task and has no relation to the old policy 
except it shares all of the same characteristics.  So if we have yet to 
call mpol_rebind_mm() after updating current->cpuset->mems_allowed in 
update_nodemask(), we'll get the old nodemask in the policy returned by 
__mpol_copy().

> 2) In mpol_rebind_nodemask():  this function is shared by bind and
> interleave policy, and is used to rebind both task and vma policies.
> However, we unconditionally update current->il_next.  Probably not an
> issue for interleave vs bind because, in the case of bind policy,
> il_next is meaningless.  However, il_next is used only to interleave
> based on task policy [for page cache and slab allocations].  Any
> allocation based on a vma policy will use the address offset into the
> vma to interleave.  So, I think it's technically incorrect to update
> il_next when we're rebinding a vma policy.  It may contain nodes that
> are not in the task policy and vice versa.  
> 

I don't see how this behavior has recently changed from what is currently 
in HEAD.  Are you concerned about an unnecessary update to 
current->il_next that affects the next node for a task's interleave if a 
VMA with a VMA policy gets rebound but there is an underlying task policy 
also of interleave?

To avoid updating current->il_next for MPOL_INTERLEAVE vs. MPOL_BIND, we 
can simply test pol->policy in mpol_rebind_nodemask().

		David

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>