linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
@ 2007-08-01 19:02 Lee Schermerhorn
  2007-08-02  0:44 ` KAMEZAWA Hiroyuki
  0 siblings, 1 reply; 6+ messages in thread
From: Lee Schermerhorn @ 2007-08-01 19:02 UTC (permalink / raw)
  To: Andrew Morton
  Cc: linux-kernel, linux-mm, Kamezawa Hiroyuki, Christoph Lameter

Fix missing numa_zonelist_order sysctl config

Against 2.6.23-rc1-mm1.

Found this testing Mel Gorman's patch for the issue with
"policy_zone" and ZONE_MOVABLE.

Misplaced #endif is hiding the numa_zonelist_order sysctl
when !SECURITY.

[But, maybe reordering the zonelists is not such a good idea
when ZONE_MOVABLE is populated?]

Signed-off-by:  Lee Schermerhorn <lee.schermerhorn@hp.com>

 kernel/sysctl.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Index: Linux/kernel/sysctl.c
===================================================================
--- Linux.orig/kernel/sysctl.c	2007-07-25 09:29:50.000000000 -0400
+++ Linux/kernel/sysctl.c	2007-08-01 13:29:18.000000000 -0400
@@ -1068,6 +1068,7 @@ static ctl_table vm_table[] = {
 		.mode		= 0644,
 		.proc_handler	= &proc_doulongvec_minmax,
 	},
+#endif
 #ifdef CONFIG_NUMA
 	{
 		.ctl_name	= CTL_UNNUMBERED,
@@ -1079,7 +1080,6 @@ static ctl_table vm_table[] = {
 		.strategy	= &sysctl_string,
 	},
 #endif
-#endif
 #if defined(CONFIG_X86_32) || \
    (defined(CONFIG_SUPERH) && defined(CONFIG_VSYSCALL))
 	{


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
  2007-08-01 19:02 [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl Lee Schermerhorn
@ 2007-08-02  0:44 ` KAMEZAWA Hiroyuki
  2007-08-02 15:07   ` Lee Schermerhorn
  2007-08-02 16:14   ` Mel Gorman
  0 siblings, 2 replies; 6+ messages in thread
From: KAMEZAWA Hiroyuki @ 2007-08-02  0:44 UTC (permalink / raw)
  To: Lee Schermerhorn; +Cc: Andrew Morton, linux-kernel, linux-mm, Christoph Lameter

On Wed, 01 Aug 2007 15:02:51 -0400
Lee Schermerhorn <Lee.Schermerhorn@hp.com> wrote:
> [But, maybe reordering the zonelists is not such a good idea
> when ZONE_MOVABLE is populated?]
> 

It's case-by-case I think. In zone order with ZONE_MOVABLE case,
user's page cache will not use ZONE_NORMAL until ZONE_MOVABLE in all node
is exhausted. This is an expected behavior, I think.

I think the real problem is the scheme for "How to set zone movable size to
appropriate value for the system". This needs more study and documentation.
(but maybe depends on system configuration to some extent.)

Thanks,
-Kame

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
  2007-08-02  0:44 ` KAMEZAWA Hiroyuki
@ 2007-08-02 15:07   ` Lee Schermerhorn
  2007-08-03  0:27     ` KAMEZAWA Hiroyuki
  2007-08-02 16:14   ` Mel Gorman
  1 sibling, 1 reply; 6+ messages in thread
From: Lee Schermerhorn @ 2007-08-02 15:07 UTC (permalink / raw)
  To: KAMEZAWA Hiroyuki
  Cc: Andrew Morton, linux-kernel, linux-mm, Christoph Lameter

On Thu, 2007-08-02 at 09:44 +0900, KAMEZAWA Hiroyuki wrote:
> On Wed, 01 Aug 2007 15:02:51 -0400
> Lee Schermerhorn <Lee.Schermerhorn@hp.com> wrote:
> > [But, maybe reordering the zonelists is not such a good idea
> > when ZONE_MOVABLE is populated?]
> > 
> 
> It's case-by-case I think. In zone order with ZONE_MOVABLE case,
> user's page cache will not use ZONE_NORMAL until ZONE_MOVABLE in all node
> is exhausted. This is an expected behavior, I think.
> 
> I think the real problem is the scheme for "How to set zone movable size to
> appropriate value for the system". This needs more study and documentation.
> (but maybe depends on system configuration to some extent.)

Yes.  Having thought about it a bit more, maybe zone order IS what we
want if we desire the remainder of the zone from which is was taken
[ZONE_MOVABLE-1] to be reserved for non-movable kernel use as long as
possible--similar to the dma zone.  I had made the non-movable zone very
large for testing, so that I could create a segment that used all of the
movable zones on all the nodes and then dip into the non-movable/normal
zone.  If I used a more reasonable [much smaller] amount of kernelcore,
the interleave would have worked as "expected".  

Of course, I don't have any idea of what is a "reasonable amount".
Guess I could look at non-movable zone memory usage in a system at
typical or peak load to get an idea.  Anyone have any data in this
regard?

Lee


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
  2007-08-02  0:44 ` KAMEZAWA Hiroyuki
  2007-08-02 15:07   ` Lee Schermerhorn
@ 2007-08-02 16:14   ` Mel Gorman
  2007-08-02 16:18     ` Mel Gorman
  1 sibling, 1 reply; 6+ messages in thread
From: Mel Gorman @ 2007-08-02 16:14 UTC (permalink / raw)
  To: KAMEZAWA Hiroyuki
  Cc: Lee Schermerhorn, Andrew Morton, linux-kernel, linux-mm,
	Christoph Lameter

On (02/08/07 09:44), KAMEZAWA Hiroyuki didst pronounce:
> On Wed, 01 Aug 2007 15:02:51 -0400
> Lee Schermerhorn <Lee.Schermerhorn@hp.com> wrote:
> > [But, maybe reordering the zonelists is not such a good idea
> > when ZONE_MOVABLE is populated?]
> > 
> 
> It's case-by-case I think. In zone order with ZONE_MOVABLE case,
> user's page cache will not use ZONE_NORMAL until ZONE_MOVABLE in all node
> is exhausted. This is an expected behavior, I think.
> 

This is expected behaviour. I see no reason for lower zones to be used
for allocations that use memory from a higher zone with free memory.

> I think the real problem is the scheme for "How to set zone movable size to
> appropriate value for the system". This needs more study and documentation.
> (but maybe depends on system configuration to some extent.)
> 

It depends on the system configuration and the workload requirements.
Right now, there isn't exact information available on what size the zone
should be. It'll need to be studied over a period of time.

-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
  2007-08-02 16:14   ` Mel Gorman
@ 2007-08-02 16:18     ` Mel Gorman
  0 siblings, 0 replies; 6+ messages in thread
From: Mel Gorman @ 2007-08-02 16:18 UTC (permalink / raw)
  To: KAMEZAWA Hiroyuki
  Cc: Lee Schermerhorn, Andrew Morton, linux-kernel, linux-mm,
	Christoph Lameter

On (02/08/07 17:14), Mel Gorman didst pronounce:
> On (02/08/07 09:44), KAMEZAWA Hiroyuki didst pronounce:
> > On Wed, 01 Aug 2007 15:02:51 -0400
> > Lee Schermerhorn <Lee.Schermerhorn@hp.com> wrote:
> > > [But, maybe reordering the zonelists is not such a good idea
> > > when ZONE_MOVABLE is populated?]
> > > 
> > 
> > It's case-by-case I think. In zone order with ZONE_MOVABLE case,
> > user's page cache will not use ZONE_NORMAL until ZONE_MOVABLE in all node
> > is exhausted. This is an expected behavior, I think.
> > 
> 
> This is expected behaviour. I see no reason for lower zones to be used
> for allocations that use memory from a higher zone with free memory.
> 

Bah. I should have thought of this better.

If you are using ZONE_MOVABLE and the zonelist is in zone order, one would
use memory from remote nodes when suitable local memory was available. I don't
have a quick answer on how this should be handled. The answer may be
something like;

o When ordering zonelists by nodes, order them so that the movable zone
  is paired with the next highest zones in a zonelist before moving to the
  next node

> > I think the real problem is the scheme for "How to set zone movable size to
> > appropriate value for the system". This needs more study and documentation.
> > (but maybe depends on system configuration to some extent.)
> > 
> 
> It depends on the system configuration and the workload requirements.
> Right now, there isn't exact information available on what size the zone
> should be. It'll need to be studied over a period of time.
> 
> -- 
> Mel Gorman
> Part-time Phd Student                          Linux Technology Center
> University of Limerick                         IBM Dublin Software Lab
> 
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo@kvack.org.  For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

-- 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl
  2007-08-02 15:07   ` Lee Schermerhorn
@ 2007-08-03  0:27     ` KAMEZAWA Hiroyuki
  0 siblings, 0 replies; 6+ messages in thread
From: KAMEZAWA Hiroyuki @ 2007-08-03  0:27 UTC (permalink / raw)
  To: Lee Schermerhorn; +Cc: Andrew Morton, linux-kernel, linux-mm, Christoph Lameter

On Thu, 02 Aug 2007 11:07:38 -0400
Lee Schermerhorn <Lee.Schermerhorn@hp.com> wrote:

> Of course, I don't have any idea of what is a "reasonable amount".
> Guess I could look at non-movable zone memory usage in a system at
> typical or peak load to get an idea.  Anyone have any data in this
> regard?
> 
I'm sorry that I have no data and idea. 
ZONE_MOVABLE is too young to be used under business workload...

just I feel...
Considering i686 which divides memory into NORMAL and HIGHMEM, it seems
that 4G to 8G servers looks stable under various workload in my experience.

Then, at least, 12.5% to 25% of "Total Memory - Hugepages" memory should be
under ZONE_NORMAL. But this is from experience of 32bit/SMP :(

Thanks,
-Kame

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2007-08-03  0:27 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-08-01 19:02 [PATCH] 2.6.23-rc1-mm1 - fix missing numa_zonelist_order sysctl Lee Schermerhorn
2007-08-02  0:44 ` KAMEZAWA Hiroyuki
2007-08-02 15:07   ` Lee Schermerhorn
2007-08-03  0:27     ` KAMEZAWA Hiroyuki
2007-08-02 16:14   ` Mel Gorman
2007-08-02 16:18     ` Mel Gorman

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox