linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: David Rientjes <rientjes@google.com>
To: Andrew Morton <akpm@osdl.org>
Cc: Paul Jackson <pj@sgi.com>,
	clameter@sgi.com, linux-mm@kvack.org, rientjes@google.com
Subject: Re: [PATCH] GFP_THISNODE for the slab allocator
Date: Sat, 16 Sep 2006 20:45:36 -0700 (PDT)	[thread overview]
Message-ID: <Pine.LNX.4.63.0609161734220.16748@chino.corp.google.com> (raw)
In-Reply-To: <Pine.LNX.4.63.0609151601230.9416@chino.corp.google.com>

On Fri, 15 Sep 2006, David Rientjes wrote:

> On Fri, 15 Sep 2006, Andrew Morton wrote:
> 
> > Yes.  Speeding up get_page_from_freelist() is less than totally trivial. 
> > I've been putting off thinking about it until we're pretty sure that there
> > aren't any other showstoppers.
> > 
> > I'm (very) impressed at how well the infrastructre which you and Christoph
> > have put together has held up under this.
> > 
> 
> I used numa=fake=64 for 64 nodes of 48M each (with my numa=fake fix).  I 
> created a 2G cpuset with 43 nodes (43*48M = ~2G) and attached 'usemem -m 
> 1500 -s 10000000 &' to it for 1.5G of anonymous memory.  I then used 
> readprofile to time and profile a kernel build of 2.6.18-rc5 with x86_64 
> defconfig in the remaining 21 nodes.
> 
> Kernel build within the 2G container:
> 	real	5m23.057s
> 	user	9m13.395s
> 	sys	1m15.417s
> 

Kernel build within 2G cpuset with Christoph's patch:
	real	5m15.758s
	user	9m10.398s
	sys	1m9.072s

> Unrestricted kernel build (no NUMA emulation or usemem):
> 	real	5m3.213s
> 	user	9m19.483s
> 	sys	0m32.014s	<-- over twice faster
> 
> In 2G container:
> 	10599 __cpuset_zone_allowed			50.4714
> 	 3521 mwait_idle				45.1410
> 	 1149 clear_page				20.1579
> 	   24 clear_page_end				 3.4286
> 	  215 find_get_page				 3.0282
> 	  110 pfn_to_page				 2.3913
> 	  130 __down_read_trylock			 1.9697
> 	   86 page_remove_rmap				 1.9545
> 	  150 find_vma					 1.7241
> 	   46 __strnlen_user				 1.1795
> 	   32 nr_free_pages				 1.1034
> 	   55 page_to_pfn				 1.0784
> 	   22 page_add_file_rmap			 1.0000
> 	  829 get_page_from_freelist			 0.8904
> 	 1548 do_page_fault				 0.8586
> 	   17 file_ra_state_init			 0.8500
> 	   63 _atomic_dec_and_lock			 0.7500
> 	   85 ia32_sysenter_target			 0.7083
> 	   47 cond_resched				 0.6912
> 	  198 copy_user_generic				 0.6644
> 

In 2G cpuset with Christoph's patch:
	  9232 __cpuset_zone_allowed                     43.9619
	  2083 mwait_idle                                26.7051
	   973 clear_page                                17.0702
	   131 pfn_to_page                                2.8478
	   201 find_get_page                              2.8310
	    18 clear_page_end                             2.5714
	   142 __down_read_trylock                        2.1515
	    87 page_remove_rmap                           1.9773
	   149 find_vma                                   1.7126
	    85 page_to_pfn                                1.6667
	    51 __strnlen_user                             1.3077
	    34 nr_free_pages                              1.1724
	    25 page_add_file_rmap                         1.1364
	    18 fput                                       1.0588
	    21 file_ra_state_init                         1.0500
	   108 ia32_sysenter_target                       0.9000
	    74 _atomic_dec_and_lock                       0.8810
	   818 get_page_from_freelist                     0.8786
	   133 zone_watermark_ok                          0.8526
	  1524 do_page_fault                              0.8453
	...

> Unrestricted:
> 	 3719 mwait_idle				47.6795
> 	 1083 clear_page				19.0000
> 	   20 clear_page_end				 2.8571
> 	  175 find_get_page				 2.4648
> 	   77 page_remove_rmap				 1.7500
> 	  114 __down_read_trylock			 1.7273
> 	   77 pfn_to_page				 1.6739
> 	  144 find_vma					 1.6552
> 	   60 __strnlen_user				 1.5385
> 	   71 page_to_pfn				 1.3922
> 	   24 page_add_file_rmap			 1.0909
> 	   17 fput					 1.0000
> 	   80 _atomic_dec_and_lock			 0.9524
> 	    4 up_write					 0.8000
> 	 1439 do_page_fault				 0.7981
> 	   13 compat_sys_open				 0.7647
> 	  227 copy_user_generic				 0.7617
> 	   89 ia32_sysenter_target			 0.7417
> 	   21 memcmp					 0.6562
> 	   13 file_ra_state_init			 0.6500
> 	...
> 	  389 get_page_from_freelist			 0.4178
> 

		David

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2006-09-17  3:45 UTC|newest]

Thread overview: 82+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-09-13 23:50 Christoph Lameter
2006-09-15  5:00 ` Andrew Morton
2006-09-15  6:49   ` Paul Jackson
2006-09-15  7:23     ` Andrew Morton
2006-09-15  7:44       ` Paul Jackson
2006-09-15  8:06         ` Andrew Morton
2006-09-15 15:53           ` David Rientjes
2006-09-15 23:03           ` David Rientjes
2006-09-16  0:04             ` Paul Jackson
2006-09-16  1:36               ` Andrew Morton
2006-09-16  2:23                 ` Christoph Lameter
2006-09-16  4:34                   ` Andrew Morton
2006-09-16  3:28                 ` [PATCH] Add node to zone for the NUMA case Christoph Lameter
2006-09-16  3:40                   ` Paul Jackson
2006-09-16  3:45                 ` [PATCH] GFP_THISNODE for the slab allocator Paul Jackson
2006-09-16  2:47             ` Christoph Lameter
2006-09-17  3:45             ` David Rientjes [this message]
2006-09-17 11:17               ` Paul Jackson
2006-09-17 12:41                 ` Christoph Lameter
2006-09-17 13:03                   ` Paul Jackson
2006-09-17 20:36                     ` David Rientjes
2006-09-17 21:20                       ` Paul Jackson
2006-09-17 22:27                       ` Paul Jackson
2006-09-17 23:49                         ` David Rientjes
2006-09-18  2:20                           ` Paul Jackson
2006-09-18 16:34                             ` Paul Jackson
2006-09-18 17:49                               ` David Rientjes
2006-09-18 20:46                                 ` Paul Jackson
2006-09-19 20:52                               ` David Rientjes
2006-09-19 21:26                                 ` Christoph Lameter
2006-09-19 21:50                                   ` David Rientjes
2006-09-21 22:11                                 ` David Rientjes
2006-09-22 10:10                                   ` Nick Piggin
2006-09-22 16:26                                   ` Paul Jackson
2006-09-22 16:36                                     ` Christoph Lameter
2006-09-15  8:28       ` Andrew Morton
2006-09-16  3:38         ` Paul Jackson
2006-09-16  4:42           ` Andi Kleen
2006-09-16 11:38             ` Paul Jackson
2006-09-16  4:48           ` Andrew Morton
2006-09-16 11:30             ` Paul Jackson
2006-09-16 15:18               ` Andrew Morton
2006-09-17  9:28                 ` Paul Jackson
2006-09-17  9:51                   ` Nick Piggin
2006-09-17 11:15                     ` Paul Jackson
2006-09-17 12:44                       ` Nick Piggin
2006-09-17 13:19                         ` Paul Jackson
2006-09-17 13:52                           ` Nick Piggin
2006-09-17 21:19                             ` Paul Jackson
2006-09-18 12:44                             ` [PATCH] mm: exempt pcp alloc from watermarks Peter Zijlstra
2006-09-18 20:20                               ` Christoph Lameter
2006-09-18 20:43                                 ` Peter Zijlstra
2006-09-19 14:35                               ` Nick Piggin
2006-09-19 14:44                                 ` Christoph Lameter
2006-09-19 15:02                                   ` Nick Piggin
2006-09-19 14:51                                 ` Peter Zijlstra
2006-09-19 15:10                                   ` Nick Piggin
2006-09-19 15:05                                     ` Peter Zijlstra
2006-09-19 15:39                                       ` Christoph Lameter
2006-09-17 16:29                   ` [PATCH] GFP_THISNODE for the slab allocator Andrew Morton
2006-09-18  2:11                     ` Paul Jackson
2006-09-18  5:09                       ` Andrew Morton
2006-09-18  7:49                         ` Paul Jackson
2006-09-16 11:48       ` Paul Jackson
2006-09-16 15:38         ` Andrew Morton
2006-09-16 21:51           ` Paul Jackson
2006-09-16 23:10             ` Andrew Morton
2006-09-17  4:37               ` Christoph Lameter
2006-09-17  4:55                 ` Andrew Morton
2006-09-17 12:09                   ` Paul Jackson
2006-09-17 12:36                   ` Christoph Lameter
2006-09-17 13:06                     ` Paul Jackson
2006-09-19 19:17                 ` David Rientjes
2006-09-19 19:19                   ` David Rientjes
2006-09-19 19:31                   ` Christoph Lameter
2006-09-19 21:12                     ` David Rientjes
2006-09-19 21:28                       ` Christoph Lameter
2006-09-19 21:53                         ` Paul Jackson
2006-09-15 17:08   ` Christoph Lameter
2006-09-15 17:37   ` [PATCH] Add NUMA_BUILD definition in kernel.h to avoid #ifdef CONFIG_NUMA Christoph Lameter
2006-09-15 17:38   ` [PATCH] Disable GFP_THISNODE in the non-NUMA case Christoph Lameter
2006-09-15 17:42   ` [PATCH] GFP_THISNODE for the slab allocator V2 Christoph Lameter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Pine.LNX.4.63.0609161734220.16748@chino.corp.google.com \
    --to=rientjes@google.com \
    --cc=akpm@osdl.org \
    --cc=clameter@sgi.com \
    --cc=linux-mm@kvack.org \
    --cc=pj@sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox