From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail138.messagelabs.com (mail138.messagelabs.com [216.82.249.35]) by kanga.kvack.org (Postfix) with ESMTP id 4C2BE6B00CA for ; Fri, 7 Jan 2011 17:24:08 -0500 (EST) Received: from kpbe18.cbf.corp.google.com (kpbe18.cbf.corp.google.com [172.25.105.82]) by smtp-out.google.com with ESMTP id p07MO5qi017465 for ; Fri, 7 Jan 2011 14:24:05 -0800 Received: from pwj4 (pwj4.prod.google.com [10.241.219.68]) by kpbe18.cbf.corp.google.com with ESMTP id p07MO3cn018421 for ; Fri, 7 Jan 2011 14:24:03 -0800 Received: by pwj4 with SMTP id 4so2343650pwj.10 for ; Fri, 07 Jan 2011 14:24:03 -0800 (PST) Date: Fri, 7 Jan 2011 14:23:59 -0800 (PST) From: David Rientjes Subject: Re: [RFC][PATCH 0/2] Tunable watermark In-Reply-To: <65795E11DBF1E645A09CEC7EAEE94B9C3A30A295@USINDEVS02.corp.hds.com> Message-ID: References: <65795E11DBF1E645A09CEC7EAEE94B9C3A30A295@USINDEVS02.corp.hds.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-linux-mm@kvack.org To: Satoru Moriya Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org, Andrew Morton , Mel Gorman , KOSAKI Motohiro , Randy Dunlap , dle-develop@lists.sourceforge.net, Seiji Aguchi List-ID: On Fri, 7 Jan 2011, Satoru Moriya wrote: > This patchset introduces a new knob to control each watermark > separately. > > [Purpose] > To control the timing at which kswapd/direct reclaim starts(ends) > based on memory pressure and/or application characteristics > because direct reclaim makes a memory alloc/access latency worse. > (We'd like to avoid direct reclaim to keep latency low even if > under the high memory pressure.) > > [Problem] > The thresholds kswapd/direct reclaim starts(ends) depend on > watermark[min,low,high] and currently all watermarks are set > based on min_free_kbytes. min_free_kbytes is the amount of > free memory that Linux VM should keep at least. > Not completely, it also depends on the amount of lowmem (because of the reserve setup next) and the amount of memory in each zone. > This means the difference between thresholds at which kswapd > starts and direct reclaim starts depends on the amount of free > memory. > > On the other hand, the amount of required memory depends on > applications. Therefore when it allocates/access memory more > than the difference between watemark[low] and watermark[min], > kernel sometimes runs direct reclaim before allocation and > it makes application latency bigger. > > [Solution] > To avoid the situation above, this patch set introduces new > tunables /proc/sys/vm/wmark_min_kbytes, wmark_low_kbytes and > wmark_high_kbytes. Each entry controls watermark[min], > watermark[low] and watermark[high] separately. > By using these parameters one can make the difference between > min and low bigger than the amount of memory which applications > require. > I really dislike this because it adds additional tunables that should already be handled correctly by the VM and it's very difficult for users to know what to tune these values to; these watermarks (with the exception of min) are supposed to be internal to the VM implementation. You didn't mention why it wouldn't be possible to modify setup_per_zone_wmarks() in some way for your configuration so this happens automatically. If you can find a deterministic way to set these watermarks from userspace, you should be able to do it in the kernel as well based on the configuration. I think we should invest time in making sure the VM works for any type of workload thrown at it instead of relying on userspace making lots of adjustments. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: email@kvack.org