From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from psmtp.com (na3sys010amx198.postini.com [74.125.245.198]) by kanga.kvack.org (Postfix) with SMTP id 2006D6B0006 for ; Fri, 8 Mar 2013 10:01:07 -0500 (EST) Message-ID: <5139FD27.1030208@symas.com> Date: Fri, 08 Mar 2013 07:00:55 -0800 From: Howard Chu MIME-Version: 1.0 Subject: Re: mmap vs fs cache References: <5136320E.8030109@symas.com> <20130307154312.GG6723@quack.suse.cz> <20130308020854.GC23767@cmpxchg.org> <5139975F.9070509@symas.com> <20130308084246.GA4411@shutemov.name> <5139B214.3040303@symas.com> <5139FA13.8090305@genband.com> In-Reply-To: <5139FA13.8090305@genband.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org List-ID: To: Chris Friesen Cc: "Kirill A. Shutemov" , Johannes Weiner , Jan Kara , linux-kernel , linux-mm@kvack.org Chris Friesen wrote: > On 03/08/2013 03:40 AM, Howard Chu wrote: > >> There is no way that a process that is accessing only 30GB of a mmap >> should be able to fill up 32GB of RAM. There's nothing else running on >> the machine, I've killed or suspended everything else in userland >> besides a couple shells running top and vmstat. When I manually >> drop_caches repeatedly, then eventually slapd RSS/SHR grows to 30GB and >> the physical I/O stops. > > Is it possible that the kernel is doing some sort of automatic > readahead, but it ends up reading pages corresponding to data that isn't > ever queried and so doesn't get mapped by the application? Yes, that's what I was thinking. I added a posix_madvise(..POSIX_MADV_RANDOM) but that had no effect on the test. First obvious conclusion - kswapd is being too aggressive. When free memory hits the low watermark, the reclaim shrinks slapd down from 25GB to 18-19GB, while the page cache still contains ~7GB of unmapped pages. Ideally I'd like a tuning knob so I can say to keep no more than 2GB of unmapped pages in the cache. (And the desired effect of that would be to allow user processes to grow to 30GB total, in this case.) I mentioned this "unmapped page cache control" post already http://lwn.net/Articles/436010/ but it seems that the idea was ultimately rejected. Is there anything else similar in current kernels? -- -- Howard Chu CTO, Symas Corp. http://www.symas.com Director, Highland Sun http://highlandsun.com/hyc/ Chief Architect, OpenLDAP http://www.openldap.org/project/ -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org