From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wi0-f173.google.com (mail-wi0-f173.google.com [209.85.212.173]) by kanga.kvack.org (Postfix) with ESMTP id 2C5D96B0253 for ; Mon, 3 Aug 2015 16:56:13 -0400 (EDT) Received: by wibud3 with SMTP id ud3so151392765wib.1 for ; Mon, 03 Aug 2015 13:56:12 -0700 (PDT) Received: from gum.cmpxchg.org (gum.cmpxchg.org. [85.214.110.215]) by mx.google.com with ESMTPS id hw4si27855072wjb.135.2015.08.03.13.56.11 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 03 Aug 2015 13:56:11 -0700 (PDT) Date: Mon, 3 Aug 2015 16:55:32 -0400 From: Johannes Weiner Subject: Re: [PATCH 2/3] mm: make workingset detection logic memcg aware Message-ID: <20150803205532.GA19478@cmpxchg.org> References: <9662034e14549b9e1445684f674063ce8b092cb0.1438599199.git.vdavydov@parallels.com> <20150803132358.GA18399@cmpxchg.org> <20150803135229.GA11971@esperanza> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20150803135229.GA11971@esperanza> Sender: owner-linux-mm@kvack.org List-ID: To: Vladimir Davydov Cc: Andrew Morton , Michal Hocko , Minchan Kim , Rik van Riel , Mel Gorman , linux-mm@kvack.org, linux-kernel@vger.kernel.org On Mon, Aug 03, 2015 at 04:52:29PM +0300, Vladimir Davydov wrote: > On Mon, Aug 03, 2015 at 09:23:58AM -0400, Johannes Weiner wrote: > > On Mon, Aug 03, 2015 at 03:04:22PM +0300, Vladimir Davydov wrote: > > > @@ -179,8 +180,9 @@ static void unpack_shadow(void *shadow, > > > eviction = entry; > > > > > > *zone = NODE_DATA(nid)->node_zones + zid; > > > + *lruvec = mem_cgroup_page_lruvec(page, *zone); > > > > > > - refault = atomic_long_read(&(*zone)->inactive_age); > > > + refault = atomic_long_read(&(*lruvec)->inactive_age); > > > mask = ~0UL >> (NODES_SHIFT + ZONES_SHIFT + > > > RADIX_TREE_EXCEPTIONAL_SHIFT); > > > /* > > > > You can not compare an eviction shadow entry from one lruvec with the > > inactive age of another lruvec. The inactive ages are not related and > > might differ significantly: memcgs are created ad hoc, memory hotplug, > > page allocator fairness drift. In those cases the result will be pure > > noise. > > That's true. If a page is evicted in one cgroup and then refaulted in > another, the activation will be random. However, is it a frequent event > when a page used by and evicted from one cgroup is refaulted in another? > If there is no active file sharing (is it common?), this should only > happen to code pages, but those will most likely end up in the cgroup > that has the greatest limit, so they shouldn't be evicted and refaulted > frequently. So the question is can we tolerate some noise here? It's not just the memcg, it's also the difference between zones themselves. > > As much as I would like to see a simpler way, I am pessimistic that > > there is a way around storing memcg ids in the shadow entries. > > On 32 bit there is too little space for storing memcg id. We can shift > the distance so that it would fit and still contain something meaningful > though, but that would take much more code, so I'm trying to try the > simplest way first. It should be easy to trim quite a few bits from the timestamp, both in terms of available memory as well as in terms of distance granularity. We probably don't care if the refault distance is only accurate to say 2MB, and how many pages do we have to represent on 32-bit in the first place? Once we trim that, we should be able to fit a CSS ID. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org