From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-ed1-f70.google.com (mail-ed1-f70.google.com [209.85.208.70]) by kanga.kvack.org (Postfix) with ESMTP id 6F0D36B4C40 for ; Wed, 29 Aug 2018 11:27:19 -0400 (EDT) Received: by mail-ed1-f70.google.com with SMTP id g18-v6so2426371edg.14 for ; Wed, 29 Aug 2018 08:27:19 -0700 (PDT) Received: from mx1.suse.de (mx2.suse.de. [195.135.220.15]) by mx.google.com with ESMTPS id b50-v6si4253468edc.408.2018.08.29.08.27.18 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 29 Aug 2018 08:27:18 -0700 (PDT) Date: Wed, 29 Aug 2018 17:27:16 +0200 From: Michal Hocko Subject: Re: Caching/buffers become useless after some time Message-ID: <20180829152716.GB10223@dhcp22.suse.cz> References: <6ef03395-6baa-a6e5-0d5a-63d4721e6ec0@suse.cz> <20180823122111.GG29735@dhcp22.suse.cz> <76c6e92b-df49-d4b5-27f7-5f2013713727@suse.cz> <8b211f35-0722-cd94-1360-a2dd9fba351e@suse.cz> <20180829150136.GA10223@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Sender: owner-linux-mm@kvack.org List-ID: To: Marinko Catovic Cc: Vlastimil Babka , Christopher Lameter , linux-mm@kvack.org On Wed 29-08-18 17:13:59, Marinko Catovic wrote: > > > > trace data which starts _before_ the cache dropdown starts and while it > > is decreasing should be the first step. Ideally along with /proc/vmstat > > gathered at the same time. I am pretty sure you have some high order > > memory consumer which forces the reclaim and we over reclaim. Last data > > was not really conclusive as it didn't really captured the dropdown > > IIRC. > > > > with before you mean in a totally healthy state? yep > as I can not tell when decreasing starts this would mean collecting data > over days perhaps. however, I have no issue with that. yeah, you can pipe the trace buffer to gzip and reduce the output considerably. > As I do not want to miss anything that might help you, could you please > provide the commands for all the data you require? Use the same set of commands for tracing I have provided earlier + add the compresssion cat /debug/trace/trace_pipe | gzip > file.gz + the loop to gather vmstat while true do cp /proc/vmstat vmstat.$(date +%s) sleep 5s done > one host is at a healthy state right now, I'd run that over there immediately. Let's see what we can get from here. -- Michal Hocko SUSE Labs