From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail191.messagelabs.com (mail191.messagelabs.com [216.82.242.19]) by kanga.kvack.org (Postfix) with SMTP id 3F7F58D0039 for ; Sun, 30 Jan 2011 19:02:06 -0500 (EST) Received: from m4.gw.fujitsu.co.jp (unknown [10.0.50.74]) by fgwmail5.fujitsu.co.jp (Postfix) with ESMTP id 75A493EE0BD for ; Mon, 31 Jan 2011 09:02:03 +0900 (JST) Received: from smail (m4 [127.0.0.1]) by outgoing.m4.gw.fujitsu.co.jp (Postfix) with ESMTP id 595152AEA81 for ; Mon, 31 Jan 2011 09:02:03 +0900 (JST) Received: from s4.gw.fujitsu.co.jp (s4.gw.fujitsu.co.jp [10.0.50.94]) by m4.gw.fujitsu.co.jp (Postfix) with ESMTP id 35C6645DE55 for ; Mon, 31 Jan 2011 09:02:03 +0900 (JST) Received: from s4.gw.fujitsu.co.jp (localhost.localdomain [127.0.0.1]) by s4.gw.fujitsu.co.jp (Postfix) with ESMTP id 24E991DB8037 for ; Mon, 31 Jan 2011 09:02:03 +0900 (JST) Received: from ml13.s.css.fujitsu.com (ml13.s.css.fujitsu.com [10.249.87.103]) by s4.gw.fujitsu.co.jp (Postfix) with ESMTP id D5E3A1DB803B for ; Mon, 31 Jan 2011 09:02:02 +0900 (JST) Date: Mon, 31 Jan 2011 08:55:47 +0900 From: KAMEZAWA Hiroyuki Subject: Re: [BUGFIX][PATCH 0/4] Fixes for memcg with THP Message-Id: <20110131085547.960f6702.kamezawa.hiroyu@jp.fujitsu.com> In-Reply-To: References: <20110128122229.6a4c74a2.kamezawa.hiroyu@jp.fujitsu.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org To: Balbir Singh Cc: "linux-mm@kvack.org" , "linux-kernel@vger.kernel.org" , "hannes@cmpxchg.org" , "nishimura@mxp.nes.nec.co.jp" List-ID: On Sat, 29 Jan 2011 18:17:56 +0530 Balbir Singh wrote: > On Fri, Jan 28, 2011 at 8:52 AM, KAMEZAWA Hiroyuki > wrote: > > > > On recent -mm, when I run make -j 8 under 200M limit of memcg, as > > == > > # mount -t cgroup none /cgroup/memory -o memory > > # mkdir /cgroup/memory/A > > # echo 200M > /cgroup/memory/A/memory.limit_in_bytes > > # echo $$ > /cgroup/memory/A/tasks > > # make -j 8 kernel > > == > > > > I see hangs with khugepaged. That's because memcg's memory reclaim > > routine doesn't handle HUGE_PAGE request in proper way. And khugepaged > > doesn't know about memcg. > > > > This patch set is for fixing above hang. Patch 1-3 seems obvious and > > has the same concept as patches in RHEL. > > Do you have any backtraces? Are they in the specific patches? > Jan 18 10:28:29 rhel6-test kernel: [56245.286007] INFO: rcu_sched_state detected stall on CPU 0 (t=60000 jiffies) Jan 18 10:28:29 rhel6-test kernel: [56245.286007] sending NMI to all CPUs: Jan 18 10:28:29 rhel6-test kernel: [56245.286007] NMI backtrace for cpu 0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] CPU 0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] arch_trigger_all_cpu_bac ktrace+0x5e/0xa0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] __rcu_pending+0x169/0x3b 0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? tick_sched_timer+0x0/0 xc0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] rcu_check_callbacks+0x6c /0x120 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] update_process_times+0x4 8/0x90 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] tick_sched_timer+0x66/0x c0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] __run_hrtimer+0x90/0x1e0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? kvm_clock_get_cycles+0 x9/0x10 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] hrtimer_interrupt+0xde/0 x240 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] smp_apic_timer_interrupt +0x6b/0x9b Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] apic_timer_interrupt+0x13/0x20 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? res_counter_charge+0xda/0x100 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] __mem_cgroup_try_charge+0x199/0x5d0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] mem_cgroup_newpage_charge+0x45/0x50 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] khugepaged+0x924/0x1430 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? autoremove_wake_function+0x0/0x40 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? khugepaged+0x0/0x1430 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] kthread+0x96/0xa0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] kernel_thread_helper+0x4/0x10 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? kthread+0x0/0xa0 Jan 18 10:28:29 rhel6-test kernel: [56245.286007] [] ? kernel_thread_helper+0x0/0x10 Thanks, -Kame -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: email@kvack.org