From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
To: balbir@linux.vnet.ibm.com
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"linux-mm@kvack.org" <linux-mm@kvack.org>,
"nishimura@mxp.nes.nec.co.jp" <nishimura@mxp.nes.nec.co.jp>,
cl@linux-foundation.org,
"akpm@linux-foundation.org" <akpm@linux-foundation.org>
Subject: Re: [PATCH 0/2] memcg make use of new percpu implementations
Date: Mon, 9 Nov 2009 16:34:20 +0900 [thread overview]
Message-ID: <20091109163420.5a2fa803.kamezawa.hiroyu@jp.fujitsu.com> (raw)
In-Reply-To: <20091109070421.GD3042@balbir.in.ibm.com>
On Mon, 9 Nov 2009 12:34:21 +0530
Balbir Singh <balbir@linux.vnet.ibm.com> wrote:
> * KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> [2009-11-06 17:52:42]:
>
> > Hi,
> >
> > Recent updates on dynamic percpu allocation looks good and I tries to rewrite
> > memcg's poor implementation of percpu status counter.
> > (It's not NUMA-aware ...)
> > Thanks for great works.
> >
> > For this time. I added Christoph to CC because I'm not fully sure my usage of
> > __this_cpu_xxx is correct...I'm glad if you check the usage when you have time.
> >
> >
> > Patch 1/2 is just clean up (prepare for patch 2/2)
> > Patch 2/2 is for percpu.
> >
> > Tested on my 8cpu box and works well.
> > Pathcesa are against the latest mmotm.
>
> How do the test results look? DO you see a significant boost?
Because my test enviroment is just an SMP (not NUMA), improvement will
not be siginificant (visible), I think.
But It's good to make use of recent updates of per_cpu implementation. We
can make use of offset calculation methods of them and one-instruction-access
macros.
This is code size (PREEMPT=y)
[Before patch]
[kamezawa@bluextal mmotm-2.6.32-Nov2]$ size mm/memcontrol.o
text data bss dec hex filename
22403 3420 4132 29955 7503 mm/memcontrol.o
[After patch]
[kamezawa@bluextal mmotm-2.6.32-Nov2]$ size mm/memcontrol.o
text data bss dec hex filename
22188 3420 4132 29740 742c mm/memcontrol.o
Then, text size is surely reduced.
One example is mem_cgroup_swap_statistics(). This function is inlined
after this patch.
this code: mem_cgroup_uncharge_swap(), modifies percpu counter.
==
memcg = mem_cgroup_lookup(id);
if (memcg) {
/*
* We uncharge this because swap is freed.
* This memcg can be obsolete one. We avoid calling css_tryget
*/
if (!mem_cgroup_is_root(memcg))
res_counter_uncharge(&memcg->memsw, PAGE_SIZE);
mem_cgroup_swap_statistics(memcg, false);
mem_cgroup_put(memcg);
}
==
[before patch]
mem_cgroup_swap_statistics() is not inlined and uses 0x69 bytes.
0000000000001d30 <mem_cgroup_swap_statistics>:
1d30: push %rbp
1d31: mov %rsp,%rbp
1d34: push %r12
<snip>
1d88: callq 1d8d <mem_cgroup_swap_statistics+0x5d>
1d8d: nopl (%rax)
1d90: jmp 1d83 <mem_cgroup_swap_statistics+0x53>
1d92: nopw %cs:0x0(%rax,%rax,1)
1d99:
[After patch]
mem_cgroup_uncharge_swap()'s inlined code.
3b67: cmp 0x0(%rip),%rax # 3b6e <mem_cgroup_uncharge_swap+0
xbe>
3b6e: je 3b81 <mem_cgroup_uncharge_swap+0xd1> <=== check mem_cgroup_is_root
3b70: lea 0x90(%rax),%rdi
3b77: mov $0x1000,%esi
3b7c: callq 3b81 <mem_cgroup_uncharge_swap+0xd1> <=== calling res_counter_uncahrge()
3b81: 48 89 df mov %rbx,%rdi
3b84: 48 8b 83 70 01 00 00 mov 0x170(%rbx),%rax <=== get offset of mem->cpustat
3b8b: 65 48 83 40 30 ff addq $0xffffffffffffffff,%gs:0x30(%rax) mem->cpustat.count[index]--;
3b91: e8 6a e0 ff ff callq 1c00 <mem_cgroup_put>
This uses 2 instruction.
Then, code size reduction is enough large, I think.
>BTW, I've been experimenting a bit with the earlier percpu counter patches,
> I might post an iteration once I have some good results.
>
Thank you, it's helpful.
Regards,
-Kame
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
prev parent reply other threads:[~2009-11-09 7:37 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-11-06 8:52 KAMEZAWA Hiroyuki
2009-11-06 8:54 ` [PATCH 1/2] memcg : rename index to short name KAMEZAWA Hiroyuki
2009-11-06 8:55 ` [PATCH 2/2] memcg : rewrite percpu countings with new interfaces KAMEZAWA Hiroyuki
2009-11-06 17:27 ` Christoph Lameter
2009-11-06 18:44 ` KAMEZAWA Hiroyuki
2009-11-09 6:44 ` [PATCH 2/2] memcg : rewrite percpu countings with new interfaces v2 KAMEZAWA Hiroyuki
2009-11-09 7:07 ` [PATCH 2/2] memcg : rewrite percpu countings with new interfaces Balbir Singh
2009-11-09 8:36 ` KAMEZAWA Hiroyuki
2009-11-09 7:04 ` [PATCH 0/2] memcg make use of new percpu implementations Balbir Singh
2009-11-09 7:34 ` KAMEZAWA Hiroyuki [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20091109163420.5a2fa803.kamezawa.hiroyu@jp.fujitsu.com \
--to=kamezawa.hiroyu@jp.fujitsu.com \
--cc=akpm@linux-foundation.org \
--cc=balbir@linux.vnet.ibm.com \
--cc=cl@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=nishimura@mxp.nes.nec.co.jp \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox