From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
To: Dave Chinner <david@fromorbit.com>
Cc: Greg Thelen <gthelen@google.com>, Vivek Goyal <vgoyal@redhat.com>,
Curt Wohlgemuth <curtw@google.com>,
James Bottomley <James.Bottomley@hansenpartnership.com>,
lsf@lists.linux-foundation.org, linux-fsdevel@vger.kernel.org,
linux-mm@kvack.org
Subject: Re: [Lsf] IO less throttling and cgroup aware writeback
Date: Tue, 12 Apr 2011 12:17:58 +0900 [thread overview]
Message-ID: <20110412121758.02d52668.kamezawa.hiroyu@jp.fujitsu.com> (raw)
In-Reply-To: <20110408012556.GU31057@dastard>
On Fri, 8 Apr 2011 11:25:56 +1000
Dave Chinner <david@fromorbit.com> wrote:
> On Thu, Apr 07, 2011 at 05:59:35PM -0700, Greg Thelen wrote:
> > cc: linux-mm
> >
> > Dave Chinner <david@fromorbit.com> writes:
> > If we later find that this supposed uncommon shared inode case is
> > important then we can either implement the previously described lru
> > scanning in mem_cgroup_balance_dirty_pages() or consider extending the
> > bdi/memcg/inode data structures (perhaps with a memcg_mapping) to
> > describe such sharing.
>
> Hmm, another idea I just had. What we're trying to avoid is needing
> to a) track inodes in multiple lists, and b) scanning to find
> something appropriate to write back.
>
> Rather than tracking at page or inode granularity, how about
> tracking "associated" memcgs at the memcg level? i.e. when we detect
> an inode is already dirty in another memcg, link the current memcg
> to the one that contains the inode. Hence if we get a situation
> where a memcg is throttling with no dirty inodes, it can quickly
> find and start writeback in an "associated" memcg that it _knows_
> contain shared dirty inodes. Once we've triggered writeback on an
> associated memcg, it is removed from the list....
>
Thank you for an idea. I think we can start from following.
0. add some feature to set 'preferred inode' for memcg.
I think
fadvise(fd, MAKE_THIF_FILE_UNDER_MY_MEMCG)
or
echo fd > /memory.move_file_here
can be added.
1. account dirty pages for a memcg. as Greg does.
2. at the same time, account dirty pages made dirty by threads in a memcg.
(to check which internal/external thread made page dirty.)
3. calculate internal/external dirty pages gap.
With gap, we can have several choices.
4-a. If it exceeds some thresh, do some notify.
userland daemon can decide to move pages to some memcg or not.
(Of coruse, if the _shared_ dirty can be caught before making page dirty,
user daemon can move inode before making it dirty by inotify().)
I like helps of userland because it can be more flexible than kernel,
it can eat config files.
4-b. set some flag to memcg as 'this memcg is dirty busy because of some extarnal
threads'. When a page is newly dirtied, check the thread's memcg.
If the memcg of a thread and a page is different from each other,
write a memo as 'please check this memcgid , too' in task_struct and
do double-memcg-check in balance_dirty_pages().
(How to clear per-task flag is difficult ;)
I don't want to handle 3-100 threads does shared write case..;)
we'll need 4-a.
Thanks,
-Kame
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
prev parent reply other threads:[~2011-04-12 3:24 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20110331222756.GC2904@dastard>
[not found] ` <20110401171838.GD20986@redhat.com>
[not found] ` <20110401214947.GE6957@dastard>
[not found] ` <20110405131359.GA14239@redhat.com>
[not found] ` <20110405225639.GB31057@dastard>
[not found] ` <BANLkTikDPHcpjmb-EAiX+MQcu7hfE730DQ@mail.gmail.com>
[not found] ` <20110406153954.GB18777@redhat.com>
[not found] ` <xr937hb7568t.fsf@gthelen.mtv.corp.google.com>
[not found] ` <20110406233602.GK31057@dastard>
[not found] ` <20110407192424.GE27778@redhat.com>
[not found] ` <20110407234249.GE30279@dastard>
2011-04-08 0:59 ` Greg Thelen
2011-04-08 1:25 ` Dave Chinner
2011-04-12 3:17 ` KAMEZAWA Hiroyuki [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110412121758.02d52668.kamezawa.hiroyu@jp.fujitsu.com \
--to=kamezawa.hiroyu@jp.fujitsu.com \
--cc=James.Bottomley@hansenpartnership.com \
--cc=curtw@google.com \
--cc=david@fromorbit.com \
--cc=gthelen@google.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lsf@lists.linux-foundation.org \
--cc=vgoyal@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox