From: Gioh Kim <gioh.kim@lge.com>
To: Jan Kara <jack@suse.cz>
Cc: Peter Zijlstra <peterz@infradead.org>,
Alexander Viro <viro@zeniv.linux.org.uk>,
Andrew Morton <akpm@linux-foundation.org>,
"Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
Theodore Ts'o <tytso@mit.edu>,
Andreas Dilger <adilger.kernel@dilger.ca>,
linux-ext4@vger.kernel.org, linux-mm@kvack.org,
Minchan Kim <minchan@kernel.org>, Joonsoo Kim <js1304@gmail.com>
Subject: Re: [PATCH 0/2] new API to allocate buffer-cache for superblock in non-movable area
Date: Fri, 01 Aug 2014 09:07:11 +0900 [thread overview]
Message-ID: <53DADA2F.1020404@lge.com> (raw)
In-Reply-To: <20140731122114.GA5240@quack.suse.cz>
2014-07-31 i??i?? 9:21, Jan Kara i?' e,?:
> On Thu 31-07-14 09:37:15, Gioh Kim wrote:
>>
>>
>> 2014-07-31 i??i ? 9:03, Jan Kara i?' e,?:
>>> On Thu 31-07-14 08:54:40, Gioh Kim wrote:
>>>> 2014-07-30 i??i?? 7:11, Jan Kara i?' e,?:
>>>>> On Wed 30-07-14 16:44:24, Gioh Kim wrote:
>>>>>> 2014-07-22 i??i?? 6:38, Jan Kara i?' e,?:
>>>>>>> On Tue 22-07-14 09:30:05, Peter Zijlstra wrote:
>>>>>>>> On Tue, Jul 22, 2014 at 02:18:47PM +0900, Gioh Kim wrote:
>>>>>>>>> Hello,
>>>>>>>>>
>>>>>>>>> This patch try to solve problem that a long-lasting page cache of
>>>>>>>>> ext4 superblock disturbs page migration.
>>>>>>>>>
>>>>>>>>> I've been testing CMA feature on my ARM-based platform
>>>>>>>>> and found some pages for page caches cannot be migrated.
>>>>>>>>> Some of them are page caches of superblock of ext4 filesystem.
>>>>>>>>>
>>>>>>>>> Current ext4 reads superblock with sb_bread(). sb_bread() allocates page
>>>>>>>> >from movable area. But the problem is that ext4 hold the page until
>>>>>>>>> it is unmounted. If root filesystem is ext4 the page cannot be migrated forever.
>>>>>>>>>
>>>>>>>>> I introduce a new API for allocating page from non-movable area.
>>>>>>>>> It is useful for ext4 and others that want to hold page cache for a long time.
>>>>>>>>
>>>>>>>> There's no word on why you can't teach ext4 to still migrate that page.
>>>>>>>> For all I know it might be impossible, but at least mention why.
>>>>>>
>>>>>> I am very sorry for lacking of details.
>>>>>>
>>>>>> In ext4_fill_super() the buffer-head of superblock is stored in sbi->s_sbh.
>>>>>> The page belongs to the buffer-head is allocated from movable area.
>>>>>> To migrate the page the buffer-head should be released via brelse().
>>>>>> But brelse() is not called until unmount.
>>>>> Hum, I don't see where in the code do we check buffer_head use count. Can
>>>>> you please point me? Thanks.
>>>>
>>>> Filesystem code does not check buffer_head use count. sb_bread() returns
>>>> the buffer_head that is included in bh_lru and has non-zero use count.
>>>> You can see the bh_lru code in buffer.c: __find_get_clock() and
>>>> lookup_bh_lru(). bh_lru_install() inserts the buffer_head into the
>>>> bh_lru(). It first calls get_bh() to increase the use count and insert
>>>> bh into the lru array.
>>>>
>>>> The buffer_head use count is non-zero until brelse() is called.
>>> So I probably didn't phrase the question precisely enough. What I was
>>> asking about is where exactly *migration* code checks buffer use count?
>>> Because as I'm looking at buffer_migrate_page() we lock the buffers on a
>>> migrated page but we don't look at buffer use counts... So it seems to me
>>> that migration of a page with buffers should succeed even if buffer head
>>> has an elevated use count. Now I think that it *should* check the buffer
>>> use counts (it is dangerous to migrate buffers someone holds reference to)
>>> but I just cannot find that place. Or does CMA use some other migration
>>> function for buffer pages than buffer_migrate_page()?
>>
>> CMA allocation function is cma_alloc().
>> Function flow is alloc_contig_range() -> __alloc_contig_migrate_range() -> migrate_pages -> unmap_and_move
>> -> __unmap_and_move -> try_to_free_buffers -> drop_buffers -> buffer_busy.
>>
>> The buffer_busy() is checking b_count.
>> If buffer is busy buffer-cache cannot be removed.
>> So the page that includes buffer_head and the page that is refered by
>> buffer_head are not movable.
>>
>> Is this what you need?
> Yes, this is what I was asking about. Thanks! But as I'm looking into
> __unmap_and_move() it calls try_to_free_buffers() only if page->mapping ==
> NULL. As the comment before that test states, this can happen only for swap
> cache (not our case) or for pagecache pages that were truncated and not yet
> fully cleaned up. But superblock page cannot really be truncated. So I
> somewhat doubt you can hit the above path for a page holding superblock...
I printed the address of busy buffer_head in drop_buffers() that is called by try_to_free_buffers().
And I printed the address of sb buffer_head.
They were the same.
I'm going to check page->mapping.
>
> Honza
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2014-08-01 0:07 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <53CDF437.4090306@lge.com>
[not found] ` <20140722073005.GT3935@laptop>
2014-07-22 9:38 ` Jan Kara
2014-07-30 7:44 ` Gioh Kim
2014-07-30 7:57 ` Kyungmin Park
2014-07-30 10:11 ` Jan Kara
2014-07-30 10:19 ` Peter Zijlstra
2014-07-30 23:45 ` Gioh Kim
2014-07-30 23:54 ` Gioh Kim
2014-07-31 0:03 ` Jan Kara
2014-07-31 0:37 ` Gioh Kim
2014-07-31 12:21 ` Jan Kara
2014-08-01 0:07 ` Gioh Kim [this message]
2014-08-01 1:06 ` Gioh Kim
2014-08-01 9:57 ` Jan Kara
2014-08-01 13:36 ` Peter Zijlstra
2014-08-01 15:24 ` Jan Kara
2014-08-01 16:04 ` Peter Zijlstra
2014-08-06 6:15 ` Gioh Kim
2014-08-01 8:34 ` Joonsoo Kim
2014-08-01 9:15 ` Jan Kara
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=53DADA2F.1020404@lge.com \
--to=gioh.kim@lge.com \
--cc=adilger.kernel@dilger.ca \
--cc=akpm@linux-foundation.org \
--cc=jack@suse.cz \
--cc=js1304@gmail.com \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=minchan@kernel.org \
--cc=paulmck@linux.vnet.ibm.com \
--cc=peterz@infradead.org \
--cc=tytso@mit.edu \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox