From: Qu Wenruo <wqu@suse.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: Linux Memory Management List <linux-mm@kvack.org>
Subject: Re: Would it be possible to add FGP_FIXED_ORDER for __filemap_get_folio()?
Date: Wed, 13 Dec 2023 20:33:32 +1030 [thread overview]
Message-ID: <929f3dbf-7ebf-4d05-bddb-527069a13498@suse.com> (raw)
In-Reply-To: <ZXk1Hbv1YHv3i+De@casper.infradead.org>
[-- Attachment #1.1.1: Type: text/plain, Size: 2134 bytes --]
On 2023/12/13 15:07, Matthew Wilcox wrote:
> On Wed, Dec 13, 2023 at 08:52:47AM +1030, Qu Wenruo wrote:
>> I'm recently converting btrfs (for now only metadata) to go larger (order >
>> 0) folios, and so far it looks pretty good (can already pass local fstests
>> runs).
>>
>> But to achieve that btrfs metadata is not utilizing __filemap_get_folio() at
>> all, but manually allocating larger folios, then attaching them to the
>> filemap.
>>
>> This is due to 2 factors:
>>
>> - We want fixed folio size
>> - We have our own fallback path
>>
>> The whole idea is go large or go bust, either we got a large folio
>> matching our request exactly, or we fall back to the old per-page
>> allocation (the cross-page handling infrastructure is here anyway).
>
> In the happy case, are all folios attached to the mapping of the same
> order?
Yep.
> Or might you have one folio of eg, order-5 and another of
> order-2 because they're different types of metadata?
Only happy case order (4 for example), or 0 order for sad cases.
All other cases are not valid.
In the future, we may get rid of sad cases completely, if the ENOMEM
rate is low enough and we have handled all ENOMEM allocation properly.
>
> I ask because we have patches out (not merged yet) that allow for
> setting min/max order, and what you're asking for sounds like it could
> be accommodated by that.
For the current usage, we're asking for something like a allowed order
bitmap. (We only set 1 for order 0 and target order).
But that allowance for order 0 is purely because we already have
cross-page handling (only for metadata).
For the future usage, I really believe most filesystem won't be happy
with some random smaller-than-expected orders, which means they need to
implement their own cross-folio handling.
(If they have already implemented such handling, they would already
support multi-page block size, which doesn't look true for now)
Although that min/max would still help for the future multi-page data
support, we would just set the same value for min/max.
Thanks,
Qu
[-- Attachment #1.1.2: OpenPGP public key --]
[-- Type: application/pgp-keys, Size: 7027 bytes --]
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 495 bytes --]
prev parent reply other threads:[~2023-12-13 10:03 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-12-12 22:22 Qu Wenruo
2023-12-13 4:37 ` Matthew Wilcox
2023-12-13 10:03 ` Qu Wenruo [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=929f3dbf-7ebf-4d05-bddb-527069a13498@suse.com \
--to=wqu@suse.com \
--cc=linux-mm@kvack.org \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox