From: Qu Wenruo <wqu@suse.com>
To: Matthew Wilcox <willy@infradead.org>, Qu Wenruo <quwenruo.btrfs@gmx.com>
Cc: linux-btrfs@vger.kernel.org,
Linux Memory Management List <linux-mm@kvack.org>
Subject: Re: [PATCH RFC 0/2] btrfs: defrag: further preparation for multi-page sector size
Date: Wed, 24 Jan 2024 16:20:16 +1030 [thread overview]
Message-ID: <a0779a67-6f9b-45e2-b07d-42cd133a5795@suse.com> (raw)
In-Reply-To: <ZbCjkrOwaMyvhRD8@casper.infradead.org>
[-- Attachment #1.1.1: Type: text/plain, Size: 2352 bytes --]
On 2024/1/24 16:13, Matthew Wilcox wrote:
> On Wed, Jan 24, 2024 at 03:57:39PM +1030, Qu Wenruo wrote:
>>
>>
>> On 2024/1/24 15:18, Matthew Wilcox wrote:
>>> On Wed, Jan 24, 2024 at 02:33:22PM +1030, Qu Wenruo wrote:
>>>> I'm pretty sure we would have some filesystems go utilizing larger folios to
>>>> implement their multi-page block size support.
>>>>
>>>> Thus in that case, can we have an interface change to make all folio
>>>> versions of filemap_*() to accept a file offset instead of page index?
>>>
>>> You're confused. There's no change needed to the filemap API to support
>>> large folios used by large block sizes. Quite possibly more of btrfs
>>> is confused, but it's really very simple. index == pos / PAGE_SIZE.
>>> That's all. Even if you have a 64kB block size device on a 4kB PAGE_SIZE
>>> machine.
>>
>> Yes, I understand that filemap API is always working on PAGE_SHIFTed index.
>
> OK, good.
>
>> The concern is, (hopefully) with more fses going to utilized large
>> folios, there would be two shifts.
>>
>> One folio shift (ilog2(blocksize)), one PAGE_SHIFT for filemap interfaces.
>
> Don't shift the file position by the folio_shift(). You want to support
> large(r) folios _and_ large blocksizes at the same time. ie 64kB might
> be the block size, but all that would mean would be that folio_shift()
> would be at least 16. It might be 17, 18 or 21 (for a THP).
Indeed, I forgot we have THP.
>
> Filesystems already have to deal with different PAGE_SIZE, SECTOR_SIZE,
> fsblock size and LBA size. Folios aren't making things any worse here
> (they're also not making anything better in this area, but I never
> claimed they would).
OK, that makes sense.
So all the folio shifts would be an fs internal deal, and we have to
handle it properly.
>
> btrfs is slightly unusual in that it defined PAGE_SIZE and fsblock size
> to be the same (and then had to deal with the consequences of arm64/x86
> interoperability later). But most filesystems have pretty good separation
> of the four concepts.
Indeed, although I also found most major fses do not support larger
block size (> PAGE_SIZE) either.
I guess subpage is simpler in the past, and hopefully with larger folio,
we can see more fses support multi-page blocksize soon.
Thanks,
Qu
[-- Attachment #1.1.2: OpenPGP public key --]
[-- Type: application/pgp-keys, Size: 7027 bytes --]
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 495 bytes --]
prev parent reply other threads:[~2024-01-24 5:50 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <cover.1706068026.git.wqu@suse.com>
2024-01-24 4:03 ` Qu Wenruo
2024-01-24 4:48 ` Matthew Wilcox
2024-01-24 5:27 ` Qu Wenruo
2024-01-24 5:43 ` Matthew Wilcox
2024-01-24 5:50 ` Qu Wenruo [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a0779a67-6f9b-45e2-b07d-42cd133a5795@suse.com \
--to=wqu@suse.com \
--cc=linux-btrfs@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=quwenruo.btrfs@gmx.com \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox