From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id BE7F8D41D44 for ; Tue, 12 Nov 2024 01:02:01 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 4FE2A8D0013; Mon, 11 Nov 2024 20:02:01 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 484188D0001; Mon, 11 Nov 2024 20:02:01 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 3232B8D0013; Mon, 11 Nov 2024 20:02:01 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 0F7EA8D0001 for ; Mon, 11 Nov 2024 20:02:01 -0500 (EST) Received: from smtpin13.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id 99E45161C14 for ; Tue, 12 Nov 2024 01:02:00 +0000 (UTC) X-FDA: 82775639100.13.98D6D70 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by imf12.hostedemail.com (Postfix) with ESMTP id 5F82140006 for ; Tue, 12 Nov 2024 01:01:40 +0000 (UTC) Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b=hwGY6DjE; dmarc=pass (policy=quarantine) header.from=kernel.org; spf=pass (imf12.hostedemail.com: domain of djwong@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=djwong@kernel.org ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1731373144; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=jUTZRssDpLg/vLnsV2aSg6JiGNJHGP/twekYzdNwpBo=; b=btVcP7ToQr6R07Cf9ViP3MHGPheMWeLCxWugnrajYCvB+SB227V5JHqxT5x3tDWa2fTKQn FItagCpAgQefDHF4WHPh0daR1jMxO9bqbePhbEJBsLY4qVcMhhjyr70iI+k/0zYVJWPQ6x JX3nr2WPNZLH8jTHmjfsaiJ6eS4Fmhk= ARC-Authentication-Results: i=1; imf12.hostedemail.com; dkim=pass header.d=kernel.org header.s=k20201202 header.b=hwGY6DjE; dmarc=pass (policy=quarantine) header.from=kernel.org; spf=pass (imf12.hostedemail.com: domain of djwong@kernel.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=djwong@kernel.org ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1731373144; a=rsa-sha256; cv=none; b=mul5A0FOjIK7OmRcTbmXe7H0vZI/DL21yCTw2hnxsl8EuFEriiGaG/EN9tV3KSqPDn3BKu 5UQSyxAmyeGQOKczeiY2vAZJ0QzfSA2MApjNhajz0awJBoqbqdljAc8w75XBUnfr2xOg8O gXHCfNQDee/qWIpy5WkTh++5oYfor7g= Received: from smtp.kernel.org (transwarp.subspace.kernel.org [100.75.92.58]) by dfw.source.kernel.org (Postfix) with ESMTP id BD8A95C5B15; Tue, 12 Nov 2024 01:01:13 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id CDA23C4CECF; Tue, 12 Nov 2024 01:01:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1731373317; bh=k80a+SD4Pue7a8iqtHCLrHC9JuTq32V81o15OKln5Io=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=hwGY6DjEaUcuyCloXySi6ub9344kqsHf43g6l5Kl9rHB77QnV5/1zbsGSI1QU26+K nuNbySMenT4Khhx58ELYoqw9EQ9V8Tp3LVMCRuJj2GiKVFcorzjaIUMJJD8CajunfY mdwvytRjx6mafLXqHuikx4sp0dKnKzCHLBdRjRSJ0DyC5WS1v3WirnjPZaEVvj3dcT gn5/8zRCeSbFqsUZQbwgxh+4eCPG05JWcH79PVJcuE7HGBtdpXHHz9pnfzE7PTKLJR Ra4rxMkdA85NFV2ZRPDnSvlzZ2RIb9sYnc3f3gJzyB2YyE7or6bOYzLNy/a1kDcDT6 j8j3I4P32oV3Q== Date: Mon, 11 Nov 2024 17:01:57 -0800 From: "Darrick J. Wong" To: Jens Axboe Cc: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, kirill@shutemov.name, linux-btrfs@vger.kernel.org, linux-ext4@vger.kernel.org, linux-xfs@vger.kernel.org Subject: Re: [PATCH 13/16] iomap: make buffered writes work with RWF_UNCACHED Message-ID: <20241112010157.GE9421@frogsfrogsfrogs> References: <20241111234842.2024180-1-axboe@kernel.dk> <20241111234842.2024180-14-axboe@kernel.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20241111234842.2024180-14-axboe@kernel.dk> X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 5F82140006 X-Stat-Signature: j1p57bsfcjj38m16yr4s1433yqykzhry X-Rspam-User: X-HE-Tag: 1731373300-895506 X-HE-Meta: U2FsdGVkX18dQM+rZbTmrP+9c39ZAl5eKsC9JHB7ceNYQtrAG7eU8oVHpqxM8hVABcWlI72kqJDeePvaZ4HmQWVCxEkbaM4Xt3HzyfjAyIOndCzLy+wi9rCMOKXEdtIimdfR4e0JTQcCkj9H942g2FaqIhHyC4QLaBWCniteSTqaEur2e2EzG42qclZDqPFgkEjoAi2HmsOlxCh3D8uKjDcXQQyTJ6B1v8nlBMfOHiYBTsAn1fNzGJU2D0+BiyIiejzJJJUnxxuPkjDYLdIH6pGrqG5OqFY9E+rJQQE7m7k2QE0YvB7ddbrlH1DempzyPXI2sHHwjihlOGJIpz7hK03+liRdYaSN9fCX2QiychSKTlkgJ1/pCzwOXPrV+A2fDFf5FGSxmmwKpuGJ1r8Z5BTkm06Id4uof+VRTvxQbncQTxzvoMJhetXE2RLjsw5gqf1rW9dAG8v6pPKgriKvl2YgSx1daTAUIGsS/86DGnkyloy8vuISjhT3e6SVU0i1g0Fu8zndEk0oqW3knUb0Y8UChqy7PSwFiP1p0NFUr+WY1oef3JOAIqCx7Kn9rQou7i2RiG6LdanuyQWRDOyHPxzi3dtmeOuXZVxOZr4SxLOR4q1MV24PrvjVbjUjdCWCgo9oHJHVyhOTll2tJIy3D9V5+/nflvCfi8q4qZBYmlE1vPubDeq3epTc1RYgoWiTjdxHJV7LopG5Cw7vwU8EVqI6hXLCQlcoXknkBIQxNXfgujdD3/oEsbdYPmYUMLzvv7HqJuwB/g9nqrJBcTW5OcduR4V4p3EBAgWxgB4WRERVcgEkhXaZpw7y3dlG2oPuPFImwsD8NpHsuTYQFK5myhB/H2jymkHFJQZqs7tKvCtqTarvkIkkXXWYLy/n7PjaDyEK5gy6Y5rLWtTesQBireilm4bWXa/OiQrQ7TkPSO1AvaqgcHqB6GP137YmDnLlbzEISlYgnOvLRSWmHS2 8bu/sEca dteB2+AOl06lTqApPU5oIJfgbCajNu6UqHlFD0IRrA+YkTsPzdP4e5++IcqiF5efgklSR/rv03031m7jRZlsV3ROVQKHZuSNh92a0ZtS6i6Eq0FaoxYSYukq6myzQ911VGqsJD2fmAMtQ9XMsvRw4DqI0l/I7v0jvbmZ03l05tYqVxx79dnHi4qEDfWJoOFbB27oMSXr5Nc3yCodjyW4Ebi8RaBDIvGAucxAjZPWPLcSONxEd/HvrrNSgIibo+3cIJ+fpH3JEOillT1O5FPOPJTqgaykoa3dk5ZkAe8KW+UIDfiY= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On Mon, Nov 11, 2024 at 04:37:40PM -0700, Jens Axboe wrote: > Add iomap buffered write support for RWF_UNCACHED. If RWF_UNCACHED is > set for a write, mark the folios being written with drop_writeback. Then > writeback completion will drop the pages. The write_iter handler simply > kicks off writeback for the pages, and writeback completion will take > care of the rest. > > This still needs the user of the iomap buffered write helpers to call > iocb_uncached_write() upon successful issue of the writes. > > Signed-off-by: Jens Axboe > --- > fs/iomap/buffered-io.c | 15 +++++++++++++-- > include/linux/iomap.h | 4 +++- > 2 files changed, 16 insertions(+), 3 deletions(-) > > diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c > index ef0b68bccbb6..2f2a5db04a68 100644 > --- a/fs/iomap/buffered-io.c > +++ b/fs/iomap/buffered-io.c > @@ -603,6 +603,8 @@ struct folio *iomap_get_folio(struct iomap_iter *iter, loff_t pos, size_t len) > > if (iter->flags & IOMAP_NOWAIT) > fgp |= FGP_NOWAIT; > + if (iter->flags & IOMAP_UNCACHED) > + fgp |= FGP_UNCACHED; > fgp |= fgf_set_order(len); > > return __filemap_get_folio(iter->inode->i_mapping, pos >> PAGE_SHIFT, > @@ -1023,8 +1025,9 @@ ssize_t > iomap_file_buffered_write(struct kiocb *iocb, struct iov_iter *i, > const struct iomap_ops *ops, void *private) > { > + struct address_space *mapping = iocb->ki_filp->f_mapping; > struct iomap_iter iter = { > - .inode = iocb->ki_filp->f_mapping->host, > + .inode = mapping->host, > .pos = iocb->ki_pos, > .len = iov_iter_count(i), > .flags = IOMAP_WRITE, > @@ -1034,9 +1037,14 @@ iomap_file_buffered_write(struct kiocb *iocb, struct iov_iter *i, > > if (iocb->ki_flags & IOCB_NOWAIT) > iter.flags |= IOMAP_NOWAIT; > + if (iocb->ki_flags & IOCB_UNCACHED) > + iter.flags |= IOMAP_UNCACHED; > > - while ((ret = iomap_iter(&iter, ops)) > 0) > + while ((ret = iomap_iter(&iter, ops)) > 0) { > + if (iocb->ki_flags & IOCB_UNCACHED) > + iter.iomap.flags |= IOMAP_F_UNCACHED; > iter.processed = iomap_write_iter(&iter, i); > + } > > if (unlikely(iter.pos == iocb->ki_pos)) > return ret; > @@ -1770,6 +1778,9 @@ static int iomap_add_to_ioend(struct iomap_writepage_ctx *wpc, > size_t poff = offset_in_folio(folio, pos); > int error; > > + if (folio_test_uncached(folio)) > + wpc->iomap.flags |= IOMAP_F_UNCACHED; > + > if (!wpc->ioend || !iomap_can_add_to_ioend(wpc, pos)) { > new_ioend: > error = iomap_submit_ioend(wpc, 0); > diff --git a/include/linux/iomap.h b/include/linux/iomap.h > index f61407e3b121..2efc72df19a2 100644 > --- a/include/linux/iomap.h > +++ b/include/linux/iomap.h > @@ -64,6 +64,7 @@ struct vm_fault; > #define IOMAP_F_BUFFER_HEAD 0 > #endif /* CONFIG_BUFFER_HEAD */ > #define IOMAP_F_XATTR (1U << 5) > +#define IOMAP_F_UNCACHED (1U << 6) This value ^^^ is set only by the core iomap code, right? > /* > * Flags set by the core iomap code during operations: ...in which case it should be set down here. It probably ought to have a description of what it does, too: "IOMAP_F_UNCACHED is set to indicate that writes to the page cache (and hence writeback) will result in folios being evicted as soon as the updated bytes are written back to the storage." If the writeback fails, does that mean that the dirty data will /not/ be retained in the page cache? IIRC we finally got to the point where the major filesystems leave pagecache alone after writeback EIO. The rest of the mechanics looks nifty to me; there's plenty of places where this could be useful to me personally. :) --D > @@ -173,8 +174,9 @@ struct iomap_folio_ops { > #define IOMAP_NOWAIT (1 << 5) /* do not block */ > #define IOMAP_OVERWRITE_ONLY (1 << 6) /* only pure overwrites allowed */ > #define IOMAP_UNSHARE (1 << 7) /* unshare_file_range */ > +#define IOMAP_UNCACHED (1 << 8) /* uncached IO */ > #ifdef CONFIG_FS_DAX > -#define IOMAP_DAX (1 << 8) /* DAX mapping */ > +#define IOMAP_DAX (1 << 9) /* DAX mapping */ > #else > #define IOMAP_DAX 0 > #endif /* CONFIG_FS_DAX */ > -- > 2.45.2 > >