From: John Hubbard <jhubbard@nvidia.com>
To: "Matthew Wilcox (Oracle)" <willy@infradead.org>, linux-mm@kvack.org
Cc: Andrew Morton <akpm@linux-foundation.org>
Subject: Re: [PATCH 05/17] gup: Add try_get_folio()
Date: Tue, 4 Jan 2022 17:25:07 -0800 [thread overview]
Message-ID: <3ac8af50-dff6-4a0f-dba6-8b8fe5f611d4@nvidia.com> (raw)
In-Reply-To: <20220102215729.2943705-6-willy@infradead.org>
On 1/2/22 13:57, Matthew Wilcox (Oracle) wrote:
> This replaces try_get_compound_head(). It includes a small optimisation
> for the race where a folio is split between being looked up from its
> tail page and the reference count being obtained. Before, it returned
> NULL, which presumably triggered a retry under the mmap_lock, whereas
> now it will retry without the lock. Finding a frozen page will still
> return NULL.
>
> Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org>
> ---
> mm/gup.c | 69 +++++++++++++++++++++++++++++---------------------------
> 1 file changed, 36 insertions(+), 33 deletions(-)
>
> diff --git a/mm/gup.c b/mm/gup.c
> index 2c51e9748a6a..58e5cfaaa676 100644
> --- a/mm/gup.c
> +++ b/mm/gup.c
> @@ -29,12 +29,11 @@ struct follow_page_context {
> unsigned int page_mask;
> };
>
> -static void hpage_pincount_add(struct page *page, int refs)
> +static void folio_pincount_add(struct folio *folio, int refs)
> {
> - VM_BUG_ON_PAGE(!hpage_pincount_available(page), page);
> - VM_BUG_ON_PAGE(page != compound_head(page), page);
> + VM_BUG_ON_FOLIO(!folio_pincount_available(folio), folio);
>
> - atomic_add(refs, compound_pincount_ptr(page));
> + atomic_add(refs, folio_pincount_ptr(folio));
> }
>
> static void hpage_pincount_sub(struct page *page, int refs)
> @@ -63,33 +62,35 @@ static void put_page_refs(struct page *page, int refs)
> }
>
> /*
> - * Return the compound head page with ref appropriately incremented,
> + * Return the folio with ref appropriately incremented,
> * or NULL if that failed.
> */
> -static inline struct page *try_get_compound_head(struct page *page, int refs)
> +static inline struct folio *try_get_folio(struct page *page, int refs)
> {
> - struct page *head = compound_head(page);
> + struct folio *folio;
>
> - if (WARN_ON_ONCE(page_ref_count(head) < 0))
> +retry:
Yes, this new retry looks like a solid improvement. Retrying at this low
level makes a lot of sense, given that it is racing with a very
transient sort of behavior.
> + folio = page_folio(page);
> + if (WARN_ON_ONCE(folio_ref_count(folio) < 0))
> return NULL;
> - if (unlikely(!page_cache_add_speculative(head, refs)))
> + if (unlikely(!folio_ref_try_add_rcu(folio, refs)))
I'm a little lost about the meaning and intended use of the _rcu aspects
of folio_ref_try_add_rcu() here. For example, try_get_folio() does not
require that callers are in an rcu read section, right? This is probably
just a documentation question, sorry if it's obvious--I wasn't able to
work it out on my own.
> return NULL;
>
> /*
> - * At this point we have a stable reference to the head page; but it
> - * could be that between the compound_head() lookup and the refcount
> - * increment, the compound page was split, in which case we'd end up
> - * holding a reference on a page that has nothing to do with the page
> + * At this point we have a stable reference to the folio; but it
> + * could be that between calling page_folio() and the refcount
> + * increment, the folio was split, in which case we'd end up
> + * holding a reference on a folio that has nothing to do with the page
> * we were given anymore.
> - * So now that the head page is stable, recheck that the pages still
> - * belong together.
> + * So now that the folio is stable, recheck that the page still
> + * belongs to this folio.
> */
> - if (unlikely(compound_head(page) != head)) {
> - put_page_refs(head, refs);
> - return NULL;
> + if (unlikely(page_folio(page) != folio)) {
> + folio_put_refs(folio, refs);
> + goto retry;
> }
>
> - return head;
> + return folio;
> }
>
> /**
> @@ -128,8 +129,10 @@ struct page *try_grab_compound_head(struct page *page,
> int refs, unsigned int flags)
> {
> if (flags & FOLL_GET)
> - return try_get_compound_head(page, refs);
> + return &try_get_folio(page, refs)->page;
Did you want to use folio_page() here, instead?
> else if (flags & FOLL_PIN) {
> + struct folio *folio;
> +
> /*
> * Can't do FOLL_LONGTERM + FOLL_PIN gup fast path if not in a
> * right zone, so fail and let the caller fall back to the slow
> @@ -143,29 +146,29 @@ struct page *try_grab_compound_head(struct page *page,
> * CAUTION: Don't use compound_head() on the page before this
> * point, the result won't be stable.
> */
> - page = try_get_compound_head(page, refs);
> - if (!page)
> + folio = try_get_folio(page, refs);
> + if (!folio)
> return NULL;
>
> /*
> - * When pinning a compound page of order > 1 (which is what
> + * When pinning a folio of order > 1 (which is what
> * hpage_pincount_available() checks for), use an exact count to
> - * track it, via hpage_pincount_add/_sub().
> + * track it, via folio_pincount_add/_sub().
> *
> - * However, be sure to *also* increment the normal page refcount
> - * field at least once, so that the page really is pinned.
> + * However, be sure to *also* increment the normal folio refcount
> + * field at least once, so that the folio really is pinned.
> * That's why the refcount from the earlier
> - * try_get_compound_head() is left intact.
> + * try_get_folio() is left intact.
> */
> - if (hpage_pincount_available(page))
> - hpage_pincount_add(page, refs);
> + if (folio_pincount_available(folio))
> + folio_pincount_add(folio, refs);
> else
> - page_ref_add(page, refs * (GUP_PIN_COUNTING_BIAS - 1));
> + folio_ref_add(folio,
> + refs * (GUP_PIN_COUNTING_BIAS - 1));
>
> - mod_node_page_state(page_pgdat(page), NR_FOLL_PIN_ACQUIRED,
> - refs);
> + node_stat_mod_folio(folio, NR_FOLL_PIN_ACQUIRED, refs);
>
> - return page;
> + return &folio->page;
> }
>
> WARN_ON_ONCE(1);
Just minor questions above. This all looks solid though.
thanks,
--
John Hubbard
NVIDIA
next prev parent reply other threads:[~2022-01-05 1:25 UTC|newest]
Thread overview: 72+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-01-02 21:57 [PATCH 00/17] Convert GUP to folios Matthew Wilcox (Oracle)
2022-01-02 21:57 ` [PATCH 01/17] mm: Add folio_put_refs() Matthew Wilcox (Oracle)
2022-01-04 8:00 ` Christoph Hellwig
2022-01-04 21:15 ` John Hubbard
2022-01-02 21:57 ` [PATCH 02/17] mm: Add folio_pincount_available() Matthew Wilcox (Oracle)
2022-01-04 8:01 ` Christoph Hellwig
2022-01-04 18:25 ` Matthew Wilcox
2022-01-04 21:40 ` John Hubbard
2022-01-05 5:04 ` Matthew Wilcox
2022-01-05 6:24 ` John Hubbard
2022-01-02 21:57 ` [PATCH 03/17] mm: Add folio_pincount_ptr() Matthew Wilcox (Oracle)
2022-01-04 8:02 ` Christoph Hellwig
2022-01-04 21:43 ` John Hubbard
2022-01-06 21:57 ` William Kucharski
2022-01-02 21:57 ` [PATCH 04/17] mm: Convert page_maybe_dma_pinned() to use a folio Matthew Wilcox (Oracle)
2022-01-04 8:03 ` Christoph Hellwig
2022-01-04 22:01 ` John Hubbard
2022-01-02 21:57 ` [PATCH 05/17] gup: Add try_get_folio() Matthew Wilcox (Oracle)
2022-01-04 8:18 ` Christoph Hellwig
2022-01-05 1:25 ` John Hubbard [this message]
2022-01-05 7:00 ` John Hubbard
2022-01-07 18:23 ` Jason Gunthorpe
2022-01-08 1:37 ` Matthew Wilcox
2022-01-08 2:36 ` John Hubbard
2022-01-10 15:01 ` Jason Gunthorpe
2022-01-02 21:57 ` [PATCH 06/17] mm: Remove page_cache_add_speculative() and page_cache_get_speculative() Matthew Wilcox (Oracle)
2022-01-04 8:18 ` Christoph Hellwig
2022-01-05 1:29 ` John Hubbard
2022-01-02 21:57 ` [PATCH 07/17] gup: Add gup_put_folio() Matthew Wilcox (Oracle)
2022-01-04 8:22 ` Christoph Hellwig
2022-01-05 6:52 ` John Hubbard
2022-01-06 22:05 ` William Kucharski
2022-01-02 21:57 ` [PATCH 08/17] gup: Add try_grab_folio() Matthew Wilcox (Oracle)
2022-01-04 8:24 ` Christoph Hellwig
2022-01-05 7:06 ` John Hubbard
2022-01-02 21:57 ` [PATCH 09/17] gup: Convert gup_pte_range() to use a folio Matthew Wilcox (Oracle)
2022-01-04 8:25 ` Christoph Hellwig
2022-01-05 7:36 ` John Hubbard
2022-01-05 7:52 ` Matthew Wilcox
2022-01-05 7:57 ` John Hubbard
2022-01-02 21:57 ` [PATCH 10/17] gup: Convert gup_hugepte() " Matthew Wilcox (Oracle)
2022-01-04 8:26 ` Christoph Hellwig
2022-01-05 7:46 ` John Hubbard
2022-01-02 21:57 ` [PATCH 11/17] gup: Convert gup_huge_pmd() " Matthew Wilcox (Oracle)
2022-01-04 8:26 ` Christoph Hellwig
2022-01-05 7:50 ` John Hubbard
2022-01-02 21:57 ` [PATCH 12/17] gup: Convert gup_huge_pud() " Matthew Wilcox (Oracle)
2022-01-05 7:58 ` John Hubbard
2022-01-02 21:57 ` [PATCH 13/17] gup: Convert gup_huge_pgd() " Matthew Wilcox (Oracle)
2022-01-05 7:58 ` John Hubbard
2022-01-02 21:57 ` [PATCH 14/17] gup: Convert for_each_compound_head() to gup_for_each_folio() Matthew Wilcox (Oracle)
2022-01-04 8:32 ` Christoph Hellwig
2022-01-05 8:17 ` John Hubbard
2022-01-09 4:39 ` Matthew Wilcox
2022-01-09 8:01 ` John Hubbard
2022-01-10 15:22 ` Jan Kara
2022-01-10 15:52 ` Matthew Wilcox
2022-01-10 20:36 ` Jan Kara
2022-01-10 21:10 ` Matthew Wilcox
2022-01-17 12:07 ` Jan Kara
2022-01-02 21:57 ` [PATCH 15/17] gup: Convert for_each_compound_range() to gup_for_each_folio_range() Matthew Wilcox (Oracle)
2022-01-04 8:35 ` Christoph Hellwig
2022-01-05 8:30 ` John Hubbard
2022-01-02 21:57 ` [PATCH 16/17] mm: Add isolate_lru_folio() Matthew Wilcox (Oracle)
2022-01-04 8:36 ` Christoph Hellwig
2022-01-05 8:44 ` John Hubbard
2022-01-06 0:34 ` Matthew Wilcox
2022-01-02 21:57 ` [PATCH 17/17] gup: Convert check_and_migrate_movable_pages() to use a folio Matthew Wilcox (Oracle)
2022-01-04 8:37 ` Christoph Hellwig
2022-01-05 9:00 ` John Hubbard
2022-01-06 22:12 ` [PATCH 00/17] Convert GUP to folios William Kucharski
2022-01-07 18:54 ` Jason Gunthorpe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3ac8af50-dff6-4a0f-dba6-8b8fe5f611d4@nvidia.com \
--to=jhubbard@nvidia.com \
--cc=akpm@linux-foundation.org \
--cc=linux-mm@kvack.org \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox