From: Matthew Wilcox <willy@infradead.org>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Jann Horn <jannh@google.com>, Jan Kara <jack@suse.cz>,
Kirill Shutemov <kirill@shutemov.name>,
Oleg Nesterov <oleg@redhat.com>, Christoph Hellwig <hch@lst.de>,
Linux-MM <linux-mm@kvack.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Mike Kravetz <mike.kravetz@oracle.com>
Subject: Re: [5.4 PATCH] mm/gup: Do not force a COW break on file-backed memory
Date: Thu, 2 Dec 2021 19:59:06 +0000 [thread overview]
Message-ID: <YaklihoYztAoKfxX@casper.infradead.org> (raw)
In-Reply-To: <CAHk-=wiHpPXjA=i6e=3Pk13frRd-RVXfSrT6=KfU2tg4Pu5MmQ@mail.gmail.com>
On Thu, Dec 02, 2021 at 10:54:48AM -0800, Linus Torvalds wrote:
> On Wed, Dec 1, 2021 at 8:11 PM Matthew Wilcox <willy@infradead.org> wrote:
> >
> > The other patch we've been kicking around (and works) is:
> >
> > static inline bool should_force_cow_break(struct vm_area_struct *vma, unsigned
> > int flags)
> > {
> > - return is_cow_mapping(vma->vm_flags) && (flags & FOLL_GET);
> > + return is_cow_mapping(vma->vm_flags) &&
> > + (!(vma->vm_flags & VM_DENYWRITE)) && (flags & FOLL_GET);
> > }
>
> That patch makes no sense to me.
>
> It may "work", but it doesn't actually do anything sensible or really
> fix the problem that I can tell.
Oh absolutely, it's semantically nonsense. The only reason it fixes the
problem is that VM_DENYWRITE VMAs are the only ones considered for the
RO_THP merging, so they're the only ones which we've seen causing a
problem.
> I suspect a real fix would be bigger and more invasive.
Darn. I was hoping you were going to say something like "The real
problem is follow_trans_huge_pmd() is complete garbage and it should
just do X, Y and Z". Or "When we force on FOLL_WRITE, we should also
force on FOLL_SPLIT_PMD".
> If the answer is not to backport all the other changes (and they were
> _really_ invasive), I think one answer may be to simply move the
> "should_force_cow_break()" down to below the point where you've looked
> up the page.
>
> Then you can actually look at "is this a file mapped page", and say
> "if so, that's ok, we can return it as-is".
>
> Otherwise, you do something like
>
> foll_flags |= FOLL_WRITE;
> free_page(page);
> goto repeat;
>
> to repeat the loop (now with FOLL_WRITE).
>
> So the patch is bigger and more involved, because you would have done
> the page lookup (for reading) and now notice "Oh, I need it for
> writing instead" so you need to undo and re-do).
>
> But at least - unlike backporting everything else - it would be
> limited to that one __get_user_pages() function.
>
> Hmm?
>
> (And you'd need to handle that follow_hugetlb_page() case too), not
> just the follow_page_mask() one)
Thanks, I'll take a look.
next prev parent reply other threads:[~2021-12-02 19:59 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-12-01 23:17 Matthew Wilcox (Oracle)
2021-12-02 3:51 ` Jann Horn
2021-12-02 4:11 ` Matthew Wilcox
2021-12-02 4:33 ` Jann Horn
2021-12-02 18:54 ` Linus Torvalds
2021-12-02 19:59 ` Matthew Wilcox [this message]
2021-12-02 22:33 ` Linus Torvalds
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YaklihoYztAoKfxX@casper.infradead.org \
--to=willy@infradead.org \
--cc=hch@lst.de \
--cc=jack@suse.cz \
--cc=jannh@google.com \
--cc=kirill@shutemov.name \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mike.kravetz@oracle.com \
--cc=oleg@redhat.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox