From: "Thomas Hellström (VMware)" <thomas_os@shipmail.org>
To: Dan Williams <dan.j.williams@intel.com>,
"Kirill A. Shutemov" <kirill@shutemov.name>
Cc: Matthew Wilcox <willy@infradead.org>,
linux-mm <linux-mm@kvack.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Thomas Hellstrom <thellstrom@vmware.com>
Subject: Re: [RFC PATCH] mm: Fix a huge pud insertion race during faulting
Date: Wed, 16 Oct 2019 07:59:15 +0200 [thread overview]
Message-ID: <3a16a199-a4bd-5503-3146-3fb24bfb2638@shipmail.org> (raw)
In-Reply-To: <CAA9_cmcSXYB1jo1=CQ78eXVcyGWm1_TjQKd-Gmg0yAO3tObOFw@mail.gmail.com>
Hi, Dan,
On 10/16/19 3:44 AM, Dan Williams wrote:
> On Tue, Oct 15, 2019 at 3:06 AM Kirill A. Shutemov <kirill@shutemov.name> wrote:
>> On Tue, Oct 08, 2019 at 11:37:11AM +0200, Thomas Hellström (VMware) wrote:
>>> From: Thomas Hellstrom <thellstrom@vmware.com>
>>>
>>> A huge pud page can theoretically be faulted in racing with pmd_alloc()
>>> in __handle_mm_fault(). That will lead to pmd_alloc() returning an
>>> invalid pmd pointer. Fix this by adding a pud_trans_unstable() function
>>> similar to pmd_trans_unstable() and check whether the pud is really stable
>>> before using the pmd pointer.
>>>
>>> Race:
>>> Thread 1: Thread 2: Comment
>>> create_huge_pud() Fallback - not taken.
>>> create_huge_pud() Taken.
>>> pmd_alloc() Returns an invalid pointer.
>>>
>>> Cc: Matthew Wilcox <willy@infradead.org>
>>> Fixes: a00cc7d9dd93 ("mm, x86: add support for PUD-sized transparent hugepages")
>>> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com>
>>> ---
>>> RFC: We include pud_devmap() as an unstable PUD flag. Is this correct?
>>> Do the same for pmds?
>> I *think* it is correct and we should do the same for PMD, but I may be
>> wrong.
>>
>> Dan, Matthew, could you comment on this?
> The _devmap() check in these paths near _trans_unstable() has always
> been about avoiding assumptions that the corresponding page might be
> page cache or anonymous which for dax it's neither and does not behave
> like a typical page.
The concern here is that _trans_huge() returns false for _devmap()
pages, which means that also _trans_unstable() returns false.
Still, I figure someone could zap the entry at any time using madvise(),
so AFAICT the entry is indeed unstable, and it's a bug not to include
_devmap() in the _trans_unstable() functions?
Thanks,
Thomas
next prev parent reply other threads:[~2019-10-16 5:59 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-10-08 9:37 Thomas Hellström (VMware)
2019-10-15 10:06 ` Kirill A. Shutemov
2019-10-16 1:44 ` Dan Williams
2019-10-16 5:59 ` Thomas Hellström (VMware) [this message]
2019-10-16 20:02 ` Dan Williams
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3a16a199-a4bd-5503-3146-3fb24bfb2638@shipmail.org \
--to=thomas_os@shipmail.org \
--cc=dan.j.williams@intel.com \
--cc=kirill@shutemov.name \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=thellstrom@vmware.com \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox