From: James Houghton <jthoughton@google.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: Khalid Aziz <khalid.aziz@oracle.com>,
Peter Xu <peterx@redhat.com>,
Vishal Moola <vishal.moola@gmail.com>,
Jane Chu <jane.chu@oracle.com>,
Muchun Song <muchun.song@linux.dev>,
linux-mm@kvack.org
Subject: Re: Unifying page table walkers
Date: Thu, 6 Jun 2024 13:23:08 -0700 [thread overview]
Message-ID: <CADrL8HXAyYhV=pKJyy5JRZDRgBed4UTSos=z2pRXAX9C0P7d2w@mail.gmail.com> (raw)
In-Reply-To: <ZmIWZWOeN2fLaJ3T@casper.infradead.org>
On Thu, Jun 6, 2024 at 1:04 PM Matthew Wilcox <willy@infradead.org> wrote:
> Right, so we ignore hugetlb_fault() and call into __handle_mm_fault().
> Once there, we'll do:
>
> vmf.pud = pud_alloc(mm, p4d, address);
> if (pud_none(*vmf.pud) &&
> thp_vma_allowable_order(vma, vm_flags,
> TVA_IN_PF | TVA_ENFORCE_SYSFS, PUD_ORDER)) {
> ret = create_huge_pud(&vmf);
>
> which will call vma->vm_ops->huge_fault(vmf, PUD_ORDER);
>
> So all we need to do is implement huge_fault in hugetlb_vm_ops. I
> don't think that's the same as creating a hugetlbfs2 because it's just
> another entry point. You can mmap() the same file both ways and it's
> all cache coherent.
That makes a lot of sense. FWIW, this sounds good to me (though I'm
curious what Peter thinks :)).
But I think you'll need to be careful to ensure that, for now anyway,
huge_fault() is always called with the exact same ptep/pmdp/pudp that
hugetlb_walk() would have returned (ignoring sharing). If you allow
PMD mapping of what would otherwise be PUD-mapped hugetlb pages right
now, you'll break the vmemmap optimization (and probably other
things).
Also I'm not sure how this will interact with arm64's hugetlb pages
implemented with contiguous PTEs/PMDs. You might have to round
`address` down to make sure you've picked the first PTE/PMD in the
group.
next prev parent reply other threads:[~2024-06-06 20:23 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-06 18:29 Matthew Wilcox
2024-06-06 19:30 ` James Houghton
2024-06-06 20:04 ` Matthew Wilcox
2024-06-06 20:23 ` James Houghton [this message]
2024-06-06 21:21 ` Matthew Wilcox
2024-06-06 23:07 ` James Houghton
2024-06-07 7:15 ` David Hildenbrand
2024-06-06 21:33 ` Peter Xu
2024-06-06 21:49 ` Peter Xu
2024-06-07 5:07 ` Oscar Salvador
2024-06-07 6:59 ` David Hildenbrand
2024-06-09 20:08 ` Matthew Wilcox
2024-06-09 20:28 ` David Hildenbrand
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CADrL8HXAyYhV=pKJyy5JRZDRgBed4UTSos=z2pRXAX9C0P7d2w@mail.gmail.com' \
--to=jthoughton@google.com \
--cc=jane.chu@oracle.com \
--cc=khalid.aziz@oracle.com \
--cc=linux-mm@kvack.org \
--cc=muchun.song@linux.dev \
--cc=peterx@redhat.com \
--cc=vishal.moola@gmail.com \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox