From: Hugh Dickins <hughd@google.com>
To: Mike Kravetz <mike.kravetz@oracle.com>
Cc: "linux-mm@kvack.org" <linux-mm@kvack.org>,
linux-kernel <linux-kernel@vger.kernel.org>,
Michal Hocko <mhocko@suse.com>,
Oscar Salvador <osalvador@suse.de>,
David Hildenbrand <david@redhat.com>,
Naoya Horiguchi <naoya.horiguchi@linux.dev>,
Hugh Dickins <hughd@google.com>, Peter Xu <peterx@redhat.com>,
Nick Piggin <npiggin@gmail.com>, Andi Kleen <ak@linux.intel.com>
Subject: Re: vma_needs_copy always true for VM_HUGETLB ?
Date: Wed, 18 May 2022 18:30:03 -0700 (PDT) [thread overview]
Message-ID: <872b743d-ac21-59a3-bd31-109229f63112@google.com> (raw)
In-Reply-To: <a7ccd8b2-c659-44c9-cb18-1496f99aa5a8@oracle.com>
On Wed, 18 May 2022, Mike Kravetz wrote:
> For most non-anonymous vmas, we do not copy page tables at fork time, but
> rather lazily populate the tables after fork via faults. The routine
> vma_needs_copy() is used to make this decision. For VM_HUGETLB vmas, it always
> returns true.
"vma_needs_copy()" is *very* recent coinage, not reached Linus yet.
>
> Anyone know/remember why? The code was added more than 15 years ago and
> my search for why hugetlb vmas were excluded came up empty.
>
> I do not see a reason why VM_HUGETLB is in this list. Initial testing did
> not reveal any problems when I removed the VM_HUGETLB check.
>
> FYI - I am looking at the performance of fork and exec (unmap) of processes
> with very large hugetlb mappings. Skipping the copy at fork time would
> certainly speed things up. Of course, there could some users who would
> notice if hugetlb page tables are not copied at fork time. However, this
> is the behavior for 'normal' mappings. I am inclined to make hugetlb be
> 'more normal'.
Good question, not obvious to me either: but I've found the answer.
The commit was of course Nick's d992895ba2b2 ("[PATCH] Lazy page table
copies in fork()") in 2.6.14; but it doesn't explain why VM_HUGETLB is
there in the test, and goes on to be copied.
I haven't re-read through the whole mail thread which led to that
commit, but I think you'll find the crucial observation comes from
Andi in https://lore.kernel.org/lkml/200508251756.07849.ak@suse.de/#t
"Actually I disabled it for hugetlbfs (... !is_huge...vma). The reason
is that lazy faulting for huge pages is still not in mainline."
and indeed, look at the 2.6.13 or 2.6.14 mm/hugetlb.c and you find
/*
* We cannot handle pagefaults against hugetlb pages at all. They cause
* handle_mm_fault() to try to instantiate regular-sized pages in the
* hugegpage VMA. do_page_fault() is supposed to trap this, so BUG is we get
* this far.
*/
static struct page *hugetlb_nopage(struct vm_area_struct *vma,
unsigned long address, int *unused)
{
BUG();
return NULL;
}
Oh, and that pretty much still exists to this day, to cover that path
to a fault; but 2.6.16 implemented hugetlb_no_page(), which is what
then actually got used to satisfy a hugetlb fault.
So the reason for fork copying VM_HUGETLB appears to have gone away
in 2.6.16.
(I haven't a clue on private hugetlb mappings and reservations and
whether anon_vma means the same on hugetlb, but you know all that.)
Hugh
next prev parent reply other threads:[~2022-05-19 1:30 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-05-18 22:36 Mike Kravetz
2022-05-19 1:30 ` Hugh Dickins [this message]
2022-05-19 3:36 ` Mike Kravetz
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=872b743d-ac21-59a3-bd31-109229f63112@google.com \
--to=hughd@google.com \
--cc=ak@linux.intel.com \
--cc=david@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.com \
--cc=mike.kravetz@oracle.com \
--cc=naoya.horiguchi@linux.dev \
--cc=npiggin@gmail.com \
--cc=osalvador@suse.de \
--cc=peterx@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox