From: Mike Rapoport <rppt@linux.vnet.ibm.com>
To: Christian Borntraeger <borntraeger@de.ibm.com>
Cc: linux-mm@kvack.org, linux-s390@vger.kernel.org,
kvm@vger.kernel.org, Janosch Frank <frankja@linux.ibm.com>,
David Hildenbrand <david@redhat.com>,
Cornelia Huck <cohuck@redhat.com>,
linux-kernel@vger.kernel.org,
Martin Schwidefsky <schwidefsky@de.ibm.com>,
Andrea Arcangeli <aarcange@redhat.com>
Subject: Re: [PATCH/RFC] mm: do not drop unused pages when userfaultd is running
Date: Fri, 29 Jun 2018 23:46:05 +0300 [thread overview]
Message-ID: <20180629204604.GF4799@rapoport-lnx> (raw)
In-Reply-To: <a2602470-a2b8-adc5-5057-fc8f489ab949@de.ibm.com>
On Fri, Jun 29, 2018 at 08:51:23AM +0200, Christian Borntraeger wrote:
>
>
> On 06/28/2018 02:39 PM, Christian Borntraeger wrote:
> > KVM guests on s390 can notify the host of unused pages. This can result
> > in pte_unused callbacks to be true for KVM guest memory.
> >
> > If a page is unused (checked with pte_unused) we might drop this page
> > instead of paging it. This can have side-effects on userfaultd, when the
> > page in question was already migrated:
> >
> > The next access of that page will trigger a fault and a user fault
> > instead of faulting in a new and empty zero page. As QEMU does not
> > expect a userfault on an already migrated page this migration will fail.
> >
> > The most straightforward solution is to ignore the pte_unused hint if a
> > userfault context is active for this VMA.
> >
> > Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
> > Cc: Andrea Arcangeli <aarcange@redhat.com>
> > Cc: stable@vger.kernel.org
> > Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com>
> > ---
> > mm/rmap.c | 2 +-
> > 1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/mm/rmap.c b/mm/rmap.c
> > index 6db729dc4c50..3f3a72aa99f2 100644
> > --- a/mm/rmap.c
> > +++ b/mm/rmap.c
> > @@ -1481,7 +1481,7 @@ static bool try_to_unmap_one(struct page *page, struct vm_area_struct *vma,
> > set_pte_at(mm, address, pvmw.pte, pteval);
> > }
> >
> > - } else if (pte_unused(pteval)) {
> > + } else if (pte_unused(pteval) && !vma->vm_userfaultfd_ctx.ctx) {
>
> FWIW, this needs a fix for !CONFIG_USERFAULTFD.
There's userfaultfd_armed() in include/linux/userfaultfd_k.h. Just
s/!vma->vm_userfaultfd_ctx.ctx/!userfaultfd_armed(vma)
> Still: more opinions on the patch itself?
If the only use case for pte_unused() hint is guest notification for host,
the patch seems Ok to me.
> > /*
> > * The guest indicated that the page content is of no
> > * interest anymore. Simply discard the pte, vmscan
> >
>
--
Sincerely yours,
Mike.
prev parent reply other threads:[~2018-06-29 20:46 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-06-28 12:39 Christian Borntraeger
2018-06-28 13:18 ` David Hildenbrand
2018-06-28 14:39 ` Christian Borntraeger
2018-06-28 14:49 ` David Hildenbrand
2018-06-28 14:51 ` Christian Borntraeger
2018-06-29 6:51 ` Christian Borntraeger
2018-06-29 20:46 ` Mike Rapoport [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180629204604.GF4799@rapoport-lnx \
--to=rppt@linux.vnet.ibm.com \
--cc=aarcange@redhat.com \
--cc=borntraeger@de.ibm.com \
--cc=cohuck@redhat.com \
--cc=david@redhat.com \
--cc=frankja@linux.ibm.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linux-s390@vger.kernel.org \
--cc=schwidefsky@de.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox