From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-18.9 required=3.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED,USER_AGENT_SANE_1,USER_IN_DEF_DKIM_WL autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 95D59C56201 for ; Wed, 11 Nov 2020 07:41:59 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id E61342076E for ; Wed, 11 Nov 2020 07:41:58 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="XwLW/dm2" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org E61342076E Authentication-Results: mail.kernel.org; dmarc=fail (p=reject dis=none) header.from=google.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 1FB586B0036; Wed, 11 Nov 2020 02:41:58 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 1ABD66B005D; Wed, 11 Nov 2020 02:41:58 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 023356B0068; Wed, 11 Nov 2020 02:41:57 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0134.hostedemail.com [216.40.44.134]) by kanga.kvack.org (Postfix) with ESMTP id C49366B0036 for ; Wed, 11 Nov 2020 02:41:57 -0500 (EST) Received: from smtpin16.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id 6E0818249980 for ; Wed, 11 Nov 2020 07:41:57 +0000 (UTC) X-FDA: 77471343474.16.end45_1215bf2272fc Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin16.hostedemail.com (Postfix) with ESMTP id 4DFC3100E690C for ; Wed, 11 Nov 2020 07:41:57 +0000 (UTC) X-HE-Tag: end45_1215bf2272fc X-Filterd-Recvd-Size: 7449 Received: from mail-ot1-f67.google.com (mail-ot1-f67.google.com [209.85.210.67]) by imf46.hostedemail.com (Postfix) with ESMTP for ; Wed, 11 Nov 2020 07:41:56 +0000 (UTC) Received: by mail-ot1-f67.google.com with SMTP id j14so1332846ots.1 for ; Tue, 10 Nov 2020 23:41:56 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:from:to:cc:subject:in-reply-to:message-id:references :user-agent:mime-version; bh=mDvH6OSBbzyVCoxpj+jRSF1gIjRVsvCyNLrBtPFpWYQ=; b=XwLW/dm2fkNO4NNgVz1CcrM/+4th1lp0Jt1bmAwnUZGl5dzjKemMasbPgwGnm7sQkg /PwXw9SmI+nYdxKqcd1DcJmG+OC6tv+/684uyIJnF5bYkgBz1J4Pd7XPZo0diSb6YqZT 6i7lJbk2dzaAogV6yRKFpbSWLUwZZ26EgXioV654U6SiWxCY0kfRq9bCvmkfJk1g1Rq/ yptFOUeq1uRAUi2CdvOUt/xavBHdVg7LLBhjyZTVVa/qAuIyzosmWgKaxwLu24+W3BAi DBDDEVugLoEVZNqTylkB7ei6iYsePtTj7jhWn5yAR6P/l0GMD6S8sEHGHd8Mgeu6afRl ZA4w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:in-reply-to:message-id :references:user-agent:mime-version; bh=mDvH6OSBbzyVCoxpj+jRSF1gIjRVsvCyNLrBtPFpWYQ=; b=buDYyuW4aqnjC9NtG9C0U46dQYeztkbN//VkS87u3o6Vjq0YVcFkcP7+V9qXOyo0Pm VGLF7TmUQq17+B6jJmodFioLgeczyCmVkuk3+bHbvFqaWrUtFoqKK6OofC2R/zC9F6xN +e5IoBOryFF9z6TM6r0jJGO+oUSyLfCQOKT+ceAgIIRbo+bRcLUVKwm9ztN3NqWtuoNt xSGk6g+o7Nir/0Wj65SfjR+bjzzjcccd4UTQfa3maVcUiElZ2mhNYGKuhkynb4s4XWR6 LuWIGoiHEdbxl3pdx5HWozvtNEa/Q446Zu83HTZiDBTORAnebCutONEDZo2AYGm/12RF lpmQ== X-Gm-Message-State: AOAM5331/0ByigfnQPGO37qI0/bmDg1vxTTee0kwCAw+kAaStSwW2ilb BpOuBPMJ5snHXS9MXr4H0uAnXA== X-Google-Smtp-Source: ABdhPJwAivedMmWYwGgc1FfOiGGj3e+wOAuSvxLz21RZV1BQwY9PJXEaLLPwifr5JqEH/2C/fhIcnA== X-Received: by 2002:a9d:6647:: with SMTP id q7mr17517045otm.196.1605080515950; Tue, 10 Nov 2020 23:41:55 -0800 (PST) Received: from eggly.attlocal.net (172-10-233-147.lightspeed.sntcca.sbcglobal.net. [172.10.233.147]) by smtp.gmail.com with ESMTPSA id 2sm276688oir.40.2020.11.10.23.41.53 (version=TLS1 cipher=ECDHE-ECDSA-AES128-SHA bits=128/128); Tue, 10 Nov 2020 23:41:55 -0800 (PST) Date: Tue, 10 Nov 2020 23:41:52 -0800 (PST) From: Hugh Dickins X-X-Sender: hugh@eggly.anvils To: Alex Shi cc: Andrew Morton , mgorman@techsingularity.net, tj@kernel.org, hughd@google.com, khlebnikov@yandex-team.ru, daniel.m.jordan@oracle.com, willy@infradead.org, hannes@cmpxchg.org, lkp@intel.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, cgroups@vger.kernel.org, shakeelb@google.com, iamjoonsoo.kim@lge.com, richard.weiyang@gmail.com, kirill@shutemov.name, alexander.duyck@gmail.com, rong.a.chen@intel.com, mhocko@suse.com, vdavydov.dev@gmail.com, shy828301@gmail.com, Minchan Kim Subject: Re: [PATCH v21 06/19] mm/rmap: stop store reordering issue on page->mapping In-Reply-To: Message-ID: References: <1604566549-62481-1-git-send-email-alex.shi@linux.alibaba.com> <1604566549-62481-7-git-send-email-alex.shi@linux.alibaba.com> User-Agent: Alpine 2.11 (LSU 23 2013-08-11) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Fri, 6 Nov 2020, Alex Shi wrote: > > updated for comments change from Johannes > > > From 2fd278b1ca6c3e260ad249808b62f671d8db5a7b Mon Sep 17 00:00:00 2001 > From: Alex Shi > Date: Thu, 5 Nov 2020 11:38:24 +0800 > Subject: [PATCH v21 06/19] mm/rmap: stop store reordering issue on > page->mapping > > Hugh Dickins and Minchan Kim observed a long time issue which > discussed here, but actully the mentioned fix missed. > https://lore.kernel.org/lkml/20150504031722.GA2768@blaptop/ > The store reordering may cause problem in the scenario: > > CPU 0 CPU1 > do_anonymous_page > page_add_new_anon_rmap() > page->mapping = anon_vma + PAGE_MAPPING_ANON > lru_cache_add_inactive_or_unevictable() > spin_lock(lruvec->lock) > SetPageLRU() > spin_unlock(lruvec->lock) > /* idletacking judged it as LRU > * page so pass the page in > * page_idle_clear_pte_refs > */ > page_idle_clear_pte_refs > rmap_walk > if PageAnon(page) > > Johannes give detailed examples how the store reordering could cause > a trouble: > "The concern is the SetPageLRU may get reorder before 'page->mapping' > setting, That would make CPU 1 will observe at page->mapping after > observing PageLRU set on the page. > > 1. anon_vma + PAGE_MAPPING_ANON > > That's the in-order scenario and is fine. > > 2. NULL > > That's possible if the page->mapping store gets reordered to occur > after SetPageLRU. That's fine too because we check for it. > > 3. anon_vma without the PAGE_MAPPING_ANON bit > > That would be a problem and could lead to all kinds of undesirable > behavior including crashes and data corruption. > > Is it possible? AFAICT the compiler is allowed to tear the store to > page->mapping and I don't see anything that would prevent it. > > That said, I also don't see how the reader testing PageLRU under the > lru_lock would prevent that in the first place. AFAICT we need that > WRITE_ONCE() around the page->mapping assignment." > > Signed-off-by: Alex Shi > Cc: Johannes Weiner > Cc: Andrew Morton > Cc: Hugh Dickins Acked-by: Hugh Dickins Many thanks to Johannes for spotting my falsehood in the next patch, and to Alex for making it true with this patch. As I just remarked against the v20, I do have some more of these WRITE_ONCEs, but consider them merely theoretical: so please don't let me hold this series up. Andrew, I am hoping that Alex's v21 will appear in the next mmotm? Thanks, Hugh > Cc: Matthew Wilcox > Cc: Minchan Kim > Cc: Vladimir Davydov > Cc: linux-kernel@vger.kernel.org > Cc: linux-mm@kvack.org > --- > mm/rmap.c | 8 +++++++- > 1 file changed, 7 insertions(+), 1 deletion(-) > > diff --git a/mm/rmap.c b/mm/rmap.c > index 1b84945d655c..380c6b9956c2 100644 > --- a/mm/rmap.c > +++ b/mm/rmap.c > @@ -1054,8 +1054,14 @@ static void __page_set_anon_rmap(struct page *page, > if (!exclusive) > anon_vma = anon_vma->root; > > + /* > + * page_idle does a lockless/optimistic rmap scan on page->mapping. > + * Make sure the compiler doesn't split the stores of anon_vma and > + * the PAGE_MAPPING_ANON type identifier, otherwise the rmap code > + * could mistake the mapping for a struct address_space and crash. > + */ > anon_vma = (void *) anon_vma + PAGE_MAPPING_ANON; > - page->mapping = (struct address_space *) anon_vma; > + WRITE_ONCE(page->mapping, (struct address_space *) anon_vma); > page->index = linear_page_index(vma, address); > } > > -- > 1.8.3.1