From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.0 required=3.0 tests=DKIM_INVALID,DKIM_SIGNED, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9C542C33CAF for ; Mon, 13 Jan 2020 11:07:25 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id F1F6D207E0 for ; Mon, 13 Jan 2020 11:07:24 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (1024-bit key) header.d=yandex-team.ru header.i=@yandex-team.ru header.b="wp6/HAL4" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org F1F6D207E0 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=yandex-team.ru Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 794048E0005; Mon, 13 Jan 2020 06:07:24 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 744428E0003; Mon, 13 Jan 2020 06:07:24 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 633888E0005; Mon, 13 Jan 2020 06:07:24 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0219.hostedemail.com [216.40.44.219]) by kanga.kvack.org (Postfix) with ESMTP id 4A4818E0003 for ; Mon, 13 Jan 2020 06:07:24 -0500 (EST) Received: from smtpin13.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay04.hostedemail.com (Postfix) with SMTP id E2F632495 for ; Mon, 13 Jan 2020 11:07:23 +0000 (UTC) X-FDA: 76372334766.13.crate29_5d820cf4ac20a X-HE-Tag: crate29_5d820cf4ac20a X-Filterd-Recvd-Size: 7852 Received: from forwardcorp1o.mail.yandex.net (forwardcorp1o.mail.yandex.net [95.108.205.193]) by imf46.hostedemail.com (Postfix) with ESMTP for ; Mon, 13 Jan 2020 11:07:22 +0000 (UTC) Received: from mxbackcorp1o.mail.yandex.net (mxbackcorp1o.mail.yandex.net [IPv6:2a02:6b8:0:1a2d::301]) by forwardcorp1o.mail.yandex.net (Yandex) with ESMTP id 0F5642E09CF; Mon, 13 Jan 2020 14:07:20 +0300 (MSK) Received: from myt4-18a966dbd9be.qloud-c.yandex.net (myt4-18a966dbd9be.qloud-c.yandex.net [2a02:6b8:c00:12ad:0:640:18a9:66db]) by mxbackcorp1o.mail.yandex.net (mxbackcorp/Yandex) with ESMTP id Rv5G6xBeJO-7JGKlpYd; Mon, 13 Jan 2020 14:07:19 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex-team.ru; s=default; t=1578913640; bh=7A8U24cNSxtKxM1dDLkIYVI0pQDqGLVjnFI8h+PV7AA=; h=In-Reply-To:Message-ID:From:Date:References:To:Subject:Cc; b=wp6/HAL4CpigeLPfs2sS6vI/ouXZYOKUuCs4qkjpx1YK9bKKSljNCyFCYsrOry+Ww Dbm/62EWNJbrUHxrw08QrPjMrgJHDC9jX4377k7a5NyBdW32mH6B3SQ75p+N0Hk692 pLB3gOwGicO1nJpFVeCAlfkOS2dJ/ECPE+0s1CLM= Authentication-Results: mxbackcorp1o.mail.yandex.net; dkim=pass header.i=@yandex-team.ru Received: from dynamic-red.dhcp.yndx.net (dynamic-red.dhcp.yndx.net [2a02:6b8:0:40c:8448:fbcc:1dac:c863]) by myt4-18a966dbd9be.qloud-c.yandex.net (smtpcorp/Yandex) with ESMTPSA id jjNaPQiYGK-7JV4ER96; Mon, 13 Jan 2020 14:07:19 +0300 (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) (Client certificate not present) Subject: Re: [PATCH v2 1/2] mm/rmap: fix and simplify reusing mergeable anon_vma as parent when fork To: Wei Yang Cc: Li Xinhai , "linux-mm@kvack.org" , akpm , "linux-kernel@vger.kernel.org" , Rik van Riel , "kirill.shutemov" References: <20200108023211.GC13943@richard> <20200109025240.GA2000@richard> <20200110023029.GB16823@richard> <20200110112357351531132@gmail.com> <20200110053442.GA27846@richard> <20200111223820.GA15506@richard> <20200113003343.GA27210@richard> From: Konstantin Khlebnikov Message-ID: <1cf002fa-a3cb-bcef-57dc-ac9c09dcf2eb@yandex-team.ru> Date: Mon, 13 Jan 2020 14:07:18 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.2.2 MIME-Version: 1.0 In-Reply-To: <20200113003343.GA27210@richard> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-CA Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 13/01/2020 03.33, Wei Yang wrote: > On Sun, Jan 12, 2020 at 12:55:45PM +0300, Konstantin Khlebnikov wrote: >> >> >> On 12/01/2020 01.38, Wei Yang wrote: >>> On Fri, Jan 10, 2020 at 11:11:23AM +0300, Konstantin Khlebnikov wrote= : >>> [...] >>>>>>>> >>>>>>>> series of vma in parent with shared AV: >>>>>>>> >>>>>>>> SRC1 - AV0 >>>>>>>> SRC2 - AV0 >>>>>>>> SRC3 - AV0 >>>>>>>> ... >>>>>>>> SRCn - AV0 >>>>>>>> >>>>>>>> in child after fork >>>>>>>> >>>>>>>> DST1 - AV_OLD_1 (some old vma, picked by anon_vma_clone) plus DS= T1 is attached to same AVs as SRC1 >>>>>>>> DST2 - AV_OLD_2 (other old vma) plus DST1 is attached to same AV= s as SRC2 >>>>>>>> DST2 - AV1 prev AV parent does not match AV0, no old vma found f= or reusing -> allocate new one (child of AV0) >>>>>>>> DST3 - AV1 - DST2->AV->parent =3D=3D SRC3->AV (AV0) -> share AV = with prev >>>>>>>> DST4 - AV1 - same thing >>>>>>>> ... >>>>>>>> DSTn - AV1 >>>>>>>> >>> >>> To focus on the point, I rearranged the order a little. Suppose your = following >>> comments is explaining the above behavior. >>> >>> I've illustrated how two heuristics (reusing-old and sharing-pre= v) _could_ work together. >>> But they both are optional. >>> At cloning first vma SRC1 -> DST1 there is no prev to share anon= vma, >>> thus works common code which _could_ reuse old vma because it ha= ve to. >>> If there is no old anon-vma which have to be reused then DST1 wi= ll allocate >>> new anon-vma (AV1) and it will be used by DST2 and so on like on= your picture. >>> >>> I agree with your 3rd paragraph, but confused with 2nd. >>> >>> At cloning first vma SRC1 -> DST1, there is no prev so anon_vma_clone= () would >>> pick up a reusable anon_vma. Here you named it AV_OLD_1. This looks g= ood to >>> me. But I am not sure why you would picked up AV_OLD_2 for DST2? In p= arent, >>> SRC1 and SRC2 has the same anon_vma, AV0. So in child, DST1 and DST2 = could >>> also share the same anon_vma, AV_OLD_1. >>> >>> Sorry for my poor understanding, would you mind giving me more hint o= n this >>> change? >> >> For DST2 heuristic "share-with-prev" will not work because if prev (DS= T1) >> uses old AV (AV_OLD_1) and AV_OLD_1->parent isn't SRC2->AV (AV0). >> So DST2 could only pick another old AV or allocate new. >=20 > I know this behavior after your change, my question is why you want to = do so. Because I want to keep both heuristics. This seems most sane way of interaction between them. Unfortunately even this patch is slightly broken. Condition prev->anon_vma->parent =3D=3D pvma->anon_vma doesn't guarantee = that prev vma has the same set of anon-vmas like current vma. I.e. anon_vma_clone(vma, prev) might be not enough for keeping connectivi= ty. Building such case isn't trivial job but I see nothing that could prevent= it. >=20 >> >> My patch uses condition dst->prev->anon_vma->parent =3D=3D src->anon_v= ma rather >> than obvious src->prev->anon_vma =3D=3D src->anon_vma because in this = way it >> eliminates all unwanted corner cases and explicitly verifies that we g= oing to >> share related anon-vma. >> >=20 > This do eliminates some corner case, but as you showed child and parent= don't > share the same AV topology. To keep the same AV topology is the purpose= of my > commit. >=20 > I agree you found some bug that previous commit doesn't do it is expect= ed. But > since you change the design a little, I suggest you split this idea to = a > separate patch so that reviewer and audience in the future could unders= tand > your approach clearly. Otherwise audience would be confused and hard to= track > this change. >=20 > For example, you describe the behavior after your change. The second vm= a would > probably have a different AV from first vma. >=20 >> Heuristic "reuse-old" uses fact that VMA links and AV parent chain are= tracked >> independently: when VMA reuses old AV it still links to all related AV= even >> if VMA->AV points into some old AV in the middle of inheritance chain. >> >>> >>>>>>> >>>>>>> Yes, your code works for DST3..DSTn. They will pick up AV1 since >>>>>>> (DST2->AV->parent =3D=3D SRC3->AV). >>>>>>> >>>>>>> My question is why DST1 and DST2 has different AV? The purpose of= my patch >>>>>>> tries to make child has the same topology and parent. So the idea= l look of >>>>>>> child is: >>>>>>> >>>>>>> DST1 - AV1 >>>>>>> DST2 - AV1 >>>>>>> DST2 - AV1 >>>>>>> DST3 - AV1 >>>>>>> DST4 - AV1 >>>>>>> >>>>>>> Would you mind putting more words on DST1 and DST2? I didn't full= y understand >>>>>>> the logic here. >>>>>>> >>>>>>> Thanks >>>>>>> >>>>>> >>>>>> I think that the first version is doing the work as you expected, = but been >>>>>> revised in second version, to limits the number of users of reused= old >>>>>> anon(which=C2=A0is picked=C2=A0in anon_vma_clone() and keep the tr= ee structure. >>>>>> >>>>> >>>>> Any reason to reduce the reuse? Maybe I lost some point. >>>> >>>>> >>>>>>> -- >>>>>>> Wei Yang >>>>>>> Help you, Help me >>>>> >>> >