From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 70EE0C19F28 for ; Wed, 3 Aug 2022 14:42:24 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C9C4B6B0072; Wed, 3 Aug 2022 10:42:23 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id C245D6B0073; Wed, 3 Aug 2022 10:42:23 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A9EB18E0001; Wed, 3 Aug 2022 10:42:23 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id 9511C6B0072 for ; Wed, 3 Aug 2022 10:42:23 -0400 (EDT) Received: from smtpin23.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay08.hostedemail.com (Postfix) with ESMTP id 6B495141526 for ; Wed, 3 Aug 2022 14:42:23 +0000 (UTC) X-FDA: 79758546966.23.D2A229C Received: from mail-pl1-f174.google.com (mail-pl1-f174.google.com [209.85.214.174]) by imf20.hostedemail.com (Postfix) with ESMTP id DB70E1C012D for ; Wed, 3 Aug 2022 14:42:22 +0000 (UTC) Received: by mail-pl1-f174.google.com with SMTP id o3so16588898ple.5 for ; Wed, 03 Aug 2022 07:42:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc; bh=WbLojLDf4bnSMua5Syjpq38ZqCfDt7KDJFslMAkFjxI=; b=S8oyKUBy69riYkQuhYnwr0bc7Xvahe0VOOFXz4BtcQ0Obxpsdkj/f3ARt4Mt/OUsCb DsrJXH1aF/JgDNByaDUbxmaljW2v3JxByMH8iLWSiayTU5HbZCNMaZqiqvwzNiTOjjYT dSe2qtNn5OyY/2iIiipeDJWnj9+8KugXVn/Gh1KEZyNk2izo4jPnWiscetPUQPS9YA5a KuaMdniensvSwTajg+cFCupR8Fse6ozUzNza11V9wBVMEgrMl8EoawPzLyociiYgPofZ qs4D88AaLCVfbZBTf1QVpfkIjxi1VjVXMa7LR98wadzIA8n6LRzLHGWMpdZ6eyKnF0ic g4CA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc; bh=WbLojLDf4bnSMua5Syjpq38ZqCfDt7KDJFslMAkFjxI=; b=TjqxKpHqI56Gk6iBaGgDFmRVXFAAFtuKkAAB8gnR0DET5pxFzOhxUfsvS2FWOrIzxL mRLT0LSserQ6aHCSRkVXdfGJahrvkiNPzYKcAuK2PSsh+k860+zL6R12S55VySfAb4EZ kdNaO9yFNrj78XKQvLB2YF0CaIwmnmW6EnFH3ys4jQL2N5UEnT2X6PGZF9tKBBA8gZmZ R3vksH04mKpzwWd16xOs3VkyvgPlKwCn/C8Y/Wvn+A8zqwDaH1zM4WRBeWWp6MNlR7Zt zjOkzYLcDPDBekPaNUjXG56lzy1Tf5SWYlD4dQbEnFfJ0ZFQah0QJKuES22cuph2sXMg w76A== X-Gm-Message-State: ACgBeo2YADLuKrSJMVjx+5C0EdPIT6emUB74HkOqMjE9KRdF/dKboc42 Hi9TBB9pJut3ET7d8P799EzSSQ== X-Google-Smtp-Source: AA6agR4YfMFOGAmga5CfWpVWHyya4teDhnGUJy+qkkGHA09bRr9mP40ayb7pIQHXCvX5PQKdF82pIw== X-Received: by 2002:a17:90b:350c:b0:1f5:e4b:2884 with SMTP id ls12-20020a17090b350c00b001f50e4b2884mr5338018pjb.46.1659537741705; Wed, 03 Aug 2022 07:42:21 -0700 (PDT) Received: from google.com (7.104.168.34.bc.googleusercontent.com. [34.168.104.7]) by smtp.gmail.com with ESMTPSA id u62-20020a627941000000b0052ac12e7596sm3118097pfc.114.2022.08.03.07.42.20 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 03 Aug 2022 07:42:20 -0700 (PDT) Date: Wed, 3 Aug 2022 14:42:17 +0000 From: Sean Christopherson To: Chao Peng Cc: kvm@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-api@vger.kernel.org, linux-doc@vger.kernel.org, qemu-devel@nongnu.org, linux-kselftest@vger.kernel.org, Paolo Bonzini , Jonathan Corbet , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , Thomas Gleixner , Ingo Molnar , Borislav Petkov , x86@kernel.org, "H . Peter Anvin" , Hugh Dickins , Jeff Layton , "J . Bruce Fields" , Andrew Morton , Shuah Khan , Mike Rapoport , Steven Price , "Maciej S . Szmigiero" , Vlastimil Babka , Vishal Annapurve , Yu Zhang , "Kirill A . Shutemov" , luto@kernel.org, jun.nakajima@intel.com, dave.hansen@intel.com, ak@linux.intel.com, david@redhat.com, aarcange@redhat.com, ddutile@redhat.com, dhildenb@redhat.com, Quentin Perret , Michael Roth , mhocko@suse.com, Muchun Song Subject: Re: [PATCH v7 09/14] KVM: Extend the memslot to support fd-based private memory Message-ID: References: <20220706082016.2603916-1-chao.p.peng@linux.intel.com> <20220706082016.2603916-10-chao.p.peng@linux.intel.com> <20220803100835.GD607465@chaop.bj.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220803100835.GD607465@chaop.bj.intel.com> ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1659537743; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=WbLojLDf4bnSMua5Syjpq38ZqCfDt7KDJFslMAkFjxI=; b=ce+Erb5b+yNeXqqAp+08hRt5hYllb+rPgDc9QtUKXVxjUqerAu93JcAy9al/PEN4YxayK6 x1+5R8f1E4KUI5iID3WTs/SZN+TZ6CVkkloJCdLG41TqgPW0z9CUw28WMElGQhPYDN0jMD ULTEFACsjwo9uQjCn8oDXSh7VDw77EE= ARC-Authentication-Results: i=1; imf20.hostedemail.com; dkim=pass header.d=google.com header.s=20210112 header.b=S8oyKUBy; spf=pass (imf20.hostedemail.com: domain of seanjc@google.com designates 209.85.214.174 as permitted sender) smtp.mailfrom=seanjc@google.com; dmarc=pass (policy=reject) header.from=google.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1659537743; a=rsa-sha256; cv=none; b=K1RVT6ERwThWn243edyBVJfn0DlcmWxPX/IHkwhmKkKDRP3G1u9d8lEEenqHa41kqJQPNh 7Y5n+rs9WnIBZVHubvCLjhp9CSq0LWzztz1Rmen7GdH/G/jWUwFvaeeuo1cXWgpelD1RDF qVJl6ZbWonZbXBolDy3Apn8vxiC/gVs= X-Stat-Signature: hkpw8zoacjp6dot5rxywme1ntaa7easi X-Rspam-User: X-Rspamd-Server: rspam03 X-Rspamd-Queue-Id: DB70E1C012D Authentication-Results: imf20.hostedemail.com; dkim=pass header.d=google.com header.s=20210112 header.b=S8oyKUBy; spf=pass (imf20.hostedemail.com: domain of seanjc@google.com designates 209.85.214.174 as permitted sender) smtp.mailfrom=seanjc@google.com; dmarc=pass (policy=reject) header.from=google.com X-HE-Tag: 1659537742-323558 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Wed, Aug 03, 2022, Chao Peng wrote: > On Fri, Jul 29, 2022 at 07:51:29PM +0000, Sean Christopherson wrote: > > On Wed, Jul 06, 2022, Chao Peng wrote: > > > @@ -1332,9 +1332,18 @@ yet and must be cleared on entry. > > > __u64 userspace_addr; /* start of the userspace allocated memory */ > > > }; > > > > > > + struct kvm_userspace_memory_region_ext { > > > + struct kvm_userspace_memory_region region; > > > + __u64 private_offset; > > > + __u32 private_fd; > > > + __u32 pad1; > > > + __u64 pad2[14]; > > > +}; > > > + > > > /* for kvm_memory_region::flags */ > > > #define KVM_MEM_LOG_DIRTY_PAGES (1UL << 0) > > > #define KVM_MEM_READONLY (1UL << 1) > > > + #define KVM_MEM_PRIVATE (1UL << 2) > > > > Very belatedly following up on prior feedback... > > > > | I think a flag is still needed, the problem is private_fd can be safely > > | accessed only when this flag is set, e.g. without this flag, we can't > > | copy_from_user these new fields since they don't exist for previous > > | kvm_userspace_memory_region callers. > > > > I forgot about that aspect of things. We don't technically need a dedicated > > PRIVATE flag to handle that, but it does seem to be the least awful soltuion. > > We could either add a generic KVM_MEM_EXTENDED_REGION or an entirely new > > ioctl(), e.g. KVM_SET_USER_MEMORY_REGION2, but in both approaches there's a decent > > chance that we'll end up needed individual "this field is valid" flags anways. > > > > E.g. if KVM requires pad1 and pad2 to be zero to carve out future extensions, > > then we're right back here if some future extension needs to treat '0' as a legal > > input. > > I had such practice (always rejecting none-zero 'pad' value when > introducing new user APIs) in other project previously, but I rarely > see that in KVM. Ya, KVM often uses flags to indicate the validity of a field specifically so that KVM doesn't misinterpret a '0' from an older userspace as an intended value.