From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-18.0 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_CR_TRAILER,INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id EBCC6C4338F for ; Wed, 11 Aug 2021 02:23:36 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 6C36860F11 for ; Wed, 11 Aug 2021 02:23:36 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.4.1 mail.kernel.org 6C36860F11 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=kvack.org Received: by kanga.kvack.org (Postfix) id D68296B0071; Tue, 10 Aug 2021 22:23:35 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id D18258D0003; Tue, 10 Aug 2021 22:23:35 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C07E28D0002; Tue, 10 Aug 2021 22:23:35 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0037.hostedemail.com [216.40.44.37]) by kanga.kvack.org (Postfix) with ESMTP id A89FF6B0071 for ; Tue, 10 Aug 2021 22:23:35 -0400 (EDT) Received: from smtpin37.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id 3DAC78249980 for ; Wed, 11 Aug 2021 02:23:35 +0000 (UTC) X-FDA: 78461203590.37.5286630 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by imf03.hostedemail.com (Postfix) with ESMTP id B51C23001C30 for ; Wed, 11 Aug 2021 02:23:34 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1628648614; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=M7ABmyJsxGwiJCfUUfU3UN9zaJoWL+WLYQ0wCYfuKKI=; b=AIo86rrIS1m5FDFJUlNnSMUQRuCk1fQ36YzTAnV0O0aSQVk58j8O6Lbc8AxebA+j6EL4dy M6g7LUIPLnJ9P8hHfYGWvpt+90Uh/v4rUwnbWJXgj42PBpa9jVkD/P4arVGQkkVivAtvne ML7+K93xJ8Dz6CF63Fww1xIZL7jrBug= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-350-NCQm-SeDOkKnlT9w_3IBvw-1; Tue, 10 Aug 2021 22:23:33 -0400 X-MC-Unique: NCQm-SeDOkKnlT9w_3IBvw-1 Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.phx2.redhat.com [10.5.11.15]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 40D24824F83; Wed, 11 Aug 2021 02:23:31 +0000 (UTC) Received: from localhost (ovpn-12-209.pek2.redhat.com [10.72.12.209]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 1FCE55D740; Wed, 11 Aug 2021 02:23:29 +0000 (UTC) Date: Wed, 11 Aug 2021 10:23:27 +0800 From: Baoquan He To: Tom Lendacky , rientjes@google.com Cc: Robin Murphy , Christoph Hellwig , linux-kernel@vger.kernel.org, linux-mm@kvack.org, x86@kernel.org, rppt@linux.ibm.com, brijesh.singh@amd.com, kexec@lists.infradead.org, iommu@lists.linux-foundation.org, m.szyprowski@samsung.com Subject: Re: [PATCH RFC 0/2] dma-pool: allow user to disable atomic pool Message-ID: <20210811022327.GF23322@MiWiFi-R3L-srv> References: <20210624052010.5676-1-bhe@redhat.com> <20210624092930.GA802261@MiWiFi-R3L-srv> <8b3d4e02-6e94-ad59-a480-fed8e55c009a@arm.com> <20210805065410.GA2051@MiWiFi-R3L-srv> <5df93c4e-ffdd-88c5-db4a-1a09826d047a@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <5df93c4e-ffdd-88c5-db4a-1a09826d047a@amd.com> User-Agent: Mutt/1.10.1 (2018-07-13) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.15 Authentication-Results: imf03.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=AIo86rrI; spf=none (imf03.hostedemail.com: domain of bhe@redhat.com has no SPF policy when checking 170.10.133.124) smtp.mailfrom=bhe@redhat.com; dmarc=pass (policy=none) header.from=redhat.com X-Rspamd-Server: rspam05 X-Rspamd-Queue-Id: B51C23001C30 X-Stat-Signature: ntxe4a63zqqckaxyaannf49t61j99pc6 X-HE-Tag: 1628648614-842537 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 08/10/21 at 03:52pm, Tom Lendacky wrote: > On 8/5/21 1:54 AM, Baoquan He wrote: > > On 06/24/21 at 11:47am, Robin Murphy wrote: > >> On 2021-06-24 10:29, Baoquan He wrote: > >>> On 06/24/21 at 08:40am, Christoph Hellwig wrote: > >>>> So reduce the amount allocated. But the pool is needed for proper > >>>> operation on systems with memory encryption. And please add the right > >>>> maintainer or at least mailing list for the code you're touching next > >>>> time. > >>> > >>> Oh, I thoutht it's memory issue only, should have run > >>> ./scripts/get_maintainer.pl. sorry. > >>> > >>> About reducing the amount allocated, it may not help. Because on x86_64, > >>> kdump kernel doesn't put any page of memory into buddy allocator of DMA > >>> zone. Means it will defenitely OOM for atomic_pool_dma initialization. > >>> > >>> Wondering in which case or on which device the atomic pool is needed on > >>> AMD system with mem encrytion enabled. As we can see, the OOM will > >>> happen too in kdump kernel on Intel system, even though it's not > >>> necessary. > > > > Sorry for very late response, and thank both for your comments. > > > >> > >> Hmm, I think the Kconfig reshuffle has actually left a slight wrinkle here. > >> For DMA_DIRECT_REMAP=y we can assume an atomic pool is always needed, since > >> that was the original behaviour anyway. However the implications of > >> AMD_MEM_ENCRYPT=y are different - even if support is enabled, it still > >> should only be relevant if mem_encrypt_active(), so it probably does make > >> sense to have an additional runtime gate on that. > > > >> > >> From a quick scan, use of dma_alloc_from_pool() already depends on > >> force_dma_unencrypted() so that's probably fine already, but I think we'd > >> need a bit of extra protection around dma_free_from_pool() to prevent > >> gen_pool_has_addr() dereferencing NULL if the pools are uninitialised, even > >> with your proposed patch as it is. Presumably nothing actually called > >> dma_direct_free() when you tested this? > > > > Yes, enforcing the conditional check of force_dma_unencrypted() around > > dma_free_from_pool sounds reasonable, just as we have done in > > dma_alloc_from_pool(). > > > > I have tested this patchset on normal x86_64 systems and one amd system > > with SME support, disabling atomic pool can fix the issue that there's no > > managed pages in dma zone then requesting page from dma zone will cause > > allocation failure. And even disabling atomic pool in 1st kernel didn't > > cause any problem on one AMD EPYC system which supports SME. I am not > > expert of DMA area, wondering how atomic pool is supposed to do in > > SME/SEV system. > > I think the atomic pool is used by the NVMe driver. My understanding is > that driver will do a dma_alloc_coherent() from interrupt context, so it > needs to use GFP_ATOMIC. The pool was created because dma_alloc_coherent() > would perform a set_memory_decrypted() call, which can sleep. The pool > eliminates that issue (David can correct me if I got that wrong). Thanks for the information. While from the code comment around which atomic pool is requested, on amd system it's used to satisfy decrypting memory because that can't block. Combined with your saying, it's because NVMe driver need decrypted memory on AMD system? void *dma_direct_alloc(struct device *dev, size_t size, dma_addr_t *dma_handle, gfp_t gfp, unsigned long attrs) { ...... /* * Remapping or decrypting memory may block. If either is required and * we can't block, allocate the memory from the atomic pools. */ if (IS_ENABLED(CONFIG_DMA_COHERENT_POOL) && !gfpflags_allow_blocking(gfp) && (force_dma_unencrypted(dev) || (IS_ENABLED(CONFIG_DMA_DIRECT_REMAP) && !dev_is_dma_coherent(dev)))) return dma_direct_alloc_from_pool(dev, size, dma_handle, gfp); ...... } Looking at the those related commits, the below one from David tells that atomic dma pool is used when device require non-blocking and unencrypted buffer. When I checked the system I borrowed, it's AMD EYPC and SME is enabled. And it has many pci devices, as you can see, its 'ls pci' outputs 113 lines. But disabling the three atomic pools didn't trigger any error on that AMD system. Does it mean only specific devices need this atomic pool in SME/SEV enabling case? Should we add more details in document or code comment to make clear this? commit 82fef0ad811fb5976cf36ccc3d2c3bc0195dfb72 Author: David Rientjes Date: Tue Apr 14 17:05:01 2020 -0700 x86/mm: unencrypted non-blocking DMA allocations use coherent pools When CONFIG_AMD_MEM_ENCRYPT is enabled and a device requires unencrypted DMA, all non-blocking allocations must originate from the atomic DMA coherent pools. Select CONFIG_DMA_COHERENT_POOL for CONFIG_AMD_MEM_ENCRYPT. Signed-off-by: David Rientjes Signed-off-by: Christoph Hellwig diff --git a/arch/x86/Kconfig b/arch/x86/Kconfig index 1d6104ea8af0..2bf2222819d3 100644 --- a/arch/x86/Kconfig +++ b/arch/x86/Kconfig @@ -1520,6 +1520,7 @@ config X86_CPA_STATISTICS config AMD_MEM_ENCRYPT bool "AMD Secure Memory Encryption (SME) support" depends on X86_64 && CPU_SUP_AMD + select DMA_COHERENT_POOL select DYNAMIC_PHYSICAL_MASK select ARCH_USE_MEMREMAP_PROT select ARCH_HAS_FORCE_DMA_UNENCRYPTED [~]# lscpu Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 32 On-line CPU(s) list: 0-31 Thread(s) per core: 2 Core(s) per socket: 16 Socket(s): 1 NUMA node(s): 4 Vendor ID: AuthenticAMD BIOS Vendor ID: AMD CPU family: 23 Model: 1 Model name: AMD EPYC 7351P 16-Core Processor BIOS Model name: AMD EPYC 7351P 16-Core Processor Stepping: 2 CPU MHz: 2395.439 BogoMIPS: 4790.87 Virtualization: AMD-V L1d cache: 32K L1i cache: 64K L2 cache: 512K L3 cache: 8192K ...... [~]# lspci| wc -l 113 > > > > > > Besides, even though atomic pool is disabled, slub page for allocation > > of dma-kmalloc also triggers page allocation failure. So I change to > > take another way to fix them, please check v2 post. The atomic pool > > disabling an be a good to have change. > > > > Thanks > > Baoquan > > >