From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 34D70EB64D9 for ; Mon, 19 Jun 2023 07:48:13 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 9BE508D0002; Mon, 19 Jun 2023 03:48:12 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 945D68D0001; Mon, 19 Jun 2023 03:48:12 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 7E94E8D0002; Mon, 19 Jun 2023 03:48:12 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id 6BE8E8D0001 for ; Mon, 19 Jun 2023 03:48:12 -0400 (EDT) Received: from smtpin19.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay04.hostedemail.com (Postfix) with ESMTP id 3B9781A04D6 for ; Mon, 19 Jun 2023 07:48:12 +0000 (UTC) X-FDA: 80918719224.19.3445162 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by imf23.hostedemail.com (Postfix) with ESMTP id 6856D140016 for ; Mon, 19 Jun 2023 07:48:10 +0000 (UTC) Authentication-Results: imf23.hostedemail.com; dkim=pass header.d=linuxfoundation.org header.s=korg header.b=dxRKosA6; spf=pass (imf23.hostedemail.com: domain of gregkh@linuxfoundation.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=gregkh@linuxfoundation.org; dmarc=pass (policy=none) header.from=linuxfoundation.org ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1687160890; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=NgQKJYllQuNfcvSJYhSBYuswLNRYtJX7DqVxHEsSOPw=; b=jVQNurZlo9m4Y7Zr5dlgAJXxYRORbGSjqbhHSpQx70+MJUCtmHDBOmmxkYQ6PwseyJiEVt zYz8BsgAFVAnDNNyNahofJWSzMMrDEFiwkRaJe1Gh2zIBs8wHAF0RQMSI48YN9hLz0KyKP yNc3ol+xIZt6jYWhgxQs7oRH9aiuDxg= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1687160890; a=rsa-sha256; cv=none; b=3eJgu03zk9nvd4KDwiOlJkrCB0mB6sg+m5bSK4qzh2vx/vu7uUlAKjxbp+qG39jv+TUjt0 RgFunTB5kRY/4x66sXtCoWUOgb6orNr3zP1P10zFN8AN3eIP6rEsW4RetP32Qznaj/oj9P DIKOAD2BOvNHgFn1ZFpqvzz3d13hsoM= ARC-Authentication-Results: i=1; imf23.hostedemail.com; dkim=pass header.d=linuxfoundation.org header.s=korg header.b=dxRKosA6; spf=pass (imf23.hostedemail.com: domain of gregkh@linuxfoundation.org designates 139.178.84.217 as permitted sender) smtp.mailfrom=gregkh@linuxfoundation.org; dmarc=pass (policy=none) header.from=linuxfoundation.org Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 3BF2261546; Mon, 19 Jun 2023 07:48:09 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id F0D75C433C8; Mon, 19 Jun 2023 07:48:07 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1687160888; bh=+a/ZmnYkb2AqJDdYRdqv7LE893dBIQUn+SESlwMdA9k=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=dxRKosA64xqAOaG0oGVnfzu8t36uWhDRhUHHvQcfSFFa0m/RWsyanMWADslj3ldLB e0rTVaWx6PxfaO/yj8G1bxvlRe1E17c9WJP7i77YRl6LCh2YymtBd8DVKh4RuzaPLS Gt+qL6BWBVE8NThOt2UZ14SGrT+0O0arYWYNYaHs= Date: Mon, 19 Jun 2023 09:48:05 +0200 From: Greg KH To: mawupeng Cc: akpm@linux-foundation.org, david@redhat.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, stable@vger.kernel.org, richard.weiyang@linux.alibaba.com, mst@redhat.com, jasowang@redhat.com, pankaj.gupta.linux@gmail.com, mhocko@kernel.org, osalvador@suse.de Subject: Re: [PATCH stable 5.10] mm/memory_hotplug: extend offline_and_remove_memory() to handle more than one memory block Message-ID: <2023061956-unlinked-fossil-ec6a@gregkh> References: <20230619065121.1720912-1-mawupeng1@huawei.com> <2023061926-monoxide-pastor-fa3b@gregkh> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Rspamd-Queue-Id: 6856D140016 X-Rspam-User: X-Rspamd-Server: rspam11 X-Stat-Signature: cwsjojokymbeafmq3b7g7a7htyfpkqtq X-HE-Tag: 1687160890-654695 X-HE-Meta: U2FsdGVkX18SbrbNyT88PsUO5fCJ9mRJKgDvDc76/BCOSOw2AnsZD3746Dlj72EOtcFbNNv8aLV4lEwaAcuOikF2d3faWaXbRr1jK5FjvwaT4kGpCmKnNnEE7pA6mT9Ccidw/yL+DRj1ilKlJZf3mOSNJiTorRRlVZUBDRf4hHZXbApqHbvHm0MbAcBPdp1qSWN6VHCMkHsCfGWuTfDDDUDDJVuhLkFykEllziki2q+gtooXGEJkNGQy2uBwnb9IoJ/4Yd1uKqVKC6swA9Qkut3k0GtIf5c/Sru+U4/Zhuy9y7FAJKP92wBUmVoUews/HMIPQEQ1FfheB19Wr1twK6gH79o9jBHGpq9RWGrPeOaxJYhJjifoljr9t3Lvo7816G76pJdeQHjjmTHPm+VYjfWiQ7QvbiAjDv6AmgOvyJ1k6PKKP2gmZ39EFM2D8hyTSn31ML10u+uyziiqDd79qogVg/MTFIxHCxZXsMkc0zw7aDzcXLy8KVTGNngIeDpG/nx3gq6zOjeXFLzpVNYhsD7sbB8Gr/RbF6iclhe4PFeSPgbbKEoHlrX/wZYqGQ4oukOkWEeD18ynQePKg20XxOzpfQdjgs3o4StEqxJHWatp8fgwiXgwVzJ4w1f9xhZe20wROItasY4NbEU2pxDBuK8Z3gYtJGL0XoJ+92scFz3/Fs8EiwXhS681JVWPjn1YkM+PHJAfty7+IXnO/DJhBFwNnz38PisZYO9zyOXCDUeMpzRu5wbHUf1ZzekAZf+dtzKFuXYif2FP8kd2Wf3AsMIGDPT2WxJT0mOKC35Rx0mgDsfvpPYqQIw3U2BpBooebWDJ0L0iqF5ZEJsJwx+dNbrWhlktYf+B3R2ar9vgmGGdDyVwN7Zi2TPKOc72bxKVp7VRxDs4K7CKb7tY2QdF+YaLrZ3UDPoiLuLNZUKPX7ere4+cT7VxSlVg6p4mltiSBUAhpuuEZ3DayqbnXwy NkHdSmBW LZCDyvtm5k2AwZh//h9OrkyxQ+d+GdHSvKMpCIcqNd4CbOqsvuzkYTsprIZSWSXRmatyG4bNg4mg41fP9KLm9bYQOOxSbMkB9YJPm4JF/L3Ox6R/c6fSBVtsYlGv6FOb0HvAWiEZXhabsS/oEVcsNSHCtuEq85QWl4+yXKVhHWj/8nrWWU2lNWzLEClJWpa9cwutf3Vej8yej/FIH+TxzRhWe48GcVauoPa5bdcNDns8/vZMT+uZluHqzVnJd7hX+OEaG6oI/PIcQ5A9Hh5WOt2ag/eIEUshI7HVQn7ti/oHtwcY2CBz2QY6HpbjMoBuR6v74H6RVY89Fl0a7aSNqs20V3YnaRkIBaU9YEbeyj3FrpwVBPW0iy0NrEpzPvwN6KeSVVr13D24KjpglTw7GN4oW457chp80yLHwd4rQf4zqEls+GqSCF3cWzkwR6RKPPX/mTVnZLJg1bMO/aRZ75HpWZeAZsXgPvR9uS/LTq/77o59WWqwHuF2rXCVn+neFVQqofv8JK29d5uRwYp6kLeZ6OwZm82J00OKpU2xMSAO1DJpQcF0rtWWpdyBluwk8t8dc8wsGpaDdEjnMKKJCAv01FN+TUMk0BDupcLggKSalapHnX0PW/4fhgYVZbwHMVnCBQyeDv2BkQ7I= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Mon, Jun 19, 2023 at 03:22:20PM +0800, mawupeng wrote: > > > On 2023/6/19 15:16, Greg KH wrote: > > On Mon, Jun 19, 2023 at 02:51:21PM +0800, Wupeng Ma wrote: > >> From: David Hildenbrand > >> > >> commit 8dc4bb58a146655eb057247d7c9d19e73928715b upstream. > >> > >> virtio-mem soon wants to use offline_and_remove_memory() memory that > >> exceeds a single Linux memory block (memory_block_size_bytes()). Let's > >> remove that restriction. > >> > >> Let's remember the old state and try to restore that if anything goes > >> wrong. While re-onlining can, in general, fail, it's highly unlikely to > >> happen (usually only when a notifier fails to allocate memory, and these > >> are rather rare). > >> > >> This will be used by virtio-mem to offline+remove memory ranges that are > >> bigger than a single memory block - for example, with a device block > >> size of 1 GiB (e.g., gigantic pages in the hypervisor) and a Linux memory > >> block size of 128MB. > >> > >> While we could compress the state into 2 bit, using 8 bit is much > >> easier. > >> > >> This handling is similar, but different to acpi_scan_try_to_offline(): > >> > >> a) We don't try to offline twice. I am not sure if this CONFIG_MEMCG > >> optimization is still relevant - it should only apply to ZONE_NORMAL > >> (where we have no guarantees). If relevant, we can always add it. > >> > >> b) acpi_scan_try_to_offline() simply onlines all memory in case > >> something goes wrong. It doesn't restore previous online type. Let's do > >> that, so we won't overwrite what e.g., user space configured. > >> > >> Reviewed-by: Wei Yang > >> Cc: "Michael S. Tsirkin" > >> Cc: Jason Wang > >> Cc: Pankaj Gupta > >> Cc: Michal Hocko > >> Cc: Oscar Salvador > >> Cc: Wei Yang > >> Cc: Andrew Morton > >> Signed-off-by: David Hildenbrand > >> Link: https://lore.kernel.org/r/20201112133815.13332-28-david@redhat.com > >> Signed-off-by: Michael S. Tsirkin > >> Acked-by: Andrew Morton > >> Signed-off-by: Ma Wupeng > >> --- > >> mm/memory_hotplug.c | 105 +++++++++++++++++++++++++++++++++++++------- > >> 1 file changed, 89 insertions(+), 16 deletions(-) > >> > > > > Why is this needed in 5.10.y? Looks like a new feature to me, what > > problem does it solve there? > > > > thanks, > > > > greg k-h > > It do introduce a new feature. But at the same time, it fix a memleak introduced > in Commit 08b3acd7a68f ("mm/memory_hotplug: Introduce offline_and_remove_memory()" > > Our test find a memleak in init_memory_block, it is clear that mem is never > been released due to wrong refcount. Commit 08b3acd7a68f ("mm/memory_hotplug: > Introduce offline_and_remove_memory()") failed to dec refcount after > find_memory_block which fail to dec refcount to zero in remove memory > causing the leak. > > Commit 8dc4bb58a146 ("mm/memory_hotplug: extend offline_and_remove_memory() > to handle more than one memory block") introduce walk_memory_blocks to > replace find_memory_block which dec refcount by calling put_device after > find_memory_block_by_id. In the way, the memleak is fixed. > > Here is the simplified calltrace: > > kmem_cache_alloc_trace+0x664/0xed0 > init_memory_block+0x8c/0x170 > create_memory_block_devices+0xa4/0x150 > add_memory_resource+0x188/0x530 > __add_memory+0x78/0x104 > add_memory+0x6c/0xb0 Ok, thanks for the information, now queued up. greg k-h