From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 85D23C2BA18 for ; Thu, 20 Jun 2024 22:53:16 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id D42088D00F0; Thu, 20 Jun 2024 18:53:15 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id CF1678D00DB; Thu, 20 Jun 2024 18:53:15 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id BB90D8D00F0; Thu, 20 Jun 2024 18:53:15 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id 9E37F8D00DB for ; Thu, 20 Jun 2024 18:53:15 -0400 (EDT) Received: from smtpin19.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id 49E1916066C for ; Thu, 20 Jun 2024 22:53:15 +0000 (UTC) X-FDA: 82252769550.19.228C64D Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.11]) by imf20.hostedemail.com (Postfix) with ESMTP id E09311C0012 for ; Thu, 20 Jun 2024 22:53:12 +0000 (UTC) Authentication-Results: imf20.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=Ub3T5+W8; spf=none (imf20.hostedemail.com: domain of ak@linux.intel.com has no SPF policy when checking 198.175.65.11) smtp.mailfrom=ak@linux.intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1718923983; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=QifJ8FIAPHxEzhrgPgL4/QFdiCRVHJvFGU7WIkOwq9o=; b=bbx2oRGVZCc4UxFdAfVFEkSgrAABWmGVAtOPaENHZzSPp1GRtyfhSzXhPYJ13lYJ4J0/n6 zSmAsDYs0nzii/lZV5oYtiIpQ9F6bumJsWs29nq3Si6C/f9hBs0j0hRP5EzcMgjEqeo+Kl RpVKPdfccKoAgJDm8teS7SLjmQV59oE= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1718923983; a=rsa-sha256; cv=none; b=umKHkY88cZAMxyIko5uSud8OP5XNXSZXWVaUXNdLN5QNDwRkQtBLRorf4dkTwhwy2j0DXF QG1RH5m1DRvk9mYyBHJYeFimfXh7TwMn8/I1r1YRL92BBXZDTGPWtQRtT+BwuQJWG0PWg0 asIrvA6zvXowwMoAtQCiPHUX9CaDsqY= ARC-Authentication-Results: i=1; imf20.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=Ub3T5+W8; spf=none (imf20.hostedemail.com: domain of ak@linux.intel.com has no SPF policy when checking 198.175.65.11) smtp.mailfrom=ak@linux.intel.com; dmarc=pass (policy=none) header.from=intel.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1718923993; x=1750459993; h=from:to:cc:subject:in-reply-to:references:date: message-id:mime-version; bh=hDSr4XMNw/8z3vhBbwyVdj5ipvE/omFz38Ap/eQ8Ewo=; b=Ub3T5+W8m7vjiseqWB5f6ntjHCiLKFqTyAp6xxTEvewoLFbwc0aia3iS SLVwWBaVvz64a2Fk+BKUL2hgMDFAvpVzKXxXdK5y0YLvJZNqr8MfSCEPG Z7Ki00evZxFKzx2hLimRcy2f9CStqqU5E48gdBCk2L2zhPTAWz2vw2KIO 22nnNbZEtMJTmxRu5Q9hQcQ/bOQLg5FrTjUARZNTthx7Vy///ogth3LWJ n3O9v1JFrink9I5+N4wFSvoh3QJYqFrGpV9GTH43m8gqgZV3eLTWFUi/5 GTRT0u9vLAkIghkvlJFH9XUGnKd0D0tpxP6nagmHzQo65hVtWHS21lR8f Q==; X-CSE-ConnectionGUID: No6jqzEKTkOubpzCawFGKw== X-CSE-MsgGUID: wLfBdfnQTHSoDHsw6UeQ6Q== X-IronPort-AV: E=McAfee;i="6700,10204,11109"; a="26532632" X-IronPort-AV: E=Sophos;i="6.08,253,1712646000"; d="scan'208";a="26532632" Received: from orviesa001.jf.intel.com ([10.64.159.141]) by orvoesa103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jun 2024 15:53:11 -0700 X-CSE-ConnectionGUID: R+PsCL3ESuSVpJWXVCMXDw== X-CSE-MsgGUID: CvAYJ6PdQa2AwpCYXcSsPA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,253,1712646000"; d="scan'208";a="79904515" Received: from tassilo.jf.intel.com (HELO tassilo.localdomain) ([10.54.38.190]) by orviesa001.jf.intel.com with ESMTP; 20 Jun 2024 15:53:11 -0700 Received: by tassilo.localdomain (Postfix, from userid 1000) id 1631A3003B6; Thu, 20 Jun 2024 15:53:11 -0700 (PDT) From: Andi Kleen To: Jiaqi Yan Cc: nao.horiguchi@gmail.com, linmiaohe@huawei.com, jane.chu@oracle.com, osalvador@suse.de, muchun.song@linux.dev, akpm@linux-foundation.org, shuah@kernel.org, corbet@lwn.net, rientjes@google.com, duenwen@google.com, fvdl@google.com, linux-mm@kvack.org, linux-kselftest@vger.kernel.org, linux-doc@vger.kernel.org Subject: Re: [PATCH v4 0/4] Userspace controls soft-offline pages In-Reply-To: <20240620184856.600717-1-jiaqiyan@google.com> (Jiaqi Yan's message of "Thu, 20 Jun 2024 18:48:52 +0000") References: <20240620184856.600717-1-jiaqiyan@google.com> Date: Thu, 20 Jun 2024 15:53:11 -0700 Message-ID: <87msnfusyw.fsf@linux.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) MIME-Version: 1.0 Content-Type: text/plain X-Stat-Signature: d16j6ok7iat93xd55xwxj8tgzyjco3qc X-Rspamd-Queue-Id: E09311C0012 X-Rspam-User: X-Rspamd-Server: rspam08 X-HE-Tag: 1718923992-152342 X-HE-Meta: U2FsdGVkX1/b5rHFcHOVB+OnCJXaq66qcHGQjxcXDNYUtChwimDAB8M3cg/UMJZ+XCQj/VYE+fgOEGf5Omo7C9gzQ2SqDgAceF+KI9kFfeaW30MBaaltsiPN1QzZTerh2ZXZmsHiEmEyss6uACg5i4T8Mgg88lEPi6UAon3vmN4LvY4wgJkuZd7ZH8TXlM+5NviMhSkker+6YlX7XtWEkqILF9A/LDyH6JFnLNPrLNgbi609UKI6RVcyaJ/eV3+dRGOiGvZA19lY+9LhNqHPiYx2eWWYKHG5e/GtvYyyED6rPwsUz8oF4VIkHvTIzomJu4jOE10LXpwwaUrillBc0j7teTwdaB0v9NA5b/Eco5mT/rMOS2EXfwW0JGItEkCHQ86fmgqhS1Klpl9lD4YwP8K3YjTQcw3rMRiJHu3JnxbfNW1eL6GxOJ8Y1rIt2zMlBX4nGkBEaw0ELd9If3dp6l3cll90Fq9jNMQIwrvQ4YVFZsES0Fu9cztjnMGRi2wqDMWNYx3/9s8jVlyrchnYaODAWYB5nO9maoDTmYHenhDvsy0NqwidKuWuK8A5zsff8jx3i5DlFx1+TquCpzDeEnsysFbxspovtpGAzJq1eaVt2PtoXv5WVNSl+2n67Y5IJbySKNr1vT0Qrl1Yr4X/3siY0VBL+t07YqwUVGNJr4rmo8mJRijSbkng4Tdtb0WEex9Lvrl6MbDP5J48qd+NIFjtaPN7Wn/pb7VLuenTXIOpRN1V6jIitfD+loT0ZcaROiX4pjRCp9D13DOZLtJkEyuhISx+41Jrs1y1gASS2P2DG8PT4tCm5iMeBgmboznPdnVELNvJpzRAbGnn+NcRzCIKAaNUEt/0zKRRpnvTxlzb9fdzlVr/ZIyz8K+xXdkiS3eObot5PgXsT024rRjY3eGkgpOyPRTLvwuFZDG3CyF2tr9jO3fuPJC+CXI/XJuimRJYffcDyBAHQSk9UV2 HWcpMvXx PYZ6jiDkyG5XEK6QO4R+Z+muiwoyCEKGO9AK4WBVTiQUJdb0mYhwlfsnayRBN7qsEX82qDo0v+iNjGYLfeV24l6VyLUFoJrv1lrfCNTaGR2pVe+7PNbxZldnjAiMjg6zhbOXcU+PDVlIxb+gJIjq0ocRKZFvu9XLSv11zKuodKRWwAXEGDf65LcUcBUda4jE/rd8Sv+IHW2qeVEP+Thna98XMvyOD2gUunbrPqTUynQior0g= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000002, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Jiaqi Yan writes: > Correctable memory errors are very common on servers with large > amount of memory, and are corrected by ECC, but with two > pain points to users: > 1. Correction usually happens on the fly and adds latency overhead > 2. Not-fully-proved theory states excessive correctable memory > errors can develop into uncorrectable memory error. This patchkit is amusing (or maybe sad) because it basically tries to reconstruct the original soft offline design using a user space daemon instead of doing policy badly in the kernel. You can still have it by enabling CONFIG_X86_MCELOG_LEGACY and use http://www.mcelog.org or an equivalent daemon of your chosing that listens to /dev/mcelog. -Andi