From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,HTML_MESSAGE,MAILING_LIST_MULTI, MIME_QP_LONG_LINE,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 7A760CA9EA0 for ; Fri, 18 Oct 2019 11:56:13 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 3945320700 for ; Fri, 18 Oct 2019 11:56:13 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=lca.pw header.i=@lca.pw header.b="qX8ZrCPc" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 3945320700 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=lca.pw Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id D702A8E0006; Fri, 18 Oct 2019 07:56:12 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id D1F6D8E0003; Fri, 18 Oct 2019 07:56:12 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C0E4F8E0006; Fri, 18 Oct 2019 07:56:12 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0106.hostedemail.com [216.40.44.106]) by kanga.kvack.org (Postfix) with ESMTP id A3D208E0003 for ; Fri, 18 Oct 2019 07:56:12 -0400 (EDT) Received: from smtpin22.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with SMTP id 45CE882F9B3A for ; Fri, 18 Oct 2019 11:56:12 +0000 (UTC) X-FDA: 76056752184.22.egg32_7050968437c50 X-HE-Tag: egg32_7050968437c50 X-Filterd-Recvd-Size: 5877 Received: from mail-qt1-f171.google.com (mail-qt1-f171.google.com [209.85.160.171]) by imf15.hostedemail.com (Postfix) with ESMTP for ; Fri, 18 Oct 2019 11:56:11 +0000 (UTC) Received: by mail-qt1-f171.google.com with SMTP id n17so8658204qtr.4 for ; Fri, 18 Oct 2019 04:56:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lca.pw; s=google; h=content-transfer-encoding:from:mime-version:subject:date:message-id :references:cc:in-reply-to:to; bh=rzVYY4NIq4rjYDqdi/uuDgiVoO4ba7zvsJGcg4tzCiA=; b=qX8ZrCPcJ9gl21utDSnPHo4KN7vgOr5BGvYvVH0XPKqr1JjMDz2+Dd4B3AOcFHe7AU 7qCV5rIWcZ/I7shqZY5VQcCmWfv+/dJXANvp/77fcDaJZo9583HnBQmKfjuKhAv0ffWN JhvLOdRv12iPws9QRv2Wq8+Ppn/rw4AqiPJY+FrdkgI4EGIF5JIsYQIGGbZIcFz+/OYj hpF1Ce2VowREWySBasyaAYe2M2QxaddjMlU7r/HDknZFmwURvzS02YAaQZlPbGH0FjRH PrH2jl3qgrJl+EsDwFL0/kafznh3qOzQOa84PO5kZaSdHduZZuNphu+ViqTyKhlKhKEY VQIw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:content-transfer-encoding:from:mime-version :subject:date:message-id:references:cc:in-reply-to:to; bh=rzVYY4NIq4rjYDqdi/uuDgiVoO4ba7zvsJGcg4tzCiA=; b=PqZi/QizhrJ1/ogzAj9BH8LTKTKB3oVnqDAoW4Jqn4L5dBxixWdcvQPgkcoqZ3Mt+M Nb8nJYR5HI5+PKF4llyS3MeJdKQhubSeI35Yf871gRCLADaw2rp7+AYC8DGqM0F/kNzV 8v+qPctsfd6GqGiUiyjnrFs2OQ2xzPeG2LoA8jIfdtwIRfQSaX15cwalABnVj0KhjCpw tCxQTnkma9C3LV0gOvhqYFwSl9CbvQL5rCYrORsC1IvMKcH8hNzkFs5bzLS6/GJO0f8O RG78h5zqRZCqZ628BdiuwMVEVlwMA1owyzt2WKzyruQ1scTpxyhQQ+P2t7bFEzsKZiHj dQZw== X-Gm-Message-State: APjAAAUIlww+VR7oDMvwGLU+0D7D3AqNZKDzPrQqz5bYT7TlJk0rsY9I JlSByTSbk4ff+9tpimAgVdSnXTlyVA8mRw== X-Google-Smtp-Source: APXvYqw7EmS9df5vMB8uJF9ZvX8ZA7iKwULYjSBxZ/o5FdgoMBAlY2XegaxP5mjI3bjhHv1tskQ6Aw== X-Received: by 2002:ac8:7447:: with SMTP id h7mr7947116qtr.11.1571399770890; Fri, 18 Oct 2019 04:56:10 -0700 (PDT) Received: from [192.168.1.183] (pool-71-184-117-43.bstnma.fios.verizon.net. [71.184.117.43]) by smtp.gmail.com with ESMTPSA id c18sm2472469qkk.17.2019.10.18.04.56.09 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 18 Oct 2019 04:56:09 -0700 (PDT) Content-Type: multipart/alternative; boundary=Apple-Mail-70297ABC-A6F6-40C7-AFFD-8927912793E9 Content-Transfer-Encoding: 7bit From: Qian Cai Mime-Version: 1.0 (1.0) Subject: Re: memory offline infinite loop after soft offline Date: Fri, 18 Oct 2019 07:56:09 -0400 Message-Id: <64DC81FB-C1D2-44F2-981F-C6F766124B91@lca.pw> References: <20191018063222.GA15406@hori.linux.bs1.fc.nec.co.jp> Cc: Michal Hocko , "linux-kernel@vger.kernel.org" , "linux-mm@kvack.org" , David Hildenbrand , Mike Kravetz In-Reply-To: <20191018063222.GA15406@hori.linux.bs1.fc.nec.co.jp> To: Naoya Horiguchi X-Mailer: iPhone Mail (17A878) X-Bogosity: Ham, tests=bogofilter, spamicity=0.000554, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: --Apple-Mail-70297ABC-A6F6-40C7-AFFD-8927912793E9 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable > On Oct 18, 2019, at 2:35 AM, Naoya Horiguchi w= rote: >=20 > You're right, then I don't see how this happens. If the error hugepage was= > isolated without having PG_hwpoison set, it's unexpected and problematic. > I'm testing myself with v5.4-rc2 (simply ran move_pages12 and did hotremov= e/hotadd) > but don't reproduce the issue yet. Do we need specific kernel version/con= fig > to trigger this? This is reproducible on linux-next with the config. Not sure if it is reprod= ucible on x86. https://raw.githubusercontent.com/cailca/linux-mm/master/powerpc.config and kernel cmdline if that matters page_poison=3Don page_owner=3Don numa_balancing=3Denable \ systemd.unified_cgroup_hierarchy=3D1 debug_guardpage_minorder=3D1 \ page_alloc.shuffle=3D1 BTW, where does the code set PG_hwpoison for the head page?= --Apple-Mail-70297ABC-A6F6-40C7-AFFD-8927912793E9 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable


On Oct 18, 2019, at 2:35 AM, Naoya Horiguchi &= lt;n-horiguchi@ah.jp.nec.com> wrote:

You're right, then I don't see how th= is happens. If the error hugepage was
isolated without havin= g PG_hwpoison set, it's unexpected and problematic.
I'm test= ing myself with v5.4-rc2 (simply ran move_pages12 and did hotremove/hotadd)<= /span>
but don't reproduce the issue yet.  Do we need specific= kernel version/config
to trigger this?

This is reproducible on linux-next with the config. Not sure i= f it is reproducible on x86.


=
and kernel cmdline if that matters

= page_poison=3Don page_owner=3Don numa_balancing=3Denable \
systemd= .unified_cgroup_hierarchy=3D1 debug_guardpage_minorder=3D1 \
page_= alloc.shuffle=3D1

BTW, where does the code se= t PG_hwpoison for the head page?
= --Apple-Mail-70297ABC-A6F6-40C7-AFFD-8927912793E9--