From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0F9E4CE7CE5 for ; Tue, 1 Oct 2024 07:55:28 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 955BD280056; Tue, 1 Oct 2024 03:55:27 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 8DD30280036; Tue, 1 Oct 2024 03:55:27 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 77D24280056; Tue, 1 Oct 2024 03:55:27 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 5914E280036 for ; Tue, 1 Oct 2024 03:55:27 -0400 (EDT) Received: from smtpin10.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id BC9ACC0E32 for ; Tue, 1 Oct 2024 07:55:26 +0000 (UTC) X-FDA: 82624273452.10.85C78B2 Received: from mail.flyingcircus.io (mail.flyingcircus.io [212.122.41.197]) by imf28.hostedemail.com (Postfix) with ESMTP id 0991FC000D for ; Tue, 1 Oct 2024 07:55:23 +0000 (UTC) Authentication-Results: imf28.hostedemail.com; dkim=pass header.d=flyingcircus.io header.s=mail header.b=KVDfHOGU; spf=pass (imf28.hostedemail.com: domain of ct@flyingcircus.io designates 212.122.41.197 as permitted sender) smtp.mailfrom=ct@flyingcircus.io; dmarc=pass (policy=reject) header.from=flyingcircus.io ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1727769197; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=bwcCQFGehdFd44jFd9rXyQb/ofjdQMa0V8EahWXsa/E=; b=I1i2craWnA6KJgv+lBMxl6sUAOioEhyxxUu99oii0toxp8ECC0/+SUIjFYKIXmm3UFSHjO 3E6g9hhPxHfb20H8zCIZ5ItpFEcMd9mITQGWq8veT3CUb8Qs9oBK0E/RLW190TKY/Dvjcz GcfkyrcYBOCRniVssRbEDUHVtJS9mHQ= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1727769197; a=rsa-sha256; cv=none; b=ruNa4ltlRDmF3QMuu+GbcXZiqsxZIFKjngbCY0wmk38K8pv3IyXqfv7cZ/61YJSRlxRLaE NbRloVfN2DPEvi9Iz3gRkeBsklL/kX2AyTJfcw8CANxim0gzS/CefBeWABAjXH54ez8XeN 2BGOndkbe06oE6Cu5qArxkd4YauEMag= ARC-Authentication-Results: i=1; imf28.hostedemail.com; dkim=pass header.d=flyingcircus.io header.s=mail header.b=KVDfHOGU; spf=pass (imf28.hostedemail.com: domain of ct@flyingcircus.io designates 212.122.41.197 as permitted sender) smtp.mailfrom=ct@flyingcircus.io; dmarc=pass (policy=reject) header.from=flyingcircus.io Content-Type: text/plain; charset=utf-8 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=flyingcircus.io; s=mail; t=1727769320; bh=bwcCQFGehdFd44jFd9rXyQb/ofjdQMa0V8EahWXsa/E=; h=Subject:From:In-Reply-To:Date:Cc:References:To; b=KVDfHOGUlMTiKO4m3NQk4GcaLwiXyGEFrg49fYDCXSzMLzlvRHXYvCbd/fRU0nqXt 4KLLx3Af0ukILjy1rob9w5KTh+ShXRxxUDcxgrAYVxSula9crMDzjv1myQ9bgKkv9j U2SPLFZdRD4ZGmC127pIYdA+3vWkA57kcambOVXo= Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3818.100.11.1.3\)) Subject: Re: Known and unfixed active data loss bug in MM + XFS with large folios since Dec 2021 (any kernel from 6.1 upwards) From: Christian Theune In-Reply-To: Date: Tue, 1 Oct 2024 09:54:56 +0200 Cc: Linus Torvalds , Dave Chinner , Matthew Wilcox , Jens Axboe , linux-mm@kvack.org, "linux-xfs@vger.kernel.org" , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Daniel Dao , regressions@lists.linux.dev, regressions@leemhuis.info Content-Transfer-Encoding: quoted-printable Message-Id: References: <74cceb67-2e71-455f-a4d4-6c5185ef775b@meta.com> <52d45d22-e108-400e-a63f-f50ef1a0ae1a@meta.com> <5bee194c-9cd3-47e7-919b-9f352441f855@kernel.dk> <459beb1c-defd-4836-952c-589203b7005c@meta.com> <02121707-E630-4E7E-837B-8F53B4C28721@flyingcircus.io> To: Chris Mason X-Rspam-User: X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 0991FC000D X-Stat-Signature: 55b88kqwbtqw9qhfz1ygyxszowefodes X-HE-Tag: 1727769323-846662 X-HE-Meta: U2FsdGVkX18CjZFgeya6u4zytKISiCwfmS3BACFSdcK52RcNj7PCkUyQ3KC/QGHRa5sdj0pT+x+HDeFuNOkRV9or2s5kBrEZAEa4La8wxnHiaFH5qy0gZtv5A0d/ZY3PLOpPXI0qZ38u61BcE9RSmiCYmGGXTUuHpcILbRsVrdYcqkgq/m3WHWK4xVA/RlI4TCwMtx5s2JxbnnQc02lk/6XzOu5ApRoTCuIGGpxOK7B5mS4bz5efmGIjRympAFgVYCywiWPjbJDGQuLIzKtnLiZrj47jk9zZPNpU2lXRSaipyYCkHA/lpIdQsrt69z0DCijKs/Z3sZV0LG1FrAMQDT0t6lsfnW74NbxWPV581lLFRvBxILn9RMlvgVOU0ehfAyDmEPpFVAGdTL3Y1p7Ji9e4WS4PRncDp0XHil5bEPOGyayPlTV/JbPVJZKDEFh/bHkyETHjmXuIAPiR7nEILUo77VeofOvnEeCDmyjOd4ar83dokayiua0jRas3n5D5gW9DmMwYv7ufSa7SbZdByNNoWLqO00EILot5wCVdMAMn45S17Wpk4TVyzFnd7YK2QmZOX362cHxlFQxQHf8QbHuPI2upEV6oFtCtgNVTg5Bz0rV9FWRcQSRQ6NgGrBO6jMwthCqXKez5e6dAUKsv3FiQz+Vy4k+TNiYYFfHWGI7SZN/9aNxIVuuCRYJKoaq51pGpXFlQaauFN8d9qQjd3VPPvyiC53K50KBPlplTurkHJSs5TIm/W75kDAajQrySUWvlpmVMELL61iKesIgnvJJltRnXUfH7q5xS9Ldrz62HFvnc/95JPj2UKBPpSP84UQM0z6ElD5Uzc0+mtuPByAw7/UpCKtRsP4idBnow6X6ekQaOJe0S0n9kUFQMEGyJoVb8FpEOcojbmt3tVUk64fXR3cncrOzcOM/ODPt3TE0rJqo0HSxLaI81mrP8Ivh8tPBqLS+naWcl46+44JO 9GKu/O9i sLRcmX8/p1tkR5GVSeV3dbFAVj6fXLwRF1TXCBy80d8evmjJ6VjhNWuIheHZn2H5GNj4oF7Goix7D+zmbjc9XkhWWqJOU9M/Dz8lYJO2MrC2cRH8gTMnc2haDsSeWNAQAJZt9YEXJ9DmMJtezVGzWZ9rQPzf7GgtRcPcQIHUyGhPf/NxtBg/oNMsVDl59IZ8C14Jcu1KHlvttApFZZKqJrSKbC1XdiaTS1RRHUtp7Az+cNE49ltU3t2uyD/T3mnIsmpfAwDidka6RosQrqdoSYxrBGAi0+Rd4U7xejHzgJ/GMTmCnuxUzuOe9/DWBPEFWS/WB6MoeuPDmsTE= X-Bogosity: Ham, tests=bogofilter, spamicity=0.006944, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: > On 1. Oct 2024, at 02:56, Chris Mason wrote: >=20 > I've attached a minimal version of a script we use here to show all = the > D state processes, it might help explain things. The only problem is > you have to actually ssh to the box and run it when you're stuck. Thanks, I=E2=80=99ll dig into this next week when I=E2=80=99m back from = vacation. I can set up alerts when this happens and hope that I=E2=80=99ll be fast = enough as the situation does seem to resolve itselve at some point. = It=E2=80=99s happened quite a bit in the fleet so I guess I should be = able to catch it. Christian --=20 Christian Theune =C2=B7 ct@flyingcircus.io =C2=B7 +49 345 219401 0 Flying Circus Internet Operations GmbH =C2=B7 https://flyingcircus.io Leipziger Str. 70/71 =C2=B7 06108 Halle (Saale) =C2=B7 Deutschland HR Stendal HRB 21169 =C2=B7 Gesch=C3=A4ftsf=C3=BChrer: Christian Theune, = Christian Zagrodnick