From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 16F93C4167B for ; Mon, 30 Oct 2023 14:09:09 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 845ED6B01FB; Mon, 30 Oct 2023 10:09:09 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 7CD856B0200; Mon, 30 Oct 2023 10:09:09 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 66E1A6B0201; Mon, 30 Oct 2023 10:09:09 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 548386B01FB for ; Mon, 30 Oct 2023 10:09:09 -0400 (EDT) Received: from smtpin25.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay03.hostedemail.com (Postfix) with ESMTP id E15EFA04E1 for ; Mon, 30 Oct 2023 14:09:07 +0000 (UTC) X-FDA: 81402309534.25.6D9906F Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by imf23.hostedemail.com (Postfix) with ESMTP id ED4C1140022 for ; Mon, 30 Oct 2023 14:09:05 +0000 (UTC) Authentication-Results: imf23.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=cCXY2F4C; dmarc=pass (policy=none) header.from=redhat.com; spf=pass (imf23.hostedemail.com: domain of mpatocka@redhat.com designates 170.10.129.124 as permitted sender) smtp.mailfrom=mpatocka@redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1698674946; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=dOFRZkTKK9Z0bMd9xnhKPQO2WNoW7oCCTYa8Hwz84I8=; b=UNDHgtIgeOfscPSxyGIQJDw0YBZFcVTg1Su+Qsb9/wgaxou7PCNUwWN3MUsyapRQ5J1s9Z 4xov1wvT1NgI3eszklaj75ww53g7bEnDIxNCVkgIstQEpzEmKXuusLaX8J7kbPmIwpAoDi Puk+s/IEs6QMC3pxMrufekYOh/JdjKQ= ARC-Authentication-Results: i=1; imf23.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=cCXY2F4C; dmarc=pass (policy=none) header.from=redhat.com; spf=pass (imf23.hostedemail.com: domain of mpatocka@redhat.com designates 170.10.129.124 as permitted sender) smtp.mailfrom=mpatocka@redhat.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1698674946; a=rsa-sha256; cv=none; b=YvY7v6L4rKc/OLkpc3B2I8ItJzZxdW3UYOAkTWkTsnR/LTP56nLiVLxNlNkNoWELF70v7k K0UrABP3LsuGESrWE36m7cGjeguCgpE11psZX0u0fA7Zvtr91R/dlZsNivVy/9lUygSzVm gW1Qq4L1doa+D+XpyS62Lko4uTGUZuc= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1698674945; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=dOFRZkTKK9Z0bMd9xnhKPQO2WNoW7oCCTYa8Hwz84I8=; b=cCXY2F4ChMAgawXxVoSJ9zmzBFy3qK3ZLKyDUIbzsiHW7f+9PquXIJM26Wh8TSCwfIVUVi 7LlMlHyxB9KL88gRsE8znHgUxrV3akOJvb/qadLBVBBjk0KULRiOFqwBRSzeeIgwTmVGRJ j8zzxQycKuUjzko+m0le5aBdS//2+m4= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-627-j40fAQMlM6aMhm1JptNM3g-1; Mon, 30 Oct 2023 10:08:57 -0400 X-MC-Unique: j40fAQMlM6aMhm1JptNM3g-1 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.rdu2.redhat.com [10.11.54.7]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 3AC63185A781; Mon, 30 Oct 2023 14:08:56 +0000 (UTC) Received: from file1-rdu.file-001.prod.rdu2.dc.redhat.com (unknown [10.11.5.21]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 2A2621C060AE; Mon, 30 Oct 2023 14:08:56 +0000 (UTC) Received: by file1-rdu.file-001.prod.rdu2.dc.redhat.com (Postfix, from userid 12668) id 1763530C72AB; Mon, 30 Oct 2023 14:08:56 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by file1-rdu.file-001.prod.rdu2.dc.redhat.com (Postfix) with ESMTP id 148423D99F; Mon, 30 Oct 2023 15:08:56 +0100 (CET) Date: Mon, 30 Oct 2023 15:08:56 +0100 (CET) From: Mikulas Patocka To: =?ISO-8859-15?Q?Marek_Marczykowski-G=F3recki?= cc: Jan Kara , Vlastimil Babka , Andrew Morton , Matthew Wilcox , Michal Hocko , stable@vger.kernel.org, regressions@lists.linux.dev, Alasdair Kergon , Mike Snitzer , dm-devel@lists.linux.dev, linux-mm@kvack.org Subject: Re: Intermittent storage (dm-crypt?) freeze - regression 6.4->6.5 In-Reply-To: Message-ID: References: <89320668-67a2-2a41-e577-a2f561e3dfdd@suse.cz> <818a23f2-c242-1c51-232d-d479c3bcbb6@redhat.com> <18a38935-3031-1f35-bc36-40406e2e6fd2@suse.cz> <3514c87f-c87f-f91f-ca90-1616428f6317@redhat.com> <1a47fa28-3968-51df-5b0b-a19c675cc289@suse.cz> <20231030122513.6gds75hxd65gu747@quack3> MIME-Version: 1.0 Content-Type: multipart/mixed; BOUNDARY="185210117-1250732461-1698674809=:1161929" Content-ID: X-Scanned-By: MIMEDefang 3.4.1 on 10.11.54.7 X-Rspam-User: X-Stat-Signature: f8syytdtakgeupwne4tdkf98ok5zn8d5 X-Rspamd-Server: rspam07 X-Rspamd-Queue-Id: ED4C1140022 X-HE-Tag: 1698674945-799202 X-HE-Meta: U2FsdGVkX18pXkyepYIsA2p1jr31bjSwY6/bI96zopeu/G0g26f9bO8Do5AnLJJYBdwpibH+r/LSC1cXYl9380vRghgerdmhMZB4nHgTyDq6valDUGcNnaO0eviSygAl34bAe9iIi7SjD+LdUGNNZ7SygHj3IVM7ulMfooioijahSmOb28mT0tHW/dqNrEgDqT1J/1V3SK6gIl4Bm4RTkkN9w1e/jsVV7fDAta1HwwkKjHxorZePC+WadCgE6PHzeA0Q+/UIHnzsX5q5747JN6sIeuwnoxEu0epn9ChBe1mvWRa3YRaShQOli/jtLmHI0Q4YLbshub1eDXGNsUcooDyDcoXyPYm2UZgCZQnfbPaJBJYOhR2VGtXLXLP6LocEbufXL03tRzfJ3NROLLKd8GrM9e92D3WJL9Rwxi6It1y7vqG/RSdnTrvpg6/ubhjKgubTESfUPGVtVwhZp0ii3F/0eLqZv587IdyCVO79qS0KMwNzQEWLclIG0GHpXmrOcRrrf+8NfY/QwJDGieD6TvNoyCtTxddHrQZY8EBQVuwZn/Vt3OUx7P9aZnRwhyBs7nmn+kkmH1xBUvfIuaAMtBV1TIOuconFoN8iw60yuJV/VRDfxpXzBJHzzQHAypB2+ML5ucQfk+ffSWMB/lXw+PK76cRTWlEBI3QzGzHem788U3tRjiqXb7+9RpVJ19OEiYxnKI7AZ4DS42znMFhkvbvg7FYVz8Bqmw7Z4ZgVjF6sIrjHgkWK/xY94if9tbMolpVcklWBPwzccd6oBpiO7Q+HhiStmtQZ17E3comBJE4uBIGpTJzXJuMsVBfUJ6SK7ezkA+yIFOpAbROvWgGGnPrRVkD2bpW/w9zJBkk5kIhZW/fZ+1/zy3IxSJfvK7zy+bau1GLhyxzIWAAKLBu4SO4SgTRSvEb4ZsSNqfsj2j0iNr6eGKPBr0bpgwNJmMMRIUODtLwYI3MWE1EG8AU m1EsJJVL 8eB+Memnls0d91sHsdUVqgozJ24wLofdGO0bJZLHm1BC9qvQQf1uMnVw/Vxek4u7PBAWn4gjGsf0Rzgykt55yv8SdVyjh6RpINkn7DKLT8QcpzU9Qo84MzbJMAoutd927EXSXKVvgEPFV/ByVNy1ywo8cpkmhTHZlg28XWGygC3skVVWUCkAh6j7fDd2x6Q/cTCeZkGdquJQwlTx99VMYgPvS5SkXOYbOO1iszm2SHCOWwQxks3BLV1FqBN8etnQm8Zzw5n7dsBF2M3N+Ei6iQwDDAA== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --185210117-1250732461-1698674809=:1161929 Content-Type: text/plain; CHARSET=ISO-8859-15 Content-Transfer-Encoding: 8BIT Content-ID: <2e223827-e6b-47b8-d026-66c8d92466bd@redhat.com> On Mon, 30 Oct 2023, Marek Marczykowski-Górecki wrote: > > Well, it would be possible that larger pages in a bio would trip e.g. bio > > splitting due to maximum segment size the disk supports (which can be e.g. > > 0xffff) and that upsets something somewhere. But this is pure > > speculation. We definitely need more debug data to be able to tell more. > > I can collect more info, but I need some guidance how :) Some patch > adding extra debug messages? > Note I collect those via serial console (writing to disk doesn't work > when it freezes), and that has some limits in the amount of data I can > extract especially when printed quickly. For example sysrq-t is too much. > Or maybe there is some trick to it, like increasing log_bug_len? If you can do more tests, I would suggest this: We already know that it works with order 3 and doesn't work with order 4. So, in the file include/linux/mmzone.h, change PAGE_ALLOC_COSTLY_ORDER from 3 to 4 and in the file drivers/md/dm-crypt.c leave "unsigned int order = PAGE_ALLOC_COSTLY_ORDER" there. Does it deadlock or not? So, that we can see whether the deadlock depends on PAGE_ALLOC_COSTLY_ORDER or whether it is just a coincidence. Mikulas --185210117-1250732461-1698674809=:1161929--