From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.1 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id CE7BCC433B4 for ; Tue, 18 May 2021 14:06:02 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 5051F61185 for ; Tue, 18 May 2021 14:06:02 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 5051F61185 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id D16578E0032; Tue, 18 May 2021 10:06:01 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id CC64C8E002F; Tue, 18 May 2021 10:06:01 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id B19398E0032; Tue, 18 May 2021 10:06:01 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0248.hostedemail.com [216.40.44.248]) by kanga.kvack.org (Postfix) with ESMTP id 7C8D08E002F for ; Tue, 18 May 2021 10:06:01 -0400 (EDT) Received: from smtpin38.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay01.hostedemail.com (Postfix) with ESMTP id 197E8180AD838 for ; Tue, 18 May 2021 14:06:01 +0000 (UTC) X-FDA: 78154525722.38.580FF71 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [216.205.24.124]) by imf23.hostedemail.com (Postfix) with ESMTP id 4F2A0A000185 for ; Tue, 18 May 2021 14:05:58 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1621346759; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=1VWiRGZdAQGUDTFZZ3enI5ayz3J4+zWlHkjYyjU21MY=; b=OIHHBfODG49ZsRgm6HvRvr6KL10KFdbXHzJOivLPAlbuH5oFy63rrLfRnbErdYjQVegjQx Lnji02Lzi2odU9Gok8f962Un2ONPEK+JdpwEb1gKYtqwnxNq9TuO7MtagfAtRcXkvGdhKl jW8hSvERTrHVI6baAIiV2WilKzJF7Sk= Received: from mail-wm1-f72.google.com (mail-wm1-f72.google.com [209.85.128.72]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-29-XVPTKmzDOguo9ZPh4EsVOA-1; Tue, 18 May 2021 10:05:57 -0400 X-MC-Unique: XVPTKmzDOguo9ZPh4EsVOA-1 Received: by mail-wm1-f72.google.com with SMTP id v2-20020a7bcb420000b0290146b609814dso314705wmj.0 for ; Tue, 18 May 2021 07:05:57 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=hwF9LeDBndbc4TKf08dL36rjBtWuzIAPQdyuiQwz9l0=; b=NWt1hmPXbdCx3WqgBpTyiz+O2GWBiJTJHPueLAMP7qHcUg/k+38iBmBc7hswVUUQ2V 2KIY7plBJdhYNQ+EyTyFh4Cxe13km8vIDjO0Q7SP+jKNMyymBRbS87RxsjevWvjvB2Vx v75BUadVK5t/fQTwNSvgEyZ3I1io7AyVUxorggvCifBphu8Hha7R+tplI24yzqu65SUt lfrfSUO9kKtSVmi6IBDMO07ZQI7fmB8Ydv3GvcwZjcmcZXhTFQ6jEplLeluhbZwBeA2D Mo8t/5MfKETp7vyljil7Wj0bROR9ZtuLf3gdB/A3k3/b3Bri+PaVzcmrEqAqwdqtkSsU rHRQ== X-Gm-Message-State: AOAM531r/f4C+s8vNauDxugZvqgn6m6ZdbEBUGtRlFUT/Xatx+K8qetw hKnl3yjCi4hwe5thNSn+opz+D6iOhZmd/mCplpffplfd3/9aMWRBoyCSYpDKeMjY6bcYB+fDbRU tkb7hxGN7Vw== X-Received: by 2002:a5d:4b08:: with SMTP id v8mr7352463wrq.122.1621346756526; Tue, 18 May 2021 07:05:56 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyThmqOZjeIy3KQ28wEzYYlXG8qZbWV69joGSKqboCqYIqvYg8Pr78tyqaPsFc5dn36E6BTsQ== X-Received: by 2002:a5d:4b08:: with SMTP id v8mr7352437wrq.122.1621346756332; Tue, 18 May 2021 07:05:56 -0700 (PDT) Received: from localhost (cpc111743-lutn13-2-0-cust979.9-3.cable.virginm.net. [82.17.115.212]) by smtp.gmail.com with ESMTPSA id y14sm2729448wmj.37.2021.05.18.07.05.55 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 18 May 2021 07:05:55 -0700 (PDT) Date: Tue, 18 May 2021 15:05:54 +0100 From: Aaron Tomlin To: Michal Hocko Cc: linux-mm@kvack.org, akpm@linux-foundation.org, linux-kernel@vger.kernel.org, Vlastimil Babka Subject: Re: [PATCH] mm/page_alloc: try oom if reclaim is unable to make forward progress Message-ID: <20210518140554.dwan66i4ttmzw4hj@ava.usersys.com> X-PGP-Key: http://pgp.mit.edu/pks/lookup?search=atomlin%40redhat.com X-PGP-Fingerprint: 7906 84EB FA8A 9638 8D1E 6E9B E2DE 9658 19CC 77D6 References: <20210315165837.789593-1-atomlin@redhat.com> <20210319172901.cror2u53b7caws3a@ava.usersys.com> <20210325210159.r565fvfitoqeuykp@ava.usersys.com> <20210326112254.jy5jkiwtgj3pqkt2@ava.usersys.com> MIME-Version: 1.0 In-Reply-To: X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="4oj4tq5xtvq4w6dd" Content-Disposition: inline Authentication-Results: imf23.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=OIHHBfOD; dmarc=pass (policy=none) header.from=redhat.com; spf=none (imf23.hostedemail.com: domain of atomlin@redhat.com has no SPF policy when checking 216.205.24.124) smtp.mailfrom=atomlin@redhat.com X-Rspamd-Server: rspam05 X-Rspamd-Queue-Id: 4F2A0A000185 X-Stat-Signature: sgju36a94h8bmee95eb5cuf7cimh9bbs X-HE-Tag: 1621346758-811188 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: --4oj4tq5xtvq4w6dd Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Michal, On Fri 2021-03-26 16:36 +0100, Michal Hocko wrote: > OK, I kinda expected this would be not easily reproducible. Unfortunately, I'm still waiting for feedback on this. > We should be focusing on the compaction retry logic and see whether we > can have some "run away" scenarios there. Seeing so many retries without > compaction bailing out sounds like a bug in that retry logic. I suspect so. This is indeed a case of excessive reclaim/compaction retries (i.e. the last known value stored in the no_progress_loops variable was 31,611,688). What might be particularly unique about this situation is that a fatal signal was found pending. In this context, if I understand correctly, it does not make sense to retry compaction when the last known compact result was skipped and a fatal signal is pending. Looking at try_to_compact_pages(), indeed COMPACT_SKIPPED can be returned; albeit, not every zone, on the zone list, would be considered in the case a fatal signal is found to be pending. Yet, in should_compact_retry(), given the last known compaction result, each zone, on the zone list, can be considered/or checked (see compaction_zonelist_suitable()). If a zone e.g. was found to succeed then reclaim/compaction would be tried again (notwithstanding the above). --=20 Aaron Tomlin --4oj4tq5xtvq4w6dd Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAABCAAdFiEEeQaE6/qKljiNHm6b4t6WWBnMd9YFAmCjybAACgkQ4t6WWBnM d9a+MBAAmH76h+vl0MTKp+BW4YxqXL9f0lULDieShMJQys2khOOQSuRcnFIsuCM3 fg2AuDFB8SgKY++gws6+ardG9i6jtNRFEczsK0ZFwwvUzxo5TOZ78EE8/PaMeAcv z5o2pAGdSBARUGgBzNVzur60uGekXjiq2nG3shuU1BtgUiki1M3xXfzHFofvbrM7 LPzIdnXUxmGVsMORH0raYkSdZNbP5xY2lKHCoUu/NpkVsqcRENNRwULgFIiDKexJ tRG1eOjDuJ0R7j5jTOIlsDtIuSdg2YAUEE1YNqt+05LvF7RjuRcKedlaelirw3Wv cuNTxFw0YBj8xLxcIBPzayzF0w480OAZg9jkC8RbRI/1scd5tp3GiEVPawiwFcHH GLg9FP2Y7qRDCymy2kurpW5xHnAj5gX0WvrApBXmTUR1eowwy27iRu39lvqN4k8P LAXSpk+YmQm231YuSv4qn9wGoIrzBwTWISMZL8vUW0oJ+TikSqEzenptOBm2HSQd U++Jyv8+ChbiiZan+spabF8RsOzKip0afwMNqzaOfklJ8DMy7PiCw6AAwI304LGj Bua6ba3UzY2/NXjJRqnNNWBUg2i50nm9YsuS4h5sLbWwaKimG6TAHnayeoBUdCgP 45lMlrofia3baPxitQfsPbc51N37T/y1uQLZU4ui42EhCKB8bw0= =onqo -----END PGP SIGNATURE----- --4oj4tq5xtvq4w6dd--