From: Nhat Pham <nphamcs@gmail.com>
To: Builder <yshuiv7@gmail.com>
Cc: pedro.falcato@gmail.com, chengming.zhou@linux.dev,
christian@heusel.eu, hannes@cmpxchg.org, linux-mm@kvack.org,
regressions@lists.linux.dev, yosryahmed@google.com
Subject: Re: zswap_writeback_entry crashes in 6.9.5
Date: Tue, 2 Jul 2024 08:28:00 -0700 [thread overview]
Message-ID: <CAKEwX=OCgzPKzCN_bvK9Y6qy1uHVDrHqbCDaYZH1iDpoWFdmgg@mail.gmail.com> (raw)
In-Reply-To: <20240702003343.2756828-1-builder@example.org>
On Mon, Jul 1, 2024 at 5:33 PM Builder <yshuiv7@gmail.com> wrote:
>
> On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
> >
> > Hi everyone,
>
> Hi,
>
> I think I have hit this problem a well. I actually reported this on RedHat's
> bug tracker a while back, along with a couple of stack traces:
>
> https://bugzilla.redhat.com/show_bug.cgi?id=2275252
>
> Reverting the commit I mentioned there seems to make this problem go away for
> me. This is a long shot, but I am curious if it will also fix the problem for
> you.
>
> (Also inserting myself into this thread so I will get updates.)
>
> Regards,
> Yuxuan Shui
This looks like a different issue. The hang-up is one task waiting for
the mutex lock (&acomp_ctx->mutex), whose holder is the other task
that crashes. Looking at that trace in particular, the line that
triggers the BUG_ON call (mm/zswap.c:1395):
BUG_ON(crypto_wait_req(crypto_acomp_decompress(acomp_ctx->req),
&acomp_ctx->wait));
is the compressor failing to decompress the data. This looks like some
sort of memory corruption, and could happen for a lot of reasons - a
zswap bug, a backend allocator bug, a compression library bug, or a
hardware issue that corrupts memory.
If it only happens on 6.8.9 (and not 6.8.5), then it's likely some
changes in between, but I'd be very surprised if the bug somehow comes
from the patch you reverted. If you look at the patch's content, all
it does is essentially handling the case where the shrinker receives a
NULL memcg, by using an alternative source of stats. It could
potentially reveal the problems previously hidden, but definitely not
the cause of those problems itself. I'd recommend that you send a
separate bug report with the build config, steps to reproduce, and
more information about your setup overall (what backend allocator are
you using for zswap - it should be zsmalloc btw, what compression
algorithm you are using, etc.)
prev parent reply other threads:[~2024-07-02 15:28 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-30 17:58 Pedro Falcato
2024-07-01 1:07 ` Nhat Pham
2024-07-19 18:57 ` Pedro Falcato
2024-07-19 20:47 ` Nhat Pham
2024-07-01 3:44 ` Chengming Zhou
2024-07-02 0:33 ` Builder
2024-07-02 15:28 ` Nhat Pham [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAKEwX=OCgzPKzCN_bvK9Y6qy1uHVDrHqbCDaYZH1iDpoWFdmgg@mail.gmail.com' \
--to=nphamcs@gmail.com \
--cc=chengming.zhou@linux.dev \
--cc=christian@heusel.eu \
--cc=hannes@cmpxchg.org \
--cc=linux-mm@kvack.org \
--cc=pedro.falcato@gmail.com \
--cc=regressions@lists.linux.dev \
--cc=yosryahmed@google.com \
--cc=yshuiv7@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox