linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Mel Gorman <mgorman@techsingularity.net>
To: Qian Cai <cai@lca.pw>
Cc: Daniel Jordan <daniel.m.jordan@oracle.com>,
	Mikhail Gavrilov <mikhail.v.gavrilov@gmail.com>,
	linux-mm@kvack.org, vbabka@suse.cz
Subject: Re: kernel BUG at include/linux/mm.h:1020!
Date: Wed, 20 Mar 2019 14:20:21 +0000	[thread overview]
Message-ID: <20190320142021.k4z6njs2kacdip3k@techsingularity.net> (raw)
In-Reply-To: <1553022891.26196.7.camel@lca.pw>

On Tue, Mar 19, 2019 at 03:14:51PM -0400, Qian Cai wrote:
> On Sun, 2019-03-17 at 15:22 +0000, Mel Gorman wrote:
> > On Fri, Mar 15, 2019 at 04:58:27PM -0400, Daniel Jordan wrote:
> > > On Tue, Mar 12, 2019 at 10:55:27PM +0500, Mikhail Gavrilov wrote:
> > > > Hi folks.
> > > > I am observed kernel panic after updated to git commit 610cd4eadec4.
> > > > I am did not make git bisect because this crashes occurs spontaneously
> > > > and I not have exactly instruction how reproduce it.
> > > > 
> > > > Hope backtrace below could help understand how fix it:
> > > > 
> > > > page:ffffef46607ce000 is uninitialized and poisoned
> > > > raw: ffffffffffffffff ffffffffffffffff ffffffffffffffff ffffffffffffffff
> > > > raw: ffffffffffffffff ffffffffffffffff ffffffffffffffff ffffffffffffffff
> > > > page dumped because: VM_BUG_ON_PAGE(PagePoisoned(p))
> > > > ------------[ cut here ]------------
> > > > kernel BUG at include/linux/mm.h:1020!
> > > > invalid opcode: 0000 [#1] SMP NOPTI
> > > > CPU: 1 PID: 118 Comm: kswapd0 Tainted: G         C
> > > > 5.1.0-0.rc0.git4.1.fc31.x86_64 #1
> > > > Hardware name: System manufacturer System Product Name/ROG STRIX
> > > > X470-I GAMING, BIOS 1201 12/07/2018
> > > > RIP: 0010:__reset_isolation_pfn+0x244/0x2b0
> > > 
> > > This is new code, from e332f741a8dd1 ("mm, compaction: be selective about
> > > what
> > > pageblocks to clear skip hints"), so I added some folks.
> > > 
> > 
> > I'm travelling at the moment and only online intermittently but I think
> > it's worth noting that the check being tripped is during a call to
> > page_zone() that also happened before the patch was merged too. I don't
> > think it's a new check as such. I haven't been able to isolate a source
> > of corruption in the series yet and suspected in at least one case that
> > there is another source of corruption that is causing unrelated
> > subsystems to trip over.
> > 
> 
> So reverting this patch on the top of the mainline fixed the memory corruption
> for me or at least make it way much harder to reproduce.
> 
> dbe2d4e4f12e ("mm, compaction: round-robin the order while searching the free
> lists for a target")
> 

Ok, thanks for that. I'm just about to fly and didn't reexamine the
patch in detail. I'll review again and see if there are cases where
order goes negative which would lead to improper accesses when I get
back online properly. It's possible that next_search_order() is ending
up with negative values because of assumptions made about the value of
cc->order.


  parent reply	other threads:[~2019-03-20 14:20 UTC|newest]

Thread overview: 33+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-03-12 17:55 Mikhail Gavrilov
2019-03-15 20:58 ` Daniel Jordan
2019-03-15 21:34   ` Qian Cai
2019-03-17 15:22   ` Mel Gorman
2019-03-19 19:14     ` Qian Cai
2019-03-19 19:27       ` Pavel Tatashin
2019-03-19 19:35         ` Qian Cai
2019-03-19 23:13           ` Pavel Tatashin
2019-03-19 23:26             ` Qian Cai
2019-03-20 14:20       ` Mel Gorman [this message]
2019-03-20 21:50   ` Mikhail Gavrilov
2019-03-21  5:39     ` Mikhail Gavrilov
2019-03-21 13:21       ` Qian Cai
2019-03-21 15:08         ` Mikhail Gavrilov
2019-03-21 15:48           ` Qian Cai
2019-03-21 18:57             ` Mikhail Gavrilov
2019-03-21 19:14               ` Qian Cai
2019-03-22  3:41                 ` Mikhail Gavrilov
2019-03-22 13:43                   ` Qian Cai
2019-03-22 11:15       ` Mel Gorman
2019-03-23  4:40         ` Mikhail Gavrilov
2019-03-25 10:58           ` Mel Gorman
2019-03-25 16:06             ` Mikhail Gavrilov
2019-03-25 20:31               ` Mel Gorman
2019-03-26  4:03                 ` Mikhail Gavrilov
2019-03-26 12:03                   ` Mel Gorman
2019-03-27  3:57                     ` Mikhail Gavrilov
2019-03-27  8:54                       ` Mel Gorman
2019-03-22  7:39 ` Oscar Salvador
2019-03-22  7:54   ` Mikhail Gavrilov
2019-03-22  8:55     ` Oscar Salvador
2019-03-22  8:56       ` Oscar Salvador
2019-03-22 17:49         ` Mikhail Gavrilov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20190320142021.k4z6njs2kacdip3k@techsingularity.net \
    --to=mgorman@techsingularity.net \
    --cc=cai@lca.pw \
    --cc=daniel.m.jordan@oracle.com \
    --cc=linux-mm@kvack.org \
    --cc=mikhail.v.gavrilov@gmail.com \
    --cc=vbabka@suse.cz \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox