From: Newsmails <newsmails@netcourrier.com>
To: Vlastimil Babka <vbabka@suse.cz>,
Andrew Morton <akpm@linux-foundation.org>
Cc: bugzilla-daemon@bugzilla.kernel.org, linux-mm@kvack.org,
Song Liu <songliubraving@fb.com>,
"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Johannes Weiner <hannes@cmpxchg.org>,
Oleg Nesterov <oleg@redhat.com>, Pavel Machek <pavel@ucw.cz>
Subject: Re: [Bug 210031] New: unable to handle page fault for address - EIP: khugepaged
Date: Mon, 9 Nov 2020 12:47:26 +0100 [thread overview]
Message-ID: <505ed020-6dba-46d3-9b49-4d97900c07dc@netcourrier.com> (raw)
In-Reply-To: <38cfc9be-e5ec-8ed2-87ce-4e877d5bf952@suse.cz>
On 11/4/20 1:45 PM, Vlastimil Babka wrote:
> On 11/4/20 1:23 PM, Newsmails wrote:
>>
>>
>> On 11/4/20 1:14 PM, Vlastimil Babka wrote:
>>> On 11/4/20 1:17 AM, Andrew Morton wrote:
>>>> (switched to email. Please respond via emailed reply-to-all, not
>>>> via the
>>>> bugzilla web interface).
>>>>
>>>>
>>>> On Tue, 03 Nov 2020 20:00:58 +0000
>>>> bugzilla-daemon@bugzilla.kernel.org wrote:
>>>>
>>>>> https://bugzilla.kernel.org/show_bug.cgi?id=210031
>>>>>
>>>>> Bug ID: 210031
>>>>> Summary: unable to handle page fault for address - EIP:
>>>>> khugepaged
>>>>> Product: Memory Management
>>>>> Version: 2.5
>>>>> Kernel Version: 5.9.1
>>>>> Hardware: All
>>>>> OS: Linux
>>>>> Tree: Mainline
>>>>> Status: NEW
>>>>> Severity: normal
>>>>> Priority: P1
>>>>> Component: Other
>>>>> Assignee: akpm@linux-foundation.org
>>>>> Reporter: newsmails@netcourrier.com
>>>>> Regression: No
>>>>
>>>> Thanks. That's a strange looking trace. I'll optimistically cc some
>>>> people who have been working in that area lately.
>>>>
>>>> What caused this kernel to be tainted?
>>>>
>>>>
>>>>> laptop Skylake i915 Distribution : slackware 14.2 32 bits
>>>>>
>>>>> Oct 23 17:38:22 linuxp kernel: [141330.499234] BUG: unable to
>>>>> handle page fault
>>>>> for address: 021d202d
>>>>> Oct 23 17:38:22 linuxp kernel: [141330.499245] #PF: supervisor
>>>>> read access in
>>>>> kernel mode
>>>>> Oct 23 17:38:22 linuxp kernel: [141330.499250] #PF:
>>>>> error_code(0x0000) -
>>>>> not-present page
>>>>> Oct 23 17:38:22 linuxp kernel: [141330.499265] Oops: 0000 [#2] SMP
>>>>> PTI
>>>
>>> #2 means this is not the first oops. Do you have the very first?
>>>
>> Yes sorry.
>> It was a resume too as you will see with the time.
>> For oct 23 17:38 it is a resume too i think : i think that I hibernated
>> and i forgot to look at something so i resumed.
>
> It's always 021d202d (3 times) and always where a vma might be
> accessed (a /proc file, khugepaged(), acct_collect()) so I would
> assume a struct vma was corrupted in the hibernate/resume process.
> Could be also firmware related AFAIK and there's I taint flag which
> means some buggy firmware workaround is in effect.
It seem's that you are right concerning the buggy firmware. Here are
lines at start in syslog :
[Firmware Bug]: TSC_DEADLINE disabled due to Errata; please update
microcode to version: 0xb2 (or later)
MDS CPU bug present and SMT on, data leak possible.
TAA CPU bug present and SMT on, data leak possible.
>
>> Oct 23 13:22:10 linuxp dhcpcd[18199]: dhcpcd not running
>> Oct 23 15:55:49 linuxp dhcpcd[27045]: dhcpcd not running
>> Oct 23 15:55:49 linuxp dhcpcd[27053]: dhcpcd not running
>> Oct 23 15:55:50 linuxp dhcpcd[27061]: dhcpcd not running
>> Oct 23 15:55:50 linuxp dhcpcd[27067]: dhcpcd not running
>> Oct 23 17:31:13 linuxp kernel: [140897.356150] iwlwifi 0000:03:00.0:
>> RF_KILL bit toggled to enable radio.
>> Oct 23 17:31:16 linuxp kernel: [140900.724013] Bluetooth: hci0:
>> unexpected event for opcode 0xfc2f
>> Oct 23 17:31:39 linuxp kernel: [140928.245135] BUG: unable to handle
>> page fault for address: 021d2001
>> Oct 23 17:31:39 linuxp kernel: [140928.245147] #PF: supervisor read
>> access in kernel mode
>> Oct 23 17:31:39 linuxp kernel: [140928.245152] #PF: error_code(0x0000) -
>> not-present page
>> Oct 23 17:31:39 linuxp kernel: [140928.245169] Oops: 0000 [#1] SMP PTI
>> Oct 23 17:31:39 linuxp kernel: [140928.245179] CPU: 1 PID: 2302 Comm:
>> Breakpad Server Tainted: G I 5.9.1 #1
>> Oct 23 17:31:39 linuxp kernel: [140928.245184] Hardware name:
>> Notebook W65_W67RZ/W65_W67RZ, BIOS 1.05.06
>> 02/22/2016
>> Oct 23 17:31:39 linuxp kernel: [140928.245197] EIP: m_next+0x1c/0x44
>> Oct 23 17:31:39 linuxp kernel: [140928.245205] Code: 24 08 d4 e6 c1 e8
>> 1a 77 e2 ff eb d6 cc cc 3e 8d 74 26 00 55 89 e5 57 56 53 8b 40 44 8b 58
>> 0c 39 da 74 24 8b 42 08 85 c0 74 0e <8b> 30 31 ff 89 31 89 79 04 5b 5e
>> 5f 5d c3 be ff ff ff ff 31 ff 85
>> Oct 23 17:31:39 linuxp kernel: [140928.245212] EAX: 021d2001 EBX:
>> 00000000 ECX: eeb9e56c EDX: eca0f000
>> Oct 23 17:31:39 linuxp kernel: [140928.245216] ESI: 00000000 EDI:
>> c128cad0 EBP: e59bff0c ESP: e59bff00
>> Oct 23 17:31:39 linuxp kernel: [140928.245221] DS: 007b ES: 007b FS:
>> 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010202
>> Oct 23 17:31:39 linuxp kernel: [140928.245224] CR0: 80050033 CR2:
>> 021d2001 CR3: 2606c000 CR4: 003506f0
>> Oct 23 17:31:39 linuxp kernel: [140928.245228] DR0: 00000000 DR1:
>> 00000000 DR2: 00000000 DR3: 00000000
>> Oct 23 17:31:39 linuxp kernel: [140928.245231] DR6: fffe0ff0 DR7:
>> 00000400
>> Oct 23 17:31:39 linuxp kernel: [140928.245233] Call Trace:
>> Oct 23 17:31:39 linuxp kernel: [140928.245242] ?
>> quota_send_warning+0x220/0x220
>> Oct 23 17:31:39 linuxp kernel: [140928.245248] seq_read+0x2bc/0x3e1
>> Oct 23 17:31:39 linuxp kernel: [140928.245254] ?
>> quota_send_warning+0x220/0x220
>> Oct 23 17:31:39 linuxp kernel: [140928.245260] ?
>> seq_open_private+0x17/0x17
>> Oct 23 17:31:39 linuxp kernel: [140928.245266] vfs_read+0x85/0x17f
>> Oct 23 17:31:39 linuxp kernel: [140928.245272] ? mutex_lock+0x10/0x33
>> Oct 23 17:31:39 linuxp kernel: [140928.245277] ksys_read+0x51/0xb6
>> Oct 23 17:31:39 linuxp kernel: [140928.245283] __ia32_sys_read+0x15/0x17
>> Oct 23 17:31:39 linuxp kernel: [140928.245289]
>> do_int80_syscall_32+0x2c/0x39
>> Oct 23 17:31:39 linuxp kernel: [140928.245295] entry_INT80_32+0xf7/0xf7
>> Oct 23 17:31:39 linuxp kernel: [140928.245299] EIP: 0xafc787c8
>> Oct 23 17:31:39 linuxp kernel: [140928.245304] Code: 00 00 c6 47 04 01
>> 8b 47 08 85 c0 75 b6 80 7f 04 00 75 5c 8b 37 ba 00 04 00 00 8d 4c 07 0c
>> 29 c2 b8 03 00 00 00 53 89 f3 cd 80 <5b> 89 c6 3d 01 f0 ff ff 73 32 85
>> f6 78 37 74 c8 01 77 08 8b 47 08
>> Oct 23 17:31:39 linuxp kernel: [140928.245308] EAX: ffffffda EBX:
>> 00000040 ECX: a3c194fc EDX: 000003d8
>> Oct 23 17:31:39 linuxp kernel: [140928.245311] ESI: 00000040 EDI:
>> a3c194c8 EBP: 995fece8 ESP: 995feccc
>> Oct 23 17:31:39 linuxp kernel: [140928.245316] DS: 007b ES: 007b FS:
>> 0000 GS: 0033 SS: 007b EFLAGS: 00000216
>> Oct 23 17:31:39 linuxp kernel: [140928.245320] Modules linked in:
>> appletalk psnap llc ipv6 fuse uvcvideo videobuf2_vmalloc
>> videobuf2_memops btusb videobuf2_v4l2 hid_generic btrtl btbcm
>> videobuf2_common btintel videodev bluetooth mc usbhid hid ecdh_generic
>> ecc rtsx_pci_sdmmc joydev mmc_core snd_hda_codec_hdmi
>> snd_hda_codec_realtek snd_hda_codec_generic i2c_dev ledtrig_audio
>> coretemp i915 hwmon iwlmvm r8169 i2c_algo_bit x86_pkg_temp_thermal
>> mac80211 drm_kms_helper intel_powerclamp rtsx_pci realtek drm kvm_intel
>> mdio_devres mfd_core libphy kvm intel_gtt irqbypass crc32_pclmul iwlwifi
>> snd_hda_intel agpgart psmouse evdev crc32c_intel serio_raw fb_sys_fops
>> cfg80211 snd_intel_dspcfg snd_hda_codec rfkill snd_hda_core syscopyarea
>> wmi thermal snd_hwdep battery snd_pcm sysfillrect sysimgblt snd_timer
>> xhci_pci i2c_i801 button xhci_hcd snd i2c_smbus intel_pch_thermal mei_me
>> soundcore video mei i2c_core acpi_pad ac loop
>> Oct 23 17:31:39 linuxp kernel: [140928.245396] CR2: 00000000021d2001
>> Oct 23 17:31:39 linuxp kernel: [140928.245402] ---[ end trace
>> c79bfd2669dd9a26 ]---
>> Oct 23 17:31:39 linuxp kernel: [140928.245408] EIP: m_next+0x1c/0x44
>> Oct 23 17:31:39 linuxp kernel: [140928.245412] Code: 24 08 d4 e6 c1 e8
>> 1a 77 e2 ff eb d6 cc cc 3e 8d 74 26 00 55 89 e5 57 56 53 8b 40 44 8b 58
>> 0c 39 da 74 24 8b 42 08 85 c0 74 0e <8b> 30 31 ff 89 31 89 79 04 5b 5e
>> 5f 5d c3 be ff ff ff ff 31 ff 85
>> Oct 23 17:31:39 linuxp kernel: [140928.245417] EAX: 021d2001 EBX:
>> 00000000 ECX: eeb9e56c EDX: eca0f000
>> Oct 23 17:31:39 linuxp kernel: [140928.245420] ESI: 00000000 EDI:
>> c128cad0 EBP: e59bff0c ESP: e59bff00
>> Oct 23 17:31:39 linuxp kernel: [140928.245424] DS: 007b ES: 007b FS:
>> 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010202
>> Oct 23 17:31:39 linuxp kernel: [140928.245427] CR0: 80050033 CR2:
>> 021d2001 CR3: 2606c000 CR4: 003506f0
>> Oct 23 17:31:39 linuxp kernel: [140928.245431] DR0: 00000000 DR1:
>> 00000000 DR2: 00000000 DR3: 00000000
>> Oct 23 17:31:39 linuxp kernel: [140928.245434] DR6: fffe0ff0 DR7:
>> 00000400
>> Oct 23 17:32:27 linuxp dhcpcd[27294]: dhcpcd not running
>> Oct 23 17:32:30 linuxp dhcpcd[27305]: dhcpcd not running
>> Oct 23 17:32:33 linuxp dhcpcd[27314]: dhcpcd not running
>> Oct 23 17:32:35 linuxp dhcpcd[27325]: dhcpcd not running
>> Oct 23 17:32:36 linuxp dhcpcd[27331]: dhcpcd not running
>> Oct 23 17:38:22 linuxp kernel: [141330.499234] BUG: unable to handle
>> page fault for address: 021d202d
>> Oct 23 17:38:22 linuxp kernel: [141330.499245] #PF: supervisor read
>> access in kernel mode
>> Oct 23 17:38:22 linuxp kernel: [141330.499250] #PF: error_code(0x0000) -
>> not-present page
>> Oct 23 17:38:22 linuxp kernel: [141330.499265] Oops: 0000 [#2] SMP PTI
>> Oct 23 17:38:22 linuxp kernel: [141330.499274] CPU: 0 PID: 37 Comm:
>> khugepaged Tainted: G D I 5.9.1 #1
>> Oct 23 17:38:22 linuxp kernel: [141330.499278] Hardware name:
>> Notebook W65_W67RZ/W65_W67RZ, BIOS 1.05.06
>> 02/22/2016
>> Oct 23 17:38:22 linuxp kernel: [141330.499289] EIP:
>> khugepaged+0x599/0x2226
>>
>>
>>
>
prev parent reply other threads:[~2020-11-09 11:47 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <bug-210031-27@https.bugzilla.kernel.org/>
2020-11-04 0:17 ` Andrew Morton
2020-11-04 9:03 ` Newsmails
2020-11-04 12:14 ` Vlastimil Babka
2020-11-04 12:23 ` Newsmails
2020-11-04 12:45 ` Vlastimil Babka
2020-11-09 11:47 ` Newsmails [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=505ed020-6dba-46d3-9b49-4d97900c07dc@netcourrier.com \
--to=newsmails@netcourrier.com \
--cc=akpm@linux-foundation.org \
--cc=bugzilla-daemon@bugzilla.kernel.org \
--cc=hannes@cmpxchg.org \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-mm@kvack.org \
--cc=oleg@redhat.com \
--cc=pavel@ucw.cz \
--cc=songliubraving@fb.com \
--cc=vbabka@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox