From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.3 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B32E8ECE588 for ; Tue, 15 Oct 2019 15:22:05 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 750B020640 for ; Tue, 15 Oct 2019 15:22:05 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 750B020640 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=canonical.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 272808E0006; Tue, 15 Oct 2019 11:22:05 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 222C18E0001; Tue, 15 Oct 2019 11:22:05 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 111DD8E0006; Tue, 15 Oct 2019 11:22:05 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0107.hostedemail.com [216.40.44.107]) by kanga.kvack.org (Postfix) with ESMTP id E32118E0001 for ; Tue, 15 Oct 2019 11:22:04 -0400 (EDT) Received: from smtpin12.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay02.hostedemail.com (Postfix) with SMTP id 8760B5DD5 for ; Tue, 15 Oct 2019 15:22:04 +0000 (UTC) X-FDA: 76046384568.12.story97_866e7a72ea805 X-HE-Tag: story97_866e7a72ea805 X-Filterd-Recvd-Size: 6697 Received: from youngberry.canonical.com (youngberry.canonical.com [91.189.89.112]) by imf35.hostedemail.com (Postfix) with ESMTP for ; Tue, 15 Oct 2019 15:22:03 +0000 (UTC) Received: from mail-pl1-f200.google.com ([209.85.214.200]) by youngberry.canonical.com with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1iKOeA-0001Ep-0i for linux-mm@kvack.org; Tue, 15 Oct 2019 15:22:02 +0000 Received: by mail-pl1-f200.google.com with SMTP id f8so12265941plj.10 for ; Tue, 15 Oct 2019 08:22:01 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:openpgp:autocrypt :message-id:date:user-agent:mime-version:in-reply-to :content-language:content-transfer-encoding; bh=4uWEMUtO+/0Vvy5QnKlN0NMPVmJvd5IxagN1D1K9RuM=; b=rZ7ZLsZCgEPRBhJ30HNvxZ2UVVIT5sEKGlOt848xU8ZzeLGSHRXsW9985A+MMHA8X5 p+lPDo7NEuN6Vu0dFbNruxySEcsqne9NaDdWjKtk6yTyY0j9hAUoX3BUXNud38nkzEGe 6YGt/tP/nydeRQrNKNyG77EJJty1Vl0RR+6VkoyGpf7206IUiCdsJ2AOyIcXsmOXW3c9 0rqQEuTEQ1UdsM/nYs7nbGJ+c9a0gAoMPcsxSgkcrmcxFxqVex0UzzzqDm3DvK9usPtj 9brrvhEbDVm2c5ohborvP/Go1V0ddEIurto64/pIDq3nnR4TvEHyqA9nVHBTU3RDJzij VDSQ== X-Gm-Message-State: APjAAAXzSXalt3j/b4X2QXcGl5ZtqW9M75al8J1xxrgiLvhRwgKKjME/ Yg3GvOnTyZK3l9chU+mZkHXOB5bnBeA63AUGx4J2CYXS/Lm7ahGdOpLpQrflNavosjBScJxpZAG AwcU7HH4txlDQCHs9mZ38k29h9AJQ X-Received: by 2002:a63:3c41:: with SMTP id i1mr4452944pgn.287.1571152920616; Tue, 15 Oct 2019 08:22:00 -0700 (PDT) X-Google-Smtp-Source: APXvYqzqIwZ8/hi+1TI04wvVCUT6kOctE1TYvzNEHURTZj+QItUndJsJWTqG0o6Iq6T0rUfS9prSFw== X-Received: by 2002:a63:3c41:: with SMTP id i1mr4452910pgn.287.1571152920254; Tue, 15 Oct 2019 08:22:00 -0700 (PDT) Received: from [192.168.1.200] (201-92-249-168.dsl.telesp.net.br. [201.92.249.168]) by smtp.gmail.com with ESMTPSA id r21sm28603670pfc.27.2019.10.15.08.21.51 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 15 Oct 2019 08:21:59 -0700 (PDT) Subject: Re: Advice on oops - memory trap on non-memory access instruction (invalid CR2?) To: Thomas Gleixner Cc: kvm@vger.kernel.org, linux-acpi@vger.kernel.org, linux-mm@kvack.org, platform-driver-x86@vger.kernel.org, x86@kernel.org, iommu@lists.linux-foundation.org, "Guilherme G. Piccoli" , gavin.guo@canonical.com, halves@canonical.com, ioanna-maria.alifieraki@canonical.com, jay.vosburgh@canonical.com, mfo@canonical.com References: <66eeae28-bfd3-c7a0-011c-801981b74243@canonical.com> From: "Guilherme G. Piccoli" Openpgp: preference=signencrypt Autocrypt: addr=gpiccoli@canonical.com; prefer-encrypt=mutual; keydata= mQENBFpVBxcBCADPNKmu2iNKLepiv8+Ssx7+fVR8lrL7cvakMNFPXsXk+f0Bgq9NazNKWJIn Qxpa1iEWTZcLS8ikjatHMECJJqWlt2YcjU5MGbH1mZh+bT3RxrJRhxONz5e5YILyNp7jX+Vh 30rhj3J0vdrlIhPS8/bAt5tvTb3ceWEic9mWZMsosPavsKVcLIO6iZFlzXVu2WJ9cov8eQM/ irIgzvmFEcRyiQ4K+XUhuA0ccGwgvoJv4/GWVPJFHfMX9+dat0Ev8HQEbN/mko/bUS4Wprdv 7HR5tP9efSLucnsVzay0O6niZ61e5c97oUa9bdqHyApkCnGgKCpg7OZqLMM9Y3EcdMIJABEB AAG0LUd1aWxoZXJtZSBHLiBQaWNjb2xpIDxncGljY29saUBjYW5vbmljYWwuY29tPokBNwQT AQgAIQUCWmClvQIbAwULCQgHAgYVCAkKCwIEFgIDAQIeAQIXgAAKCRDOR5EF9K/7Gza3B/9d 5yczvEwvlh6ksYq+juyuElLvNwMFuyMPsvMfP38UslU8S3lf+ETukN1S8XVdeq9yscwtsRW/ 4YoUwHinJGRovqy8gFlm3SAtjfdqysgJqUJwBmOtcsHkmvFXJmPPGVoH9rMCUr9s6VDPox8f q2W5M7XE9YpsfchS/0fMn+DenhQpV3W6pbLtuDvH/81GKrhxO8whSEkByZbbc+mqRhUSTdN3 iMpRL0sULKPVYbVMbQEAnfJJ1LDkPqlTikAgt3peP7AaSpGs1e3pFzSEEW1VD2jIUmmDku0D LmTHRl4t9KpbU/H2/OPZkrm7809QovJGRAxjLLPcYOAP7DUeltveuQENBFpVBxcBCADbxD6J aNw/KgiSsbx5Sv8nNqO1ObTjhDR1wJw+02Bar9DGuFvx5/qs3ArSZkl8qX0X9Vhptk8rYnkn pfcrtPBYLoux8zmrGPA5vRgK2ItvSc0WN31YR/6nqnMfeC4CumFa/yLl26uzHJa5RYYQ47jg kZPehpc7IqEQ5IKy6cCKjgAkuvM1rDP1kWQ9noVhTUFr2SYVTT/WBHqUWorjhu57/OREo+Tl nxI1KrnmW0DbF52tYoHLt85dK10HQrV35OEFXuz0QPSNrYJT0CZHpUprkUxrupDgkM+2F5LI bIcaIQ4uDMWRyHpDbczQtmTke0x41AeIND3GUc+PQ4hWGp9XABEBAAGJAR8EGAEIAAkFAlpV BxcCGwwACgkQzkeRBfSv+xv1wwgAj39/45O3eHN5pK0XMyiRF4ihH9p1+8JVfBoSQw7AJ6oU 1Hoa+sZnlag/l2GTjC8dfEGNoZd3aRxqfkTrpu2TcfT6jIAsxGjnu+fUCoRNZzmjvRziw3T8 egSPz+GbNXrTXB8g/nc9mqHPPprOiVHDSK8aGoBqkQAPZDjUtRwVx112wtaQwArT2+bDbb/Y Yh6gTrYoRYHo6FuQl5YsHop/fmTahpTx11IMjuh6IJQ+lvdpdfYJ6hmAZ9kiVszDF6pGFVkY kHWtnE2Aa5qkxnA2HoFpqFifNWn5TyvJFpyqwVhVI8XYtXyVHub/WbXLWQwSJA4OHmqU8gDl X18zwLgdiQ== Message-ID: <331f83c2-1d52-dfdb-1006-e910ff20c3a5@canonical.com> Date: Tue, 15 Oct 2019 12:21:45 -0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000352, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 14/10/2019 11:10, Thomas Gleixner wrote: > On Mon, 14 Oct 2019, Guilherme G. Piccoli wrote: >> Modules linked in: <...> >> CPU: 40 PID: 78274 Comm: qemu-system-x86 Tainted: P W OE >=20 > Tainted: P - Proprietary module loaded ... >=20 > Try again without that module Thanks Thomas, for the prompt response. This is some ScaleIO stuff, I guess it's part of customer setup, and I agree would be better to not have this kind of module loaded. Anyway, the analysis of oops show a quite odd situation that we'd like to at least have a strong clue before saying the scaleio stuff is the culprit. >=20 > Tainted: W - Warning issued before >=20 > Are you sure that that warning is harmless and unrelated? >=20 Sorry I didn't mention that before, the warn is: [5946866.593060] WARNING: CPU: 42 PID: 173056 at /build/linux-lts-xenial-80t3lB/linux-lts-xenial-4.4.0/arch/x86/events/int= el/core.c:1868 intel_pmu_handle_irq+0x2d4/0x470() [5946866.593061] perfevents: irq loop stuck! It happened ~700 days before the oops (yeah, the uptime is quite large, about 900 days when the oops happened heh). >> 4.4.0-45-generic #66~14.04.1-Ubuntu >=20 > Does the same problem happen with a not so dead kernel? CR2 handling go= t > quite some updates/fixes since then. Unfortunately we don't have ways to test that for now, but your comment is quite interesting - we can take a look in the CR2 fixes since v4.4. But what do you think about having a #PF while the instruction pointed in the oops Code section (and the RIP address) is not a memory-related in= sn? Thanks, Guilherme >=20 > Thanks, >=20 > tglx >=20 >=20