From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-io1-f72.google.com (mail-io1-f72.google.com [209.85.166.72]) by kanga.kvack.org (Postfix) with ESMTP id E038A8E0001 for ; Wed, 19 Dec 2018 16:15:09 -0500 (EST) Received: by mail-io1-f72.google.com with SMTP id i11so125247iog.2 for ; Wed, 19 Dec 2018 13:15:09 -0800 (PST) Received: from mail-sor-f41.google.com (mail-sor-f41.google.com. [209.85.220.41]) by mx.google.com with SMTPS id 5sor11843362itu.14.2018.12.19.13.15.08 for (Google Transport Security); Wed, 19 Dec 2018 13:15:08 -0800 (PST) MIME-Version: 1.0 References: <24702c72-cc06-1b54-0ab9-6d2409362c27@amd.com> <3ffe451b-1f17-23a5-985b-28d26fbaf7da@amd.com> <09781f6e-5ea3-ccfd-1aa2-79941b089863@amd.com> In-Reply-To: From: Mikhail Gavrilov Date: Thu, 20 Dec 2018 02:14:57 +0500 Message-ID: Subject: Re: After Vega 56/64 GPU hang I unable reboot system Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: owner-linux-mm@kvack.org List-ID: To: "StDenis, Tom" Cc: "Grodzovsky, Andrey" , "Wentland, Harry" , "Deucher, Alexander" , "Koenig, Christian" , "linux-mm@kvack.org" , "amd-gfx@lists.freedesktop.org" On Thu, 20 Dec 2018 at 01:56, StDenis, Tom wrote: > > Sorry missed the gfx ring in the reply. > > Um what kernel version? 4.20.0-0.rc6 > Is this the latest umr? yes, master branch, commit 546c30a71f7b87f97f2a96eab184c3973b014711 > Maybe capture a trace of umr to see what is happening. Cannot seek to MMIO address: Bad file descriptor [ERROR]: Could not open ring debugfs file Program received signal SIGSEGV, Segmentation fault. umr_pm4_decode_ring (asic=3Dasic@entry=3D0x1c08a50, ringname=3D, no_halt=3Dno_halt@entry=3D1) at /home/mikhail/packaging-work/umr/src/lib/umr_read_pm4_stream.c:333 333 ringdata[0] %=3D ringsize; (gdb) thread apply all bt full Thread 1 (Thread 0x7ffff7a22740 (LWP 7844)): #0 umr_pm4_decode_ring (asic=3Dasic@entry=3D0x1c08a50, ringname=3D, no_halt=3Dno_halt@entry=3D1) at /home/mikhail/packaging-work/umr/src/lib/umr_read_pm4_stream.c:333 ps =3D ringdata =3D 0x0 ringsize =3D 8191 #1 0x00000000004b4ac6 in umr_print_waves (asic=3Dasic@entry=3D0x1c08a50) at /home/mikhail/packaging-work/umr/src/app/print_waves.c:52 x =3D y =3D shift =3D thread =3D pgm_addr =3D shader_addr =3D wd =3D owd =3D first =3D 1 col =3D 0 shader =3D 0x0 stream =3D #2 0x0000000000496952 in main (argc=3D, argv=3D) at /home/mikhail/packaging-work/umr/src/app/main.c:285 i =3D 3 j =3D k =3D l =3D asic =3D 0x1c08a50 blockname =3D str =3D str2 =3D asicname =3D "\000\000\000\000\004", '\000' , "F;\226\000\000\000\000\000\000\000\000\000\004", '\000' , "\a", '\000' , "\004", '\000' , "\027\362\321\000\000\000\000\000\000\000\000\000\004", '\000' , "\004", '\000' , "\004", '\000' , "\004", '\000' ... ipname =3D '\000' , "F;\226", '\000' , "l-option", '\000' , "\006\000\000\000\000\000\000\200", '\000' , "\027\362\321", '\000' , "\037", '\000' ... regname =3D "\000\000\000\000\000 ", '\000' , "\017\004", '\000' , " ", '\000' , "\220\377\377\377\377\377\377\377", '\000' , "\031", '\000' , "\a\000\000\000\000\000\000\000\037\000\000\000\000\000\000\000\003\000\000= \000\000\000\000\000\030\220\275\001\000\000\000\000P\000\000\000\000\000\0= 00\000\220\377\377\377\377\377\377\377\000\000\000\000\000\000\000\000\003\= 000\000\000w\000\000\000[\000\000\000\060", '\000' , "n\000\000\000|", '\000' ... req =3D {tv_sec =3D 0, tv_nsec =3D 7310868735956184161} (gdb) > It works just fine on my raven1. > $ inxi -bM System: Host: localhost.localdomain Kernel: 4.20.0-0.rc6.git2.3.fc30.x86_64 x86_64 bits: 64 Desktop: Gnome 3.31.2 Distro: Fedora release 30 (Rawhide) Machine: Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-I GAMING v: Rev 1.xx serial: UEFI: American Megatrends v: 1103 date: 11/16/2018 CPU: 8-Core: AMD Ryzen 7 2700X type: MT MCP speed: 2086 MHz min/max: 2200/3700 MHz Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Vega 10 XL/XT [Radeon RX Vega 56/64] driver: amdgpu v: kernel Display: wayland server: Fedora Project X.org 1.20.3 driver: amdgpu resolution: 3840x2160~60Hz OpenGL: renderer: Radeon RX Vega (VEGA10 DRM 3.27.0 4.20.0-0.rc6.git2.3.fc30.x86_64 LLVM 7.0.0) v: 4.5 Mesa 18.3.0 Network: Device-1: Intel I211 Gigabit Network driver: igb Device-2: Realtek RTL8822BE 802.11a/b/g/n/ac WiFi adapter driver: r8822be Drives: Local Storage: total: 11.35 TiB used: 7.54 TiB (66.4%) Info: Processes: 435 Uptime: 22m Memory: 31.35 GiB used: 19.69 GiB (62.8%) Shell: bash inxi: 3.0.29 -- Best Regards, Mike Gavrilov.