-
-
Save gvolpe/8bc75f89f3a58f596dfd556be54c5387 to your computer and use it in GitHub Desktop.
[Dec16 23:16] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -2! | |
[ +0.000908] gmc_v9_0_process_interrupt: 24 callbacks suppressed | |
[ +0.000010] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000007] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f0000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00640C51 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x6 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x1 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x5 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x1 | |
[ +0.000007] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f1000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000007] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f2000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000006] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f3000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000002] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000004] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f4000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000007] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f5000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000004] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f6000 from client 27 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 | |
[ +0.000007] amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32769, for process X pid 1073 thread X:cs0 pid 1095) | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001018f7000 from client 27 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000000 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0 | |
[ +0.000002] amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x0 | |
[ +0.000000] amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0 | |
[ +0.000001] amdgpu 0000:04:00.0: amdgpu: RW: 0x0 |
This is with Linux kernel 5.10.7
and vulkan
disabled.
Jan 20 18:34:41 tongfang-amd kernel: amdgpu_cs_ioctl: 32 callbacks suppressed
Jan 20 18:34:41 tongfang-amd kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -2!
Jan 20 18:35:04 tongfang-amd kernel: GpuWatchdog[5798]: segfault at 0 ip 00007f937fc31fa6 sp 00007f9377719030 error 6 in libcef.so[7f937b924000+75cd000]
Jan 20 18:35:04 tongfang-amd kernel: Code: 89 de e8 cd 2f 63 ff 80 7d cf 00 79 09 48 8b 7d b8 e8 0e 1f 5f fe 41 8b 84 24 e0 00 00 00 89 45 b8 48 8d 7d b8 e8 9a 2e cf fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 5d 41 5e
Jan 20 18:35:04 tongfang-amd kernel: traps: Chrome_IOThread[5714] trap int3 ip:7f7d000ebff4 sp:7f7ce579f0d0 error:0 in libcef.so[7f7cfd87b000+75cd000]
Jan 20 18:35:11 tongfang-amd kernel: GpuWatchdog[3689]: segfault at 0 ip 000055734be3c107 sp 00007f729aa49430 error 6 in signal-desktop[557348c5b000+53d6000]
Jan 20 18:35:11 tongfang-amd kernel: Code: 7d b7 00 79 09 48 8b 7d a0 e8 35 52 d3 fe 8b 83 00 01 00 00 85 c0 0f 84 91 00 00 00 48 8b 03 48 89 df be 01 00 00 00 ff 50 68 <c7> 04 25 00 00 00 00 37 13 00 00 c6 05 17 bc 6f 02 01 80 7d 87 00
Jan 20 18:35:12 tongfang-amd kernel: traps: Chrome_IOThread[3495] trap int3 ip:55d7f536ecd3 sp:7f118e520520 error:0 in signal-desktop[55d7f34eb000+53d6000]
Click here to see the full stack trace
Jan 20 18:34:21 tongfang-amd kernel: Freezing user space processes ... (elapsed 0.002 seconds) done.
Jan 20 18:34:21 tongfang-amd kernel: OOM killer disabled.
Jan 20 18:34:21 tongfang-amd kernel: Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
Jan 20 18:34:21 tongfang-amd kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Jan 20 18:34:21 tongfang-amd kernel: [drm] free PSP TMR buffer
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: interrupt blocked
Jan 20 18:34:21 tongfang-amd kernel: xhci_hcd 0000:04:00.4: refused to change power state from D0 to D3hot
Jan 20 18:34:21 tongfang-amd kernel: ACPI: Preparing to enter system sleep state S3
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: event blocked
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: EC stopped
Jan 20 18:34:21 tongfang-amd kernel: PM: Saving platform NVS memory
Jan 20 18:34:21 tongfang-amd kernel: Disabling non-boot CPUs ...
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 1 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 2 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 3 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 4 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 5 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 6 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 7 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 8 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 9 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 10 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 11 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 12 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 13 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 14 is now offline
Jan 20 18:34:21 tongfang-amd kernel: smpboot: CPU 15 is now offline
Jan 20 18:34:21 tongfang-amd kernel: ACPI: Low-level resume complete
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: EC started
Jan 20 18:34:21 tongfang-amd kernel: PM: Restoring platform NVS memory
Jan 20 18:34:21 tongfang-amd kernel: LVT offset 0 assigned for vector 0x400
Jan 20 18:34:21 tongfang-amd kernel: Enabling non-boot CPUs ...
Jan 20 18:34:21 tongfang-amd kernel: x86: Booting SMP configuration:
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 1 APIC 0x1
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU1: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P001: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU1 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 2 APIC 0x2
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU2: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P002: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU2 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 3 APIC 0x3
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU3: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P003: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU3 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 4 APIC 0x4
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU4: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P004: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU4 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 5 APIC 0x5
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU5: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P005: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU5 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 6 APIC 0x6
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU6: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P006: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU6 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 7 APIC 0x7
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU7: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P007: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU7 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 8 APIC 0x8
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU8: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P008: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU8 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 9 APIC 0x9
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU9: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P009: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU9 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 10 APIC 0xa
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU10: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00A: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU10 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 11 APIC 0xb
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU11: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00B: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU11 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 12 APIC 0xc
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU12: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00C: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU12 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 13 APIC 0xd
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU13: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00D: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU13 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 14 APIC 0xe
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU14: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00E: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU14 is up
Jan 20 18:34:21 tongfang-amd kernel: smpboot: Booting Node 0 Processor 15 APIC 0xf
Jan 20 18:34:21 tongfang-amd kernel: microcode: CPU15: patch_level=0x08600103
Jan 20 18:34:21 tongfang-amd kernel: ACPI: \_SB_.PLTF.P00F: Found 3 idle states
Jan 20 18:34:21 tongfang-amd kernel: CPU15 is up
Jan 20 18:34:21 tongfang-amd kernel: ACPI: Waking up from system sleep state S3
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: interrupt unblocked
Jan 20 18:34:21 tongfang-amd kernel: ACPI: EC: event unblocked
Jan 20 18:34:21 tongfang-amd kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F400900000).
Jan 20 18:34:21 tongfang-amd kernel: [drm] PSP is resuming...
Jan 20 18:34:21 tongfang-amd kernel: [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
Jan 20 18:34:21 tongfang-amd kernel: nvme nvme0: Shutdown timeout set to 8 seconds
Jan 20 18:34:21 tongfang-amd kernel: nvme nvme0: 16/0/0 default/read/poll queues
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: dpm has been disabled
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
Jan 20 18:34:21 tongfang-amd kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jan 20 18:34:21 tongfang-amd kernel: [drm] DMUB hardware initialized: version=0x01000000
Jan 20 18:34:21 tongfang-amd kernel: ------------[ cut here ]------------
Jan 20 18:34:21 tongfang-amd kernel: WARNING: CPU: 3 PID: 22239 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:2029 dm_resume+0x4bb/0x520 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: Modules linked in: xt_nat xt_tcpudp rfcomm xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c br_netfilter af_packet overlay cmac algif_hash algif_skcipher af_alg bnep 8021q snd_hda_codec_realtek btusb snd_hda_codec_generic btrtl ledtrig_audio btbcm btintel uvcvideo snd_hda_codec_hdmi videobuf2_vmalloc videobuf2_memops bluetooth snd_usb_audio iwlmvm snd_hda_intel videobuf2_v4l2 snd_intel_dspcfg sch_fq_codel videobuf2_common rtsx_usb_sdmmc snd_hda_codec snd_usbmidi_lib rtsx_usb_ms mmc_core videodev memstick snd_hda_core snd_rawmidi snd_pcm_oss joydev snd_seq_device ecdh_generic snd_rn_pci_acp3x snd_hwdep mac80211 mousedev snd_mixer_oss rtsx_usb ecc mc usbhid snd_pci_acp3x libarc4 snd_pcm snd_timer hid_multitouch iwlwifi snd soundcore hid_generic sha256_ssse3 sha256_generic edac_mce_amd cfg80211 edac_core asus_wmi nls_iso8859_1 sparse_keymap wmi_bmof
Jan 20 18:34:21 tongfang-amd kernel: crc32_pclmul nls_cp437 ghash_clmulni_intel aesni_intel r8169 vfat libaes deflate crypto_simd fat sp5100_tco cryptd watchdog input_leds glue_helper realtek led_class mdio_devres efi_pstore evdev mac_hid rapl serio_raw libphy pstore rfkill k10temp i2c_piix4 loop tun wmi tap video macvlan battery i2c_hid veth hid tpm_crb bridge tpm_tis tpm_tis_core tpm tiny_power_button stp llc rng_core vboxnetflt(O) acpi_cpufreq thermal i2c_designware_platform vboxnetadp(O) pinctrl_amd i2c_designware_core button ac vboxdrv(O) kvm_amd kvm irqbypass efivarfs ip_tables x_tables autofs4 ext4 crc32c_generic crc16 mbcache jbd2 xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata atkbd libps2 usbcore nvme scsi_mod nvme_core t10_pi crc32c_intel crc_t10dif crct10dif_generic crct10dif_pclmul usb_common crct10dif_common i8042 rtc_cmos serio dm_mod amdgpu iommu_v2 gpu_sched ttm i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm i2c_core backlight agpgart
Jan 20 18:34:21 tongfang-amd kernel: CPU: 3 PID: 22239 Comm: kworker/u32:18 Tainted: G W O 5.10.7 #1-NixOS
Jan 20 18:34:21 tongfang-amd kernel: Hardware name: Standard Standard/PF5NU1G, BIOS N.1.06PCS00 06/17/2020
Jan 20 18:34:21 tongfang-amd kernel: Workqueue: events_unbound async_run_entry_fn
Jan 20 18:34:21 tongfang-amd kernel: RIP: 0010:dm_resume+0x4bb/0x520 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: Code: 89 df 48 c7 83 c0 55 01 00 00 00 00 00 e8 3d 0f 00 00 48 8d bb f8 3b 01 00 e8 d1 b5 14 c1 e9 28 fe ff ff 0f 0b e9 c0 fd ff ff <0f> 0b e9 59 fd ff ff 48 8d bb d8 75 00 00 e8 62 fe f8 ff 85 c0 0f
Jan 20 18:34:21 tongfang-amd kernel: RSP: 0018:ffffb6a5026dbd38 EFLAGS: 00010206
Jan 20 18:34:21 tongfang-amd kernel: RAX: 0000000000000004 RBX: ffff98f245fa0000 RCX: 0000000000100008
Jan 20 18:34:21 tongfang-amd kernel: RDX: ffff98f3240f2f00 RSI: ffff98f313fda500 RDI: ffff98f3a29a7800
Jan 20 18:34:21 tongfang-amd kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: ffffffffc0696b00
Jan 20 18:34:21 tongfang-amd kernel: R10: ffff98f628f15800 R11: 0000000000000001 R12: 0000000000000000
Jan 20 18:34:21 tongfang-amd kernel: R13: ffff98f245fa0010 R14: ffff98f7426f5400 R15: 0000000000000000
Jan 20 18:34:21 tongfang-amd kernel: FS: 0000000000000000(0000) GS:ffff99010f6c0000(0000) knlGS:0000000000000000
Jan 20 18:34:21 tongfang-amd kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 20 18:34:21 tongfang-amd kernel: CR2: 0000000000000000 CR3: 000000055de0a000 CR4: 0000000000350ee0
Jan 20 18:34:21 tongfang-amd kernel: Call Trace:
Jan 20 18:34:21 tongfang-amd kernel: amdgpu_device_ip_resume_phase2+0x52/0xb0 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: amdgpu_device_resume+0x7b/0x330 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: ? pci_pm_restore+0xe0/0xe0
Jan 20 18:34:21 tongfang-amd kernel: dpm_run_callback+0x4c/0x120
Jan 20 18:34:21 tongfang-amd kernel: device_resume+0x8b/0x190
Jan 20 18:34:21 tongfang-amd kernel: async_resume+0x19/0x30
Jan 20 18:34:21 tongfang-amd kernel: async_run_entry_fn+0x37/0x140
Jan 20 18:34:21 tongfang-amd kernel: process_one_work+0x1df/0x370
Jan 20 18:34:21 tongfang-amd kernel: worker_thread+0x50/0x400
Jan 20 18:34:21 tongfang-amd kernel: ? process_one_work+0x370/0x370
Jan 20 18:34:21 tongfang-amd kernel: kthread+0x11b/0x140
Jan 20 18:34:21 tongfang-amd kernel: ? __kthread_bind_mask+0x60/0x60
Jan 20 18:34:21 tongfang-amd kernel: ret_from_fork+0x22/0x30
Jan 20 18:34:21 tongfang-amd kernel: ---[ end trace 1d0e3d3340295edb ]---
Jan 20 18:34:21 tongfang-amd kernel: ------------[ cut here ]------------
Jan 20 18:34:21 tongfang-amd kernel: WARNING: CPU: 3 PID: 22239 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:2038 dm_resume+0x4b4/0x520 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: Modules linked in: xt_nat xt_tcpudp rfcomm xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c br_netfilter af_packet overlay cmac algif_hash algif_skcipher af_alg bnep 8021q snd_hda_codec_realtek btusb snd_hda_codec_generic btrtl ledtrig_audio btbcm btintel uvcvideo snd_hda_codec_hdmi videobuf2_vmalloc videobuf2_memops bluetooth snd_usb_audio iwlmvm snd_hda_intel videobuf2_v4l2 snd_intel_dspcfg sch_fq_codel videobuf2_common rtsx_usb_sdmmc snd_hda_codec snd_usbmidi_lib rtsx_usb_ms mmc_core videodev memstick snd_hda_core snd_rawmidi snd_pcm_oss joydev snd_seq_device ecdh_generic snd_rn_pci_acp3x snd_hwdep mac80211 mousedev snd_mixer_oss rtsx_usb ecc mc usbhid snd_pci_acp3x libarc4 snd_pcm snd_timer hid_multitouch iwlwifi snd soundcore hid_generic sha256_ssse3 sha256_generic edac_mce_amd cfg80211 edac_core asus_wmi nls_iso8859_1 sparse_keymap wmi_bmof
Jan 20 18:34:21 tongfang-amd kernel: crc32_pclmul nls_cp437 ghash_clmulni_intel aesni_intel r8169 vfat libaes deflate crypto_simd fat sp5100_tco cryptd watchdog input_leds glue_helper realtek led_class mdio_devres efi_pstore evdev mac_hid rapl serio_raw libphy pstore rfkill k10temp i2c_piix4 loop tun wmi tap video macvlan battery i2c_hid veth hid tpm_crb bridge tpm_tis tpm_tis_core tpm tiny_power_button stp llc rng_core vboxnetflt(O) acpi_cpufreq thermal i2c_designware_platform vboxnetadp(O) pinctrl_amd i2c_designware_core button ac vboxdrv(O) kvm_amd kvm irqbypass efivarfs ip_tables x_tables autofs4 ext4 crc32c_generic crc16 mbcache jbd2 xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata atkbd libps2 usbcore nvme scsi_mod nvme_core t10_pi crc32c_intel crc_t10dif crct10dif_generic crct10dif_pclmul usb_common crct10dif_common i8042 rtc_cmos serio dm_mod amdgpu iommu_v2 gpu_sched ttm i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm i2c_core backlight agpgart
Jan 20 18:34:21 tongfang-amd kernel: CPU: 3 PID: 22239 Comm: kworker/u32:18 Tainted: G W O 5.10.7 #1-NixOS
Jan 20 18:34:21 tongfang-amd kernel: Hardware name: Standard Standard/PF5NU1G, BIOS N.1.06PCS00 06/17/2020
Jan 20 18:34:21 tongfang-amd kernel: Workqueue: events_unbound async_run_entry_fn
Jan 20 18:34:21 tongfang-amd kernel: RIP: 0010:dm_resume+0x4b4/0x520 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: Code: 00 e8 30 d9 0e 00 48 89 df 48 c7 83 c0 55 01 00 00 00 00 00 e8 3d 0f 00 00 48 8d bb f8 3b 01 00 e8 d1 b5 14 c1 e9 28 fe ff ff <0f> 0b e9 c0 fd ff ff 0f 0b e9 59 fd ff ff 48 8d bb d8 75 00 00 e8
Jan 20 18:34:21 tongfang-amd kernel: RSP: 0018:ffffb6a5026dbd38 EFLAGS: 00010206
Jan 20 18:34:21 tongfang-amd kernel: RAX: 0000000000000004 RBX: ffff98f245fa0000 RCX: 0000000000100008
Jan 20 18:34:21 tongfang-amd kernel: RDX: ffff98f3240f2f00 RSI: ffff98f313fda500 RDI: ffff98fb0ea39400
Jan 20 18:34:21 tongfang-amd kernel: RBP: ffff98f46a880c00 R08: 0000000000000000 R09: ffffffffc0696b00
Jan 20 18:34:21 tongfang-amd kernel: R10: ffff98f628f15800 R11: 0000000000000001 R12: 0000000000000002
Jan 20 18:34:21 tongfang-amd kernel: R13: ffff98f245fa0010 R14: ffff98f7426f5800 R15: 0000000000000000
Jan 20 18:34:21 tongfang-amd kernel: FS: 0000000000000000(0000) GS:ffff99010f6c0000(0000) knlGS:0000000000000000
Jan 20 18:34:21 tongfang-amd kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 20 18:34:21 tongfang-amd kernel: CR2: 0000000000000000 CR3: 000000055de0a000 CR4: 0000000000350ee0
Jan 20 18:34:21 tongfang-amd kernel: Call Trace:
Jan 20 18:34:21 tongfang-amd kernel: amdgpu_device_ip_resume_phase2+0x52/0xb0 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: amdgpu_device_resume+0x7b/0x330 [amdgpu]
Jan 20 18:34:21 tongfang-amd kernel: ? pci_pm_restore+0xe0/0xe0
Jan 20 18:34:21 tongfang-amd kernel: dpm_run_callback+0x4c/0x120
Jan 20 18:34:21 tongfang-amd kernel: device_resume+0x8b/0x190
Jan 20 18:34:21 tongfang-amd kernel: async_resume+0x19/0x30
Jan 20 18:34:21 tongfang-amd kernel: async_run_entry_fn+0x37/0x140
Jan 20 18:34:21 tongfang-amd kernel: process_one_work+0x1df/0x370
Jan 20 18:34:21 tongfang-amd kernel: worker_thread+0x50/0x400
Jan 20 18:34:21 tongfang-amd kernel: ? process_one_work+0x370/0x370
Jan 20 18:34:21 tongfang-amd kernel: kthread+0x11b/0x140
Jan 20 18:34:21 tongfang-amd kernel: ? __kthread_bind_mask+0x60/0x60
Jan 20 18:34:21 tongfang-amd kernel: ret_from_fork+0x22/0x30
Jan 20 18:34:21 tongfang-amd kernel: ---[ end trace 1d0e3d3340295edc ]---
Jan 20 18:34:21 tongfang-amd kernel: usb 3-2.3: reset high-speed USB device number 6 using xhci_hcd
Jan 20 18:34:21 tongfang-amd kernel: usb 1-3: reset high-speed USB device number 2 using xhci_hcd
Jan 20 18:34:21 tongfang-amd kernel: ata2: SATA link down (SStatus 0 SControl 300)
Jan 20 18:34:21 tongfang-amd kernel: ata1: SATA link down (SStatus 0 SControl 300)
Jan 20 18:34:21 tongfang-amd kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Jan 20 18:34:21 tongfang-amd kernel: [drm] JPEG decode initialized successfully.
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
Jan 20 18:34:21 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
Jan 20 18:34:21 tongfang-amd kernel: usb 1-4.3: reset high-speed USB device number 4 using xhci_hcd
Jan 20 18:34:21 tongfang-amd kernel: usb 1-4.4: reset full-speed USB device number 5 using xhci_hcd
Jan 20 18:34:21 tongfang-amd kernel: OOM killer enabled.
Jan 20 18:34:21 tongfang-amd kernel: Restarting tasks ... done.
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Bootloader revision 0.3 build 0 week 24 2017
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Device revision is 1
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Secure boot is enabled
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: OTP lock is enabled
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: API lock is enabled
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Debug lock is disabled
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Minimum firmware build 1 week 10 2014
Jan 20 18:34:21 tongfang-amd kernel: Bluetooth: hci0: Found device firmware: intel/ibt-20-1-3.sfi
Jan 20 18:34:21 tongfang-amd kernel: PM: suspend exit
Jan 20 18:34:21 tongfang-amd kernel: Generic FE-GE Realtek PHY r8169-200:00: attached PHY driver [Generic FE-GE Realtek PHY] (mii_bus:phy_addr=r8169-200:00, irq=IGNORE)
Jan 20 18:34:22 tongfang-amd kernel: r8169 0000:02:00.0 eno1: Link is Down
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Waiting for firmware download to complete
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Firmware loaded in 1520629 usecs
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Waiting for device to boot
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Device booted in 15860 usecs
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Found Intel DDC parameters: intel/ibt-20-1-3.ddc
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Applying Intel DDC parameters completed
Jan 20 18:34:23 tongfang-amd kernel: Bluetooth: hci0: Firmware revision 0.0 build 127 week 48 2020
Jan 20 18:34:26 tongfang-amd kernel: r8169 0000:02:00.0 eno1: Link is Up - 1Gbps/Full - flow control rx/tx
Jan 20 18:34:26 tongfang-amd kernel: IPv6: ADDRCONF(NETDEV_CHANGE): eno1: link becomes ready
Jan 20 18:34:41 tongfang-amd kernel: amdgpu_cs_ioctl: 32 callbacks suppressed
Jan 20 18:34:41 tongfang-amd kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -2!
Jan 20 18:35:04 tongfang-amd kernel: GpuWatchdog[5798]: segfault at 0 ip 00007f937fc31fa6 sp 00007f9377719030 error 6 in libcef.so[7f937b924000+75cd000]
Jan 20 18:35:04 tongfang-amd kernel: Code: 89 de e8 cd 2f 63 ff 80 7d cf 00 79 09 48 8b 7d b8 e8 0e 1f 5f fe 41 8b 84 24 e0 00 00 00 89 45 b8 48 8d 7d b8 e8 9a 2e cf fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 5d 41 5e
Jan 20 18:35:04 tongfang-amd kernel: traps: Chrome_IOThread[5714] trap int3 ip:7f7d000ebff4 sp:7f7ce579f0d0 error:0 in libcef.so[7f7cfd87b000+75cd000]
Jan 20 18:35:11 tongfang-amd kernel: GpuWatchdog[3689]: segfault at 0 ip 000055734be3c107 sp 00007f729aa49430 error 6 in signal-desktop[557348c5b000+53d6000]
Jan 20 18:35:11 tongfang-amd kernel: Code: 7d b7 00 79 09 48 8b 7d a0 e8 35 52 d3 fe 8b 83 00 01 00 00 85 c0 0f 84 91 00 00 00 48 8b 03 48 89 df be 01 00 00 00 ff 50 68 <c7> 04 25 00 00 00 00 37 13 00 00 c6 05 17 bc 6f 02 01 80 7d 87 00
Jan 20 18:35:12 tongfang-amd kernel: traps: Chrome_IOThread[3495] trap int3 ip:55d7f536ecd3 sp:7f118e520520 error:0 in signal-desktop[55d7f34eb000+53d6000]
Another one from today on Linux 5.10.7
...
Jan 24 09:59:15 tongfang-amd kernel: IPv6: ADDRCONF(NETDEV_CHANGE): eno1: link becomes ready
Jan 24 12:16:03 tongfang-amd kernel: amdgpu_cs_ioctl: 79 callbacks suppressed
Jan 24 12:16:03 tongfang-amd kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -2!
This is on Linux 5.10.9
.
Jan 27 13:11:36 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:11:36 tongfang-amd kernel: retire_capture_urb: 79 callbacks suppressed
Jan 27 13:11:44 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:11:57 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:11:57 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:11:57 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:11:57 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:12:09 tongfang-amd kernel: xhci_hcd 0000:04:00.4: ERROR unknown event type 37
Jan 27 13:12:09 tongfang-amd kernel: retire_capture_urb: 17 callbacks suppressed
Jan 27 13:12:39 tongfang-amd kernel: amdgpu_cs_ioctl: 1 callbacks suppressed
Jan 27 13:12:39 tongfang-amd kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -2!
Jan 27 13:12:59 tongfang-amd kernel: GpuWatchdog[11742]: segfault at 0 ip 00007f48720a2fa6 sp 00007f4869b8a030 error 6 in libcef.so[7f486dd95000+75cd000]
Jan 27 13:12:59 tongfang-amd kernel: Code: 89 de e8 cd 2f 63 ff 80 7d cf 00 79 09 48 8b 7d b8 e8 0e 1f 5f fe 41 8b 84 24 e0 00 00 00 89 45 b8 48 8d 7d b8 e8 9a 2e cf fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 5d 41 5e
Jan 27 13:13:05 tongfang-amd kernel: GpuWatchdog[1228]: segfault at 0 ip 000055c760a18107 sp 00007f07ffa1c430 error 6 in signal-desktop[55c75d837000+53d6000]
Jan 27 13:13:05 tongfang-amd kernel: Code: 7d b7 00 79 09 48 8b 7d a0 e8 35 52 d3 fe 8b 83 00 01 00 00 85 c0 0f 84 91 00 00 00 48 8b 03 48 89 df be 01 00 00 00 ff 50 68 <c7> 04 25 00 00 00 00 37 13 00 00 c6 05 17 bc 6f 02 01 80 7d 87 00
I've also experienced same error twice to date on Gentoo and recent kernels (since about ~2 months). Last one has occured today. Since you've reported this issue to some bugtrackers, guess my feedback might be useful, too.
Looks like it's a generic driver issue, since my GPU seems to be different from yours (previous generation).
Kernel: 5.10.15-gentoo
GPU: AMD Radeon RX 480
In my case, I don't use suspend, and it triggers quite rarely (so, perhaps, suspend increases chances of bug triggering?). Mouse cursor still moves, but other than that - desktop completely locks up. Kernel hotkeys, such as Ctrl+Alt+F1 also stop working (equivalent command chvt 1
also hangs indefinitely), which prevents to attempt any fix from the freezed machine directly. But since I have ssh on machine, I still able to interact with it. Hanging chvt
seems to indicate that kernel ioctl hangs completely. Which, also suggests that not only userspace is broken - something in kernel seems to lock-up too.
Also, page faults don't occur on my installation, when desktop freezes.
Ultimately, there is a way out of lock-up (at cost of killing all running graphical processes). Not great, but if you have something running on the machine in ssh session/tmux that you want to keep running - it might be better than hard reboot. You can use through ssh:
loginctl terminate-session <your GUI session ID>
This will try to gracefully terminate userspace session. And as soon as it's finished - you can restart X Window System and it should work, machine should un-freeze.
Actually, it's possible (but I have no proof yet) that this might be caused by some single program running in GUI session and killing it will unfreeze the desktop. So, perhaps, GUI processes could be killed one by one through SSH to see if desktop un-freezes (will try this on next freeze).
In the past (couple years ago) there was a bug in mesa git master, where GUI Java programs triggered bug, causing desktop to freeze. Killing the java process was sufficient to unfreeze the desktop without terminating entire GUI session. And now that I think about it, during both freezes I had JetBrains IDE (which is Java-based) running, though it might be coincidence and unrelated to this particular bug.
@tmp6154 I reported the issue here and I've been pointed to a Mesa bug that might be related. There was a PR (https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9006) allegedly fixing the issue.
I'm now running Linux Kernel 5.11.0 with a Mesa build from master
that includes this fix but since I've been traveling I had to turn off the laptop a couple of times, couldn't test it properly. I will start testing it this Sunday.
@gvolpe Oh, that's great! Will try fixed version as well (both Mesa and newer kernel) and test them as well.
I bumped into a different issue while testing Linux Kernel 5.11.0 and a Mesa version with that fix.
Mar 02 10:09:32 tongfang-amd kernel: amdgpu 0000:04:00.0: amdgpu: 000000002db3ee17 pin failed
Mar 02 10:09:32 tongfang-amd kernel: [drm:dm_plane_helper_prepare_fb [amdgpu]] *ERROR* Failed to pin framebuffer with error -12
I reported it here: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4381
This is with Linux kernel
5.10.4
andvulkan
disabled.