Bug 194843 - [amdgpu] oops [drm:gfx_v8_0_priv_reg_irq] *ERROR* Illegal register access in command stream
Summary: [amdgpu] oops [drm:gfx_v8_0_priv_reg_irq] *ERROR* Illegal register access in ...
Status: RESOLVED INVALID
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-03-10 20:34 UTC by Johannes Hirte
Modified: 2017-06-21 19:04 UTC (History)
1 user (show)

See Also:
Kernel Version: 4.11.0-rc1
Subsystem:
Regression: No
Bisected commit-id:


Attachments
dmesg-4.10.0 (68.95 KB, text/plain)
2017-03-10 21:05 UTC, Johannes Hirte
Details
Xorg.0.log (29.75 KB, text/plain)
2017-03-10 21:05 UTC, Johannes Hirte
Details

Description Johannes Hirte 2017-03-10 20:34:49 UTC
With kernel 4.11-rc1 I get occasional systems hangs, where only Magic Sys-Req keys help. I'm sure I've hit those hangs first when testing drm-next-4.12-wip branch, but never got some good logs. I've caught the oops for the first time in the log:

Mar  6 15:11:17 probook kernel: WARNING: CPU: 0 PID: 861 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x73f/0x760
Mar  6 15:11:17 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb irqbypass iwlmvm btrtl aesni_intel crypto_si
md cryptd mac80211 glue_helper btbcm btintel fam15h_power snd_hda_codec_conexant snd_hda_codec_generic bluetooth snd_hda_codec_hdmi k10temp iwlwifi i2c_piix4 snd_hda_intel snd_hda_codec snd_hwdep cfg80211 snd_hda_core sn
d_pcm rfkill rtsx_pci mfd_core snd_timer snd soundcore wmi i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linea
r raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar  6 15:11:17 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4te
ch xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar  6 15:11:17 probook kernel: CPU: 0 PID: 861 Comm: gfx Tainted: G        W       4.11.0-rc1 #104
Mar  6 15:11:17 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar  6 15:11:17 probook kernel: Call Trace:
Mar  6 15:11:17 probook kernel:  dump_stack+0x4f/0x73
Mar  6 15:11:17 probook kernel:  __warn+0xc6/0xe0
Mar  6 15:11:17 probook kernel:  warn_slowpath_null+0x18/0x20
Mar  6 15:11:17 probook kernel:  amdgpu_vm_grab_id+0x73f/0x760
Mar  6 15:11:17 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar  6 15:11:17 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar  6 15:11:17 probook kernel:  amd_sched_main+0xa6/0x4c0
Mar  6 15:11:17 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar  6 15:11:17 probook kernel:  kthread+0xfc/0x130
Mar  6 15:11:17 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar  6 15:11:17 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar  6 15:11:17 probook kernel:  ? umh_complete+0x40/0x40
Mar  6 15:11:17 probook kernel:  ? call_usermodehelper_exec_async+0x137/0x140
Mar  6 15:11:17 probook kernel:  ret_from_fork+0x29/0x40
Mar  6 15:11:17 probook kernel: ---[ end trace efcce4a47ec23c92 ]---
Mar  6 15:11:17 probook kernel: ------------[ cut here ]------------
Mar  6 15:11:17 probook kernel: WARNING: CPU: 0 PID: 861 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x73f/0x760
Mar  6 15:11:17 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb irqbypass iwlmvm btrtl aesni_intel crypto_si
md cryptd mac80211 glue_helper btbcm btintel fam15h_power snd_hda_codec_conexant snd_hda_codec_generic bluetooth snd_hda_codec_hdmi k10temp iwlwifi i2c_piix4 snd_hda_intel snd_hda_codec snd_hwdep cfg80211 snd_hda_core sn
d_pcm rfkill rtsx_pci mfd_core snd_timer snd soundcore wmi i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linea
r raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar  6 15:11:17 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4te
ch xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar  6 15:11:17 probook kernel: CPU: 0 PID: 861 Comm: gfx Tainted: G        W       4.11.0-rc1 #104
Mar  6 15:11:17 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar  6 15:11:17 probook kernel: Call Trace:
Mar  6 15:11:17 probook kernel:  dump_stack+0x4f/0x73
Mar  6 15:11:17 probook kernel:  __warn+0xc6/0xe0
Mar  6 15:11:17 probook kernel:  warn_slowpath_null+0x18/0x20
Mar  6 15:11:17 probook kernel:  amdgpu_vm_grab_id+0x73f/0x760
Mar  6 15:11:17 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar  6 15:11:17 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar  6 15:11:17 probook kernel:  amd_sched_main+0xa6/0x4c0
Mar  6 15:11:17 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar  6 15:11:17 probook kernel:  kthread+0xfc/0x130
Mar  6 15:11:17 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar  6 15:11:17 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar  6 15:11:17 probook kernel:  ? umh_complete+0x40/0x40
Mar  6 15:11:17 probook kernel:  ? call_usermodehelper_exec_async+0x137/0x140
Mar  6 15:11:17 probook kernel:  ret_from_fork+0x29/0x40
Mar  6 15:11:17 probook kernel: ---[ end trace efcce4a47ec23c93 ]---
Mar  6 15:11:17 probook kernel: [drm:gfx_v8_0_priv_reg_irq] *ERROR* Illegal register access in command stream
Mar  6 15:11:17 probook kernel: [drm] IP block:gfx_v8_0 is hung!
Mar  6 15:11:17 probook kernel: BUG: unable to handle kernel NULL pointer dereference at 000000000000001c
Mar  6 15:11:17 probook kernel: IP: kthread_park+0x7/0x90
Mar  6 15:11:17 probook kernel: PGD 0
Mar  6 15:11:17 probook kernel: 
Mar  6 15:11:17 probook kernel: Oops: 0000 [#1] PREEMPT SMP
Mar  6 15:11:17 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb irqbypass iwlmvm btrtl aesni_intel crypto_si
md cryptd mac80211 glue_helper btbcm btintel fam15h_power snd_hda_codec_conexant snd_hda_codec_generic bluetooth snd_hda_codec_hdmi k10temp iwlwifi i2c_piix4 snd_hda_intel snd_hda_codec snd_hwdep cfg80211 snd_hda_core sn
d_pcm rfkill rtsx_pci mfd_core snd_timer snd soundcore wmi i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linea
r raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar  6 15:11:17 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4te
ch xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar  6 15:11:17 probook kernel: CPU: 0 PID: 13409 Comm: kworker/0:0 Tainted: G        W       4.11.0-rc1 #104
Mar  6 15:11:17 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar  6 15:11:17 probook kernel: Workqueue: events amdgpu_irq_reset_work_func
Mar  6 15:11:17 probook kernel: task: ffff88004139ea00 task.stack: ffff88003034c000
Mar  6 15:11:17 probook kernel: RIP: 0010:kthread_park+0x7/0x90
Mar  6 15:11:17 probook kernel: RSP: 0018:ffff88003034fda8 EFLAGS: 00010286
Mar  6 15:11:17 probook kernel: RAX: 0000000000000000 RBX: ffff88013551aa30 RCX: 0000000000000001
Mar  6 15:11:17 probook kernel: RDX: 0000000080000001 RSI: 0000000000000004 RDI: 0000000000000000
Mar  6 15:11:17 probook kernel: RBP: ffff88003034fdb8 R08: ffff88013a000000 R09: 0000000000000000
Mar  6 15:11:17 probook kernel: R10: 0000000000000040 R11: ffff88013a000028 R12: ffff880135518000
Mar  6 15:11:17 probook kernel: R13: ffff88013551aab0 R14: ffff88013551c550 R15: ffff88013551aa30
Mar  6 15:11:17 probook kernel: FS:  0000000000000000(0000) GS:ffff88013ec00000(0000) knlGS:0000000000000000
Mar  6 15:11:17 probook kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  6 15:11:17 probook kernel: CR2: 000000000000001c CR3: 0000000134cbf000 CR4: 00000000001406f0
Mar  6 15:11:17 probook kernel: Call Trace:
Mar  6 15:11:17 probook kernel:  amdgpu_gpu_reset+0x86/0x620
Mar  6 15:11:17 probook kernel:  ? kfree+0x16f/0x180
Mar  6 15:11:17 probook kernel:  ? amdgpu_job_free_cb+0x42/0x70
Mar  6 15:11:17 probook kernel:  amdgpu_irq_reset_work_func+0xd/0x10
Mar  6 15:11:17 probook kernel:  process_one_work+0x1ee/0x4a0
Mar  6 15:11:17 probook kernel:  worker_thread+0x43/0x4e0
Mar  6 15:11:17 probook kernel:  kthread+0xfc/0x130
Mar  6 15:11:17 probook kernel:  ? process_one_work+0x4a0/0x4a0
Mar  6 15:11:17 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar  6 15:11:17 probook kernel:  ret_from_fork+0x29/0x40
Mar  6 15:11:17 probook kernel: Code: 76 ac 81 e8 6c 1c fe ff 48 8b bb 40 05 00 00 e8 50 ff ff ff 5b 5d c3 0f 1f 00 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 41 54 53 <8b> 47 1c 49 89 fc a9 00 00 20 00 74 58 a8 04 49 8b 9c 24 40 05 
Mar  6 15:11:17 probook kernel: RIP: kthread_park+0x7/0x90 RSP: ffff88003034fda8
Mar  6 15:11:17 probook kernel: CR2: 000000000000001c
Mar  6 15:11:17 probook kernel: ---[ end trace efcce4a47ec23c94 ]---
Mar  6 15:11:32 probook kwin_x11[4127]: Freeze in OpenGL initialization detected
Mar  6 15:13:11 probook systemd-logind[3809]: Lid closed.
Mar  6 15:13:24 probook systemd-logind[3809]: Lid opened.
Mar  6 15:13:37 probook systemd-logind[3809]: Power key pressed.

Today I had another hang without an oops but AMD-Vi message in the log:

Mar 10 21:09:30 probook kernel: ------------[ cut here ]------------
Mar 10 21:09:30 probook kernel: WARNING: CPU: 3 PID: 873 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x73f/0x760
Mar 10 21:09:30 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb btrtl irqbypass btbcm btintel iwlmvm aesni_i
ntel mac80211 crypto_simd cryptd glue_helper bluetooth iwlwifi fam15h_power k10temp cfg80211 snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 snd_hda_codec_hdmi rfkill snd_hda_intel rtsx_pci snd_hda_codec mfd_core 
snd_hwdep wmi snd_hda_core snd_pcm snd_timer snd soundcore i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linea
r raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar 10 21:09:30 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4te
ch xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar 10 21:09:30 probook kernel: CPU: 3 PID: 873 Comm: sdma0 Not tainted 4.11.0-rc1 #104
Mar 10 21:09:30 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 10 21:09:30 probook kernel: Call Trace:
Mar 10 21:09:30 probook kernel:  dump_stack+0x4f/0x73
Mar 10 21:09:30 probook kernel:  __warn+0xc6/0xe0
Mar 10 21:09:30 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 10 21:09:30 probook kernel:  amdgpu_vm_grab_id+0x73f/0x760
Mar 10 21:09:30 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar 10 21:09:30 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar 10 21:09:30 probook kernel:  amd_sched_main+0xa6/0x4c0
Mar 10 21:09:30 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar 10 21:09:30 probook kernel:  kthread+0xfc/0x130
Mar 10 21:09:30 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar 10 21:09:30 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar 10 21:09:30 probook kernel:  ret_from_fork+0x29/0x40
Mar 10 21:09:30 probook kernel: ---[ end trace 2c00b1592d9c3307 ]---
Mar 10 21:09:59 probook kernel: ------------[ cut here ]------------
Mar 10 21:09:59 probook kernel: WARNING: CPU: 2 PID: 864 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x73f/0x760
Mar 10 21:09:59 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb btrtl irqbypass btbcm btintel iwlmvm aesni_intel mac80211 crypto_simd cryptd glue_helper bluetooth iwlwifi fam15h_power k10temp cfg80211 snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 snd_hda_codec_hdmi rfkill snd_hda_intel rtsx_pci snd_hda_codec mfd_core snd_hwdep wmi snd_hda_core snd_pcm snd_timer snd soundcore i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linear raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar 10 21:09:59 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4tech xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar 10 21:09:59 probook kernel: CPU: 2 PID: 864 Comm: gfx Tainted: G        W       4.11.0-rc1 #104
Mar 10 21:09:59 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 10 21:09:59 probook kernel: Call Trace:
Mar 10 21:09:59 probook kernel:  dump_stack+0x4f/0x73
Mar 10 21:09:59 probook kernel:  __warn+0xc6/0xe0
Mar 10 21:09:59 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 10 21:09:59 probook kernel:  amdgpu_vm_grab_id+0x73f/0x760
Mar 10 21:09:59 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar 10 21:09:59 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar 10 21:09:59 probook kernel:  amd_sched_main+0xa6/0x4c0
Mar 10 21:09:59 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar 10 21:09:59 probook kernel:  kthread+0xfc/0x130
Mar 10 21:09:59 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar 10 21:09:59 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar 10 21:09:59 probook kernel:  ? umh_complete+0x40/0x40
Mar 10 21:09:59 probook kernel:  ? call_usermodehelper_exec_async+0x137/0x140
Mar 10 21:09:59 probook kernel:  ret_from_fork+0x29/0x40
Mar 10 21:09:59 probook kernel: ---[ end trace 2c00b1592d9c3308 ]---
Mar 10 21:09:59 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:09:59 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:09:59 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:09:59 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:00 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459800]
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:00 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:01 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x00000001394598c0]
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:01 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:02 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:02 probook crond[12952]: (root) CMD (test -x /usr/sbin/run-crons && /usr/sbin/run-crons)
Mar 10 21:10:02 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:02 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:02 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459980]
Mar 10 21:10:02 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:02 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:02 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:03 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:03 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:03 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:03 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459a40]
Mar 10 21:10:03 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:03 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:03 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:04 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:04 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:04 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:04 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459b00]
Mar 10 21:10:04 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:04 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:04 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:05 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:05 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:05 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:05 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459bc0]
Mar 10 21:10:05 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:05 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:05 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:06 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:06 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:06 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:06 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459c80]
Mar 10 21:10:06 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:06 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:06 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:07 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:07 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:07 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:07 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:07 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459d40]
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: ------------[ cut here ]------------
Mar 10 21:10:08 probook kernel: WARNING: CPU: 2 PID: 0 at drivers/iommu/amd_iommu.c:1256 __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb btrtl irqbypass btbcm btintel iwlmvm aesni_intel mac80211 crypto_simd cryptd glue_helper bluetooth iwlwifi fam15h_power k10temp cfg80211 snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 snd_hda_codec_hdmi rfkill snd_hda_intel rtsx_pci snd_hda_codec mfd_core snd_hwdep wmi snd_hda_core snd_pcm snd_timer snd soundcore i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linear raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar 10 21:10:08 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4tech xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar 10 21:10:08 probook kernel: CPU: 2 PID: 0 Comm: swapper/2 Tainted: G        W       4.11.0-rc1 #104
Mar 10 21:10:08 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 10 21:10:08 probook kernel: Call Trace:
Mar 10 21:10:08 probook kernel:  <IRQ>
Mar 10 21:10:08 probook kernel:  dump_stack+0x4f/0x73
Mar 10 21:10:08 probook kernel:  __warn+0xc6/0xe0
Mar 10 21:10:08 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 10 21:10:08 probook kernel:  __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel:  __queue_flush+0x46/0xc0
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_all+0x70/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_timeout+0x13/0x20
Mar 10 21:10:08 probook kernel:  call_timer_fn+0x30/0x160
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  run_timer_softirq+0x1e8/0x450
Mar 10 21:10:08 probook kernel:  ? lapic_next_event+0x18/0x20
Mar 10 21:10:08 probook kernel:  ? clockevents_program_event+0x7a/0x120
Mar 10 21:10:08 probook kernel:  __do_softirq+0x104/0x2d0
Mar 10 21:10:08 probook kernel:  irq_exit+0xa0/0xb0
Mar 10 21:10:08 probook kernel:  smp_apic_timer_interrupt+0x38/0x50
Mar 10 21:10:08 probook kernel:  apic_timer_interrupt+0x86/0x90
Mar 10 21:10:08 probook kernel: RIP: 0010:acpi_safe_halt+0x16/0x19
Mar 10 21:10:08 probook kernel: RSP: 0018:ffff880139fa3e10 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
Mar 10 21:10:08 probook kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 000000000000001f
Mar 10 21:10:08 probook kernel: RDX: ffff88013ed00001 RSI: ffffffff81b287c5 RDI: ffff88013994ec64
Mar 10 21:10:08 probook kernel: RBP: ffff880139fa3e10 R08: 00000000000003d2 R09: 0000000000000018
Mar 10 21:10:08 probook kernel: R10: 0000000000000312 R11: 00000000000003a7 R12: ffff88013994ec64
Mar 10 21:10:08 probook kernel: R13: ffff88013994ec00 R14: 0000000000000001 R15: ffff8801394b3c00
Mar 10 21:10:08 probook kernel:  </IRQ>
Mar 10 21:10:08 probook kernel:  acpi_idle_do_entry+0x1b/0x37
Mar 10 21:10:08 probook kernel:  acpi_idle_enter+0x1be/0x1e4
Mar 10 21:10:08 probook kernel:  cpuidle_enter_state+0xed/0x2e0
Mar 10 21:10:08 probook kernel:  cpuidle_enter+0x12/0x20
Mar 10 21:10:08 probook kernel:  call_cpuidle+0x1e/0x30
Mar 10 21:10:08 probook kernel:  do_idle+0x183/0x1e0
Mar 10 21:10:08 probook kernel:  cpu_startup_entry+0x18/0x20
Mar 10 21:10:08 probook kernel:  start_secondary+0xf0/0x100
Mar 10 21:10:08 probook kernel:  start_cpu+0x14/0x14
Mar 10 21:10:08 probook kernel: ---[ end trace 2c00b1592d9c3309 ]---
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: ------------[ cut here ]------------
Mar 10 21:10:08 probook kernel: WARNING: CPU: 2 PID: 0 at drivers/iommu/amd_iommu.c:1256 __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb btrtl irqbypass btbcm btintel iwlmvm aesni_intel mac80211 crypto_simd cryptd glue_helper bluetooth iwlwifi fam15h_power k10temp cfg80211 snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 snd_hda_codec_hdmi rfkill snd_hda_intel rtsx_pci snd_hda_codec mfd_core snd_hwdep wmi snd_hda_core snd_pcm snd_timer snd soundcore i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linear raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar 10 21:10:08 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4tech xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar 10 21:10:08 probook kernel: CPU: 2 PID: 0 Comm: swapper/2 Tainted: G        W       4.11.0-rc1 #104
Mar 10 21:10:08 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 10 21:10:08 probook kernel: Call Trace:
Mar 10 21:10:08 probook kernel:  <IRQ>
Mar 10 21:10:08 probook kernel:  dump_stack+0x4f/0x73
Mar 10 21:10:08 probook kernel:  __warn+0xc6/0xe0
Mar 10 21:10:08 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 10 21:10:08 probook kernel:  __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel:  __queue_flush+0x46/0xc0
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_all+0x70/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_timeout+0x13/0x20
Mar 10 21:10:08 probook kernel:  call_timer_fn+0x30/0x160
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  run_timer_softirq+0x1e8/0x450
Mar 10 21:10:08 probook kernel:  ? lapic_next_event+0x18/0x20
Mar 10 21:10:08 probook kernel:  ? clockevents_program_event+0x7a/0x120
Mar 10 21:10:08 probook kernel:  __do_softirq+0x104/0x2d0
Mar 10 21:10:08 probook kernel:  irq_exit+0xa0/0xb0
Mar 10 21:10:08 probook kernel:  smp_apic_timer_interrupt+0x38/0x50
Mar 10 21:10:08 probook kernel:  apic_timer_interrupt+0x86/0x90
Mar 10 21:10:08 probook kernel: RIP: 0010:acpi_safe_halt+0x16/0x19
Mar 10 21:10:08 probook kernel: RSP: 0018:ffff880139fa3e10 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
Mar 10 21:10:08 probook kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 000000000000001f
Mar 10 21:10:08 probook kernel: RDX: ffff88013ed00001 RSI: ffffffff81b287c5 RDI: ffff88013994ec64
Mar 10 21:10:08 probook kernel: RBP: ffff880139fa3e10 R08: 00000000000003d2 R09: 0000000000000018
Mar 10 21:10:08 probook kernel: R10: 0000000000000312 R11: 00000000000003a7 R12: ffff88013994ec64
Mar 10 21:10:08 probook kernel: R13: ffff88013994ec00 R14: 0000000000000001 R15: ffff8801394b3c00
Mar 10 21:10:08 probook kernel:  </IRQ>
Mar 10 21:10:08 probook kernel:  acpi_idle_do_entry+0x1b/0x37
Mar 10 21:10:08 probook kernel:  acpi_idle_enter+0x1be/0x1e4
Mar 10 21:10:08 probook kernel:  cpuidle_enter_state+0xed/0x2e0
Mar 10 21:10:08 probook kernel:  cpuidle_enter+0x12/0x20
Mar 10 21:10:08 probook kernel:  call_cpuidle+0x1e/0x30
Mar 10 21:10:08 probook kernel:  do_idle+0x183/0x1e0
Mar 10 21:10:08 probook kernel:  cpu_startup_entry+0x18/0x20
Mar 10 21:10:08 probook kernel:  start_secondary+0xf0/0x100
Mar 10 21:10:08 probook kernel:  start_cpu+0x14/0x14
Mar 10 21:10:08 probook kernel: ---[ end trace 2c00b1592d9c330a ]---
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: ------------[ cut here ]------------
Mar 10 21:10:08 probook kernel: WARNING: CPU: 2 PID: 0 at drivers/iommu/amd_iommu.c:1256 __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel: Modules linked in: algif_hash algif_skcipher af_alg cmac rfcomm uhid bnep rtsx_pci_sdmmc mmc_core rtsx_pci_ms memstick hp_wmi kvm_amd kvm btusb btrtl irqbypass btbcm btintel iwlmvm aesni_intel mac80211 crypto_simd cryptd glue_helper bluetooth iwlwifi fam15h_power k10temp cfg80211 snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 snd_hda_codec_hdmi rfkill snd_hda_intel rtsx_pci snd_hda_codec mfd_core snd_hwdep wmi snd_hda_core snd_pcm snd_timer snd soundcore i2c_designware_platform i2c_designware_core hp_wireless efivarfs autofs4 xts aes_x86_64 sha512_generic macvlan r8169 mii fuse overlay xfs ext4 jbd2 mbcache linear raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx libcrc32c raid1 raid0 md_mod dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log
Mar 10 21:10:08 probook kernel:  dm_mod hid_sunplus hid_sony hid_samsung hid_pl hid_petalynx hid_monterey hid_microsoft hid_logitech hid_gyration hid_ezkey hid_cypress hid_chicony hid_cherry hid_belkin hid_apple hid_a4tech xhci_pci xhci_hcd ohci_pci ohci_hcd usb_storage ehci_pci ehci_hcd
Mar 10 21:10:08 probook kernel: CPU: 2 PID: 0 Comm: swapper/2 Tainted: G        W       4.11.0-rc1 #104
Mar 10 21:10:08 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 10 21:10:08 probook kernel: Call Trace:
Mar 10 21:10:08 probook kernel:  <IRQ>
Mar 10 21:10:08 probook kernel:  dump_stack+0x4f/0x73
Mar 10 21:10:08 probook kernel:  __warn+0xc6/0xe0
Mar 10 21:10:08 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 10 21:10:08 probook kernel:  __domain_flush_pages+0x1d2/0x200
Mar 10 21:10:08 probook kernel:  __queue_flush+0x46/0xc0
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_all+0x70/0x90
Mar 10 21:10:08 probook kernel:  queue_flush_timeout+0x13/0x20
Mar 10 21:10:08 probook kernel:  call_timer_fn+0x30/0x160
Mar 10 21:10:08 probook kernel:  ? queue_flush_all+0x90/0x90
Mar 10 21:10:08 probook kernel:  run_timer_softirq+0x1e8/0x450
Mar 10 21:10:08 probook kernel:  ? lapic_next_event+0x18/0x20
Mar 10 21:10:08 probook kernel:  ? clockevents_program_event+0x7a/0x120
Mar 10 21:10:08 probook kernel:  __do_softirq+0x104/0x2d0
Mar 10 21:10:08 probook kernel:  irq_exit+0xa0/0xb0
Mar 10 21:10:08 probook kernel:  smp_apic_timer_interrupt+0x38/0x50
Mar 10 21:10:08 probook kernel:  apic_timer_interrupt+0x86/0x90
Mar 10 21:10:08 probook kernel: RIP: 0010:acpi_safe_halt+0x16/0x19
Mar 10 21:10:08 probook kernel: RSP: 0018:ffff880139fa3e10 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
Mar 10 21:10:08 probook kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 000000000000001f
Mar 10 21:10:08 probook kernel: RDX: ffff88013ed00001 RSI: ffffffff81b287c5 RDI: ffff88013994ec64
Mar 10 21:10:08 probook kernel: RBP: ffff880139fa3e10 R08: 00000000000003d2 R09: 0000000000000018
Mar 10 21:10:08 probook kernel: R10: 0000000000000312 R11: 00000000000003a7 R12: ffff88013994ec64
Mar 10 21:10:08 probook kernel: R13: ffff88013994ec00 R14: 0000000000000001 R15: ffff8801394b3c00
Mar 10 21:10:08 probook kernel:  </IRQ>
Mar 10 21:10:08 probook kernel:  acpi_idle_do_entry+0x1b/0x37
Mar 10 21:10:08 probook kernel:  acpi_idle_enter+0x1be/0x1e4
Mar 10 21:10:08 probook kernel:  cpuidle_enter_state+0xed/0x2e0
Mar 10 21:10:08 probook kernel:  cpuidle_enter+0x12/0x20
Mar 10 21:10:08 probook kernel:  call_cpuidle+0x1e/0x30
Mar 10 21:10:08 probook kernel:  do_idle+0x183/0x1e0
Mar 10 21:10:08 probook kernel:  cpu_startup_entry+0x18/0x20
Mar 10 21:10:08 probook kernel:  start_secondary+0xf0/0x100
Mar 10 21:10:08 probook kernel:  start_cpu+0x14/0x14
Mar 10 21:10:08 probook kernel: ---[ end trace 2c00b1592d9c330b ]---
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:08 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459e00]
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:08 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:09 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:09 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459f10]
Mar 10 21:10:09 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:09 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf000 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf040 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf080 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf0c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf100 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf140 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf180 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf1c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf200 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf240 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf280 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf2c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf340 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf300 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf380 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf400 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf3c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf440 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf480 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf4c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf500 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf540 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf580 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf5c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf600 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf640 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf680 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf6c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf700 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf740 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf780 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf7c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf800 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf840 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf880 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf8c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf900 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf940 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf980 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bf9c0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfa00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfa40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfa80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfac0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfb00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfb40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfb80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfbc0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfc00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfc40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfc80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfcc0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfd00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfd40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfd80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfdc0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfe00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfe40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfe80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bfec0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bff00 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bff40 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bff80 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:09 probook kernel: IO_PAGE_FAULT device=00:11.0 domain=0x0007 address=0x00000000f28bffc0 flags=0x0050]
Mar 10 21:10:09 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:10 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:10 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:10 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:10 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139459fd0]
Mar 10 21:10:10 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:10 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:10 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:11 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:11 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:11 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:11 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139458090]
Mar 10 21:10:11 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:11 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:11 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:12 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:12 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:12 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:12 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139458150]
Mar 10 21:10:12 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:12 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:12 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:13 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:13 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:13 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:13 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139458210]
Mar 10 21:10:13 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:13 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:13 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:14 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:14 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:14 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:14 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x00000001394582d0]
Mar 10 21:10:14 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:14 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:14 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:15 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:15 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:15 probook kernel: AMD-Vi: Event logged [
Mar 10 21:10:15 probook kernel: IOTLB_INV_TIMEOUT device=00:01.0 address=0x0000000139458390]
Mar 10 21:10:15 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:15 probook kernel: AMD-Vi: Completion-Wait loop timed out
Mar 10 21:10:15 probook kernel: AMD-Vi: Completion-Wait loop timed out

Sadly I don't have a workflow for reproducing this, so I can't bisect.
Comment 1 Alex Deucher 2017-03-10 20:40:10 UTC
what chip is this?  Please attach your xorg log and dmesg output.
Comment 2 Johannes Hirte 2017-03-10 21:05:35 UTC
Created attachment 255173 [details]
dmesg-4.10.0
Comment 3 Johannes Hirte 2017-03-10 21:05:57 UTC
Created attachment 255175 [details]
Xorg.0.log
Comment 4 Johannes Hirte 2017-03-10 21:08:08 UTC
it's a Carrizo, A10-8700B R6

Requested logs attached, both running kernel 4.10.0 at moment. Do you need them from 4.11-rc1?
Comment 5 Johannes Hirte 2017-03-12 11:26:23 UTC
Ok, it's not 4.11 specific. Now I had a system hang with 4.10.0 and found in the logs after reboot only this:

Mar 12 12:12:48 probook kernel: ------------[ cut here ]------------
Mar 12 12:12:48 probook kernel: WARNING: CPU: 1 PID: 872 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x7ef/0x810
Mar 12 12:12:48 probook kernel: Modules linked in: uas usb_storage cmac rfcomm uhid bnep btusb btrtl btbcm btintel bluetooth hp_wmi kvm_amd kvm iwlmvm irqbypass mac80211 aesni_intel aes_x86_64 crypto_simd cryptd glue_hel
per fam15h_power snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 k10temp snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm iwlwifi rtsx_pci_ms snd_timer cfg80211 memstick snd rfkill r81
69 soundcore mii wmi i2c_designware_platform i2c_designware_core hp_wireless rtsx_pci_sdmmc mmc_core ehci_pci ehci_hcd xhci_pci xhci_hcd rtsx_pci mfd_core efivarfs autofs4
Mar 12 12:12:48 probook kernel: CPU: 1 PID: 872 Comm: sdma0 Not tainted 4.10.0 #91
Mar 12 12:12:48 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 12 12:12:48 probook kernel: Call Trace:
Mar 12 12:12:48 probook kernel:  dump_stack+0x4f/0x73
Mar 12 12:12:48 probook kernel:  __warn+0xc6/0xe0
Mar 12 12:12:48 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 12 12:12:48 probook kernel:  amdgpu_vm_grab_id+0x7ef/0x810
Mar 12 12:12:48 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar 12 12:12:48 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar 12 12:12:48 probook kernel:  amd_sched_main+0x9e/0x500
Mar 12 12:12:48 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar 12 12:12:48 probook kernel:  kthread+0xfc/0x130
Mar 12 12:12:48 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar 12 12:12:48 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar 12 12:12:48 probook kernel:  ret_from_fork+0x29/0x40
Mar 12 12:12:48 probook kernel: ---[ end trace 4591763eee9b4ab4 ]---
Mar 12 12:12:48 probook kernel: ------------[ cut here ]------------
Mar 12 12:12:48 probook kernel: WARNING: CPU: 2 PID: 863 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id+0x7ef/0x810
Mar 12 12:12:48 probook kernel: Modules linked in: uas usb_storage cmac rfcomm uhid bnep btusb btrtl btbcm btintel bluetooth hp_wmi kvm_amd kvm iwlmvm irqbypass mac80211 aesni_intel aes_x86_64 crypto_simd cryptd glue_hel
per fam15h_power snd_hda_codec_conexant snd_hda_codec_generic i2c_piix4 k10temp snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm iwlwifi rtsx_pci_ms snd_timer cfg80211 memstick snd rfkill r81
69 soundcore mii wmi i2c_designware_platform i2c_designware_core hp_wireless rtsx_pci_sdmmc mmc_core ehci_pci ehci_hcd xhci_pci xhci_hcd rtsx_pci mfd_core efivarfs autofs4
Mar 12 12:12:48 probook kernel: CPU: 2 PID: 863 Comm: gfx Tainted: G        W       4.10.0 #91
Mar 12 12:12:48 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Mar 12 12:12:48 probook kernel: Call Trace:
Mar 12 12:12:48 probook kernel:  dump_stack+0x4f/0x73
Mar 12 12:12:48 probook kernel:  __warn+0xc6/0xe0
Mar 12 12:12:48 probook kernel:  warn_slowpath_null+0x18/0x20
Mar 12 12:12:48 probook kernel:  amdgpu_vm_grab_id+0x7ef/0x810
Mar 12 12:12:48 probook kernel:  ? dma_fence_wait_timeout+0x110/0x110
Mar 12 12:12:48 probook kernel:  amdgpu_job_dependency+0x5a/0x90
Mar 12 12:12:48 probook kernel:  amd_sched_main+0x9e/0x500
Mar 12 12:12:48 probook kernel:  ? wake_atomic_t_function+0x50/0x50
Mar 12 12:12:48 probook kernel:  kthread+0xfc/0x130
Mar 12 12:12:48 probook kernel:  ? amd_sched_process_job+0xe0/0xe0
Mar 12 12:12:48 probook kernel:  ? kthread_create_on_node+0x40/0x40
Mar 12 12:12:48 probook kernel:  ? umh_complete+0x40/0x40
Mar 12 12:12:48 probook kernel:  ? call_usermodehelper_exec_async+0x137/0x140
Mar 12 12:12:48 probook kernel:  ret_from_fork+0x29/0x40
Mar 12 12:12:48 probook kernel: ---[ end trace 4591763eee9b4ab5 ]---
Comment 6 Johannes Hirte 2017-04-18 19:40:50 UTC
Some more observation: It seems the hangs happen much more often/frequent with kernel 4.11 than with 4.10. Where 4.10 kernels running usually several days, I have a hang with 4.11 within a day.

Additionally I've found some of the 

WARNING: CPU: 1 PID: 872 at ./include/linux/dma-fence.h:349 amdgpu_vm_grab_id 

entries in the logs without a hang at this time. As far as I've seen this was always with a 4.10 kernel.
Comment 7 Michel Dänzer 2017-04-19 03:23:08 UTC
I wonder if there might be memory corruption going on, in which case enabling CONFIG_KASAN for the kernel build might give more clues.
Comment 8 Johannes Hirte 2017-04-24 12:49:56 UTC
(In reply to Michel Dänzer from comment #7)
> I wonder if there might be memory corruption going on, in which case
> enabling CONFIG_KASAN for the kernel build might give more clues.

I was testing the last days with KASAN enabled and didn't hit one hang or other BUG message in the logs. Today I've upgraded the RAM from one 4G module to two 8G modules and now the first hit directly after boot:

[  104.834811] wlp2s0: authenticate with 02:a0:f9:37:8e:a6
[  104.838674] ==================================================================
[  104.838715] BUG: KASAN: global-out-of-bounds in iwl_mvm_mac_ctxt_cmd_common+0x14b5/0x1610 [iwlmvm] at addr ffffffffa0d4a336
[  104.838724] Read of size 2 by task wpa_supplicant/4039
[  104.838739] Address belongs to variable iwl_drv_exit+0xf66f/0x339 [iwlwifi]
[  104.838750] CPU: 2 PID: 4039 Comm: wpa_supplicant Not tainted 4.11.0-rc7-kasan-00001-g73080f5e1d5b #171
[  104.838755] Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
[  104.838760] Call Trace:
[  104.838772]  dump_stack+0x4f/0x66
[  104.838781]  kasan_report+0x4da/0x510
[  104.838798]  ? iwl_mvm_mac_ctxt_cmd_common+0x14b5/0x1610 [iwlmvm]
[  104.838805]  ? update_curr+0x14b/0x490
[  104.838812]  ? wake_atomic_t_function+0x2b0/0x2b0
[  104.838819]  __asan_report_load2_noabort+0x14/0x20
[  104.838835]  iwl_mvm_mac_ctxt_cmd_common+0x14b5/0x1610 [iwlmvm]
[  104.838854]  ? iwl_mvm_channel_switch_noa_notif+0x40f/0x410 [iwlmvm]
[  104.838870]  ? iwl_mvm_mac_ctxt_send_beacon+0xcb0/0xcb0 [iwlmvm]
[  104.838885]  ? iwl_mvm_send_cmd_pdu+0x91/0xb0 [iwlmvm]
[  104.838901]  ? iwl_mvm_send_cmd+0x160/0x160 [iwlmvm]
[  104.838917]  iwl_mvm_mac_ctxt_cmd_sta+0xd1/0xe70 [iwlmvm]
[  104.838933]  ? iwl_mvm_mac_ctxt_cmd_common+0x1610/0x1610 [iwlmvm]
[  104.838949]  ? iwl_mvm_phy_ctxt_apply.constprop.3+0x31f/0x5d0 [iwlmvm]
[  104.838966]  ? iwl_mvm_ref_taken+0x150/0x150 [iwlmvm]
[  104.838982]  iwl_mvm_mac_ctx_send+0x68/0x110 [iwlmvm]
[  104.838996]  iwl_mvm_mac_ctxt_changed+0x68/0x180 [iwlmvm]
[  104.839011]  iwl_mvm_bss_info_changed+0x2f8/0xec0 [iwlmvm]
[  104.839043]  ieee80211_bss_info_change_notify+0x177/0x4c0 [mac80211]
[  104.839070]  ? __ieee80211_recalc_txpower+0x111/0x320 [mac80211]
[  104.839097]  ieee80211_assign_vif_chanctx+0x7ce/0xf80 [mac80211]
[  104.839123]  ieee80211_vif_use_channel+0x3ad/0x780 [mac80211]
[  104.839149]  ieee80211_prep_connection+0x55b/0x1cf0 [mac80211]
[  104.839174]  ? ieee80211_handle_bss_capability+0x220/0x220 [mac80211]
[  104.839182]  ? __kmalloc+0x126/0x220
[  104.839207]  ieee80211_mgd_auth+0x69d/0xdd0 [mac80211]
[  104.839232]  ? ieee80211_mlme_notify_scan_completed+0x1c0/0x1c0 [mac80211]
[  104.839261]  ieee80211_auth+0x13/0x20 [mac80211]
[  104.839291]  cfg80211_mlme_auth+0x2a7/0x6b0 [cfg80211]
[  104.839298]  ? unwind_get_return_address+0x1e0/0x1e0
[  104.839319]  ? cfg80211_rx_mgmt+0x710/0x710 [cfg80211]
[  104.839342]  ? parse_station_flags.isra.36+0x490/0x490 [cfg80211]
[  104.839363]  nl80211_authenticate+0x8f7/0xfe0 [cfg80211]
[  104.839385]  ? nl80211_parse_key+0xe70/0xe70 [cfg80211]
[  104.839406]  ? nl80211_pre_doit+0xcd/0x560 [cfg80211]
[  104.839414]  ? nla_parse+0xde/0x210
[  104.839422]  genl_family_rcv_msg+0x5c8/0x10f0
[  104.839429]  ? __alloc_skb+0x31f/0x560
[  104.839435]  ? genl_rcv+0x40/0x40
[  104.839443]  ? try_to_wake_up+0xb8/0x1080
[  104.839450]  ? alloc_skb_with_frags+0x8d/0x4c0
[  104.839458]  genl_rcv_msg+0x9b/0x120
[  104.839465]  netlink_rcv_skb+0x23b/0x340
[  104.839471]  ? genl_family_rcv_msg+0x10f0/0x10f0
[  104.839477]  genl_rcv+0x23/0x40
[  104.839483]  netlink_unicast+0x438/0x620
[  104.839489]  ? netlink_attachskb+0x640/0x640
[  104.839497]  netlink_sendmsg+0x86f/0xb60
[  104.839503]  ? netlink_broadcast+0x10/0x10
[  104.839510]  ? netlink_broadcast+0x10/0x10
[  104.839516]  sock_sendmsg+0xb5/0xf0
[  104.839522]  ___sys_sendmsg+0x6a2/0x8c0
[  104.839529]  ? ___sys_recvmsg+0x333/0x590
[  104.839535]  ? SYSC_sendto+0x300/0x300
[  104.839541]  ? sock_sendmsg+0xb5/0xf0
[  104.839547]  ? sock_write_iter+0x1e0/0x3b0
[  104.839553]  ? _raw_spin_unlock_irq+0x39/0x60
[  104.839559]  ? sock_sendmsg+0xf0/0xf0
[  104.839567]  ? __vfs_write+0x299/0x620
[  104.839573]  ? vfs_dedupe_get_page.isra.20+0x1d0/0x1d0
[  104.839580]  ? __fdget+0xe/0x10
[  104.839587]  __sys_sendmsg+0xc1/0x140
[  104.839592]  ? __sys_sendmsg+0xc1/0x140
[  104.839598]  ? SyS_shutdown+0x170/0x170
[  104.839605]  ? vfs_write+0x305/0x490
[  104.839613]  ? exit_to_usermode_loop+0x75/0xf0
[  104.839620]  SyS_sendmsg+0xd/0x20
[  104.839626]  entry_SYSCALL_64_fastpath+0x13/0x94
[  104.839632] RIP: 0033:0x7fb02a23fad7
[  104.839637] RSP: 002b:00007ffdd3d73b28 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
[  104.839645] RAX: ffffffffffffffda RBX: 000000000185daf0 RCX: 00007fb02a23fad7
[  104.839649] RDX: 0000000000000000 RSI: 00007ffdd3d73b80 RDI: 0000000000000006
[  104.839654] RBP: 00007fb02a4e6ae0 R08: 0000000000000000 R09: 00000000000000a6
[  104.839658] R10: 0000000001867d90 R11: 0000000000000246 R12: 0000000000000000
[  104.839663] R13: 0000000000000003 R14: 0000000000000011 R15: 000000000185d8c0
[  104.839669] Memory state around the buggy address:
[  104.839676]  ffffffffa0d4a200: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  104.839682]  ffffffffa0d4a280: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  104.839687] >ffffffffa0d4a300: 00 00 00 00 00 00 fa fa fa fa fa fa 00 00 00 00
[  104.839691]                                      ^
[  104.839696]  ffffffffa0d4a380: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  104.839702]  ffffffffa0d4a400: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  104.839705] ==================================================================
[  104.839709] Disabling lock debugging due to kernel taint
[  104.843536] wlp2s0: send auth to 02:a0:f9:37:8e:a6 (try 1/3)
[  104.849308] wlp2s0: authenticated
Comment 9 Johannes Hirte 2017-04-24 14:23:05 UTC
(In reply to Johannes Hirte from comment #8)
> (In reply to Michel Dänzer from comment #7)
> > I wonder if there might be memory corruption going on, in which case
> > enabling CONFIG_KASAN for the kernel build might give more clues.
> 
> I was testing the last days with KASAN enabled and didn't hit one hang or
> other BUG message in the logs. 

I have to correct this. Found in the logs three use-after-free from 

find_cpio_data

The most detailed was this one:

Apr 23 11:55:16 probook kernel: smpboot: Booting Node 0 Processor 1 APIC 0x11
Apr 23 11:55:16 probook kernel: ==================================================================
Apr 23 11:55:16 probook kernel: BUG: KASAN: use-after-free in find_cpio_data+0x4d8/0x570 at addr ffff880037991000
Apr 23 11:55:16 probook kernel: Read of size 1 by task swapper/1/0
Apr 23 11:55:16 probook kernel: CPU: 1 PID: 0 Comm: swapper/1 Not tainted 4.11.0-rc7-00006-g3e06d0af3e4b #164
Apr 23 11:55:16 probook kernel: Hardware name: HP HP ProBook 645 G2/80FE, BIOS N77 Ver. 01.07 11/01/2016
Apr 23 11:55:16 probook kernel: Call Trace:
Apr 23 11:55:16 probook kernel:  dump_stack+0x4f/0x66
Apr 23 11:55:16 probook kernel:  kasan_object_err+0x1c/0x70
Apr 23 11:55:16 probook kernel:  kasan_report+0x252/0x510
Apr 23 11:55:16 probook kernel:  ? find_cpio_data+0x4d8/0x570
Apr 23 11:55:16 probook kernel:  ? put_dec+0xb0/0xb0
Apr 23 11:55:16 probook kernel:  __asan_report_load1_noabort+0x14/0x20
Apr 23 11:55:16 probook kernel:  find_cpio_data+0x4d8/0x570
Apr 23 11:55:16 probook kernel:  ? dump_stack+0x66/0x66
Apr 23 11:55:16 probook kernel:  ? snprintf+0x87/0xb0
Apr 23 11:55:16 probook kernel:  ? vsprintf+0x20/0x20
Apr 23 11:55:16 probook kernel:  find_microcode_in_initrd+0x229/0x3c0
Apr 23 11:55:16 probook kernel:  ? get_builtin_firmware+0x5e/0x120
Apr 23 11:55:16 probook kernel:  __load_ucode_amd+0x11c/0x240
Apr 23 11:55:16 probook kernel:  ? clockevents_program_event+0x1a2/0x2c0
Apr 23 11:55:16 probook kernel:  ? apply_microcode_amd+0x3d0/0x3d0
Apr 23 11:55:16 probook kernel:  ? pick_next_task_fair+0x7a3/0xfe0
Apr 23 11:55:16 probook kernel:  ? pick_next_task_fair+0x7a3/0xfe0
Apr 23 11:55:16 probook kernel:  load_ucode_amd_ap+0x90/0x100
Apr 23 11:55:16 probook kernel:  ? load_ucode_amd_ap+0x90/0x100
Apr 23 11:55:16 probook kernel:  ? __load_ucode_amd+0x240/0x240
Apr 23 11:55:16 probook kernel:  ? flat_send_IPI_mask+0x2b/0x40
Apr 23 11:55:16 probook kernel:  ? sched_clock_cpu+0x1b/0x1e0
Apr 23 11:55:16 probook kernel:  ? default_send_IPI_single+0x77/0xa0
Apr 23 11:55:16 probook kernel:  load_ucode_ap+0x80/0x90
Apr 23 11:55:16 probook kernel:  cpu_init+0x7dc/0xd40
Apr 23 11:55:16 probook kernel:  ? smp_call_function_single+0xf7/0x340
Apr 23 11:55:16 probook kernel:  ? syscall_init+0x140/0x140
Apr 23 11:55:16 probook kernel:  ? debug_smp_processor_id+0x17/0x20
Apr 23 11:55:16 probook kernel:  ? native_play_dead+0xf2/0x120
Apr 23 11:55:16 probook kernel:  ? arch_cpu_idle_dead+0x28/0x40
Apr 23 11:55:16 probook kernel:  ? do_idle+0x206/0x2d0
Apr 23 11:55:16 probook kernel:  start_secondary+0x12/0x2c0
Apr 23 11:55:16 probook kernel:  ? start_secondary+0x12/0x2c0
Apr 23 11:55:16 probook kernel:  start_cpu+0x14/0x14
Apr 23 11:55:16 probook kernel: Object at ffff880037990f00, in cache kmalloc-512 size: 512
Apr 23 11:55:16 probook kernel: Allocated:
Apr 23 11:55:16 probook kernel: PID = 4012
Apr 23 11:55:16 probook kernel:  save_stack_trace+0x16/0x20
Apr 23 11:55:16 probook kernel:  save_stack+0x46/0xd0
Apr 23 11:55:16 probook kernel:  kasan_kmalloc+0xad/0xe0
Apr 23 11:55:16 probook kernel:  kasan_slab_alloc+0x12/0x20
Apr 23 11:55:16 probook kernel:  __kmalloc_node_track_caller+0xfe/0x290
Apr 23 11:55:16 probook kernel:  __kmalloc_reserve.isra.36+0x2c/0xc0
Apr 23 11:55:16 probook kernel:  __alloc_skb+0xd0/0x560
Apr 23 11:55:16 probook kernel:  alloc_skb_with_frags+0x8d/0x4c0
Apr 23 11:55:16 probook kernel:  sock_alloc_send_pskb+0x587/0x6f0
Apr 23 11:55:16 probook kernel:  unix_stream_sendmsg+0x57d/0x880
Apr 23 11:55:16 probook kernel:  sock_sendmsg+0xb5/0xf0
Apr 23 11:55:16 probook kernel:  sock_write_iter+0x1e0/0x3b0
Apr 23 11:55:16 probook kernel:  __do_readv_writev+0x2b7/0x350
Apr 23 11:55:16 probook kernel:  do_readv_writev+0x79/0xb0
Apr 23 11:55:16 probook kernel:  vfs_writev+0x37/0x50
Apr 23 11:55:16 probook kernel:  do_writev+0x4d/0xd0
Apr 23 11:55:16 probook kernel:  SyS_writev+0xb/0x10
Apr 23 11:55:16 probook kernel:  entry_SYSCALL_64_fastpath+0x13/0x94
Apr 23 11:55:16 probook kernel: Freed:
Apr 23 11:55:16 probook kernel: PID = 4281
Apr 23 11:55:16 probook kernel:  save_stack_trace+0x16/0x20
Apr 23 11:55:16 probook kernel:  save_stack+0x46/0xd0
Apr 23 11:55:16 probook kernel:  kasan_slab_free+0x73/0xc0
Apr 23 11:55:16 probook kernel:  kfree+0x91/0x1c0
Apr 23 11:55:16 probook kernel:  skb_free_head+0x6a/0x90
Apr 23 11:55:16 probook kernel:  skb_release_data+0x279/0x330
Apr 23 11:55:16 probook kernel:  skb_release_all+0x3d/0x50
Apr 23 11:55:16 probook kernel:  consume_skb+0x62/0x180
Apr 23 11:55:16 probook kernel:  unix_stream_read_generic+0x1493/0x1b50
Apr 23 11:55:16 probook kernel:  unix_stream_recvmsg+0x8a/0xa0
Apr 23 11:55:16 probook kernel:  sock_recvmsg+0xc2/0x100
Apr 23 11:55:16 probook kernel:  ___sys_recvmsg+0x227/0x590
Apr 23 11:55:16 probook kernel:  __sys_recvmsg+0xbe/0x140
Apr 23 11:55:16 probook kernel:  SyS_recvmsg+0xd/0x20
Apr 23 11:55:16 probook kernel:  entry_SYSCALL_64_fastpath+0x13/0x94
Apr 23 11:55:16 probook kernel: Memory state around the buggy address:
Apr 23 11:55:16 probook kernel:  ffff880037990f00: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Apr 23 11:55:16 probook kernel:  ffff880037990f80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Apr 23 11:55:16 probook kernel: >ffff880037991000: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Apr 23 11:55:16 probook kernel:                    ^
Apr 23 11:55:16 probook kernel:  ffff880037991080: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Apr 23 11:55:16 probook kernel:  ffff880037991100: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
Apr 23 11:55:16 probook kernel: ==================================================================
Apr 23 11:55:16 probook kernel: Disabling lock debugging due to kernel taint


THe other two entries don't have the Allocated/Freed part.
Comment 10 Johannes Hirte 2017-06-21 19:04:13 UTC
(In reply to Michel Dänzer from comment #7)
> I wonder if there might be memory corruption going on, in which case
> enabling CONFIG_KASAN for the kernel build might give more clues.

You're right, KASAN pointet me at two other bugs:

https://bugzilla.kernel.org/show_bug.cgi?id=195677
https://bugzilla.kernel.org/show_bug.cgi?id=196145

After eliminating this, no more problems with amdgpu happened. Closing this report as invalid.

Note You need to log in before you can comment on or make changes to this bug.