Bug 201833 - WARNING: CPU: 2 PID: 1275 at drivers/gpu/drm/nouveau/nvif/vmm.c:71 nvif_vmm_put+0x6d/0x80
Summary: WARNING: CPU: 2 PID: 1275 at drivers/gpu/drm/nouveau/nvif/vmm.c:71 nvif_vmm_p...
Status: NEW
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: platform_ia-64
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-12-02 10:59 UTC by Marc B.
Modified: 2020-05-06 20:22 UTC (History)
3 users (show)

See Also:
Kernel Version: 4.19.6
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Marc B. 2018-12-02 10:59:48 UTC
[2018-12-02 09:26:38] warning kern 04 kernel : WARNING: CPU: 2 PID: 1275 at drivers/gpu/drm/nouveau/nvif/vmm.c:71 nvif_vmm_put+0x6d/0x80
[2018-12-02 09:26:38] warning kern 04 kernel : Modules linked in: l2tp_netlink l2tp_core tun acpi_call(O) ctr ccm ipt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo br_netfilter bridge stp llc overlay xt_hl xt_limit xt_addrtype nf_nat_ftp nf_conntrack_ftp af_packet nf_tables nfnetlink nf_log_ipv4 ipt_REJECT nf_reject_ipv4 nf_log_ipv6 nf_log_common ip6t_REJECT nf_reject_ipv6 xt_state xt_conntrack xt_tcpudp xt_LOG configfs algif_skcipher af_alg joydev btusb btintel bluetooth jitterentropy_rng drbg ansi_cprng usbhid ecdh_generic vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) loop fan wireguard(O) ip6_udp_tunnel udp_tunnel sbs sbshc ip6table_nat nf_nat_ipv6 ip6table_filter ip6_tables iptable_nat nf_nat_ipv4 nf_nat nf_conntrack iwlmvm nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter mac80211 ip_tables msr snd_hda_codec_hdmi x_tables
[2018-12-02 09:26:38] warning kern 04 kernel :  ipv6 iwlwifi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal snd_hda_intel xhci_pci coretemp snd_hda_codec xhci_hcd crc32_pclmul snd_hwdep crc32c_intel usbcore snd_hda_core pcbc cfg80211 snd_pcm usb_common thinkpad_acpi tpm_tis snd_timer intel_pch_thermal i2c_i801 aesni_intel snd tpm_tis_core tpm soundcore evdev rfkill battery ac thermal button efivarfs
[2018-12-02 09:26:38] warning kern 04 kernel : CPU: 2 PID: 1275 Comm: kworker/2:0 Tainted: G           O      4.19.6loc64 #1
[2018-12-02 09:26:38] warning kern 04 kernel : Hardware name: LENOVO 20EN0006GE/20EN0006GE, BIOS N1EET80W (1.53 ) 09/14/2018
[2018-12-02 09:26:38] warning kern 04 kernel : Workqueue: events nouveau_cli_work
[2018-12-02 09:26:38] warning kern 04 kernel : RIP: 0010:nvif_vmm_put+0x6d/0x80
[2018-12-02 09:26:38] warning kern 04 kernel : Code: 00 00 48 8d 55 e0 be 02 00 00 00 48 c7 45 e0 00 00 00 00 48 89 45 e8 e8 91 e5 ff ff 85 c0 75 0a 48 c7 43 08 00 00 00 00 eb b7 <0f> 0b eb f2 e8 4a bc b4 ff 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44
[2018-12-02 09:26:38] warning kern 04 kernel : RSP: 0018:ffffa51b12a3bdb8 EFLAGS: 00010282
[2018-12-02 09:26:38] warning kern 04 kernel : RAX: 00000000fffffffe RBX: ffffa51b12a3bde8 RCX: 0000000000000000
[2018-12-02 09:26:38] warning kern 04 kernel : RDX: 0000000000000010 RSI: ffffa51b12a3bd28 RDI: ffffa51b12a3bdc8
[2018-12-02 09:26:38] warning kern 04 kernel : RBP: ffffa51b12a3bdd8 R08: ffffffff825dd000 R09: 0000000000000000
[2018-12-02 09:26:38] warning kern 04 kernel : R10: 8080808080808080 R11: 0000000000000030 R12: ffffa51b12a3be20
[2018-12-02 09:26:38] warning kern 04 kernel : R13: dead000000000200 R14: dead000000000100 R15: ffff99552e832a50
[2018-12-02 09:26:38] warning kern 04 kernel : FS:  0000000000000000(0000) GS:ffff99553ba80000(0000) knlGS:0000000000000000
[2018-12-02 09:26:38] warning kern 04 kernel : CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[2018-12-02 09:26:38] warning kern 04 kernel : CR2: 00007f0999617000 CR3: 000000035120a005 CR4: 00000000003606e0
[2018-12-02 09:26:38] warning kern 04 kernel : DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[2018-12-02 09:26:38] warning kern 04 kernel : DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[2018-12-02 09:26:38] warning kern 04 kernel : Call Trace:
[2018-12-02 09:26:38] warning kern 04 kernel :  nouveau_vma_del+0x74/0xc0
[2018-12-02 09:26:38] warning kern 04 kernel :  nouveau_gem_object_delete_work+0x3a/0x60
[2018-12-02 09:26:38] warning kern 04 kernel :  nouveau_cli_work+0xc3/0x100
[2018-12-02 09:26:38] warning kern 04 kernel :  ? __schedule+0x253/0x840
[2018-12-02 09:26:38] warning kern 04 kernel :  process_one_work+0x1f7/0x420
[2018-12-02 09:26:38] warning kern 04 kernel :  worker_thread+0x34/0x3f0
[2018-12-02 09:26:38] warning kern 04 kernel :  kthread+0x121/0x140
[2018-12-02 09:26:38] warning kern 04 kernel :  ? process_one_work+0x420/0x420
[2018-12-02 09:26:38] warning kern 04 kernel :  ? kthread_park+0x90/0x90
[2018-12-02 09:26:38] warning kern 04 kernel :  ret_from_fork+0x35/0x40
[2018-12-02 09:26:38] warning kern 04 kernel : ---[ end trace 4ebaae2d2fa9e291 ]---



Hardware is 



01:00.0 VGA compatible controller: NVIDIA Corporation GM107GLM [Quadro M2000M] (rev a2) (prog-if 00 [VGA controller])
        Subsystem: Lenovo GM107GLM [Quadro M2000M]
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Interrupt: pin A routed to IRQ 121
        Region 0: Memory at c2000000 (32-bit, non-prefetchable) [size=16M]
        Region 1: Memory at b0000000 (64-bit, prefetchable) [size=256M]
        Region 3: Memory at c0000000 (64-bit, prefetchable) [size=32M]
        Region 5: I/O ports at 4000 [size=128]
        Expansion ROM at c3080000 [disabled] [size=512K]
        Capabilities: [60] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
                Address: 00000000fee00238  Data: 0000
        Capabilities: [78] Express (v2) Endpoint, MSI 00
                DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s unlimited, L1 <64us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- SlotPowerLimit 75.000W
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend+
                LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <512ns, L1 <4us
                        ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Range AB, TimeoutDis+, LTR+, OBFF Via message
                         AtomicOpsCap: 32bit- 64bit- 128bitCAS-
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR+, OBFF Disabled
                         AtomicOpsCtl: ReqEn-
                LnkCtl2: Target Link Speed: 8GT/s, EnterCompliance- SpeedDis-
                         Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
                LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete+, EqualizationPhase1+
                         EqualizationPhase2+, EqualizationPhase3+, LinkEqualizationRequest-
        Kernel driver in use: nouveau
Comment 1 Dominik Mierzejewski 2018-12-30 21:27:10 UTC
I think I've just seen the same:

Dec 30 21:45:51 kernel: nouveau 0000:01:00.0: DRM: Dropped ACPI reprobe event due to RPM error: -115
Dec 30 21:56:22 kernel: WARNING: CPU: 3 PID: 470 at drivers/gpu/drm/nouveau/nvif/vmm.c:71 nvif_vmm_put+0x6a/0x80 [nouveau]
Dec 30 21:56:22 kernel: Modules linked in: rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rfcomm ccm bnep sunrpc fuse intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel intel_cstate intel_uncore intel_rapl_perf snd_hda_codec_hdmi snd_hda_codec_realtek dell_laptop iTCO_wdt uvcvideo videobuf2_vmalloc iTCO_vendor_support snd_hda_codec_generic videobuf2_memops videobuf2_v4l2 videobuf2_common snd_hda_intel videodev dell_wmi dell_smbios dcdbas sparse_keymap snd_hda_codec joydev dell_wmi_descriptor media btusb btrtl btbcm arc4 btintel snd_hda_core bluetooth iwldvm snd_hwdep mac80211 snd_seq wmi_bmof iwlwifi snd_seq_device ecdh_generic snd_pcm cfg80211 mei_me i2c_i801 rfkill snd_timer mei lpc_ich snd soundcore dell_smo8800 pcc_cpufreq
Dec 30 21:56:22 kernel:  i915 nouveau kvmgt mdev vfio kvm hid_logitech_hidpp irqbypass mxm_wmi ttm i2c_algo_bit drm_kms_helper crc32c_intel serio_raw drm r8169 hid_logitech_dj wmi video
Dec 30 21:56:22 kernel: CPU: 3 PID: 470 Comm: kworker/3:3 Not tainted 4.19.10-300.fc29.x86_64 #1
Dec 30 21:56:22 kernel: Hardware name: Dell Inc.          Dell System XPS L502X/0YR8NN, BIOS A06 07/20/2011
Dec 30 21:56:22 kernel: Workqueue: events nouveau_cli_work [nouveau]
Dec 30 21:56:22 kernel: RIP: 0010:nvif_vmm_put+0x6a/0x80 [nouveau]
Dec 30 21:56:22 kernel: Code: 00 00 48 89 e2 be 02 00 00 00 48 c7 04 24 00 00 00 00 48 89 44 24 08 e8 24 e6 ff ff 85 c0 75 0a 48 c7 43 08 00 00 00 00 eb b7 <0f> 0b eb f2 e8 8d 93 a3 ce 66 66 2e 0f 1f 84 00 00 00 00 00 66 90
Dec 30 21:56:22 kernel: RSP: 0018:ffffb21e40aafde8 EFLAGS: 00010282
Dec 30 21:56:22 kernel: RAX: 00000000fffffffe RBX: ffffb21e40aafe10 RCX: 0000000000000000
Dec 30 21:56:22 kernel: RDX: 0000000000000010 RSI: ffffb21e40aafd48 RDI: ffffb21e40aafde8
Dec 30 21:56:22 kernel: RBP: ffffb21e40aafe40 R08: 0000000000000000 R09: 0000000000b80000
Dec 30 21:56:22 kernel: R10: 0000000000000000 R11: 0000000000b80000 R12: ffff9c9f664a8660
Dec 30 21:56:22 kernel: R13: ffff9c9eec74c800 R14: dead000000000100 R15: ffff9c9e7655f670
Dec 30 21:56:22 kernel: FS:  0000000000000000(0000) GS:ffff9c9f7a2c0000(0000) knlGS:0000000000000000
Dec 30 21:56:22 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 30 21:56:22 kernel: CR2: 00007fabbdc62e60 CR3: 000000006e20a003 CR4: 00000000000606e0
Dec 30 21:56:22 kernel: Call Trace:
Dec 30 21:56:22 kernel:  nouveau_vma_del+0x70/0xc0 [nouveau]
Dec 30 21:56:22 kernel:  nouveau_gem_object_delete_work+0x36/0x60 [nouveau]
Dec 30 21:56:22 kernel:  nouveau_cli_work+0xc7/0x100 [nouveau]
Dec 30 21:56:22 kernel:  process_one_work+0x1a1/0x3a0
Dec 30 21:56:22 kernel:  worker_thread+0x30/0x380
Dec 30 21:56:22 kernel:  ? pwq_unbound_release_workfn+0xd0/0xd0
Dec 30 21:56:22 kernel:  kthread+0x112/0x130
Dec 30 21:56:22 kernel:  ? kthread_create_worker_on_cpu+0x70/0x70
Dec 30 21:56:22 kernel:  ret_from_fork+0x35/0x40
Dec 30 21:56:22 kernel: ---[ end trace 39fd8320774c0eb9 ]---

Hardware is different:
01:00.0 VGA compatible controller: NVIDIA Corporation GF108M [GeForce GT 525M] (rev a1) (prog-if 00 [VGA controller])
	Subsystem: Dell Device 04b6
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0, Cache Line Size: 64 bytes
	Interrupt: pin A routed to IRQ 33
	Region 0: Memory at f0000000 (32-bit, non-prefetchable) [size=16M]
	Region 1: Memory at c0000000 (64-bit, prefetchable) [size=256M]
	Region 3: Memory at d0000000 (64-bit, prefetchable) [size=32M]
	Region 5: I/O ports at 3000 [size=128]
	Expansion ROM at f1000000 [disabled] [size=512K]
	Capabilities: [60] Power Management version 3
		Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
		Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
	Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
		Address: 00000000fee00378  Data: 0000
	Capabilities: [78] Express (v2) Endpoint, MSI 00
		DevCap:	MaxPayload 128 bytes, PhantFunc 0, Latency L0s unlimited, L1 <64us
			ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- SlotPowerLimit 75.000W
		DevCtl:	CorrErr- NonFatalErr- FatalErr- UnsupReq-
			RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop-
			MaxPayload 128 bytes, MaxReadReq 512 bytes
		DevSta:	CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- TransPend-
		LnkCap:	Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <256ns, L1 <4us
			ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp-
		LnkCtl:	ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+
			ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
		LnkSta:	Speed 2.5GT/s (ok), Width x16 (ok)
			TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
		DevCap2: Completion Timeout: Not Supported, TimeoutDis+, LTR-, OBFF Not Supported
			 AtomicOpsCap: 32bit- 64bit- 128bitCAS-
		DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
			 AtomicOpsCtl: ReqEn-
		LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis-
			 Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
			 Compliance De-emphasis: -6dB
		LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
			 EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
	Capabilities: [b4] Vendor Specific Information: Len=14 <?>
	Capabilities: [100 v1] Virtual Channel
		Caps:	LPEVC=0 RefClk=100ns PATEntryBits=1
		Arb:	Fixed- WRR32- WRR64- WRR128-
		Ctrl:	ArbSelect=Fixed
		Status:	InProgress-
		VC0:	Caps:	PATOffset=00 MaxTimeSlots=1 RejSnoopTrans-
			Arb:	Fixed- WRR32- WRR64- WRR128- TWRR128- WRR256-
			Ctrl:	Enable+ ID=0 ArbSelect=Fixed TC/VC=ff
			Status:	NegoPending- InProgress-
	Capabilities: [128 v1] Power Budgeting <?>
	Capabilities: [600 v1] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
	Kernel driver in use: nouveau
	Kernel modules: nouveau
Comment 2 Dominik Mierzejewski 2018-12-31 23:39:11 UTC
Downstream bug report on Fedora rawhide: https://bugzilla.redhat.com/show_bug.cgi?id=1661953
Comment 3 Ortwin Glück 2019-10-29 17:28:59 UTC
one from my HP laptop today:
[Tue Oct 29 16:58:57 2019] ------------[ cut here ]------------
[Tue Oct 29 16:58:57 2019] WARNING: CPU: 1 PID: 18599 at drivers/gpu/drm/nouveau/nvif/vmm.c:71 nvif_vmm_put.cold.0+0xc/0x19
[Tue Oct 29 16:58:57 2019] CPU: 1 PID: 18599 Comm: kworker/1:1 Not tainted 5.3.7 #7
[Tue Oct 29 16:58:57 2019] Hardware name: Hewlett-Packard HP ZBook 15/1909, BIOS L70 Ver. 01.21 08/13/2014
[Tue Oct 29 16:58:57 2019] Workqueue: events nouveau_cli_work
[Tue Oct 29 16:58:57 2019] RIP: 0010:nvif_vmm_put.cold.0+0xc/0x19
[Tue Oct 29 16:58:57 2019] Code: 41 5e 41 5f c3 49 8b 7d 38 eb c4 bd f4 ff ff ff eb bd bd f4 ff ff ff eb cb e8 39 28 a1 ff 48 c7 c7 28 96 53 93 e8 34 a5 a5 ff <0f> 0b e9 e5 fc ff ff 90 90 90 90 90 90 48 83 bf a0 00 00 00 00 74
[Tue Oct 29 16:58:57 2019] RSP: 0018:ffffa9ac4426fde8 EFLAGS: 00010246
[Tue Oct 29 16:58:57 2019] RAX: 0000000000000024 RBX: ffffa9ac4426fe10 RCX: 0000000000000000
[Tue Oct 29 16:58:57 2019] RDX: 0000000000000000 RSI: ffff8a06cea56488 RDI: ffff8a06cea56488
[Tue Oct 29 16:58:57 2019] RBP: ffffa9ac4426fe40 R08: 00000000000003ec R09: 0000000000000001
[Tue Oct 29 16:58:57 2019] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8a06bd064ab8
[Tue Oct 29 16:58:57 2019] R13: ffff8a05fc778ea0 R14: dead000000000100 R15: ffff8a05fc778610
[Tue Oct 29 16:58:57 2019] FS:  0000000000000000(0000) GS:ffff8a06cea40000(0000) knlGS:0000000000000000
[Tue Oct 29 16:58:57 2019] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Tue Oct 29 16:58:57 2019] CR2: 00007fbca8cd7000 CR3: 000000026580a005 CR4: 00000000001606e0
[Tue Oct 29 16:58:57 2019] Call Trace:
[Tue Oct 29 16:58:57 2019]  nouveau_vma_del+0x6b/0xc0
[Tue Oct 29 16:58:57 2019]  nouveau_gem_object_delete_work+0x31/0x60
[Tue Oct 29 16:58:57 2019]  nouveau_cli_work+0xda/0x100
[Tue Oct 29 16:58:57 2019]  process_one_work+0x16b/0x2b0
[Tue Oct 29 16:58:57 2019]  ? process_one_work+0x2b0/0x2b0
[Tue Oct 29 16:58:57 2019]  worker_thread+0x2b/0x380
[Tue Oct 29 16:58:57 2019]  ? process_one_work+0x2b0/0x2b0
[Tue Oct 29 16:58:57 2019]  kthread+0x109/0x120
[Tue Oct 29 16:58:57 2019]  ? __kthread_create_on_node+0x1a0/0x1a0
[Tue Oct 29 16:58:57 2019]  ret_from_fork+0x35/0x40
[Tue Oct 29 16:58:57 2019] ---[ end trace bf2fc0097662bb9c ]---

Note You need to log in before you can comment on or make changes to this bug.