Bug 198603
Summary: | [nouveau] Card overheating with HDMI → DVI output plugged-in, WARNING in dmesg | ||
---|---|---|---|
Product: | Drivers | Reporter: | Bruno Pagani (bruno.n.pagani) |
Component: | Video(DRI - non Intel) | Assignee: | drivers_video-dri |
Status: | NEW --- | ||
Severity: | normal | CC: | bruno.n.pagani |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 4.14 | Subsystem: | |
Regression: | No | Bisected commit-id: | |
Attachments: | dmesg output from the boot |
Description
Bruno Pagani
2018-01-28 21:24:11 UTC
Some update: overheating and the error message seems not correlated anymore. The error message is happening every time I unplug (even after only 5s), and I have yet to see overheating going as hot as it was before, but this might be quite subjective. I’ll keep running with two screens from now on, so will see. At the very least there should still be investigation going on for this WARNING. I’ve took a look at my former kernel logs when overheating was linked to an error on unplug, and the output looked like this: ``` [ 7039.070478] ------------[ cut here ]------------ [ 7039.070493] WARNING: CPU: 0 PID: 1652 at drivers/gpu/drm/nouveau/include/nvkm/subdev/i2c.h:169 nouveau_dp_detect+0x9b/0x320 [nouveau] [ 7039.070493] Modules linked in: nouveau ttm fuse mousedev hp_wmi joydev mxm_wmi sparse_keymap iTCO_wdt iTCO_vendor_support i2c_designware_platform i2c_designware_core intel_rapl x86_pkg_temp_thermal intel_powerclamp kvm irqbypass intel_cstate intel_rapl_perf input_leds psmouse snd_hda_codec_conexant snd_hda_codec_generic arc4 nls_iso8859_1 nls_cp437 snd_hda_intel vfat fat snd_hda_codec snd_hda_core snd_hwdep snd_pcm e1000e snd_timer iwlmvm snd soundcore ptp pps_core i2c_i801 mac80211 iwlwifi rtsx_pci_ms memstick cfg80211 rfkill idma64 intel_lpss_pci intel_pch_thermal shpchp processor_thermal_device intel_soc_dts_iosf tpm_infineon thermal battery int3403_thermal int340x_thermal_zone wmi hp_accel intel_lpss_acpi lis3lv02d intel_lpss input_polldev led_class int3400_thermal evdev acpi_thermal_rel mac_hid [ 7039.070517] hp_wireless tpm_tis tpm_tis_core acpi_pad ac tpm sch_fq_codel coretemp msr ip_tables x_tables btrfs xor raid6_pq algif_skcipher af_alg dm_crypt dm_mod dax rtsx_pci_sdmmc mmc_core serio_raw atkbd libps2 crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd ahci xhci_pci libahci nvme xhci_hcd libata nvme_core rtsx_pci scsi_mod usbcore usb_common i8042 serio i915 video button intel_gtt i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm crc32c_intel [last unloaded: bbswitch] [ 7039.070538] CPU: 0 PID: 1652 Comm: kworker/0:0 Tainted: G W O 4.12.10-1-ARCH #1 [ 7039.070539] Hardware name: HP HP ZBook Studio G3/80D4, BIOS N82 Ver. 01.16 04/14/2017 [ 7039.070549] Workqueue: events nouveau_display_hpd_work [nouveau] [ 7039.070550] task: ffff88d259c40000 task.stack: ffff9bfb03ffc000 [ 7039.070559] RIP: 0010:nouveau_dp_detect+0x9b/0x320 [nouveau] [ 7039.070560] RSP: 0018:ffff9bfb03fffc88 EFLAGS: 00010293 [ 7039.070561] RAX: 0000000000000000 RBX: ffff88d232a05400 RCX: 0000000000000000 [ 7039.070561] RDX: 0000000000000008 RSI: ffff9bfb1100e4e4 RDI: 0000000001009007 [ 7039.070562] RBP: ffff9bfb03fffcd0 R08: ffff9bfb03fffc98 R09: ffff9bfb03fffc92 [ 7039.070562] R10: 0000000000000000 R11: 0000000000000010 R12: ffff88d234f1f800 [ 7039.070563] R13: ffff88d234f18800 R14: ffff88d21908f000 R15: ffff88d232a05418 [ 7039.070563] FS: 0000000000000000(0000) GS:ffff88d27f400000(0000) knlGS:0000000000000000 [ 7039.070564] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 7039.070564] CR2: 00002bfdda997140 CR3: 000000064ea09000 CR4: 00000000003406f0 [ 7039.070565] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 7039.070565] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [ 7039.070566] Call Trace: [ 7039.070569] ? radix_tree_lookup+0xd/0x10 [ 7039.070579] nouveau_connector_detect+0x2d1/0x4c0 [nouveau] [ 7039.070582] drm_helper_probe_detect_ctx+0xc2/0xe0 [drm_kms_helper] [ 7039.070583] ? drm_helper_probe_detect_ctx+0xc2/0xe0 [drm_kms_helper] [ 7039.070585] drm_helper_hpd_irq_event+0xa2/0x120 [drm_kms_helper] [ 7039.070594] nouveau_display_hpd_work+0x2e/0x70 [nouveau] [ 7039.070596] process_one_work+0x1de/0x430 [ 7039.070597] worker_thread+0x47/0x3f0 [ 7039.070598] kthread+0x125/0x140 [ 7039.070599] ? process_one_work+0x430/0x430 [ 7039.070600] ? kthread_create_on_node+0x70/0x70 [ 7039.070602] ret_from_fork+0x25/0x30 [ 7039.070603] Code: c3 4c 8d 4d c2 4c 8d 45 c8 31 c9 ba 09 00 00 00 be 01 00 00 00 4c 89 e7 e8 53 4e f8 ff 85 c0 0f 85 8a 00 00 00 80 7d c2 08 74 02 <0f> ff 4c 89 e7 e8 3b 4c f8 ff 44 0f b6 45 c9 0f b6 4d ca 8b 15 [ 7039.070620] ---[ end trace e2badeedc3409171 ]--- [ 7039.070646] nouveau 0000:01:00.0: DRM: DDC responded, but no EDID for DP-3 ``` It’s quite similar to me, but maybe not… So I got overheating again, but the dmesg output was the same as in OP. The old log should probably be disregarded. My hope is that fixing the cause of this WARNING will also fix the overheating, since I don’t get anything specific to the overheating anymore. Created attachment 273913 [details]
dmesg output from the boot
I’m attaching the full dmesg output.
At 43.x is when I plug the output.
Starting from 94.x is unplugging.
|