Bug 198997 - Nvidia driver broken
Summary: Nvidia driver broken
Status: NEW
Alias: None
Product: IO/Storage
Classification: Unclassified
Component: Other (show other bugs)
Hardware: All Linux
: P1 blocking
Assignee: io_other
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-03-04 23:54 UTC by mm0hgw
Modified: 2018-09-06 19:26 UTC (History)
7 users (show)

See Also:
Kernel Version: 4.16
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description mm0hgw 2018-03-04 23:54:22 UTC
https://devtalk.nvidia.com/default/topic/1030082/linux/kernel-4-16-rc1-breaks-latest-drivers-unknown-symbol-swiotlb_map_sg_attrs-/post/5242836/?offset=3#5242845


https://github.com/torvalds/linux/commit/4bd89ed39b2ab8dc4ac4b6c59b07d420b0213bec

In this commit, it is presumed nobody is using these symbols. 

The NVIDIA proprietary driver, while it compiles without error, can't be loaded as it relies on at least one of these symbols.
Comment 1 mm0hgw 2018-03-05 18:42:47 UTC
https://devtalk.nvidia.com/default/topic/1030082/linux/kernel-4-16-rc1-breaks-latest-drivers-unknown-symbol-swiotlb_map_sg_attrs-/post/5243056/#5243056

User towo reverted that commit, and linked the logs from the subsequent load failure.

[code]
towo:Defiant> journalctl -b -1 | grep nvid
Mär 05 19:00:10 Defiant kernel: nvidia: loading out-of-tree module taints kernel.
Mär 05 19:00:10 Defiant kernel: nvidia: module license 'NVIDIA' taints kernel.
Mär 05 19:00:11 Defiant kernel: caller _nv001170rm+0xe3/0x1d0 [nvidia] mapping multiple BARs
Mär 05 19:00:11 Defiant kernel: caller _nv029921rm+0x57/0x90 [nvidia] mapping multiple BARs
Mär 05 19:00:12 Defiant kernel: usercopy: Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
Mär 05 19:00:12 Defiant kernel:  os_memcpy_to_user+0x1c/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel:  _nv009373rm+0xbf/0xe0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv028042rm+0x79/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv028042rm+0x55/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv013687rm+0xee/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv015333rm+0x154/0x270 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv008306rm+0x134/0x1a0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv008285rm+0x29c/0x2b0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv001073rm+0xe/0x20 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv007315rm+0xd8/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv001172rm+0x627/0x830 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? rm_ioctl+0x73/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_ioctl+0x4f0/0x6f0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_frontend_unlocked_ioctl+0x39/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
Mär 05 19:00:12 Defiant kernel: RIP: 0010:_nv007213rm+0x25/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv025869rm+0x13/0x50 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv035572rm+0x144/0x1e0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? rm_free_unused_clients+0x4f/0xe0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? os_pci_read_dword+0xd/0x20 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nv_check_pci_config_space+0x258/0x2e0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_close+0xb5/0x350 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_frontend_close+0x25/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel: RIP: _nv007213rm+0x25/0x90 [nvidia] RSP: ffffc9000262fd30
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
[/code]
Comment 2 loqs 2018-03-05 22:57:34 UTC
https://bbs.archlinux.org/viewtopic.php?id=235072
Reverting or working around the swiotlb_map_sg_attrs export removal
the second issue only produces a warning for me.

Note You need to log in before you can comment on or make changes to this bug.