Bug 198997

Summary: Nvidia driver broken
Product: IO/Storage Reporter: mm0hgw
Component: OtherAssignee: io_other
Status: NEW ---    
Severity: blocking CC: davidmenhur, fhlfibh, fred.blaise, jbuchert+kbugs, joshua613, kernel_bugzilla, steven
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 4.16 Tree: Mainline
Regression: No

Description mm0hgw 2018-03-04 23:54:22 UTC
https://devtalk.nvidia.com/default/topic/1030082/linux/kernel-4-16-rc1-breaks-latest-drivers-unknown-symbol-swiotlb_map_sg_attrs-/post/5242836/?offset=3#5242845


https://github.com/torvalds/linux/commit/4bd89ed39b2ab8dc4ac4b6c59b07d420b0213bec

In this commit, it is presumed nobody is using these symbols. 

The NVIDIA proprietary driver, while it compiles without error, can't be loaded as it relies on at least one of these symbols.
Comment 1 mm0hgw 2018-03-05 18:42:47 UTC
https://devtalk.nvidia.com/default/topic/1030082/linux/kernel-4-16-rc1-breaks-latest-drivers-unknown-symbol-swiotlb_map_sg_attrs-/post/5243056/#5243056

User towo reverted that commit, and linked the logs from the subsequent load failure.

[code]
towo:Defiant> journalctl -b -1 | grep nvid
Mär 05 19:00:10 Defiant kernel: nvidia: loading out-of-tree module taints kernel.
Mär 05 19:00:10 Defiant kernel: nvidia: module license 'NVIDIA' taints kernel.
Mär 05 19:00:11 Defiant kernel: caller _nv001170rm+0xe3/0x1d0 [nvidia] mapping multiple BARs
Mär 05 19:00:11 Defiant kernel: caller _nv029921rm+0x57/0x90 [nvidia] mapping multiple BARs
Mär 05 19:00:12 Defiant kernel: usercopy: Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
Mär 05 19:00:12 Defiant kernel:  os_memcpy_to_user+0x1c/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel:  _nv009373rm+0xbf/0xe0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv028042rm+0x79/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv028042rm+0x55/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv013687rm+0xee/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv015333rm+0x154/0x270 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv008306rm+0x134/0x1a0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv008285rm+0x29c/0x2b0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv001073rm+0xe/0x20 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv007315rm+0xd8/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv001172rm+0x627/0x830 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? rm_ioctl+0x73/0x100 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_ioctl+0x4f0/0x6f0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_frontend_unlocked_ioctl+0x39/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
Mär 05 19:00:12 Defiant kernel: RIP: 0010:_nv007213rm+0x25/0x90 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv025869rm+0x13/0x50 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? _nv035572rm+0x144/0x1e0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? rm_free_unused_clients+0x4f/0xe0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? os_pci_read_dword+0xd/0x20 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nv_check_pci_config_space+0x258/0x2e0 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_close+0xb5/0x350 [nvidia]
Mär 05 19:00:12 Defiant kernel:  ? nvidia_frontend_close+0x25/0x40 [nvidia]
Mär 05 19:00:12 Defiant kernel: RIP: _nv007213rm+0x25/0x90 [nvidia] RSP: ffffc9000262fd30
Mär 05 19:00:12 Defiant kernel: Modules linked in: fuse vmw_vsock_vmci_transport vsock vmw_vmci snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mxm_wmi intel_rapl x86_pkg_temp_thermal snd_hda_codec_realtek intel_powerclamp snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc joydev input_leds evdev xpad snd_hda_intel aesni_intel aes_x86_64 eeprom snd_hda_codec crypto_simd glue_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_timer snd sg mei_wdt soundcore shpchp lpc_ich video button wmi nvidia_drm(PO) drm nvidia_modeset(PO) nvidia(PO) ipmi_devintf ipmi_msghandler kyber_iosched kvm_intel kvm irqbypass coretemp parport_pc ppdev lp parport ip_tables x_tables ipv6 crc_ccitt autofs4 ext4 crc16 mbcache jbd2 fscrypto hid_logitech_hidpp hid_logitech_dj hid_generic usbhid sr_mod cdrom sd_mod crc32c_intel
[/code]
Comment 2 loqs 2018-03-05 22:57:34 UTC
https://bbs.archlinux.org/viewtopic.php?id=235072
Reverting or working around the swiotlb_map_sg_attrs export removal
the second issue only produces a warning for me.