Bug 200799

Summary: from 4.9.X aftewards - after 1Gbit traffics, later kernels - crash on load
Product: Networking Reporter: Nikolay Dimitrov (hell)
Component: IPV4Assignee: Stephen Hemminger (stephen)
Status: RESOLVED INVALID    
Severity: normal    
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 4.X Subsystem:
Regression: No Bisected commit-id:

Description Nikolay Dimitrov 2018-08-12 21:43:00 UTC
[  184.265160] ------------[ cut here ]------------
[  184.265167] WARNING: CPU: 15 PID: 0 at kernel/locking/lockdep.c:722 register_lock_class+0x524/0x580
[  184.265169] Modules linked in: xt_IMQ xt_ratelimit(O) arptable_filter arp_tables iptable_filter xt_addrtype xt_nat xt_tcpudp iptable_nat nf_nat_ipv4 xt_set iptable_mangle ip_set_hash_net ip_set nfnetlink binfmt_misc cpufreq_conservative cpufreq_powersave cpufreq_userspace 8021q garp mrp stp intel_rapl llc sb_edac x86_pkg_temp_thermal intel_powerclamp kvm irqbypass mxm_wmi crct10dif_pclmul crc32_pclmul mgag200 ghash_clmulni_intel cryptd ttm intel_cstate drm_kms_helper intel_uncore lpc_ich pcspkr intel_rapl_perf joydev sg drm mfd_core shpchp evdev wmi button ipmi_si ipmi_devintf ipmi_msghandler pppoe e100 8139cp 8139too r8169 mii sb1000 tg3 libphy e1000e e1000 ifb nf_nat_ftp nf_conntrack_ftp nf_nat_pptp nf_nat_proto_gre nf_conntrack_pptp nf_conntrack_proto_gre nf_nat pptp gre pppox arc4 ecb ppp_mppe
[  184.265226]  ppp_generic slhc coretemp nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 fscrypto raid10 hid_generic uas usbhid usb_storage hid raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid0 multipath linear raid1 md_mod sd_mod ahci ixgbe libahci crc32c_intel isci libsas ehci_pci libata ehci_hcd scsi_transport_sas i2c_i801 usbcore scsi_mod igb i2c_algo_bit dca mdio ptp pps_core
[  184.265266] CPU: 15 PID: 0 Comm: swapper/15 Tainted: G           O    4.14.60 #1
[  184.265267] Hardware name: Supermicro X9DR3-F/X9DR3-F, BIOS 3.2a 07/09/2015
[  184.265269] task: ffff98e05844e180 task.stack: ffffb9f583318000
[  184.265271] RIP: 0010:register_lock_class+0x524/0x580
[  184.265272] RSP: 0018:ffff98e05e403978 EFLAGS: 00010087
[  184.265274] RAX: ffffffffb3a8bfa2 RBX: 0000000000000000 RCX: 0000000000000000
[  184.265275] RDX: ffffffffb4388fc0 RSI: 0000000000000ca9 RDI: 0000000000000000
[  184.265276] RBP: ffff98e05e403ad0 R08: 0000000000000000 R09: 0000000000000000
[  184.265277] R10: 0000000000000000 R11: ffff98e05844e180 R12: 0000000000000000
[  184.265279] R13: 0000000000000000 R14: ffff98e050a6cc98 R15: ffffffffb4a8c750
[  184.265280] FS:  0000000000000000(0000) GS:ffff98e05e400000(0000) knlGS:0000000000000000
[  184.265282] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  184.265283] CR2: 00007fb86deb02c0 CR3: 0000000538212006 CR4: 00000000001606e0
[  184.265284] Call Trace:
[  184.265288]  <IRQ>
[  184.265294]  __lock_acquire+0xcd/0x17e0
[  184.265299]  ? __imq_nf_queue+0x3d1/0x920
[  184.265303]  ? skb_save_cb+0x1a/0x70
[  184.265305]  ? lock_acquire+0xea/0x1e0
[  184.265307]  lock_acquire+0xea/0x1e0
[  184.265310]  ? __imq_nf_queue+0x3d1/0x920
[  184.265314]  _raw_spin_lock+0x29/0x60
[  184.265316]  ? __imq_nf_queue+0x3d1/0x920
[  184.265319]  __imq_nf_queue+0x3d1/0x920
[  184.265322]  imq_nf_queue+0x252/0x3e0
[  184.265326]  nf_queue+0x132/0x340
[  184.265329]  nf_hook_slow+0xa4/0xf0
[  184.265333]  ip_output+0x14e/0x250
[  184.265336]  ? ip_fragment.constprop.54+0x80/0x80
[  184.265339]  ip_forward+0x3d8/0x650
[  184.265342]  ? ip_frag_mem+0x10/0x10
[  184.265345]  ip_rcv+0x2a8/0x4f0
[  184.265348]  ? inet_del_offload+0x40/0x40
[  184.265352]  __netif_receive_skb_core+0x875/0xe10
[  184.265355]  ? lock_acquire+0xea/0x1e0
[  184.265357]  ? process_backlog+0xd0/0x240
[  184.265360]  ? process_backlog+0xd0/0x240
[  184.265362]  ? process_backlog+0x76/0x240
[  184.265365]  ? process_backlog+0x116/0x240
[  184.265367]  process_backlog+0x76/0x240
[  184.265370]  net_rx_action+0x28d/0x4b0
[  184.265374]  __do_softirq+0xd1/0x465
[  184.265378]  irq_exit+0xc1/0xd0
[  184.265380]  call_function_single_interrupt+0x87/0x90
[  184.265382]  </IRQ>
[  184.265385] RIP: 0010:cpuidle_enter_state+0xac/0x370
[  184.265387] RSP: 0018:ffffb9f58331bea0 EFLAGS: 00000206 ORIG_RAX: ffffffffffffff04
[  184.265389] RAX: ffff98e05844e180 RBX: 0000002ae70e6706 RCX: 000000000000001f
[  184.265390] RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffff98e05844e180
[  184.265391] RBP: ffffd9f57ec09ee8 R08: 000000000000008d R09: 0000000000000000
[  184.265392] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
[  184.265393] R13: 0000000000000001 R14: 0000000000000001 R15: 0000002ae70d7ab1
[  184.265399]  do_idle+0xec/0x1d0
[  184.265402]  cpu_startup_entry+0x6f/0x80
[  184.265405]  start_secondary+0x1b3/0x200
[  184.265409]  secondary_startup_64+0xa5/0xb0
[  184.265412] Code: 65 48 8b 04 25 40 5e 01 00 48 c7 c7 e0 8e f9 b4 83 a8 ac 08 00 00 01 c6 07 00 0f 1f 40 00 e9 ab fe ff ff 49 89 c7 e9 3c fe ff ff <0f> 0b e9 8f fe ff ff 48 8d 74 24 30 4c 89 f7 e8 48 80 04 00 84 
[  184.265451] ---[ end trace 40120af66f8b5fa7 ]---
Comment 1 Nikolay Dimitrov 2018-08-12 21:50:20 UTC
I have several scripts that bring to my squeduele. I have 1.6.2 iptables, ipset v6, kernel 4.14.59. 
module IMQ only is doing not onlyh oops, but hangs the entire system( which needs hard reset).
Comment 2 Stephen Hemminger 2018-08-14 17:49:09 UTC
IMQ is not part of the mainline kernel.org code.
It is known buggy.

The kernel.org bugzilla is only for kernels that contain no "out of tree" code.