Bug 32132 - rt73usb making the kernel panic.
Summary: rt73usb making the kernel panic.
Status: CLOSED CODE_FIX
Alias: None
Product: Drivers
Classification: Unclassified
Component: network-wireless (show other bugs)
Hardware: x86-64 Linux
: P1 high
Assignee: Stanislaw Gruszka
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-03-29 11:05 UTC by Hellyna Ng
Modified: 2012-04-01 10:23 UTC (History)
5 users (show)

See Also:
Kernel Version: 2.6.37.5-1
Subsystem:
Regression: No
Bisected commit-id:


Attachments
my rmmod crash (920.06 KB, image/jpeg)
2011-05-29 20:31 UTC, Stanislaw Gruszka
Details

Description Hellyna Ng 2011-03-29 11:05:58 UTC
As stated. But the crash doesn't come immediately.

I used to blacklist the rt2500lib module which was in conflict with rt73usb in earlier kernel versions... not about now.

How do I confirm it's this card? I used another card which runs the ath9k_htc driver and it runs fine for many many hours, but when I insert and use the (TP-LINK TL-WN321G) rt73usb driver it crashes after sometime, like 30mins - 2hrs. 

The kernel panic message traceback shows rt7x00lib but i have no idea how to capture that output :(. kernel.log{.1} shows nothing too.
Comment 1 John W. Linville 2011-03-29 13:54:29 UTC
Perhaps you could use netconsole to capture the backtrace?  Or if nothing else, attach a digital photo of it?
Comment 2 Stanislaw Gruszka 2011-03-29 15:01:55 UTC
Is this regression? Does things work in 2.6.37.4 or some other kernel version you had tried before?
Comment 3 Alejandro Lorenzo 2011-05-09 22:22:39 UTC
I am seeing something similar here. Kernel is 2.6.38.5 and it is quite reproducible.

i attach a link to a photo of the kernel hang.

http://ballfire.ath.cx/~alex/kernel_oop.jpg

Also, i can reproduce it quite well by running this abusive script:

#!/bin/bash

while (true); do

ifdown wlan0
rmmod rt73usb
rmmod rt2x00usb
modprobe rt73usb
modprobe rt2x00usb

ifup wlan0
done;


At first, it start to fail to localte a wireless DHCP server, and after a pair of new failing attemps to get an IP, the removal of the modules trigger a kernel crash
Comment 4 Stanislaw Gruszka 2011-05-29 20:31:25 UTC
Created attachment 59992 [details]
my rmmod crash
Comment 5 John W. Linville 2011-06-28 18:26:27 UTC
Failing to delete a timer during the rmmod cleanup?
Comment 6 Ivo van Doorn 2011-06-28 20:06:58 UTC
Not sure, but isn't this fixed by the patch "rt2x00: fix rmmod crash"
from Stanislaw Gruszka?

http://comments.gmane.org/gmane.linux.kernel.wireless.general/71197

It blames 1c0bcf89d85cc97a0d9ce4cd909351a81fa4fdde for the culprit of this bug, but I thought that commit has gone into 3.0 only...
Comment 7 Ivo van Doorn 2011-06-28 20:10:36 UTC
Not sure, but isn't this fixed by the patch "rt2x00: fix rmmod crash"
from Stanislaw Gruszka?

http://comments.gmane.org/gmane.linux.kernel.wireless.general/71197

It blames 1c0bcf89d85cc97a0d9ce4cd909351a81fa4fdde for the culprit of this bug, but I thought that commit has gone into 3.0 only...
Comment 8 John W. Linville 2011-06-28 20:28:09 UTC
Hellyna, can you test v3.0-rc5 or later?
Comment 9 Stanislaw Gruszka 2011-06-29 10:54:24 UTC
Comment 3 problem should be fixed by 7f503fc49f144bb509dbd33daf3426df3f176e6b
"rt2x00: fix possible memory corruption in case of invalid rxdesc.size". Follow up commit bf4c02d5e772903be5bf8952bac730a2956d0619 "rt2x00: reset usb devices at probe" should prevent some other problems when reloading modules. Both commits are from wireless-testing tree. Perhaps they should be posted to 3.0, but I was a bit worried about possible regression, and do not consider these module reload issues super critical. Not sure if they fix original, crash after few hours problem reported in comment 0.

Anyway it's better to test wireless-testing to see if issue is gone.
Comment 10 Stanislaw Gruszka 2012-04-01 10:22:13 UTC
This bug must be fixed now.

Note You need to log in before you can comment on or make changes to this bug.