Bug 20972 - mv643xx_eth causes oops then panic on removal
Summary: mv643xx_eth causes oops then panic on removal
Status: RESOLVED OBSOLETE
Alias: None
Product: Drivers
Classification: Unclassified
Component: Network (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: drivers_network@kernel-bugs.osdl.org
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-10-23 07:18 UTC by Robby Workman
Modified: 2013-12-10 22:21 UTC (History)
2 users (show)

See Also:
Kernel Version: 2.6.36
Subsystem:
Regression: No
Bisected commit-id:


Attachments
crashlog (3.62 KB, text/plain)
2010-10-23 07:18 UTC, Robby Workman
Details
New trace (16.21 KB, text/plain)
2010-10-24 22:04 UTC, Robby Workman
Details

Description Robby Workman 2010-10-23 07:18:44 UTC
Created attachment 34492 [details]
crashlog

While trying to backtrack a different bug with this, I discovered that unloading the mv643xx_eth module triggers a kernel panic on the GuruPlug Server Plus.

This is with the ArmedSlack (Slackware for ARM) -current installer, but it also occurs on the real system -- I'm only able to get a trace from the installer though, for whatever reason.

I'm going to file a different bug for it, but the original issue leading to this is a regression from 2.6.35 and earlier (I know 2.6.35 was fine), no traffic leaves on the ethernet ports, even though they configure correctly and /sys/class/net/<if>/carrier shows '1'.  More on that in the correct bug report though :)
Comment 1 Lennert Buytenhek 2010-10-23 09:57:33 UTC
This doesn't seem like a mv643xx_eth-specific issue at first sight.

Does your distro apply any patches to mv643xx_eth?

Also, which staging driver have you loaded?
Comment 2 Robby Workman 2010-10-24 02:30:43 UTC
Crap, I didn't notice the taint -- it's the xgifb driver.  
Re patches, none to 2.6.36 - it's vanilla.
I'll make sure this can be repro'd without that driver loaded, and if so, I'll post a new trace (assuming I can get one again).
Comment 3 Robby Workman 2010-10-24 22:04:09 UTC
Created attachment 34852 [details]
New trace

Well, the xgifb driver is included in the initramfs, so blacklisting it will require a bit more than I thought.  Here's another trace with more lines (attached).
Comment 4 Robby Workman 2010-10-24 22:05:59 UTC
Oh, and the kernel now includes one patch to fix the network issue I was originally having when I discovered this bug - here's the git commit:
http://git.kernel.org/?p=linux/kernel/git/next/linux-next.git;a=commit;h=be8c648051048bc66fbca590d00f3e8543ec32af
Comment 5 Lennert Buytenhek 2010-10-24 22:57:53 UTC
On Sun, Oct 24, 2010 at 10:04:10PM +0000, bugzilla-daemon@bugzilla.kernel.org wrote:

> Well, the xgifb driver is included in the initramfs, so blacklisting
> it will require a bit more than I thought.  Here's another trace with
> more lines (attached).

It looks like there might be a timer that is getting freed without
being deleted first.  (If I'm reading the log right.)

mv643xx_eth uses two timers (the MIB counter one and the RX OOM one),
but both are deleted by mv643xx_eth_stop.

Is there any chance you could build a custom kernel with a load of
memory debugging options enabled?  I'm sure that there are many smart
people that can diagnose the problem from just this trace, but I am
not one of them. :-|

Note You need to log in before you can comment on or make changes to this bug.