Bug 7215
Summary: | PCMCIA network card causes either X or kernel to freeze | ||
---|---|---|---|
Product: | Drivers | Reporter: | Eric (ericabel) |
Component: | PCMCIA | Assignee: | Dominik Brodowski (linux) |
Status: | RESOLVED INSUFFICIENT_DATA | ||
Severity: | normal | CC: | akpm, linux, protasnb, stefan |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.18 | Subsystem: | |
Regression: | Yes | Bisected commit-id: |
Description
Eric
2006-09-27 10:17:16 UTC
It could be that the machine has oopsed, only we don't know about it because you're stuck in X. Are you able to set up a serial console? netconsole is easier, but if it's a networking problem then that might not give us any info either. Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > > > > ------- Additional Comments From akpm@osdl.org 2006-09-27 10:45 ------- > It could be that the machine has oopsed, only we don't know about > it because you're stuck in X. > > Are you able to set up a serial console? > > netconsole is easier, but if it's a networking problem then that > might not give us any info either. > I can still connect through the ethernet port, which is unaffected by this problem. I will try that, and see if I can get some more clues. Anything specific I should look for when the problem occurs? > > > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > On Wed, 27 Sep 2006 13:07:00 -0700 bugme-daemon@bugzilla.kernel.org wrote: > I can still connect through the ethernet port, which is unaffected by this > problem. You can? That's interesting. When you do so, is there nothing interesting in the `dmesg' output? This means that there's no point in setting up a serial console or anything. If you can still connect to the machine, then what do you mean by "it freezes"? Just that the X interface is wedged up? If so, run `top' and `ps aux', see if some process is stuck spinning in a loop or something. Could it just be that the keyboard/mouse have gone bad? What happens if you do `sudo killall X' when logged in over the network? Does the X server terminate? Can it be restarted? etcetera.... Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > > > > ------- Additional Comments From akpm@osdl.org 2006-09-27 13:04 ------- > On Wed, 27 Sep 2006 13:07:00 -0700 > bugme-daemon@bugzilla.kernel.org wrote: > >> I can still connect through the ethernet port, which is unaffected by this >> problem. > > You can? That's interesting. Scratch that...I can't. I meant, in theory, I can connect through the ethernet port, but I just tried, and the netconsole I was working on went dead when the computer froze. > When you do so, is there nothing interesting in the `dmesg' output? Even without doing this, I thought dmesg just printed the contents of /var/log/messages, which has a time stamp, and I have tried looking at the log file right before it died (to see the last time stamp), then the next series of messages are associated with the new boot after restarting (starting with the message "restart". Basically there's nothing I can see no messages which seem to occur around the time the system freezes up, much less anything which indicates a problem (in syslog either). This is why the problem is so perplexing to me. > > This means that there's no point in setting up a serial console or anything. I guess there is...but it'll be a while. I'll need to get a serial cable...haven't had need for one of those in years. > > If you can still connect to the machine, then what do you mean by "it > freezes"? Just that the X interface is wedged up? At this point, I can say that I have no interaction with the machine, either with the mouse or keyboard. > > If so, run `top' and `ps aux', see if some process is stuck spinning in a > loop or something. > > Could it just be that the keyboard/mouse have gone bad? No, the computer will run forever if I use only the ethernet connection, it's only when I connect wirelessly that the problem manifests itself. > > What happens if you do `sudo killall X' when logged in over the network? > Does the X server terminate? Can it be restarted? Don't know, haven't been able to interact with the computer once it's frozen up yet. > > etcetera.... > > > > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > OK, I've ascertained now that it is the kernel which seizes up...nothing works. The serial terminal freezes up when the computer freezes...and I still can't find any trace of error messages. Eric Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > > > > ------- Additional Comments From ericabel@mit.edu 2006-09-27 15:36 ------- > Quoting bugme-daemon@bugzilla.kernel.org: > >> http://bugzilla.kernel.org/show_bug.cgi?id=7215 >> >> >> >> >> >> ------- Additional Comments From akpm@osdl.org 2006-09-27 13:04 ------- >> On Wed, 27 Sep 2006 13:07:00 -0700 >> bugme-daemon@bugzilla.kernel.org wrote: >> >>> I can still connect through the ethernet port, which is unaffected by this >>> problem. >> >> You can? That's interesting. > > Scratch that...I can't. I meant, in theory, I can connect through the > ethernet > port, but I just tried, and the netconsole I was working on went dead > when the > computer froze. > >> When you do so, is there nothing interesting in the `dmesg' output? > > Even without doing this, I thought dmesg just printed the contents of > /var/log/messages, which has a time stamp, and I have tried looking > at the log > file right before it died (to see the last time stamp), then the next > series of > messages are associated with the new boot after restarting (starting with the > message "restart". Basically there's nothing I can see no messages > which seem > to occur around the time the system freezes up, much less anything which > indicates a problem (in syslog either). This is why the problem is so > perplexing to me. > >> >> This means that there's no point in setting up a serial console or anything. > > I guess there is...but it'll be a while. I'll need to get a serial > cable...haven't had need for one of those in years. > >> >> If you can still connect to the machine, then what do you mean by "it >> freezes"? Just that the X interface is wedged up? > > At this point, I can say that I have no interaction with the machine, either > with the mouse or keyboard. > >> >> If so, run `top' and `ps aux', see if some process is stuck spinning in a >> loop or something. >> >> Could it just be that the keyboard/mouse have gone bad? > > No, the computer will run forever if I use only the ethernet connection, it's > only when I connect wirelessly that the problem manifests itself. > >> >> What happens if you do `sudo killall X' when logged in over the network? >> Does the X server terminate? Can it be restarted? > > Don't know, haven't been able to interact with the computer once it's > frozen up > yet. > >> >> etcetera.... >> >> >> >> ------- You are receiving this mail because: ------- >> You reported the bug, or are watching the reporter. >> > > > > > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > Hello, I hope you haven't closed out this bug report because of the time lag since my last email, but I just wanted to let you know that the problem still persists, and I am out of ideas as to what to check. It seems to be a kernel problem, since when it freezes, the serial terminal becomes unresponsive, in addition any running sub processes also hang. I do think that the error occurs before any error data can be logged, because I still can't find any trace of an error in any log file. Unfortunately going back to the 2.6.8 kernel isn't so easy for me at this point, so I am dealing with the repeated system shutdowns every time this happens. Any ideas? Erid Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > > > > ------- Additional Comments From ericabel@mit.edu 2006-10-16 08:36 ------- > OK, I've ascertained now that it is the kernel which seizes > up...nothing works. The serial terminal freezes up when the computer > freezes...and I still can't > find any trace of error messages. > > Eric > > Quoting bugme-daemon@bugzilla.kernel.org: > >> http://bugzilla.kernel.org/show_bug.cgi?id=7215 >> >> >> >> >> >> ------- Additional Comments From ericabel@mit.edu 2006-09-27 15:36 ------- >> Quoting bugme-daemon@bugzilla.kernel.org: >> >>> http://bugzilla.kernel.org/show_bug.cgi?id=7215 >>> >>> >>> >>> >>> >>> ------- Additional Comments From akpm@osdl.org 2006-09-27 13:04 ------- >>> On Wed, 27 Sep 2006 13:07:00 -0700 >>> bugme-daemon@bugzilla.kernel.org wrote: >>> >>>> I can still connect through the ethernet port, which is unaffected by this >>>> problem. >>> >>> You can? That's interesting. >> >> Scratch that...I can't. I meant, in theory, I can connect through the >> ethernet >> port, but I just tried, and the netconsole I was working on went dead >> when the >> computer froze. >> >>> When you do so, is there nothing interesting in the `dmesg' output? >> >> Even without doing this, I thought dmesg just printed the contents of >> /var/log/messages, which has a time stamp, and I have tried looking >> at the log >> file right before it died (to see the last time stamp), then the next >> series of >> messages are associated with the new boot after restarting (starting >> with the >> message "restart". Basically there's nothing I can see no messages >> which seem >> to occur around the time the system freezes up, much less anything which >> indicates a problem (in syslog either). This is why the problem is so >> perplexing to me. >> >>> >>> This means that there's no point in setting up a serial console or >>> anything. >> >> I guess there is...but it'll be a while. I'll need to get a serial >> cable...haven't had need for one of those in years. >> >>> >>> If you can still connect to the machine, then what do you mean by "it >>> freezes"? Just that the X interface is wedged up? >> >> At this point, I can say that I have no interaction with the machine, either >> with the mouse or keyboard. >> >>> >>> If so, run `top' and `ps aux', see if some process is stuck spinning in a >>> loop or something. >>> >>> Could it just be that the keyboard/mouse have gone bad? >> >> No, the computer will run forever if I use only the ethernet >> connection, it's >> only when I connect wirelessly that the problem manifests itself. >> >>> >>> What happens if you do `sudo killall X' when logged in over the network? >>> Does the X server terminate? Can it be restarted? >> >> Don't know, haven't been able to interact with the computer once it's >> frozen up >> yet. >> >>> >>> etcetera.... >>> >>> >>> >>> ------- You are receiving this mail because: ------- >>> You reported the bug, or are watching the reporter. >>> >> >> >> >> >> ------- You are receiving this mail because: ------- >> You reported the bug, or are watching the reporter. >> > > > > > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > Eric, As I understand, your laptop freezes, and after reboot you don't see anything logged in /var/log/messages, /var/lof/Xorg.log, and without X in text mode your keyboard and mouse and the whole system work fine? You also mentioned that you cant go back to the 2.6.8 because of system shutdowns - can you explain more on this. For more information, please run "dmesg -n 7" to increase verbosity of system messages and after next freeze/reboot collect the log files and attach to the bugzilla. You can also increase verbosity of Xorg: do man on Xorg and see about logverbose and verbose parameters. Thanks. Subject: Re: PCMCIA network card causes either X or kernel to freeze Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > protasnb@gmail.com changed: > > What |Removed |Added > ---------------------------------------------------------------------------- > CC| |protasnb@gmail.com > > > > > ------- Comment #7 from protasnb@gmail.com 2007-07-04 19:55 ------- > Eric, > As I understand, your laptop freezes, and after reboot you don't see anything > logged in /var/log/messages, /var/lof/Xorg.log, and without X in text > mode your > keyboard and mouse and the whole system work fine? Basically, I have not been able to reproduce the problem without booting X. The system has not yet frozen in console mode. I have since reverted back to 2.6.8, and have not upgraded kernels since, due to this issue. One work around would be to use the pcmcia-cs packages with a later kernel, but for now the system is working fine with the 2.6.8 configuration. However, on very rare occasions, when I am utilizing excessive bandwidth on my wireless network card even in 2.6.8 I have experienced the same sort of system freeze-up, which I only mention because this is evidence that the problem has nothing to do with the pcmcia driver compiled into the later kernel versions, but something more fundamental at the hardware level? Anyway, I'm still not sure if X is the problem or not. I will try the dmesg -n 7 and reproduce the system freeze and see if the log catches anything. Thanks for the response, Eric > You also mentioned that you cant go back to the 2.6.8 because of system > shutdowns - can you explain more on this. > For more information, please run "dmesg -n 7" to increase verbosity of system > messages and after next freeze/reboot collect the log files and attach to the > bugzilla. You can also increase verbosity of Xorg: do man on Xorg and > see about > logverbose and verbose parameters. > Thanks. > > > -- > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > We can't make much progress without seeing the kernel printks. There's probabyl something there, only we're not seeing it. Options would be: - When X is running, do the alt-shift-F1 thing to get back to the VGA console, and try to get X (which is still running) to freeze in this state. Maybe you'll see some messages - Get that serial console working! - have you tried getting netconsole working over the wired ethernet? If we _can_ get those messages coming out then we can go further and investigate the NMI watchdog, whcih can be useful in locating random hangs. But until we can see those messages there isn't much point in setting that up. Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > akpm@osdl.org changed: > > What |Removed |Added > ---------------------------------------------------------------------------- > Regression|0 |1 > > > > > ------- Comment #9 from akpm@osdl.org 2007-08-02 17:26 ------- > We can't make much progress without seeing the kernel printks. There's > probabyl something there, only we're not seeing it. Options would be: > > - When X is running, do the alt-shift-F1 thing to get back to the VGA > console, > and try to get X (which is still running) to freeze in this state. Maybe > you'll > see some messages I'll give this a try. > > - Get that serial console working! > > - have you tried getting netconsole working over the wired ethernet? Both of these freeze up along with X (if X is indeed causing the problem) > > If we _can_ get those messages coming out then we can go further and > investigate the NMI watchdog, whcih can be useful in locating random hangs. > But until we can see those messages there isn't much point in setting > that up. I understand. > > > -- > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > Eric, any progress on debugging? Do you need any assistance with debug procedures? If you have any ideas that haven't been already suggested, it would be greatly appreciated. So far I have been unable to recover any error messages associated with the system freeze up...remote terminal (ssh) freezes up, as does a serial terminal. I tried writing a perl script which prints the time in a text file, then I'll freeze the system, and check the text file, and indeed, the script stops with the system freeze. This is the most information I have been able to accumulate. Eric Quoting bugme-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > > > > ------- Comment #11 from protasnb@gmail.com 2007-11-07 21:34 ------- > Eric, any progress on debugging? Do you need any assistance with debug > procedures? > > > -- > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > ------- You are receiving this mail because: ------- > You reported the bug, or are watching the reporter. > How about MNI watchdog? You can enable it by "nmi_watchdog=<1 or 2>" on the boot line. 1 or 2 depends if you have APIC or IO-APIC based NMI. I can tell more precisely if you attach /proc/interrupts and dmesg. Eric, any updates on this? How is it working with recent kernel? Seems I have exactly the same problem on a TP23 with Debian 4.0, 2.6.18.
Description matches, I can provoke the error by more network traffic via the card.
If debugging help still needed, I can jump in.
Regards,
Stefan
> Eric, any updates on this? How is it working with recent kernel?
>
I can suggest git bisect since you have good working version. You can try to identify what breaks it going from 2.6.8 to 2.6.9 say. See http://www.kernel.org/doc/local/git-quick.html It is a bit elaborate process, but it should show a kernel change that impaired your laptop. Could you post the output of "lspcmcia -vvv" please? You can probably close this out. The computer this applies to is dead. Eric Quoting bugzilla-daemon@bugzilla.kernel.org: > http://bugzilla.kernel.org/show_bug.cgi?id=7215 > > > Dominik Brodowski <linux@brodo.de> changed: > > What |Removed |Added > ---------------------------------------------------------------------------- > CC| |linux@brodo.de > > > > > --- Comment #17 from Dominik Brodowski <linux@brodo.de> 2009-10-17 > 12:19:58 --- > Could you post the output of "lspcmcia -vvv" please? > > -- > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > ------- You are receiving this mail because: ------- > You reported the bug. > closing this bug, as no more data is available |