Bug 5046
Summary: | Switching elevator on-the-fly causes NULL pointer dereference oops | ||
---|---|---|---|
Product: | IO/Storage | Reporter: | Kerin Millar (kfm) |
Component: | Block Layer | Assignee: | Jens Axboe (axboe) |
Status: | CLOSED CODE_FIX | ||
Severity: | high | CC: | akpm, cp |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.13-rc6 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: |
dmesg-with-oops.log
dotconfig lspci-vvv fdisk-l mount mdstat |
Description
Kerin Millar
2005-08-11 10:40:28 UTC
Created attachment 5604 [details]
dmesg-with-oops.log
Complete dmesg log (oops visible at tail end).
Created attachment 5605 [details]
dotconfig
The .config file used for the 2.6.13-rc6 sources.
Created attachment 5606 [details]
lspci-vvv
Output from the lspci -vvv command.
Created attachment 5607 [details]
fdisk-l
Output from the fdisk -l command for /dev/sda, /dev/sdb, /dev/sdc and /dev/sdd.
Created attachment 5608 [details]
mount
Active mounts (correct at time of oops).
Created attachment 5609 [details]
mdstat
Output of /proc/mdstat.
The other thing I want to say at this point is that, although the machine is reasonably high powered and runs a lot of software, it is seldom put under any significant load. Certainly, there would have been practically zero load and minimal I/O activity at the time that this oops was triggered. Corrections: The move was from deadline -> cfq -> deadline -> cfq -> deadline. So the switch was made twice within the timeframe; I apologise for not being 100% clear but it may be that the oops occurred on the final switch to cfq (prior to trying to switch back to deadline). I know only that the oops occurred on either the final _or_ the penultimate switch process. Also, I meant to say that the machine *didn't* come up after trying to remotely reboot. Jens, I've tracked down the cause of this. It effects both the AS and the Deadline schedulers. The problem is that e->elevator_data is null when entering the *_exit_queue functions. Fixing it is beyond my realm but hopefully this helps you some. Well, surely this is now resolved as of the 2.6.14.2 release? http://www.kernel.org/git/?p=linux/kernel/git/chrisw/stable-queue.git;a=blob;h=1e59c36347ca8d3321abbcfa56e242c274dc6a3b;hb=1876efb4444c275f3aabbc79541edac189e57d91;f=queue/cfq-io-sched-fix.patch In view of this, I would request closure. Yep indeed, 2.6.15-rc1 has it as well. |