Hi, I am running a benchmark against DB2 in my cluster with several ext3 (SSD) and GPFS file systems. After running the benchmark for around two days I noticed that one of the nodes rebooted and that was due to an error touching a file. The file was being touched by one of the HA monitors of Tivoli System Automation. I immediately tried to reproduce the error by touching the file again manually. I touched the file a few times successfully, then out of the blue I got a "Permission denied" error. Right after that I kept touching the file and got no error message. CPU utilization and I/O utilization were NOT at their peaks. This error keeps happening randomly and therefore it is difficult to reproduce. It has happened both on a GPFS file system and an ext3 file system. Also I don't have much information on the internals of this problem. Do you have suggestions on how to instrument/debug/monitor the error so that I can move on with the investigation and bring more info to this bug report? Thanks Phil
Can you reproduce this on an upstream Linux system without GPFS, or does it need GPFS added ?
I couldn't reproduce it ever again. I reproduced it two or three times then it never happened. But as I said it happens on an ext3 system as well. Not sure if it would still happen on ext3 if I didn't have GPFS compiled.
Thanks - given it was with 3rd party stuff loaded and not reproducible I'm not sure there is anything more we can do