[plug] re: server freezing

Andrew Barbara andrew at mailerdirect.net
Wed Jul 9 23:03:31 WST 2003


I have a server that has 2 Xeons and shows up as 4 CPUs in the cpu list, 
the server was running RH7.3 and has just been upgraded to RH9.0, we have 
no issues with locking or anything.  The differences are that this is an 
Intel MB and Adaptec RAID controller.

Andrew

On Wed, 9 Jul 2003, Craig Ringer wrote:

> > [root at gfpmsql root]# /sbin/lspci 
> > 00:00.0 Host bridge: ServerWorks: Unknown device 0012 (rev 13)
> 
> Try updating your pci.ids database from pciids.sourceforge.net . It 
> doesn't really change much, but it makes lspci output much easier to 
> work with and identify.
> 
> >>>[root at gfpmsql root]# cat /proc/interrupts 
> > 
> >            CPU0       CPU1       CPU2       CPU3       
> >   0:   16506895   16507517   16507378   16507241    IO-APIC-edge  timer
> >   1:          1          1          1          1    IO-APIC-edge 
> > keyboard
> >   2:          0          0          0          0          XT-PIC 
> > cascade
> >   8:          1          0          0          0    IO-APIC-edge  rtc
> >  11:          0          0          0          0   IO-APIC-level 
> > usb-ohci
> >  12:         12          8          8         13    IO-APIC-edge  PS/2
> > Mouse
> >  14:          1          0          0          1    IO-APIC-edge  ide0
> >  18:       8502       8519       8504       8532   IO-APIC-level  eth0
> >  20:          4          4          4          4   IO-APIC-level 
> > aic7xxx
> >  22:       5211       5230       5251       5209   IO-APIC-level  ips
> >  29:     332445     332628     332689     332663   IO-APIC-level  eth1
> > NMI:          0          0          0          0 
> > LOC:   66029161   66029191   66029191   66029191 
> > ERR:          0
> > MIS:          0
> 
> All looks reasonable. I can tell right off that you don't use your 
> onboard SCSI or ATA interfaces :-) More importantly, nothing is trying 
> to share an interrupt or anything silly like that.
> 
> >>>[root at gfpmsql root]# uname -a
> > 
> > Linux gfpmsql 2.4.18-14smp #1 SMP Wed Sep 4 12:34:47 EDT 2002 i686 i686
> > i386 GNU/Linux
> 
> Try updating to 2.4.20, especially as you have a fairly recent chipset 
> (newer than your kernel and distro at least). Sometimes chipset-specific 
> tweaks and fixes get in. It shouldn't be needed, but sometimes it is.
> 
> > While gathering this information the server froze, no errors nor
> > messages.
> 
> That's just damnn strange. As I said before, you should probably look at 
> talking to its IPMI interface to ask it if if thinks there's anything 
> wrong. Unfortunately, I think this requires kernel patches to 2.4.x .
> 
> The only other thing I can suggest now is to try running on a 
> uniprocessor kernel - just to see. It's remotely possible that you're 
> encountering a problem with a driver that has locking problems or 
> something similar. Depends, really, on if the server can take the 
> workload on only one processor with HyperThreading disabled.
> 
> Craig Ringer
> 
> 

-- 




More information about the plug mailing list