[ofa-general] Need help diagnosing a problem....

Mike Heinz michael.heinz at qlogic.com
Thu May 8 06:54:24 PDT 2008


I was smoke testing a small cluster when one of the nodes posted this:
 
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
Internal error detected:
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[00]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[01]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[02]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[03]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[04]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[05]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[06]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[07]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[08]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[09]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0a]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0b]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0c]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0d]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0e]: ffffffff
May  7 16:47:00 compute-0-4.local kernel: mlx4_core 0000:02:00.0:
buf[0f]: ffffffff

At this point, all further IB traffic on that node failed, and it
silently hung during shut down. 
 
Any suggestions as to what I should look at?
 
--
Michael Heinz
Principal Engineer, Qlogic Corporation
King of Prussia, Pennsylvania
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080508/44de437d/attachment.html>


More information about the general mailing list