[ofa-general] Problem with ConnectX HBA

Roland Fehrenbacher rf at q-leap.de
Mon Jan 28 08:09:42 PST 2008


Hi,

when running MPI codes, we have the following error messages coming
from some of our servers running 2.6.22.16 with kernel modules from
ofa_kernel-1.2.5.4:

mlx4_core 0000:08:00.0: SW2HW_MPT failed (-16)

The communication on the corresponding machines is completely blocked,
and ibstat is just hanging.

Any idea what could be wrong? Just for additional info: When running
the kernel with the original 2.6.22 drivers, I had these kind of error
messages at a much higher rate.

Thanks,

Roland



More information about the general mailing list