[ofa-general] Problem with ConnectX HBA
Roland Fehrenbacher
rf at q-leap.de
Mon Jan 28 08:09:42 PST 2008
Hi,
when running MPI codes, we have the following error messages coming
from some of our servers running 2.6.22.16 with kernel modules from
ofa_kernel-1.2.5.4:
mlx4_core 0000:08:00.0: SW2HW_MPT failed (-16)
The communication on the corresponding machines is completely blocked,
and ibstat is just hanging.
Any idea what could be wrong? Just for additional info: When running
the kernel with the original 2.6.22 drivers, I had these kind of error
messages at a much higher rate.
Thanks,
Roland
More information about the general
mailing list