[ofa-general] MPI IB Errors

Jeff Squyres jsquyres at cisco.com
Wed Aug 22 14:39:43 PDT 2007


On Aug 22, 2007, at 5:32 PM, John Leidel wrote:

> Yeah, I'm sure the OFED release is the same.  I'm running ROCKS  
> 4.2.1, so all the node images are identical regarding the package  
> selection.  There could possibly be [well, probably] a difference  
> in the firmware releases of the HCAs and switches from the older  
> machines and the latest delivery.

Try running ibv_devinfo on your new nodes, which should show you the  
HCA(s) on your host.  I suspect that it will fail with a similar  
error (but am not 100% sure -- I'm the MPI guy, not the verbs guy :-) ).

If this is the case, then you've got a bigger issue that your IB  
drivers are not loading.  This will need to be fixed before you  
investigate the firmware level on your HCAs across the cluster.

(other IB stack experts feel free to chime in...)

-- 
Jeff Squyres
Cisco Systems




More information about the general mailing list