[ofa-general] saquery hangs/timeouts

Troy Benjegerdes troy at scl.ameslab.gov
Fri Nov 9 21:30:27 PST 2007


What reasons could cause the following:

[root at sm1 infiniband]# saquery -d
Nov 09 22:30:00 692541 [C94EDFB0] -> osm_vendor_bind: Binding to port 
0x2c9021b701236
Nov 09 22:30:04 779705 [41001940] -> umad_receiver: ERR 5409: send 
completed with error (method=0x12 attr=0x11 trans_id=0x6400000001) -- 
dropping
Nov 09 22:30:04 779716 [41001940] -> umad_receiver: ERR 5410: class 0x3 
LID 0x12
Query SA failed: IB_TIMEOUT

This occurs on a machine which has had both a mthca and mlx4 card, and 
an almost identical machine with another mlx4 card works just fine.

The only real difference I can tell is that the machine that works 
previously had OFED-1.3 alpha 1 installed, and the one that does not 
work has not had OFED-1.3 installed. I also get the hang on my debian 
systems that I built the kernel, libibverbs, libmthca, etc myself.

The debian system gets the following behavior:

bash-3.1# /opt/sc07/sbin/saquery -d
Nov 09 22:36:14 336986 [F7EDD6C0] -> osm_vendor_bind: Binding to port 
0x2c90300001dd1
NodeRecord dump:
                lid.....................0xA1
                reserved................0x0
                base_version............0x



More information about the general mailing list