[ofa-general] saquery hangs/timeouts
Troy Benjegerdes
troy at scl.ameslab.gov
Fri Nov 9 21:30:27 PST 2007
What reasons could cause the following:
[root at sm1 infiniband]# saquery -d
Nov 09 22:30:00 692541 [C94EDFB0] -> osm_vendor_bind: Binding to port
0x2c9021b701236
Nov 09 22:30:04 779705 [41001940] -> umad_receiver: ERR 5409: send
completed with error (method=0x12 attr=0x11 trans_id=0x6400000001) --
dropping
Nov 09 22:30:04 779716 [41001940] -> umad_receiver: ERR 5410: class 0x3
LID 0x12
Query SA failed: IB_TIMEOUT
This occurs on a machine which has had both a mthca and mlx4 card, and
an almost identical machine with another mlx4 card works just fine.
The only real difference I can tell is that the machine that works
previously had OFED-1.3 alpha 1 installed, and the one that does not
work has not had OFED-1.3 installed. I also get the hang on my debian
systems that I built the kernel, libibverbs, libmthca, etc myself.
The debian system gets the following behavior:
bash-3.1# /opt/sc07/sbin/saquery -d
Nov 09 22:36:14 336986 [F7EDD6C0] -> osm_vendor_bind: Binding to port
0x2c90300001dd1
NodeRecord dump:
lid.....................0xA1
reserved................0x0
base_version............0x
More information about the general
mailing list