[ofa-general] Re: saquery hangs/timeouts
Troy Benjegerdes
troy at scl.ameslab.gov
Fri Nov 9 21:38:54 PST 2007
saquery apparently very much dislikes having two ports active at once.
If I pull the cable off the second port it works.
Troy Benjegerdes wrote:
> What reasons could cause the following:
>
> [root at sm1 infiniband]# saquery -d
> Nov 09 22:30:00 692541 [C94EDFB0] -> osm_vendor_bind: Binding to port
> 0x2c9021b701236
> Nov 09 22:30:04 779705 [41001940] -> umad_receiver: ERR 5409: send
> completed with error (method=0x12 attr=0x11 trans_id=0x6400000001) --
> dropping
> Nov 09 22:30:04 779716 [41001940] -> umad_receiver: ERR 5410: class
> 0x3 LID 0x12
> Query SA failed: IB_TIMEOUT
>
> This occurs on a machine which has had both a mthca and mlx4 card, and
> an almost identical machine with another mlx4 card works just fine.
>
> The only real difference I can tell is that the machine that works
> previously had OFED-1.3 alpha 1 installed, and the one that does not
> work has not had OFED-1.3 installed. I also get the hang on my debian
> systems that I built the kernel, libibverbs, libmthca, etc myself.
>
> The debian system gets the following behavior:
>
> bash-3.1# /opt/sc07/sbin/saquery -d
> Nov 09 22:36:14 336986 [F7EDD6C0] -> osm_vendor_bind: Binding to port
> 0x2c90300001dd1
> NodeRecord dump:
> lid.....................0xA1
> reserved................0x0
> base_version............0x
>
More information about the general
mailing list