[Users] OpenSM error message rosetta stone?

Ira Weiny weiny2 at llnl.gov
Tue Feb 19 09:10:54 PST 2013


On Tue, 19 Feb 2013 10:52:16 -0600
Narayan Desai <narayan.desai at gmail.com> wrote:

> Is there a good guide to decoding opensm error logs?
> 
> i'm specifically seeing this:
> Feb 19 10:50:26 667041 [21C62700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR
> 3120 Timeout while getting attribute 0x15 (PortInfo); Possible mis-set
> mkey?
> Feb 19 10:50:26 667057 [21C62700] 0x01 -> log_send_error: ERR 5411: DR
> SMP Send completed with error (IB_TIMEOUT) -- dropping
>                         Method 0x1, Attr 0x15, TID 0x1b684f
> 
> a lot.

What version of OpenSM is this?  Jim Foraker here at LLNL worked on the mkey support and we just went through fixing an issue similar to the above but I can't remember the details off the top of my head.

> 
> Also, the timestamp is clear enough, but what do the next 3 fields
> (667*, [21C6*, and 0x01 mean?

667* -- milisecond time stamp
21C* -- thread id
0x01 -- log level

Ira

> thanks.
>  -nld
> _______________________________________________
> Users mailing list
> Users at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/users


-- 
Ira Weiny
Member of Technical Staff
Lawrence Livermore National Lab
925-423-8008
weiny2 at llnl.gov



More information about the Users mailing list