[Users] OpenSM error message rosetta stone?
Ira Weiny
weiny2 at llnl.gov
Tue Feb 19 09:10:54 PST 2013
On Tue, 19 Feb 2013 10:52:16 -0600
Narayan Desai <narayan.desai at gmail.com> wrote:
> Is there a good guide to decoding opensm error logs?
>
> i'm specifically seeing this:
> Feb 19 10:50:26 667041 [21C62700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR
> 3120 Timeout while getting attribute 0x15 (PortInfo); Possible mis-set
> mkey?
> Feb 19 10:50:26 667057 [21C62700] 0x01 -> log_send_error: ERR 5411: DR
> SMP Send completed with error (IB_TIMEOUT) -- dropping
> Method 0x1, Attr 0x15, TID 0x1b684f
>
> a lot.
What version of OpenSM is this? Jim Foraker here at LLNL worked on the mkey support and we just went through fixing an issue similar to the above but I can't remember the details off the top of my head.
>
> Also, the timestamp is clear enough, but what do the next 3 fields
> (667*, [21C6*, and 0x01 mean?
667* -- milisecond time stamp
21C* -- thread id
0x01 -- log level
Ira
> thanks.
> -nld
> _______________________________________________
> Users mailing list
> Users at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/users
--
Ira Weiny
Member of Technical Staff
Lawrence Livermore National Lab
925-423-8008
weiny2 at llnl.gov
More information about the Users
mailing list