[openib-general] opensm errors with ehca

Troy Benjegerdes hozer at hozed.org
Sun Oct 30 15:55:04 PST 2005


The firmware on the IBM eHCA causes opensm to spit out these kinds of
errors all the time..

Is there a way we can either not send P_KeyTable requests to any eHCA
guids, or figure out what (if anything) is broken in their firmware?

Is this a spec violation, or just ambiguities in implementation?

Oct 30 17:49:46 053820 [43005960] -> umad_receiver: ERR 5409: send
completed wit
h error (method=0x1 attr=0x16 trans_id=0x158c) -- dropping.
Oct 30 17:49:46 053830 [43005960] -> umad_receiver: ERR 5411: DR SMP hop
ptr 0 h
op count 2 DR SLID 0x0 DR DLID 0x0
Oct 30 17:49:46 053839 [43005960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MA
D completed in error (IB_TIMEOUT).
Oct 30 17:49:46 053861 [43005960] -> SMP dump:
                                base_ver................0x1
                                mgmt_class..............0x81
                                class_ver...............0x1
                                method..................0x1 (SubnGet)
                                D bit...................0x0
                                status..................0x0
                                hop_ptr.................0x0
                                hop_count...............0x2
                                trans_id................0x158c
                                attr_id.................0x16 (P_KeyTable)
                                resv....................0x0
                                attr_mod................0x260000
                                m_key...................0x0000000000000000
                                dr_slid.................0xFFFF
                                dr_dlid.................0xFFFF

                                Initial path: [0][1][16]
                                Return path:  [0][0][0]
                                Reserved:     [0][0][0][0][0][0][0]




More information about the general mailing list