<div dir="ltr"><div>Hi Tim,</div><div><br></div><div>Attribute ID 0xff17 is in the vendor specific range for SM attributes and not supported with (at least) the upstream ibsim.</div><div><br></div><div>I think you are using MLNX OpenSM rather than upstream or OFED OpenSM with the upstream ibsim. I'm not sure if MLNX ibsim supports the additional vendor specific SM attributes or not.</div><div><br></div><div>Can you work with some upstream or OFED OpenSM or only MLNX OpenSM ? If not, I try to find out whether using the MLNX OFED ibsim supports the additional attributes for running MLNX OpenSM.</div><div><br></div><div>-- Hal</div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Feb 15, 2018 at 11:50 AM, Tim Miller <span dir="ltr"><<a href="mailto:btmiller@helix.nih.gov" target="_blank">btmiller@helix.nih.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">I am attempting to use ibsim to test some possible configuration changes in our routing, but I am running into some difficulties. I can get the simulator started, but opensm fails to discover the fabric in the simulated environment. It discovers the switch to which the host running opensm is connected, but it can't discover any further than that. In the opensm log, I see:<br>
<br>
Feb 14 16:31:50 047307 [AD332700] 0x04 -> ni_rcv_process_new: Discovered new Switch node,<br>
<wbr> GUID 0x7cfe900300b49890, TID 0x1239<br>
Feb 14 16:31:50 047821 [AD533700] 0x04 -> nd_rcv_process_nd: Node 0x7cfe900300b49890<br>
<wbr> Description = SwitchIB Mellanox Technologies<br>
Feb 14 16:31:50 047847 [B5974700] 0x01 -> log_send_error: ERR 5411: DR SMP Send completed with error (IB_TIMEOUT) -- dropping<br>
Method 0x1, Attr 0xFF17, TID 0x123b<br>
Feb 14 16:31:50 047866 [B5974700] 0x01 -> Received SMP on a 1 hop path: Initial path = 0,1, Return path = 0,0<br>
Feb 14 16:31:50 047893 [B5974700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR 3113: MAD completed in error (IB_TIMEOUT): SubnGet(GeneralInfo), attr_mod 0x4, TID 0x123b<br>
Feb 14 16:31:50 047913 [B5974700] 0x04 -> osm_hm_set_by_physp: Remote port of 0x7cfe900300b49890[0] couldn't be found<br>
Feb 14 16:31:50 047921 [B5974700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR 3120: Timeout while getting attribute 0xFF17 (GeneralInfo); Possible mis-set mkey?<br>
Feb 14 16:31:50 047927 [B5974700] 0x01 -> sm_mad_ctrl_send_err_cb: Error during initialization: got General Info time out from node 0x7cfe900300b49890<br>
<br>
And in the simulator console, I see messages of the form.<br>
<br>
ibwarn: [32331] process_packet: no one to handle pkt: class 0x81, attr 0xff17<br>
<br>
Looking at the output of the "dump" command from within the console, it shows that all ports are in Init/LinkUp, except for the SMA port, which is in state Active/LinkUp.<br>
<br>
Does anyone have any idea what I might be doing wrong here?<br>
<br>
Thanks,<br>
Tim<span class="HOEnZb"><font color="#888888"><br>
<br>
-- <br>
Tim Miller<br>
NIH HPC systems staff<br>
<a href="tel:301-827-5261" target="_blank" value="+13018275261">301-827-5261</a><br>
<a href="https://hpc.nih.gov" target="_blank" rel="noreferrer">https://hpc.nih.gov</a><br>
<br>
______________________________<wbr>_________________<br>
Users mailing list<br>
<a href="mailto:Users@lists.openfabrics.org" target="_blank">Users@lists.openfabrics.org</a><br>
<a href="http://lists.openfabrics.org/mailman/listinfo/users" target="_blank" rel="noreferrer">http://lists.openfabrics.org/m<wbr>ailman/listinfo/users</a><br>
</font></span></blockquote></div><br></div>