<div dir="ltr"><div>Are you sure you're using MLNX ibsim with MLNX OpenSM ? It looks like MLNX ibsim supports 0xff17 to me so the message "process_packet: no one to handle pkt: class 0x81, attr 0xff17" shouldn't come out.</div><div><br></div><div>Can you send me your ibnetdiscover file that is used as input to ibsim ? Maybe the real problem is:</div><div>osm_hm_set_by_physp: Remote port of 0x7cfe900300b49890[0] couldn't be found</div><div>That looks like remote port to some switch port 0 which looks odd to me as switch port 0 has no peer port and it shouldn't be looking for one. Which MLNX OpenSM version ?<br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Feb 15, 2018 at 3:50 PM, Tim Miller <span dir="ltr"><<a href="mailto:btmiller@helix.nih.gov" target="_blank">btmiller@helix.nih.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Hal,<br>
<br>
Thanks for looking into this. You're indeed correct that I'm using an MLNX OFED ibsim (and opensm for that matter). I could try running both from a vanilla OpenFabrics release and see if I have any better luck; let me go ahead and try that...<br>
<br>
Regards,<br>
Tim<span><br>
<br>
On 02/15/2018 03:39 PM, Hal Rosenstock wrote:<br>
</span><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;padding-left:1ex;border-left-color:rgb(204,204,204);border-left-width:1px;border-left-style:solid"><span>
Just checked. MLNX OFED ibsim supports 0xff17 attribute.<br>
<br></span><span>
On Thu, Feb 15, 2018 at 3:36 PM, Hal Rosenstock <<a href="mailto:hal.rosenstock@gmail.com" target="_blank">hal.rosenstock@gmail.com</a> <mailto:<a href="mailto:hal.rosenstock@gmail.com" target="_blank">hal.rosenstock@gmail.c<wbr>om</a>>> wrote:<br>
<br>
Hi Tim,<br>
<br>
Attribute ID 0xff17 is in the vendor specific range for SM<br>
attributes and not supported with (at least) the upstream ibsim.<br>
<br>
I think you are using MLNX OpenSM rather than upstream or OFED<br>
OpenSM with the upstream ibsim. I'm not sure if MLNX ibsim<br>
supports the additional vendor specific SM attributes or not.<br>
<br>
Can you work with some upstream or OFED OpenSM or only MLNX OpenSM<br>
? If not, I try to find out whether using the MLNX OFED ibsim<br>
supports the additional attributes for running MLNX OpenSM.<br>
<br>
-- Hal<br>
<br>
<br>
On Thu, Feb 15, 2018 at 11:50 AM, Tim Miller<br></span><div><div class="h5">
<<a href="mailto:btmiller@helix.nih.gov" target="_blank">btmiller@helix.nih.gov</a> <mailto:<a href="mailto:btmiller@helix.nih.gov" target="_blank">btmiller@helix.nih.gov</a><wbr>>> wrote:<br>
<br>
I am attempting to use ibsim to test some possible<br>
configuration changes in our routing, but I am running into<br>
some difficulties. I can get the simulator started, but opensm<br>
fails to discover the fabric in the simulated environment. It<br>
discovers the switch to which the host running opensm is<br>
connected, but it can't discover any further than that. In the<br>
opensm log, I see:<br>
<br>
Feb 14 16:31:50 047307 [AD332700] 0x04 -> ni_rcv_process_new:<br>
Discovered new Switch node,<br>
GUID 0x7cfe900300b49890, TID 0x1239<br>
Feb 14 16:31:50 047821 [AD533700] 0x04 -> nd_rcv_process_nd:<br>
Node 0x7cfe900300b49890<br>
Description = SwitchIB Mellanox Technologies<br>
Feb 14 16:31:50 047847 [B5974700] 0x01 -> log_send_error: ERR<br>
5411: DR SMP Send completed with error (IB_TIMEOUT) -- dropping<br>
Method 0x1, Attr 0xFF17, TID 0x123b<br>
Feb 14 16:31:50 047866 [B5974700] 0x01 -> Received SMP on a 1<br>
hop path: Initial path = 0,1, Return path = 0,0<br>
Feb 14 16:31:50 047893 [B5974700] 0x01 -><br>
sm_mad_ctrl_send_err_cb: ERR 3113: MAD completed in error<br>
(IB_TIMEOUT): SubnGet(GeneralInfo), attr_mod 0x4, TID 0x123b<br>
Feb 14 16:31:50 047913 [B5974700] 0x04 -> osm_hm_set_by_physp:<br>
Remote port of 0x7cfe900300b49890[0] couldn't be found<br>
Feb 14 16:31:50 047921 [B5974700] 0x01 -><br>
sm_mad_ctrl_send_err_cb: ERR 3120: Timeout while getting<br>
attribute 0xFF17 (GeneralInfo); Possible mis-set mkey?<br>
Feb 14 16:31:50 047927 [B5974700] 0x01 -><br>
sm_mad_ctrl_send_err_cb: Error during initialization: got<br>
General Info time out from node 0x7cfe900300b49890<br>
<br>
And in the simulator console, I see messages of the form.<br>
<br>
ibwarn: [32331] process_packet: no one to handle pkt: class<br>
0x81, attr 0xff17<br>
<br>
Looking at the output of the "dump" command from within the<br>
console, it shows that all ports are in Init/LinkUp, except<br>
for the SMA port, which is in state Active/LinkUp.<br>
<br>
Does anyone have any idea what I might be doing wrong here?<br>
<br>
Thanks,<br>
Tim<br>
<br>
-- Tim Miller<br>
NIH HPC systems staff<br></div></div>
<a href="tel:301-827-5261" target="_blank" value="+13018275261">301-827-5261</a> <tel:<a href="tel:301-827-5261" target="_blank" value="+13018275261">301-827-5261</a>><span><br>
<a href="https://hpc.nih.gov" target="_blank" rel="noreferrer">https://hpc.nih.gov</a><br>
<br>
______________________________<wbr>_________________<br>
Users mailing list<br></span>
<a href="mailto:Users@lists.openfabrics.org" target="_blank">Users@lists.openfabrics.org</a> <mailto:<a href="mailto:Users@lists.openfabrics.org" target="_blank">Users@lists.openfabric<wbr>s.org</a>><br>
<a href="http://lists.openfabrics.org/mailman/listinfo/users" target="_blank" rel="noreferrer">http://lists.openfabrics.org/m<wbr>ailman/listinfo/users</a><br>
<<a href="http://lists.openfabrics.org/mailman/listinfo/users" target="_blank" rel="noreferrer">http://lists.openfabrics.org/<wbr>mailman/listinfo/users</a>><br>
<br>
<br>
<br>
</blockquote><div class="HOEnZb"><div class="h5">
<br>
-- <br>
Tim Miller<br>
NIH HPC systems staff<br>
<a href="tel:301-827-5261" target="_blank" value="+13018275261">301-827-5261</a><br>
<a href="https://hpc.nih.gov" target="_blank" rel="noreferrer">https://hpc.nih.gov</a><br>
<br>
</div></div></blockquote></div><br></div>