[ofa-general] OpenSM "Dead end on path to LID"

Yevgeny Kliteynik kliteyn at dev.mellanox.co.il
Fri Jul 18 08:06:41 PDT 2008


Hi Nathan,

Nathan Dauchy wrote:
> 
> Looking through osm.log a bit more, I also found a handful of errors
> like these:
> 
> Jul 17 01:31:29 345329 [46E0A940] 0x01 ->
> __osm_state_mgr_light_sweep_start: ERR 0108: Unknown remote side for
> node 0x000002c900000048(MT47396 Infiniscale-III Mellanox Technologies)
> port 14. Adding to light sweep sampling list
> Jul 17 01:31:29 345340 [46E0A940] 0x01 -> Directed Path Dump of 4 hop path:
>                                 Path = 0,1,20,7,15
> Jul 17 01:31:29 345381 [46E0A940] 0x01 ->
> __osm_state_mgr_light_sweep_start: ERR 0108: Unknown remote side for
> node 0x000002c900000049(MT47396 Infiniscale-III Mellanox Technologies)
> port 15. Adding to light sweep sampling list
> Jul 17 01:31:29 345390 [46E0A940] 0x01 -> Directed Path Dump of 3 hop path:
>                                 Path = 0,1,22,11
> 
> Does that indicate a problem as well?

This explains why ibdiagnet couldn't query port counters.
OpenSM couldn't discover what's behind these ports, so it
didn't configure routing tables for the undiscovered nodes.
Ibdiagnet could discover them. It queries port counters by
their LIDs, but switches don't have these LIDs in the
routing tables.

 > Unknown remote side for node 0x000002c900000049(MT47396
 > Infiniscale-III Mellanox Technologies) port 15

What is the remote side of this port? HCA? Switch?
If it's HCA, does its host run some heavy application?

I understand you already increased transaction time.
Please try limiting SMPs on the wire - in opensm.conf
file, set max_wire_smps to 1 (you probably have 4).
You can also run opensm with '-maxsmps 1' command line
argument.

-- Yevgeny


> Thanks,
> Nathan
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
> 




More information about the general mailing list