[ofa-general] IB network problem - "Unknown remote side for node"

Sasha Khapyorsky sashak at voltaire.com
Tue Jun 17 09:09:21 PDT 2008


On 11:07 Tue 17 Jun     , Michael Di Domenico wrote:
> Can anyone tell me what i likely causing this?  The SM seems to be in a loop
> entering/exiting,

I don't know why it is looping, are you use -o (run once) option?

> and the "Unknown remote side" comes up with a different
> list of port each time it cycles.  I though it was bad cables, but since it
> keeps changing, that seems unlikely.
> 
> Thanks
> - Michael
> 
> 
> Jun 17 04:05:51 681669 [AAF0D060] -> OpenSM Rev:openib-2.0.5 OpenIB svn 9905
> Jun 17 04:05:51 681723 [AAF0D060] -> OpenSM Rev:openib-2.0.5 OpenIB svn 9905
> Jun 17 04:05:51 686837 [AAF0D060] -> osm_vendor_bind: Binding to port
> 0x2c9030000792e
> Jun 17 04:05:51 689709 [AAF0D060] -> osm_vendor_bind: Binding to port
> 0x2c9030000792e
> Jun 17 04:05:52 785232 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00510d port 4. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785273 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][13]
> Jun 17 04:05:52 785281 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00510d port 6. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785290 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][13]
> Jun 17 04:05:52 785296 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00510d port 8. Adding to light sweep
> sampling list

But it is always after that node Path = [0][2][3], so likely this part
of subnet is slow in discovery or have a cabling problems.

Also note that you are using two years old version of OpenSM.

Sasha

> Jun 17 04:05:52 785305 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][13]
> Jun 17 04:05:52 785312 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00510d port 10. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785319 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][13]
> Jun 17 04:05:52 785325 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00510d port 12. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785334 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][13]
> Jun 17 04:05:52 785352 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff005118 port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785359 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][13][4]
> Jun 17 04:05:52 785424 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff00507a port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785431 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][13][6]
> Jun 17 04:05:52 785444 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff005094 port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785454 [46409940] -> Directed Path Dump of 2 hop path:
>                 Path = [0][2][3]
> Jun 17 04:05:52 785467 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff005095 port 10. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785475 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][8][2]
> Jun 17 04:05:52 785501 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050a0 port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785509 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][13][8]
> Jun 17 04:05:52 785536 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050af port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785545 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][13][C]
> Jun 17 04:05:52 785553 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050b0 port 8. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785561 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][13][A]
> Jun 17 04:05:52 785602 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050c1 port 4. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785631 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][D]
> Jun 17 04:05:52 785645 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050c4 port 4. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785651 [46409940] -> Directed Path Dump of 3 hop path:
>                 Path = [0][2][3][8]
> Jun 17 04:05:52 785662 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050c6 port 4. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785671 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][D][4]
> Jun 17 04:05:52 785686 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050d2 port 2. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785694 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][8]
> Jun 17 04:05:52 785702 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050d2 port 12. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785710 [46409940] -> Directed Path Dump of 4 hop path:
>                 Path = [0][2][3][11][8]
> Jun 17 04:05:52 785741 [46409940] -> osm_drop_mgr_process: ERR 0108: Unknown
> remote side for node 0x000b8cffff0050fa port 10. Adding to light sweep
> sampling list
> Jun 17 04:05:52 785748 [46409940] -> Directed Path Dump of 5 hop path:
>                 Path = [0][2][3][11][8][C]
> Jun 17 04:05:52 785774 [46409940] -> Entering MASTER state
> Jun 17 04:05:52 786126 [46409940] -> osm_report_notice: Reporting Generic
> Notice type:3 num:66 from LID:0x0000
> GID:0x000000000000e80f,0x0002c9030000792e
> Jun 17 04:05:52 786254 [46409940] -> osm_report_notice: Reporting Generic
> Notice type:3 num:66 from LID:0x0000
> GID:0x000000000000e80f,0x0002c9030000792e
> Jun 17 04:05:56 691253 [AAF0D060] -> Exiting SM

> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general



More information about the general mailing list