[openib-general] opensm
Batwara, Ashish
Ashish.Batwara at lsi.com
Tue Dec 19 17:12:18 PST 2006
Logs from the end of the osm.log:
Dec 19 15:48:26 984523 [43204960] -> SUBNET UP
Dec 19 15:48:36 985477 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b1d) --
dropping
Dec 19 15:48:36 985538 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:48:36 985560 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:48:36 985643 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b1d
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:48:36 985728 [42803960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:48:36 985754 [42803960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:48:36 986161 [42803960] -> SUBNET UP
Dec 19 15:48:46 986814 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b22) --
dropping
Dec 19 15:48:46 986868 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:48:46 986895 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:48:46 986935 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b22
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:48:46 987025 [41401960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:48:46 987050 [41401960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:48:46 987459 [41401960] -> SUBNET UP
Dec 19 15:48:56 988475 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b27) --
dropping
Dec 19 15:48:56 988536 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:48:56 988562 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:48:56 988601 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b27
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:48:56 988681 [41E02960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:48:56 988706 [41E02960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:48:56 989146 [41E02960] -> SUBNET UP
Dec 19 15:49:06 990152 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b2c) --
dropping
Dec 19 15:49:06 990209 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:06 990231 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:06 990292 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b2c
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:06 990375 [43204960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:06 990399 [43204960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:06 990815 [43204960] -> SUBNET UP
Dec 19 15:49:16 991042 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b31) --
dropping
Dec 19 15:49:16 991095 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:16 991122 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:16 991174 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b31
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:16 991281 [41401960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:16 991306 [41401960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:16 991719 [41401960] -> SUBNET UP
Dec 19 15:49:26 992226 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b36) --
dropping
Dec 19 15:49:26 992280 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:26 992306 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:26 992347 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b36
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:26 992442 [42803960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:26 992468 [42803960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:26 993031 [42803960] -> SUBNET UP
Dec 19 15:49:36 995288 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b3b) --
dropping
Dec 19 15:49:36 995341 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:36 995360 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:36 995428 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b3b
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:36 995515 [43204960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:36 995538 [43204960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:36 996077 [43204960] -> SUBNET UP
Dec 19 15:49:46 995190 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b40) --
dropping
Dec 19 15:49:46 995243 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:46 995265 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:46 995308 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b40
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:46 995383 [42803960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:46 995407 [42803960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:46 995960 [42803960] -> SUBNET UP
Dec 19 15:49:56 997558 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b45) --
dropping
Dec 19 15:49:56 997609 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:49:56 997624 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:49:56 997663 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b45
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:49:56 997780 [43204960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:49:56 997805 [43204960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:49:56 998216 [43204960] -> SUBNET UP
Dec 19 15:50:06 999247 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b4a) --
dropping
Dec 19 15:50:06 999296 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:50:06 999311 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:50:06 999351 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b4a
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:50:06 999425 [42803960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:50:06 999487 [42803960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:50:06 999996 [42803960] -> SUBNET UP
Dec 19 15:50:17 003083 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b4f) --
dropping
Dec 19 15:50:17 003139 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:50:17 003159 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:50:17 003217 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b4f
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:50:17 003297 [41401960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:50:17 003360 [41401960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:50:17 003779 [41401960] -> SUBNET UP
Dec 19 15:50:27 002576 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b54) --
dropping
Dec 19 15:50:27 002663 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:50:27 002683 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:50:27 002744 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b54
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:50:27 002837 [41E02960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:50:27 002891 [41E02960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:50:27 003312 [41E02960] -> SUBNET UP
Dec 19 15:50:37 004082 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b59) --
dropping
Dec 19 15:50:37 004139 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:50:37 004162 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:50:37 004205 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b59
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:50:37 004290 [42803960] -> osm_drop_mgr_process: ERR 0108:
Unknown remote side for node 0x0002c9020022cce0 port 2. Adding to light
sweep sampling list
Dec 19 15:50:37 004315 [42803960] -> Directed Path Dump of 0 hop path:
Path = [0]
Dec 19 15:50:37 004730 [42803960] -> SUBNET UP
Dec 19 15:50:46 205115 [42803960] -> SM port is down
Dec 19 15:50:56 206763 [42803960] -> SM port is down
Dec 19 15:50:56 206903 [42803960] -> __osm_sm_state_mgr_signal_error:
ERR 3207: Invalid signal OSM_SM_SIGNAL_DISCOVER in state
IB_SMINFO_STATE_DISCOVERING
Dec 19 15:51:06 209285 [42803960] -> SM port is down
Dec 19 15:51:06 209448 [42803960] -> __osm_sm_state_mgr_signal_error:
ERR 3207: Invalid signal OSM_SM_SIGNAL_DISCOVER in state
IB_SMINFO_STATE_DISCOVERING
Dec 19 15:51:16 209877 [41E02960] -> SM port is down
Dec 19 15:51:16 210032 [41E02960] -> __osm_sm_state_mgr_signal_error:
ERR 3207: Invalid signal OSM_SM_SIGNAL_DISCOVER in state
IB_SMINFO_STATE_DISCOVERING
Dec 19 15:51:26 210935 [41401960] -> SM port is down
Dec 19 15:51:26 211100 [41401960] -> __osm_sm_state_mgr_signal_error:
ERR 3207: Invalid signal OSM_SM_SIGNAL_DISCOVER in state
IB_SMINFO_STATE_DISCOVERING
Dec 19 15:51:36 214582 [41E02960] -> Entering MASTER state
Dec 19 15:51:36 228305 [42803960] -> SUBNET UP
Dec 19 15:51:36 992447 [41E02960] -> __osm_trap_rcv_process_request:
Received Generic Notice type:0x04 num:144 Producer:1 from LID:0x0009
TID:0x0000000000000003
Dec 19 15:51:36 992663 [41E02960] -> osm_report_notice: Reporting
Generic Notice type:4 num:144 from LID:0x0009
GID:0xfe80000000000000,0x0002c9020022cd26
Dec 19 15:51:36 994495 [41401960] -> SUBNET UP
Dec 19 15:51:47 014297 [45007960] -> umad_receiver: ERR 5409: send
completed with error (method=0x1 attr=0x11 trans_id=0x2500001b89) --
dropping
Dec 19 15:51:47 014371 [45007960] -> umad_receiver: ERR 5411: DR SMP
Dec 19 15:51:47 014386 [45007960] -> __osm_sm_mad_ctrl_send_err_cb: ERR
3113: MAD completed in error (IB_TIMEOUT)
Dec 19 15:51:47 014426 [45007960] -> SMP dump:
base_ver................0x1
mgmt_class..............0x81
class_ver...............0x1
method..................0x1 (SubnGet)
D bit...................0x0
status..................0x0
hop_ptr.................0x0
hop_count...............0x1
trans_id................0x1b89
attr_id.................0x11 (NodeInfo)
resv....................0x0
attr_mod................0x0
m_key...................0x0000000000000000
dr_slid.................0xFFFF
dr_dlid.................0xFFFF
Initial path: [0][2]
Return path: [0][0]
Reserved: [0][0][0][0][0][0][0]
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00
Dec 19 15:51:47 014531 [41E02960] -> osm_report_notice: Reporting
Generic Notice type:3 num:65 from LID:0x0001
GID:0xfe80000000000000,0x0002c9020022cce2
Dec 19 15:51:47 014552 [41E02960] -> Removed port with
GUID:0x0002c9020022cd26 LID range [0x9,0x9] of node:Native Infiniband
Storage - LSI Logic, Engenio Storage Group
Dec 19 15:51:47 014570 [41E02960] -> osm_report_notice: Reporting
Generic Notice type:3 num:65 from LID:0x0001
GID:0xfe80000000000000,0x0002c9020022cce2
Dec 19 15:51:47 014586 [41E02960] -> Removed port with
GUID:0x0002c9020022cce2 LID range [0x1,0x1] of node:p49 HCA-1
Dec 19 15:51:47 014658 [41E02960] -> __osm_lid_mgr_process_our_sm_node:
ERR 0308: Can't acquire SM's port object, GUID = 0x0002c9020022cce2
Dec 19 15:51:47 015001 [41E02960] -> SUBNET UP
Dec 19 15:51:51 371737 [41401960] -> osm_pr_rcv_process: ERR 1F16:
Cannot find requester physical port
Dec 19 15:51:56 216932 [41401960] -> osm_report_notice: Reporting
Generic Notice type:3 num:64 from LID:0x0001
GID:0xfe80000000000000,0x0002c9020022cce2
Dec 19 15:51:56 217034 [41401960] -> Discovered new port with
GUID:0x0002c9020022cce2 LID range [0x1,0x1] of node:p49 HCA-1
Dec 19 15:51:56 217045 [41401960] -> osm_report_notice: Reporting
Generic Notice type:3 num:64 from LID:0x0001
GID:0xfe80000000000000,0x0002c9020022cce2
Dec 19 15:51:56 217122 [41401960] -> Discovered new port with
GUID:0x0002c9020022cd26 LID range [0x9,0x9] of node:Native Infiniband
Storage - LSI Logic, Engenio Storage Group
Dec 19 15:51:56 217432 [41401960] -> SUBNET UP
Dec 19 15:52:06 217884 [43204960] -> SUBNET UP
Dec 19 15:52:16 222523 [42803960] -> SUBNET UP
Dec 19 15:52:26 221109 [42803960] -> SUBNET UP
Dec 19 15:52:36 222369 [42803960] -> SUBNET UP
Dec 19 15:52:46 224523 [41401960] -> SUBNET UP
Dec 19 15:52:52 902536 [95AB6160] -> Exiting SM
Dec 19 15:54:17 354494 [95AB6160] -> OpenSM Rev:openib-2.0.5 OpenIB svn
Exported revision
Dec 19 17:09:20 792650 [95AB6160] -> OpenSM Rev:openib-2.0.5 OpenIB svn
Exported revision
-----Original Message-----
From: Batwara, Ashish
Sent: Tuesday, December 19, 2006 5:22 PM
To: 'Hal Rosenstock'
Cc: Eitan Zahavi; ishai at mellanox.co.il; openib-general at openib.org
Subject: RE: [openib-general] opensm
Hi,
Please look towards the end of the attached file.
Thanks
Ashish
-----Original Message-----
From: Hal Rosenstock [mailto:halr at voltaire.com]
Sent: Tuesday, December 19, 2006 5:06 PM
To: Batwara, Ashish
Cc: Eitan Zahavi; ishai at mellanox.co.il; openib-general at openib.org
Subject: Re: [openib-general] opensm
Ashish,
On Tue, 2006-12-19 at 17:43, Batwara, Ashish wrote:
> Hi,
>
> Here is the info that you have asked. I am seeing the Subnet manager
> is up now having the port active. But server is not able to discover
> the target. I am seeing the error "Got failed path rec status -110" on
> Linux console.
That means the request for an SA PathRecord from the initiator to the
target failed (-110 is ETIMEDOUT). Are you sure the target is up
(ACTIVE) on the subnet ? If it is, can you send the opensm log ?
-- Hal
> Below are the output of different commands. I am using following to
> discover the target:
>
>
>
> /etc/init.d/opensmd start
>
> /etc/init.d/openibd start
>
> modprobe ib_srp
>
> echo
>
id_ext=200300A0B811C847,ioc_guid=00a0b8020022cd27,dgid=fe800000000000000
002c9020022cd26,pkey=ffff,service_id=200300a0b811c847 >
/sys/class/infiniband_srp/srp-mthca0-2/add_target
>
>
>
>
>
> [root at p49 ~]# ibv_devinfo
>
> hca_id: mthca0
>
> fw_ver: 5.1.400
>
> node_guid: 0002:c902:0022:cce0
>
> sys_image_guid: 0002:c902:0022:cce3
>
> vendor_id: 0x02c9
>
> vendor_part_id: 25218
>
> hw_ver: 0xA0
>
> board_id: MT_0370130002
>
> phys_port_cnt: 2
>
> port: 1
>
> state: PORT_DOWN (1)
>
> max_mtu: 2048 (4)
>
> active_mtu: 512 (2)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
>
>
> port: 2
>
> state: PORT_ACTIVE (4)
>
> max_mtu: 2048 (4)
>
> active_mtu: 2048 (4)
>
> sm_lid: 1
>
> port_lid: 1
>
> port_lmc: 0x00
> hca_id: mthca1
>
> fw_ver: 5.1.400
>
> node_guid: 0002:c902:0022:cd2c
>
> sys_image_guid: 0002:c902:0022:cd2f
>
> vendor_id: 0x02c9
>
> vendor_part_id: 25218
>
> hw_ver: 0xA0
>
> board_id: MT_0370130002
>
> phys_port_cnt: 2
>
> port: 1
>
> state: PORT_DOWN (1)
>
> max_mtu: 2048 (4)
>
> active_mtu: 512 (2)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
>
>
> port: 2
>
> state: PORT_DOWN (1)
>
> max_mtu: 2048 (4)
>
> active_mtu: 512 (2)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
>
>
>
>
> [root at p49 ~]# uname -a
>
> Linux p49.ks.lsil.com 2.6.9-42.0.3.ELsmp #1 SMP Mon Sep 25 17:24:31
> EDT 2006 x86_64 x86_64 x86_64 GNU/Linux
>
>
>
> [root at p49 ~]# cat /etc/infiniband/info
>
> #!/bin/bash
>
>
>
> echo prefix=/usr/local/ofed
>
> echo Kernel=2.6.9-42.0.3.ELsmp
>
> echo
>
> echo "Configure options: --with-dapl --with-ipoibtools --with-libibcm
> --with-libibcommon --with-libibmad --with-libibumad --with-libibverbs
> --with-libipathverbs --with-libmthca --with-opensm --with-librdmacm
> --with-libsdp --with-openib-diags --with-srptools --with-mstflint
> --with-perftest --with-tvflash --with-ipath_inf-mod --with-ipoib-mod
> --with-mthca-mod --with-sdp-mod --with-srp-mod --with-core-mod
> --with-user_mad-mod --with-user_access-mod --with-addr_trans-mod"
>
> echo
>
>
>
> OFED Version: OFED-1.1
>
> Thanks
>
> Ashish
>
> -----Original Message-----
> From: Eitan Zahavi [mailto:eitan at mellanox.co.il]
> Sent: Tuesday, December 19, 2006 5:18 AM
> To: Batwara, Ashish
> Cc: ishai at mellanox.co.il; openib-general at openib.org
> Subject: Re: [openib-general] opensm
>
>
>
> Hi Ashish,
>
>
>
> SRP people say they have no such error message.
>
> OpenSM does. So I take it back.
>
>
>
> Ashish,
>
> Please provide more into:
>
>
>
> 1. ibv_devinfo
>
> 2. Version of code you are using
>
> 3. Command line you use for starting opensm
>
> 4. /var/log/osm.log
>
>
>
> Thanks and sorry for the confusion.
>
>
>
> EZ
>
>
>
> Eitan Zahavi wrote:
>
> > This is not an OpenSM issue.
>
> > Forwarded to the SRP people.
>
> >
>
> > EZ
>
> > Batwara, Ashish wrote:
>
> >
>
> >> Hi,
>
> >> I am trying to run opensm on Linux server. It has two HCAs
> (4-ports) and
>
> >> connected to IB Switch. ibnodes command displays the information
> about
>
> >> the Switch ports and HCA ports.
>
> >> When I start opensm, I see in /var/log/messages "Starting
> srp_daemon"
>
> >> for all the 4 ports and immediately after I see "failed srp_daemon"
> for
>
> >> all the ports and the displays "SM Port is down".
>
> >>
>
> >> I tried several times and even rebooted the server few times but no
>
> >> luck.
>
> >>
>
> >> Does anybody know what this problem is?
>
> >>
>
> >> Thanks
>
> >> Ashish
>
> >>
>
> >> _______________________________________________
>
> >> openib-general mailing list
>
> >> openib-general at openib.org
>
> >> http://openib.org/mailman/listinfo/openib-general
>
> >>
>
> >> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>
> >>
>
> >>
>
> >
>
> >
>
> > _______________________________________________
>
> > openib-general mailing list
>
> > openib-general at openib.org
>
> > http://openib.org/mailman/listinfo/openib-general
>
> >
>
> > To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>
> >
>
>
>
>
>
> ______________________________________________________________________
>
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
>
> To unsubscribe, please visit
http://openib.org/mailman/listinfo/openib-general
More information about the general
mailing list