[Users] IPoIB not working on Windows 2008 r2 - need help
Orion Poplawski
orion at cora.nwra.com
Fri Jun 7 15:52:52 PDT 2013
On 06/07/2013 02:23 PM, Hal Rosenstock wrote:
> Also, if you turn on log verbosity on OpenSM temporarily and send me the log
> for that run, I could see what is going on with in terms of trying to set the
> non default subnet prefix with the Windows node. Given the log you sent, I can
> only imagine that the SMA on the Windows node is ack'ing the PortInfo set
> which sets the subnet prefix but not really acting on it properly.
> -- Hal
Full log is at http://sw.cora.nwra.com/test/opensm.debug.log.gz
I had fontdb shutdown when I started opensm - then booted it up.
This seems to be when it first comes up (lid 0, prefix 0xfe80::0)
Jun 07 14:56:58 088453 [193D0700] 0x10 -> osm_pi_rcv_process: [
Jun 07 14:56:58 088465 [193D0700] 0x08 -> PortInfo dump:
port number..............1
node_guid................0x0005ad00000c5cec
port_guid................0x0005ad00000c5ced
m_key....................0x0000000000000000
subnet_prefix............0xfe80000000000000
base_lid.................0
master_sm_base_lid.......0
capability_mask..........0x2500A68
diag_code................0x0
m_key_lease_period.......0x0
local_port_num...........1
link_width_enabled.......0x3
link_width_supported.....0x3
link_width_active........0x2
link_speed_supported.....0x3
port_state...............INIT
state_info2..............0x52
m_key_protect_bits.......0x0
lmc......................0x0
link_speed...............0x13
mtu_smsl.................0x20
vl_cap_init_type.........0x30
vl_high_limit............0x0
vl_arb_high_cap..........0x8
vl_arb_low_cap...........0x8
init_rep_mtu_cap.........0x4
vl_stall_life............0xFF
vl_enforce...............0x30
m_key_violations.........0x0
p_key_violations.........0x0
q_key_violations.........0x0
guid_cap.................0x20
client_reregister........0x0
mcast_pkey_trap_suppr....0x0
subnet_timeout...........0x0
resp_time_value..........0x10
error_threshold..........0xF0
max_credit_hint..........0x0
link_round_trip_latency..0x0
capability_mask2.........0x0
link_speed_ext_active....0x0
link_speed_ext_supported.0x0
link_speed_ext_enabled...0x0
Jun 07 14:56:58 088495 [193D0700] 0x08 -> Capability Mask:
IB_PORT_CAP_HAS_TRAP
IB_PORT_CAP_HAS_AUTO_MIG
IB_PORT_CAP_HAS_SL_MAP
IB_PORT_CAP_HAS_LED_INFO
IB_PORT_CAP_HAS_SYS_IMG_GUID
IB_PORT_CAP_HAS_VEND_CLS
IB_PORT_CAP_HAS_CAP_NTC
IB_PORT_CAP_HAS_CLIENT_REREG
Jun 07 14:56:58 088499 [193D0700] 0x04 -> osm_pi_rcv_process: Discovered port
num 1 with GUID 0x5ad00000c5ced for parent node GUID 0x5ad00000c5cec, TID 0x130e
Then later, sm seems to have assigned a lid.
Jun 07 14:56:58 090679 [161CB700] 0x08 -> PortInfo dump:
port number..............1
node_guid................0x0005ad00000c5cec
port_guid................0x0005ad00000c5ced
m_key....................0x0000000000000000
subnet_prefix............0xfe80000000000001
base_lid.................16
master_sm_base_lid.......1
capability_mask..........0x2500A68
diag_code................0x0
m_key_lease_period.......0x0
local_port_num...........1
link_width_enabled.......0x3
link_width_supported.....0x3
link_width_active........0x2
link_speed_supported.....0x3
port_state...............INIT
state_info2..............0x52
m_key_protect_bits.......0x0
lmc......................0x0
link_speed...............0x13
mtu_smsl.................0x40
vl_cap_init_type.........0x30
vl_high_limit............0x0
vl_arb_high_cap..........0x8
vl_arb_low_cap...........0x8
init_rep_mtu_cap.........0x4
vl_stall_life............0xFF
vl_enforce...............0x30
m_key_violations.........0x0
p_key_violations.........0x0
q_key_violations.........0x0
guid_cap.................0x20
client_reregister........0x1
mcast_pkey_trap_suppr....0x0
subnet_timeout...........0x12
resp_time_value..........0x10
error_threshold..........0x88
max_credit_hint..........0x0
link_round_trip_latency..0x0
capability_mask2.........0x0
link_speed_ext_active....0x0
link_speed_ext_supported.0x0
link_speed_ext_enabled...0x0
Jun 07 14:56:58 090709 [161CB700] 0x08 -> Capability Mask:
IB_PORT_CAP_HAS_TRAP
IB_PORT_CAP_HAS_AUTO_MIG
IB_PORT_CAP_HAS_SL_MAP
IB_PORT_CAP_HAS_LED_INFO
IB_PORT_CAP_HAS_SYS_IMG_GUID
IB_PORT_CAP_HAS_VEND_CLS
IB_PORT_CAP_HAS_CAP_NTC
IB_PORT_CAP_HAS_CLIENT_REREG
Jun 07 14:56:58 090713 [161CB700] 0x08 -> osm_pi_rcv_process: Client
reregister received on response
Jun 07 14:56:58 091294 [12FC6700] 0x10 -> osm_db_store: ]
Jun 07 14:56:58 091301 [12FC6700] 0x10 -> osm_lid_mgr_process_subnet: ]
Jun 07 14:56:58 091308 [161CB700] 0x10 -> pi_rcv_process_set: [
Jun 07 14:56:58 091313 [161CB700] 0x08 -> pi_rcv_process_set: Received logical
SetResp() for GUID 0x5ad00000c5ced, port num 1
for parent node GUID 0x5ad00000c5cec TID 0x1311
Jun 07 14:56:58 091320 [161CB700] 0x08 -> osm_db_update:
Key:0x0005ad00000c5ced previously exists in:/var/cache/opensm/guid2mkey with
value:0x0000000000000000
Jun 07 14:56:58 091324 [161CB700] 0x10 -> pi_rcv_process_set: ]
Jun 07 14:56:58 091327 [161CB700] 0x10 -> osm_pi_rcv_process: ]
But I'm not really sure what I'm looking for.
--
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA, Boulder/CoRA Office FAX: 303-415-9702
3380 Mitchell Lane orion at nwra.com
Boulder, CO 80301 http://www.nwra.com
More information about the Users
mailing list