[ofw] RE: SM timeout

Fab Tillier ftillier at windows.microsoft.com
Thu Oct 2 12:21:00 PDT 2008


Hi Hal,

> On Thu, Oct 2, 2008 at 2:19 PM, Fab Tillier
> <ftillier at windows.microsoft.com> wrote:
>> Hi Slava,
>>
>>> When IB switch is heavy loaded, host will not receive reply from SM. It
>>> results with IB_TIMEOUT and IPoIB adapter is shown as disconnected.
>>> Suggested solution for IB_TIMEOUT problem.
>>  Why not eliminate the port info query?  The following fields of the
>> port info records are used:
>>
>> - base_lid
>> - link_width_active
>> - link_speed_active
>>
>> Why not get these from the port attributes and eliminate the query
>> altogether?  The ib_port_attr_t structure has:
>>
>> - lid (base_lid)
>> - active_speed (link_speed_active)
>
> Is this assuming a homogeneous subnet in terms of rate ?

No, in this case only the local information is needed, in order to report the link speed to the OS properly, and to create an endpoint for loopback traffic.

>> If you look at what vstat does, it issues a local MAD to the HCA for the
>> port info record to get the information missing from the port attributes.
>> Fixing the port attributes would simplify everybody's life.
>
> Do you mean PortInfo attribute (SM class) rather than PortInfoRecord
> (SA class/query) ?

Yes, PortInfo attribute, which is embedded in the PortInfoRecord.

> If this comes from the SMA, what happens when MKey protection is enabled ?

Dunno, just saying this is what vstat does.

Ideally the port info attribute would be embedded in the CA attributes, so that the information would be available (read only) to local clients.  That or the SMA should distinguish between local requests and requests that come from the wire.  Point being that the mkey stuff shouldn't come into play for local clients.

-Fab



More information about the ofw mailing list