[ofa-general] ofed 1.3.2 opensmd failover

Hal Rosenstock hal.rosenstock at gmail.com
Wed Aug 26 08:20:16 PDT 2009


On 8/25/09, PN <poknam at gmail.com> wrote:
>
> HI,
>
> I can think of a situation in which all servers have dual port IB cards and
> need failover of OpenSM to achieve HA.
> As I know, OpenSM can only bind to 1 port at a time,


Yes.

 so do I need to start 2 OpenSM in server A and 2 OpenSM in server B?


That would be one valid configuration. I'm assuming all ports are connected
to same subnet.

Will they use the same guid2lid file?


Depends how the OpenSM configuration is done.

 Do I need to set something in the config file or they will automatically
> communcate each other?


What communication are you referring to ? The all need to share the same
subnet prefix.


> Do I need to run sldd.sh manually or it will automatically sync with other
> OpenSM?


You can either manually copy the guid2lid file around to the appropriate
places. I'm not that familiar with sldd.sh but I think it can either be run
manually or made to run automatically but I'm not familiar with the details.

-- Hal


Thanks a lot.
>
> Regards,
> PN
>
>
>
>
> 2009/8/26 Hal Rosenstock <hal.rosenstock at gmail.com>
>
>>
>>
>>  On 8/25/09, kovlensky at interia.pl <kovlensky at interia.pl> wrote:
>>>
>>> Hi all,
>>>
>>> Quick question - is there a need to run anything except opensmd deamons
>>> to provide failover capability on ib network in ofed 1.3?
>>
>>
>> In terms of SM failover, modulo bugs fixed relative to this feature since
>> OFED 1.3 (there are a couple of things here which may affect your
>> environment if I recall correctly), you only need to run more than 1 SM for
>> this (one will become master, the other standby).
>>
>> I'm aware that when master manager dies standby one comes in and manages
>>> the network, but that does not necessary means that lids are preserved,
>>> especially for nodes joining in. I used to run sldd.sh for distributing lids
>>> list on ofed 1.2.5, but while this script seems to be in place noone
>>> mentions necessity for it.
>>
>>
>> So subnet manager failover is provided by running standby opensm.
>>
>>
>> And how LID preservation is provided?
>>
>>
>> If you want LIDs to be preserved, the guid2lid file needs to be sync'd
>> (copied from the master SM once it's fully assembled to the node which is
>> running the standby SM). That's what the sldd.sh script does.
>>
>> -- Hal
>>
>> Regards,
>>>
>>> Zdenek Kovlensky
>>>
>>> ----------------------------------------------------------------------
>>> Kup wlasne mieszkanie za 33 tys. zl!
>>> Sprawdz >>> http://link.interia.pl/f22f2
>>>
>>> _______________________________________________
>>> general mailing list
>>> general at lists.openfabrics.org
>>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>>
>>> To unsubscribe, please visit
>>> http://openib.org/mailman/listinfo/openib-general
>>>
>>
>>
>> _______________________________________________
>> general mailing list
>> general at lists.openfabrics.org
>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>
>> To unsubscribe, please visit
>> http://openib.org/mailman/listinfo/openib-general
>>
>
>
>
> --
> Best Regards,
> PN Lai
> HPC Specialist
> Galactic Computng Corp.
> Tel: 86-755-26733939 ext 826
> Mobile: 86-13823161729
> Fax: 86-755-26733780
> URL: http://www.galactic.com.hk
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20090826/7daf7c54/attachment.html>


More information about the general mailing list