[openfabrics-ewg] Problems with OFED IPoIB HA on SLES10

Vladimir Sokolovsky vlad at mellanox.co.il
Tue Oct 3 03:52:58 PDT 2006


Hi Scott,
Please see my comments below:

On Mon, 2006-10-02 at 23:24 -0700, Scott Weitzenkamp (sweitzen) wrote:
> Vlad,
>  
> I filed a bug for these issues.
>  
> 1) If I start IPoIB HA with ib0 IB port shut down (from IB switch) and
> ib1 IB port enabled, then IPoIB does not work because "ip monitor link
> all" does not report NO-CARRIER at startup like ipoib_ha.pl is looking
> for.  This is a major hole.

Fixed, will be updated in OFED-1.1-rc7.

>  
> 
> 2) /etc/init.d/openibd runs ipoib_ha.pl with its stdout and stderr
> redirected to /dev/null, should we run with -v for verbose instead and
> redirect log file to /var/log?
>  
> # fgrep ipoib_ha.pl /etc/init.d/openibd
>                 ipoib_ha.pl -p ${PRIMARY_IPOIB_DEV} -s
> ${SECONDARY_IPOIB_DEV} --
> with-arping --with-multicast > /dev/null 2>&1 &
> 

Added /var/log/ipoib_ha.log and the verbose output of ipoib_ha.pl
redirected into this log file.

> 3) I got IPoIB HA working on SLES 10, but the documentation is a
> little lacking.   Looks like I have to put the same IP address in
> ifcfg-ib0 and ifcfg-ib1, is this correct?
>  

Yes, IP address should be the same. Actually the configuration of the
secondary interface does not matter. 
The High Availability daemon reads the configuration of the primary
interface and migrates it between the interfaces in case of failure.

> # pwd
> /etc/sysconfig/network
> # cat ifcfg-ib0
> DEVICE=ib0
> BOOTPROTO=static
> IPADDR=192.168.2.46
> NETMASK=255.255.255.0
> ONBOOT=yes
> # cat ifcfg-ib1
> DEVICE=ib1
> BOOTPROTO=static
> IPADDR=192.168.2.46
> NETMASK=255.255.255.0
> ONBOOT=yes
>  
> 4) If I shutdown ib0 IB port, I see this from
> "/usr/local/ofed/bin/ipoib_ha.pl -v --with-arping --with-multicast"
>  
>     Use of uninitialized value in concatenation (.) or string
> at /usr/local/ofed/bin/ipoib_ha.pl line 287.
> 

Fixed.

>  
> Scott Weitzenkamp
> SQA and Release Manager
> Server Virtualization Business Unit
> Cisco Systems
>  
> 
> _______________________________________________
> openfabrics-ewg mailing list
> openfabrics-ewg at openib.org
> http://openib.org/mailman/listinfo/openfabrics-ewg


Best Regards, 
Vladimir Sokolovsky 
Software Integration Engineer 
Mellanox Technologies Ltd. 
Tell: +972 (4) 909-7200 ext. 338   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20061003/51573b3c/attachment.html>


More information about the ewg mailing list