[openib-general] [Bug 371] New: IPoIB HA not working properly with OFED1.2-alpha

bugzilla-daemon at lists.openfabrics.org bugzilla-daemon at lists.openfabrics.org
Mon Feb 19 01:50:30 PST 2007


https://bugs.openfabrics.org/show_bug.cgi?id=371

           Summary: IPoIB HA not working properly with OFED1.2-alpha
           Product: OpenFabrics Linux
           Version: 1.2alpha1
          Platform: X86-64
        OS/Version: RHEL 4
            Status: NEW
          Severity: major
          Priority: P2
         Component: IPoIB
        AssignedTo: bugzilla at openib.org
        ReportedBy: karun.sharma at qlogic.com


I configured IPoIB HA with OFED1.2-alpha release and it is not working for me.
I have configured IPoIB HA on a RHEL4up4 machine with both ports up. Before
configuring IPoIB HA, both IB interfaces are able to ping the other machine.

Then I executed ipoib_ha.pl script and configured ib0 as primary and ib1 as
secondary interface. The ip address of ib1 interface has gone and till this
point the things seems to be working fine. 

The problem starts when I pulled the IB cable connecting port1. I can see ib0
interface going down and ib1 interface taking IP address of ib0 interface but
ping doesn't work after that. Even if I reinsert the cable in port1, ping is
not working. I have attached some logs below. 

################################################################
[root at ss27 ~]# ibv_devinfo
hca_id: mthca0
        fw_ver:                         5.1.400
        node_guid:                      0006:6a00:9800:6b90
        sys_image_guid:                 0006:6a00:9800:6b90
        vendor_id:                      0x066a
        vendor_part_id:                 25218
        hw_ver:                         0xA0
        board_id:                       SS_0000000002
        phys_port_cnt:                  2
                port:   1
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 6
                        port_lid:               2
                        port_lmc:               0x00
                port:   2
                        state:                  PORT_ACTIVE (4)
                        max_mtu:                2048 (4)
                        active_mtu:             2048 (4)
                        sm_lid:                 6
                        port_lid:               3
                        port_lmc:               0x00
[root at ss27 ~]# ifconfig
eth0      Link encap:Ethernet  HWaddr 00:A0:D1:E4:53:DA  
          inet addr:172.20.50.227  Bcast:172.20.50.255  Mask:255.255.255.0
          inet6 addr: fe80::2a0:d1ff:fee4:53da/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:125 errors:0 dropped:0 overruns:0 frame:0
          TX packets:115 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:17236 (16.8 KiB)  TX bytes:15347 (14.9 KiB)
          Interrupt:201 
ib0       Link encap:UNSPEC  HWaddr
00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00  
          inet addr:172.20.51.227  Bcast:172.20.51.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)
ib1       Link encap:UNSPEC  HWaddr
00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00  
          inet addr:172.20.52.227  Bcast:172.20.52.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)
lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:1543 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1543 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:1648528 (1.5 MiB)  TX bytes:1648528 (1.5 MiB)
[root at ss27 ~]# ping 172.20.51.226 -c 1
PING 172.20.51.226 (172.20.51.226) 56(84) bytes of data.
64 bytes from 172.20.51.226: icmp_seq=0 ttl=64 time=1.44 ms
--- 172.20.51.226 ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 1.442/1.442/1.442/0.000 ms, pipe 2
[root at ss27 ~]# ping 172.20.52.226 -c 1
PING 172.20.52.226 (172.20.52.226) 56(84) bytes of data.
64 bytes from 172.20.52.226: icmp_seq=0 ttl=64 time=1.67 ms
--- 172.20.52.226 ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 1.671/1.671/1.671/0.000 ms, pipe 2
[root at ss27 ~]# 

[root at ss27 ~]# ipoib_ha.pl -p ib0 -s ib1 --with-arping -vv
get_cfg: Got /etc/sysconfig/network-scripts/ifcfg-ib0

Date:Mon Feb 19 02:32:22 2007
ib0:
======================================
BOOTPROTO = static
status = 
HA = 0
DEVICE = ib0
NETMASK = 255.255.255.0
BROADCAST = 172.20.51.255
IPADDR = 172.20.51.227
NETWORK = 172.20.51.0
ONBOOT = yes
pkey = ffff

Date:Mon Feb 19 02:32:22 2007
Bond:
======================================
BOOTPROTO = static
status = 
HA = 0
DEVICE = ib0
NETMASK = 255.255.255.0
BROADCAST = 172.20.51.255
IPADDR = 172.20.51.227
NETWORK = 172.20.51.0
ONBOOT = yes
pkey = ffff

Date:Mon Feb 19 02:32:23 2007
Got NO-CARRIER event on ib0.
Interface ib0 is down.
Currently Active : ib0
Other device: ib1 is UP
migrate_conf: Migrating from ib0 to ib1
Date:Mon Feb 19 02:33:37 2007
Date:Mon Feb 19 02:33:37 2007
set_up_bond: Going to set up ib1 with 172.20.51.227
set_up_bond: Arping ib1 172.20.51.227.
Got CARRIER-ON event on ib1.
Got CARRIER-ON event on ib1.
Got CARRIER-ON event on ib1.
Got NO-CARRIER event on ib0.
Interface ib0 is down.
Currently Active : ib1
Got CARRIER-ON event on ib1.
Got CARRIER-ON event on ib0.
Got CARRIER-ON event on ib0.
Got NO-CARRIER event on ib1.
Interface ib1 is down.
Currently Active : ib1
Other device: ib0 is UP
migrate_conf: Migrating from ib1 to ib0
Date:Mon Feb 19 02:35:48 2007
Date:Mon Feb 19 02:35:48 2007
set_up_bond: Going to set up ib0 with 172.20.51.227
set_up_bond: Arping ib0 172.20.51.227.
Got CARRIER-ON event on ib0.
Got CARRIER-ON event on ib0.
Got CARRIER-ON event on ib1.

[root at ss27 ~]# 
#######################################################


-- 
Configure bugmail: https://bugs.openfabrics.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.




More information about the general mailing list