[ewg] Infiniband Interoperability

Matt Breitbach matthewb at flash.shanje.com
Wed Jun 30 12:09:46 PDT 2010


Switch 0x003048ffffa12591 MT47396 Infiniscale-III Mellanox Technologies:
           3    1[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       1    1[  ]
"MT25208 InfiniHostEx Mellanox Technologies" ( )
           3    2[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       4    2[  ]
"MT25208 InfiniHostEx Mellanox Technologies" ( )
           3    3[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    4[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    5[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    6[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    7[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    8[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3    9[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   10[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   11[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       7    1[  ]
"MT25218 InfiniHostEx Mellanox Technologies" ( )
           3   12[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   13[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   14[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   15[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   16[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   17[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   18[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       6    1[  ]
"ibcontrol HCA-1" ( )
           3   19[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       2    1[  ]
"xen1 HCA-1" ( )
           3   20[  ] ==( 4X 5.0 Gbps Active/  LinkUp)==>       5    1[  ]
"MT25408 ConnectX Mellanox Technologies" ( )
           3   21[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   22[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   23[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )
           3   24[  ] ==( 4X 2.5 Gbps   Down/ Polling)==>             [  ]
"" ( )


-----Original Message-----
From: Ira Weiny [mailto:weiny2 at llnl.gov] 
Sent: Wednesday, June 30, 2010 1:57 PM
To: richard at informatix-sol.com
Cc: Matt Breitbach; ewg at lists.openfabrics.org
Subject: Re: [ewg] Infiniband Interoperability

On Wed, 30 Jun 2010 11:13:50 -0700
"richard at informatix-sol.com" <richard at informatix-sol.com> wrote:

> I'm still suspicious that you have more than one SM running. Mellonex
switches have it enabled by default.
> It's common that ARP requests, as caused by ping, will result in multicast
group activity.
> Infiniband creates these on demand and tears them down if there are no
current members. There is no broadcast address. It uses a dedicated MC
group.
> They all seem to originate to LID 6 so you can trace the source.
> 
> If you have ports at non optimal speeds, try toggling their enable state.
This often fixes it.

One other way of checking for SM's is to use the console in OpenSM.  The
"status" command will lists SM's it sees and who is currently master.

As for the network config could you send the iblinkinfo output?  I would be
curious to see it.

Thanks,
Ira

> 
> Richard
> 
> ----- Reply message -----
> From: "Matt Breitbach" <matthewb at flash.shanje.com>
> Date: Wed, Jun 30, 2010 15:33
> Subject: [ewg] Infiniband Interoperability
> To: <ewg at lists.openfabrics.org>
> 
> Well, let me throw out a little about the environment :
> 
> 
> 
> We are running one SuperMicro 4U system with a Mellanox InfiniHost III EX
> card w/ 128MB RAM.  This box is the OpenSolaris box.  It's running the
> OpenSolaris Infiniband stack, but no SM.  Both ports are cabled to the IB
> Switch to ports 1 and 2.
> 
> 
> 
> The other systems are in a SuperMicro Bladecenter.  The switch in the
> BladeCenter is an InfiniScale III switch with 10 internal ports and 10
> external ports.
> 
> 
> 
> 3 blades are connected with Mellanox ConnectX Mezzanine cards.  1 blade is
> connected with an InfiniHost III EX Mezzanine card.
> 
> 
> 
> One of the blades is running CentOS and the 1.5.1 OFED release.  OpenSM is
> running on that system, and is the only SM running on the network.  This
> blade is using a ConnectX Mezzanine card.
> 
> 
> 
> One blade is running Windows 2008 with the latest OFED drivers installed.
> It is using an InfiniHost III EX Mezzanine card.
> 
> 
> 
> One blade is running Windows 2008 R2 with the latest OFED drivers
installed.
> It is using an ConnectX Mezzanine card.
> 
> 
> 
> One blade has been switching between Windows 2008 R2 and CentOS with Xen.
> Windows 2008 is running the latest OFED drivers, CentOS is running the
1.5.2
> RC2.  That blade is using a ConnectX Mezzanine card.
> 
> 
> 
> All of the firmware has been updated on the Mezzanine cards, the PCI-E
> InfiniHost III EX card, and the switch.  All of the Windows boxes are
> configured to use Connected mode.  I have not changed any other settings
on
> the Linux boxes.
> 
> 
> 
> As of right now, the network seems stable.  I've been running pings for
the
> last 12 hours, and nothing has dropped.
> 
> 
> 
> I did notice in the OpenSM log though some odd entries that I do not
believe
> belong there.
> 
> 
> 
> Jun 30 06:56:26 832438 [B5723B90] 0x02 -> log_notice: Reporting Generic
> Notice type:3 num:67 (Mcast group deleted) from LID:6
> GID:ff12:1405:ffff::3333:1:2
> 
> Jun 30 06:57:53 895990 [B5723B90] 0x02 -> log_notice: Reporting Generic
> Notice type:3 num:66 (New mcast group created) from LID:6
> GID:ff12:1405:ffff::3333:1:2
> 
> Jun 30 07:18:06 770861 [B6124B90] 0x02 -> log_notice: Reporting Generic
> Notice type:3 num:67 (Mcast group deleted) from LID:6
> GID:ff12:1405:ffff::3333:1:2
> 
> Jun 30 07:19:14 835273 [B5723B90] 0x02 -> log_notice: Reporting Generic
> Notice type:3 num:66 (New mcast group created) from LID:6
> GID:ff12:1405:ffff::3333:1:2
> 
> 
> 
> 
> 
> I would not think that new mcast groups should be created or deleted when
> there are no new adapters being added to the network, especially in this
> small of a network.  Is it odd to see those messages?
> 
> 
> 
> Also, I have a warning when I run ibdiagnet - "Suboptimal rate for group.
> Lowest member rate: 20Gbps > group-rate: 10gbps"
> 
> 
> 
> I also have a few things that I'm concerned about in the "PM Counters
Info"
> section of ibdiagnet as follows :
> 
> 
> 
> -W- lid=0x0003 guid=0x003048ffffa12591 dev=47396 Port=1
> 
>      Performance Monitor counter     : Value
> 
>      symbol_error_counter            : 0xffff (overflow)
> 
> -W- lid=0x0004 guid=0x0002c9020029a492 dev=25208 MT25208/P2
> 
>      Performance Monitor counter     : Value
> 
>      symbol_error_counter            : 0xffff (overflow)
> 
> -W- lid=0x0003 guid=0x003048ffffa12591 dev=47396 Port=18
> 
>      Performance Monitor counter     : Value
> 
>      symbol_error_counter            : 0xffff (overflow)
> 
>      port_xmit_constraint_errors     : 0xff (overflow)
> 
> -W- lid=0x0003 guid=0x003048ffffa12591 dev=47396 Port=19
> 
>      Performance Monitor counter     : Value
> 
>      symbol_error_counter            : 0xffff (overflow)
> 
> 
> 
> I'm not sure if those are bad or not, and if they would point to any sort
of
> problem, but that's what I'm seeing.
> 
> 
> 
> Hopefully this gives you a bit more insight into the setup and the issues.
> If I can provide anything else that would help debug this issue, please
let
> me know.
> 
> 
> 
> From: Richard Croucher [mailto:richard at informatix-sol.com]
> Sent: Wednesday, June 30, 2010 3:12 AM
> To: 'Jeff Becker'; 'Matt Breitbach'; ewg at lists.openfabrics.org
> Subject: RE: [ewg] Infiniband Interoperability
> 
> 
> 
> The InfiniBand fabric  knows very little about IpoIB, it is handled by the
> host OS stack, however it does need capabilities such as multicast to work
> properly for ARP name resolution.
> 
> 
> 
> The problem you describe sounds similar to a situation I encountered
running
> multiple, incompatible SM's.
> 
> Make sure you only have a single vendor SM.   Whilst the OFED SM build is
> fine, I have found many vendors hack their distro's so they either ignore
or
> always win the SM election.   Explicitly disable all SM's on all
> environments you don't want to be running. Don't rely on SM priority
across
> different implementations .  I'd recommend running openSM on Linux and
> disabling all others.
> 
> 
> 
> Identifying that no SM running is easy, since the ports don't get LIDs,
> however when multiple SM's are running it sort of works, since the
different
> SM's discover which LIDS have been allocated when they scan the fabric.
The
> problem I saw was with multicast, each SM
> 





More information about the ewg mailing list