[openib-general] Solaris IPoIB MTU with OpenSM

Nitin Hande Nitin.Hande at Sun.COM
Tue Feb 22 19:56:25 PST 2005


Hal,


On Thu, 2005-02-17 at 13:12, Hal Rosenstock wrote:
> Hi Nitin,
> 
> On Wed, 2005-02-16 at 17:33, Nitin Hande wrote: 
> > On Wed, 2005-02-16 at 13:26, Hal Rosenstock wrote:
> > > On Wed, 2005-02-16 at 16:08, Nitin Hande wrote:
> > > > Hal,
> > [snip..]
[snip...]
> > > > 
> > > > 
> > 
> > Before the patch the selector was coming 04. Do you reply 84 seeing a
> > particular component mask and otherwise 01 ??(I think not..) 
> 
> I agree that OpenSM responds/should respond the same regardless of the
> component mask in the request.
> 
> I was unaware of OpenSM responding with MTU of 01 until now. I have a
> theory as to this. Any chance I can get the osm logs from a -V run of
> the above ? 
> 
> I also have a simple patch below to try which is just to test the
> theory. This is off the latest version but should be easy to apply to
> any version of osm_sa_mcmember_record.c.
> 
> This is separate from the support for PathRecords with multicast DGID
> and/or DLID. I have the changes for this scoped out and should be able
> to implement by early next week.

So I tried the latest patches and preliminarily things seem to be
working fine. The PathRecord response is successful and so is the MTU
correct. I need to spend some more time looking at MAD and confirm it. I
could configure both interfaces and ping each other this time. Here is
some out on the solaris side:

[root at dongon ~]# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu
8232 index 1
        inet 127.0.0.1 netmask ff000000 
ibd0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 2044 index
28
        inet 192.168.100.105 netmask ffffff00 broadcast 192.168.100.255
        ipib 0:2c:0:16:fe:80:0:0:0:0:0:0:0:2:c9:1:9:76:51:d1 
.....
[root at dongon ~]# ping -s 192.168.100.104
PING 192.168.100.104: 56 data bytes
64 bytes from 192.168.100.104: icmp_seq=0. time=0.590 ms
64 bytes from 192.168.100.104: icmp_seq=1. time=0.434 ms
64 bytes from 192.168.100.104: icmp_seq=2. time=0.365 ms

the other side is a openib interface runing OpenSM.

So after this test above, I try to run snoop on the solaris interface
and get the following error message from the layer below IPoIB:

Feb 22 19:50:25 dongon.SFBay.Sun.COM ibd: [ID 517869 kern.info] NOTICE:
ibd0: HCA GUID 0002c901097651d0 port 1 PKEY ffff Could not get list of
IBA multicast groups

My preliminary assumption is that OpenSm is not returning the list of
multicast groups that the ibd interface has joined. I will look at the
MAD's tomorrow and try to ascertain that.

Thanks
Nitin


 



> 
> Thanks.
> 
> -- Hal
> 
> Index: osm_sa_mcmember_record.c
> ===================================================================
> --- osm_sa_mcmember_record.c	(revision 1821)
> +++ osm_sa_mcmember_record.c	(working copy)
> @@ -1325,11 +1325,13 @@
>    /* copy qkey mlid tclass pkey sl_flow_hop mtu rate pkt_life
> sl_flow_hop */
>    __copy_from_create_mc_rec(&mcmember_rec, &p_mgrp->mcmember_rec);
>  
> +#if 0
>    if(p_mgrp->well_known)
>    {
>      p_mgrp->mcmember_rec.mtu = mtu;
>      mcmember_rec.mtu = mtu;
>    }
> +#endif
>  
>    /* Release the lock as we don't need it. */
>    CL_PLOCK_RELEASE( p_rcv->p_lock );
> 
> 
> 
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general




More information about the general mailing list