[ewg] [GIT PULL compat-rdma] qib for OFED 4.8

RSD@SFI rsdance at soft-forge.com
Fri Jun 30 15:12:07 PDT 2017


I suggest that we limit OFED 4.8-1 to the following items:

 

*	Broadcom Drivers
*	Cavium Drivers
*	SRP fix

 

Both the Broadcom and Cavium drivers were tested with OFED 4.8 RC1 during the IBTA Plugfest when we did the RoCE Interoperability testing. The results were very good – see the RoCE Interoperability List: https://cw.infinibandta.org/document/dl/8203

 

So I feel that if Bart and DDN can work on an SRP fix, we should be able to release OFED 4.8-1 GA within approximately 30 days after the release of OFED 4.8 GA. 

 

Therefore the sooner we release OFED 4.8 GA, the sooner Vlad can start integrating Broadcom and Cavium drivers into OFED so we can get daily builds and start testing. Hopefully an SRP fix will be available in the next week or so.

 

Therefore I am supportive of releasing OFED 4.8 GA with the condition that we do a very fast turnaround on OFED 4.8-1.

 

Thanks

 

Rupert

 

-----Original Message-----
From: Woodruff, Robert J [mailto:robert.j.woodruff at intel.com] 
Sent: Friday, June 30, 2017 12:33 PM
To: Davis, Arlin R <arlin.r.davis at intel.com>; Vladimir Sokolovsky <vlad at dev.mellanox.co.il>; Bart Van Assche <Bart.VanAssche at wdc.com>; rsdance at soft-forge.com; Hanania, Amir <amir.hanania at intel.com>; mschlining at ddn.com
Cc: mdavis at ddn.com; ewg at lists.openfabrics.org; cfernandes at ddn.com
Subject: RE: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

 

I vote for documenting the problem as a known issue with documentation on any known workarounds (such as using Bart's srp_backports driver) and then work on getting an in-box fix into OFED-4.8-1.

especially since we are attempting to have a quick turn-around for OFED-4.8-1 since  the new content being added is limited, and thus it should not take too long to get done. 

 

Vlad, what is your position on this one ?

 

And also, does anyone else have an opinion ?

 

-----Original Message-----

From: Davis, Arlin R

Sent: Friday, June 30, 2017 9:12 AM

To: Woodruff, Robert J < <mailto:robert.j.woodruff at intel.com> robert.j.woodruff at intel.com>; Vladimir Sokolovsky < <mailto:vlad at dev.mellanox.co.il> vlad at dev.mellanox.co.il>; Bart Van Assche < <mailto:Bart.VanAssche at wdc.com> Bart.VanAssche at wdc.com>;  <mailto:rsdance at soft-forge.com> rsdance at soft-forge.com; Hanania, Amir < <mailto:amir.hanania at intel.com> amir.hanania at intel.com>;  <mailto:mschlining at ddn.com> mschlining at ddn.com

Cc:  <mailto:mdavis at ddn.com> mdavis at ddn.com;  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org;  <mailto:cfernandes at ddn.com> cfernandes at ddn.com

Subject: RE: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

 

We need community input here or some direction from the co-chairs (Woody and Vlad) so we can move forward. I have no problem waiting for a fix if someone can give the EWG a timely ETA on an upstream fix. From what I can tell, the maintainer doesn't have the bandwidth and DDN doesn't have the expertise to fix so I don't see any quick resolution. There are several OFA members anxiously waiting for OFED 4.8-1 to add their new drivers and many that need to get OFED 4.8 GA done. Not sure we can wait weeks and/or months for this to get resolved.

 

The EWG meeting next Monday is on a US holiday and will be cancelled so we don't want to wait 2 weeks for a decision. 

 

I am asking EWG and OFA members to please reply with their recommendations and suggestions. We are clearly at an impasse.

 

 

> -----Original Message-----

> From: Woodruff, Robert J

> Sent: Thursday, June 29, 2017 1:25 PM

> To: Davis, Arlin R < <mailto:arlin.r.davis at intel.com> arlin.r.davis at intel.com>; Vladimir Sokolovsky 

> < <mailto:vlad at dev.mellanox.co.il> vlad at dev.mellanox.co.il>; Bart Van Assche < <mailto:Bart.VanAssche at wdc.com> Bart.VanAssche at wdc.com>; 

>  <mailto:rsdance at soft-forge.com> rsdance at soft-forge.com; Hanania, Amir < <mailto:amir.hanania at intel.com> amir.hanania at intel.com>; 

>  <mailto:mschlining at ddn.com> mschlining at ddn.com

> Cc:  <mailto:mdavis at ddn.com> mdavis at ddn.com;  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org;  <mailto:cfernandes at ddn.com> cfernandes at ddn.com

> Subject: RE: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

> 

> I know that DDN really wants to hold the release until a fix is 

> available.  I am personally OK with just documenting the known issues 

> for now and fixing them in OFED-4.8-1 as I know there are some people 

> that want to use what is there now and do not care about SRP.

> 

> What do others think ?  Are we going to have to have a vote to break 

> this log jam ?

> 

> -----Original Message-----

> From: Davis, Arlin R

> Sent: Thursday, June 29, 2017 11:10 AM

> To: Davis, Arlin R < <mailto:arlin.r.davis at intel.com> arlin.r.davis at intel.com>; Vladimir Sokolovsky 

> < <mailto:vlad at dev.mellanox.co.il> vlad at dev.mellanox.co.il>; Bart Van Assche < <mailto:Bart.VanAssche at wdc.com> Bart.VanAssche at wdc.com>; 

>  <mailto:rsdance at soft-forge.com> rsdance at soft-forge.com; Hanania, Amir < <mailto:amir.hanania at intel.com> amir.hanania at intel.com>; 

> Woodruff, Robert J < <mailto:robert.j.woodruff at intel.com> robert.j.woodruff at intel.com>;  <mailto:mschlining at ddn.com> mschlining at ddn.com

> Cc:  <mailto:mdavis at ddn.com> mdavis at ddn.com;  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org;  <mailto:cfernandes at ddn.com> cfernandes at ddn.com

> Subject: RE: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

> 

> All,

> 

> Here is a quick summary of OFED 4.8 and SRP blocking bugs. Please 

> correct/comment if I have it wrong.

> 

> - SL7.2 and SL7.3 SRP support is a must for DDN, need latest upstream 

> SRP

> 4.11 fixes

> - No resources available to backport or integrate the SRP 4.11 base or 

> fixes into OFED 4.8

> - ib_srp_backport (4.11 base) installs/builds/runs on OFED 4.8 for 

> both 7.2 and 7.3

> - SL7.3+OFED+srp_backport fixes 2632

> - SL7.2+OFED+srp_backport still hitting bug 2632

> - SL7.2+inbox_infiniband+srp_backport (no OFED) still hitting 2632 and

> 2634 (slightly different but still seeing CM DREQ)

> - Bug 2632, 2634 still open, no resolution and no ETA on fix from 

> maintainer

> 

> Given that we still have the same issues without OFED installed and no 

> ETA on fixes, I would suggest moving forward and releasing OFED 4.8 

> GA. We can document issues and process for applying upstream SRP fixes 

> in the release notes.

> 

> If there are no objections, I would like Vlad to modify OFED release 

> notes, add the 2 remaining bugs to known issues, and add backport 

> procedure for SRP upstream as follow:

> 

>  <http://bugs.openfabrics.org/show_bug.cgi?id=2632> http://bugs.openfabrics.org/show_bug.cgi?id=2632

>  <http://bugs.openfabrics.org/show_bug.cgi?id=2634> http://bugs.openfabrics.org/show_bug.cgi?id=2634

> 

> The ib_srp-backport procedure is as follows:

>  1. Install the OS (RHEL, SLES, ...).

>  2. Install OFED.

>  3. Install the ib_srp-backport driver:

>          git clone  <https://github.com/bvanassche/ib_srp-backport> https://github.com/bvanassche/ib_srp-backport

>          cd ib_srp-backport

>          make rpm

>          sudo rpm -U $PWD/rpmbuilddir/RPMS/*/*.rpm

> 

> We can then release GA package tomorrow with these changes and latest 

> daily build.

> Any other release notes changes?

> 

> Thanks,

> 

> Arlin

> 

> > -----Original Message-----

> > From: ewg [ <mailto:ewg-bounces at lists.openfabrics.org> mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of 

> > Davis, Arlin R

> > Sent: Friday, June 23, 2017 12:59 PM

> > To: Vladimir Sokolovsky < <mailto:vlad at dev.mellanox.co.il> vlad at dev.mellanox.co.il>; Bart Van Assche 

> > < <mailto:Bart.VanAssche at wdc.com> Bart.VanAssche at wdc.com>;  <mailto:rsdance at soft-forge.com> rsdance at soft-forge.com; Hanania, Amir 

> > < <mailto:amir.hanania at intel.com> amir.hanania at intel.com>; Woodruff, Robert J 

> > < <mailto:robert.j.woodruff at intel.com> robert.j.woodruff at intel.com>;  <mailto:mschlining at ddn.com> mschlining at ddn.com

> > Cc:  <mailto:mdavis at ddn.com> mdavis at ddn.com;  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org;  <mailto:cfernandes at ddn.com> cfernandes at ddn.com

> > Subject: Re: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

> >

> > It seems like we now have a way to apply the latest SRP upstream 

> > fixes to OFED 4.8, tested with RH 7.2 and RH7.3. However, we still 

> > have 2 critical bugs

> > (1 new one) on RH 7.2 even after the upstream fixes are applied. RH 

> > 7.3+OFED4.8+srp_backport seems to be fine.

> >

> >  <http://bugs.openfabrics.org/show_bug.cgi?id=2632> http://bugs.openfabrics.org/show_bug.cgi?id=2632

> >  <http://bugs.openfabrics.org/show_bug.cgi?id=2634> http://bugs.openfabrics.org/show_bug.cgi?id=2634

> >

> > How does DDN want to proceed?

> > Is it acceptable to apply upstream version/fixes to OFED 4.8 as 

> > follow, document in release notes?

> > Can you tell us if the remaining bugs are related to OFED 4.8 or if 

> > RH7.2+infiniband+srp_backport has the same issues?

> >

> > The ib_srp-backport procedure is as follows:

> > 1. Install the OS (RHEL, SLES, ...).

> > 2. Install OFED.

> > 3. Install the ib_srp-backport driver:

> >       git clone  <https://github.com/bvanassche/ib_srp-backport> https://github.com/bvanassche/ib_srp-backport

> >       cd ib_srp-backport

> >       make rpm

> >       sudo rpm -U $PWD/rpmbuilddir/RPMS/*/*.rpm

> >

> > Thanks,

> >

> > Arlin

> >

> > > SRP is included already in OFED-4.8 as a part of linux-4.8 kernel 

> > > and I don't see any reason to include SRP code twice.

> > > If some bug fixes have to be added there then there is a specific 

> > > place in compat-rdma git tree for the relevant patches.

> > > If somebody is interested to add an additional SRP standalone 

> > > package then please send me the relevant patches for 

> > > compat-rdma/ofed_scripts and build git trees.

> > >

> > > Regards,

> > > Vladimir

> > >

> > >

> > > On 06/20/2017 09:01 PM, Davis, Arlin R wrote:

> > > > Ok. Vlad, what would it take to get this backported version of 

> > > > SRP into OFED

> > > 4.8?

> > > >

> > > >> -----Original Message-----

> > > >> From: Bart Van Assche [ <mailto:Bart.VanAssche at wdc.com> mailto:Bart.VanAssche at wdc.com]

> > > >> Sent: Tuesday, June 20, 2017 10:49 AM

> > > >> To:  <mailto:vlad at dev.mellanox.co.il> vlad at dev.mellanox.co.il;  <mailto:rsdance at soft-forge.com> rsdance at soft-forge.com; Davis, 

> > > >> Arlin R < <mailto:arlin.r.davis at intel.com> arlin.r.davis at intel.com>; Hanania, Amir 

> > > >> < <mailto:amir.hanania at intel.com> amir.hanania at intel.com>; Woodruff, Robert J 

> > > >> < <mailto:robert.j.woodruff at intel.com> robert.j.woodruff at intel.com>;  <mailto:mschlining at ddn.com> mschlining at ddn.com

> > > >> Cc:  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org;  <mailto:cfernandes at ddn.com> cfernandes at ddn.com;

>  <mailto:mdavis at ddn.com> mdavis at ddn.com

> > > >> Subject: Re: [ewg] [GIT PULL compat-rdma] qib for OFED 4.8

> > > >>

> > > >> Hello Arlin,

> > > >>

> > > >> Adding a URL to the OFED README is not the same as including a 

> > > >> version of the ib_srp-backport driver in OFED. How can OFED 

> > > >> users know what version of the ib_srp-backport driver has been 

> > > >> tested in combination with which OFED version?

> > > >>

> > > >> Thanks,

> > > >>

> > > >> Bart.

> > > >>

> > > >> On Tue, 2017-06-20 at 17:18 +0000, Davis, Arlin R wrote:

> > > >>> Marty,

> > > >>>

> > > >>> Would this work for DDN? We could add the URL and the SRP 

> > > >>> backport readme to the OFED release notes. I believe you had 

> > > >>> an issue when applying the backport to SL 7.2 and OFED 4.8 RC4 

> > > >>> but I think Bart fixed that

> > > >> already.

> > > >>> Thanks, Arlin

> > > >>>

> > > >>>> Hello Arlin and Robert,

> > > >>>>

> > > >>>> A backported version of the 4.11 ib_srp driver that already 

> > > >>>> has been tested is available at

> > > >>>>  <https://github.com/bvanassche/ib_srp-> https://github.com/bvanassche/ib_srp-

> > > backport.

> > > >>>> That driver builds fine against kernel.org kernels, 

> > > >>>> RHEL/CentOS kernels, openSuSE and SLES kernels and also 

> > > >>>> against MOFED and

> OFED.

> > > >>>> Has it already been considered to use that code base instead 

> > > >>>> of duplicating the backporting effort of the ib_srp driver?

> > > >>>>

> > > >>>> Thanks,

> > > >>>>

> > > >>>> Bart.

> >

> > _______________________________________________

> > ewg mailing list

> >  <mailto:ewg at lists.openfabrics.org> ewg at lists.openfabrics.org

> >  <http://lists.openfabrics.org/mailman/listinfo/ewg> http://lists.openfabrics.org/mailman/listinfo/ewg

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20170630/2a0f8919/attachment.html>


More information about the ewg mailing list