[ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Davis, Arlin R arlin.r.davis at intel.com
Fri Feb 9 09:11:24 PST 2018


Thanks Pradeep!

Let’s move forward with RC1, given this one remaining bug is not critical. There was a new critical bug (2671) for NFS-RDMA on RH7.4 but that is a combination not supported in OFED 4.8 so it was closed.

Reminder to all: ULP support in 4.8-2 will remain the same as 4.8-1 as follow (see 4.8-1 release notes):

ULP and Driver restrictions:
    - NVMe-oF (kernel 4.8 only)
    - NFS-RDMA (SLES12.2 RH7.3 only).
    - i40iw 10GbE iWARP Adapter (kernel 4.8 only)

Vlad, please promote latest build to RC1:  http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180208-0920.tgz

Paul and Stefan, if the interop event needs NFS-RDMA support, we can support that with RH7.3.

Thanks,

Arlin





From: Pradeep Kankipati [mailto:pradeep.kankipati at broadcom.com]
Sent: Friday, February 09, 2018 8:07 AM
To: Davis, Arlin R <arlin.r.davis at intel.com>
Cc: Vladimir Sokolovsky <vlad at dev.mellanox.co.il>; Kalderon, Michal <Michal.Kalderon at cavium.com>; ewg at lists.openfabrics.org; Woodruff, Robert J <robert.j.woodruff at intel.com>; Stefan Oesterreich <soesterreich at iol.unh.edu>; Devesh Sharma <devesh.sharma at broadcom.com>; Bowden, Paul <paul.bowden at intel.com>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Hi Arlin,

I think one of them has been fixed but other may not be serious enough to stop RC1. The engineers will update the BZ before end of day.

Thanks,
Pradeep
--


On Thu, Feb 8, 2018 at 10:53 PM, Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>> wrote:
Hello all,

We still have a few things left before we can get to RC1 this Friday. IWG is anxiously waiting for RC1.

Michal, thanks for the fix for (please close bug)
#2669 “Cannot set mtu greater than 1500 on SLES12.3”

Pradeep, do you have an update from your team for the following (please update bugs):
#2666 “hwrm req_type 0x51 seq id 0x140 error 0x2 failure with lldpad service on RH7.4”
#2668 “Cannot change MTU to greater than default”

Vlad, can you look at the rdmavt issue and fix the driver load problem?
#2670 “unable to see QLE7340/QLE7342 adapters, rdmavt missing in ofed 4.8-2 install”

Thanks,

Arlin


From: Vladimir Sokolovsky [mailto:vlad at dev.mellanox.co.il<mailto:vlad at dev.mellanox.co.il>]
Sent: Wednesday, February 07, 2018 8:11 AM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>; Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>
Cc: ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>; Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>

Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes


Hi Michal,

Merged + build: OFED-4.8-2-20180207-0745



Regards,

Vladimir

On 02/07/2018 09:41 AM, Kalderon, Michal wrote:
Hi Arlin,

Sorry for the delay, had some logistic stuff to work out.

Hope this is sufficient:
Vlad,

Please pull following fix:
https://github.com/mkalderon/ofed-compat-rdma/commit/01d945c12286b1ad8960ffa74b64fcd256c873e7

Let me know if you prefer I email a patch
I updated the Bug but left it assigned to me -> not sure who I’m supposed to assign it to at this point?

Thanks,
Michal

From: Pradeep Kankipati [mailto:pradeep.kankipati at broadcom.com]
Sent: Wednesday, February 07, 2018 9:27 AM
To: Davis, Arlin R <arlin.r.davis at intel.com><mailto:arlin.r.davis at intel.com>; Kalderon, Michal <Michal.Kalderon at cavium.com><mailto:Michal.Kalderon at cavium.com>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com><mailto:Venkata.Srikakulam at cavium.com>; ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>; Vladimir Sokolovsky <vlad at mellanox.com><mailto:vlad at mellanox.com>; Woodruff, Robert J <robert.j.woodruff at intel.com><mailto:robert.j.woodruff at intel.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Hi Arlin,

Sorry, just coming back today from sick leave. Let me look into this.

Thanks,
Pradeep
--

From: Davis, Arlin R [mailto:arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>]
Sent: Wednesday, February 7, 2018 1:18 AM
To: 'Kalderon, Michal'; 'Pradeep Kankipati'
Cc: 'Srikakulam, Venkata'; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>'; Vladimir Sokolovsky; Woodruff, Robert J
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Michal and Pradeep,

OFA Interop Working Group is anxiously waiting for RC1 for Interop testing (scheduled to start this week).
Is it possible to get Vlad some patches soon so we can move to RC1 by end of the week?

Thanks,
Arlin


From: Davis, Arlin R
Sent: Thursday, February 01, 2018 11:25 AM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Michal, thanks for the update.

Thanks Michal for the update. This is what we have so far:

Bug 2662<http://bugs.openfabrics.org/show_bug.cgi?id=2662> Chelsio: Cannot set mtu greater than 1500 on SLES12Sp3 – Fixed/Closed, Thanks!
Bug 2668<http://bugs.openfabrics.org/show_bug.cgi?id=2668> Broadcom: Cannot change MTU to greater than default – No ETA on Fix
Bug 2669<http://bugs.openfabrics.org/show_bug.cgi?id=2669> Cavium: Cannot set mtu greater than 1500 on SLES12Sp3 – ETA for fix, next week

We will need fixes before moving to RC1.

-arlin


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Thursday, February 01, 2018 7:08 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>; Nikolova, Tatyana E <tatyana.e.nikolova at intel.com<mailto:tatyana.e.nikolova at intel.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Opened bugzilla: http://bugs.openfabrics.org/show_bug.cgi?id=2669

Will provide a patch to qedr next week.

Thanks,
Michal

From: Davis, Arlin R [mailto:arlin.r.davis at intel.com]
Sent: Wednesday, January 31, 2018 8:56 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>; Nikolova, Tatyana E <tatyana.e.nikolova at intel.com<mailto:tatyana.e.nikolova at intel.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

True, thanks for catching this. Please open a qedr bug so we can track as critical/blocking.

Tatyana and Pradeep, do Intel and/or Broadcom drivers need similar changes?

Anyone else?


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Wednesday, January 31, 2018 10:29 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

We assumed this is a generic issue since it happened on several adapters, I see now with the bug resolve that Steve made changes specific to cxgb3
We need to make similar changes in qede.

Thanks,
Michal


From: Davis, Arlin R [mailto:arlin.r.davis at intel.com]
Sent: Wednesday, January 31, 2018 8:10 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes


Are you using the latest daily builds? Steve Wise reported this and forwarded patches to Vlad on Jan 19th.



Fix went into build: http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180122-1411.tgz


Bug: http://bugs.openfabrics.org/show_bug.cgi?id=2662

Please let us know if you still have issues with latest builds.

-arlin


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Wednesday, January 31, 2018 9:15 AM
To: Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Hi

We’re seeing an issue on SLES12SP3 with modifying mtu.


linux-p4eo:~ # ifconfig eth7 mtu 9000

SIOCSIFMTU: Invalid argument

Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500

We’ve seen this with other vendor devices as well.
Chelsio reported an issue in the past regarding MTU change,
But I didn’t see any related bugs open or discussions.

Has this been discussed since?
Attaching relevant email

Thanks,
Michal


From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Schmidt, William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Yes please. The omitted commits are listed in the OFED bugs.

From: Davis, Arlin R
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: vlad at dev.mellanox.co.il<mailto:vlad at dev.mellanox.co.il>; Woodruff, Robert J <robert.j.woodruff at intel.com<mailto:robert.j.woodruff at intel.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Bill, thanks for the update. Do you need Vlad’s help getting these fixes into OFED 4.8 compat-rdma?

From: Schmidt, William R
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes


>>2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 – Need update from Bill Schmidt (Intel)

Bug 2664<http://bugs.openfabrics.org/show_bug.cgi?id=2664> - Bonding doesn't work on RHEL 7.4

Bonding driver form RHEL 7.4 has integrated commit: https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729eb86845

This requires ipoib driver to return correct speed and duplex mode.  Commit adding this feature on ipoib driver:

https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15feb574cc was added in 4.12 kernel, so it is missing from OFED 4.8 compat-rdma.

In consequence Bonding driver cannot retrieve required data and fails.

Bug 2665<http://bugs.openfabrics.org/show_bug.cgi?id=2665> - Bonding causes kernel panic on SLES 12.1 and SLES 12.2

Bonding driver from SLES 12.1/12.2 is missing kernel panic fix commit: https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa0a74c0

added in linux kernel 4.8. This makes it incompatible with ipoib drivers from OFED 4.8 compat-rdma.


From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Monday, January 29, 2018 11:57 AM
To: 'ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Attendees                      Company
Pradeep Kankipati         Broadcom
Steve Wise                     Chelsio
Robert Woodruff            Intel
Arlin Davis                      Intel
Vladimir Sokolovsky       Mellanox
Ariel Elior                       Cavium
Michal Kalderon             Cavium
Michael Rice                       HPE



Minutes:



•         Opens - none


•         OFED 4.8-2:  http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180124-0818.tgz



Status:  RH7.4 and SLES12.3 backports added.

                Updated packages:  rdma_core-v16, perftest 4.1-0.2, libfabric 1.5.3

                Installation changes:  --without-depcheck docs, vmw_pvrdma moved out of tech preview

             Test Status:  Intel - build RH 7.0, 7.1, 7.2, 7.3, 7.4 SLES 12, 12.1, 12.2, and 12.3 - Passed



Known issues to be resolved before RC1:



1. Bug #2663 - (P1) rping fails, iwpmd hitting segfault on SLES12.3 –
•Chelsio validation team hits bug, engineering team cannot reproduce.
•Steve (Chelsio) needs help from Tatyana’s (Intel) team to reproduce and isolate.

2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 – Need update from Bill Schmidt (Intel)


•         OFED 4.8-2 RC1 schedule: (2 blocking bugs)



Plan is to clean up bugs this week and push hard for RC1 by Friday.

                The GA plan is to go from RC1 to GA, 1-2 week RC1 validation, and Feb 16th for a GA target.

Board approved OFED 4.8-2 so we can move to GA as soon as EWG is ready.



Regards,



Arlin





_______________________________________________

ewg mailing list

ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>

http://lists.openfabrics.org/mailman/listinfo/ewg


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20180209/269d310e/attachment.html>


More information about the ewg mailing list