[ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Steve Wise swise at opengridcomputing.com
Thu Feb 1 11:59:10 PST 2018


And 2663 has been fixed in rdma-core-16.2 stable release.  I've asked Vlad
to pull in that release.

Steve.
 
 
 
From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Davis,
Arlin R
Sent: Thursday, February 01, 2018 1:25 PM
To: Kalderon, Michal; 'ewg at lists.openfabrics.org'
Cc: Srikakulam, Venkata
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US
Pacific Time (12pm EST) - Minutes
 
Michal, thanks for the update.
 
Thanks Michal for the update. This is what we have so far: 
 
 <http://bugs.openfabrics.org/show_bug.cgi?id=2662> Bug 2662 Chelsio: Cannot
set mtu greater than 1500 on SLES12Sp3 - Fixed/Closed, Thanks!
 <http://bugs.openfabrics.org/show_bug.cgi?id=2668> Bug 2668 Broadcom:
Cannot change MTU to greater than default - No ETA on Fix
 <http://bugs.openfabrics.org/show_bug.cgi?id=2669> Bug 2669 Cavium: Cannot
set mtu greater than 1500 on SLES12Sp3 - ETA for fix, next week 
 
We will need fixes before moving to RC1. 
 
-arlin
 
 
From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com] 
Sent: Thursday, February 01, 2018 7:08 AM
To: Davis, Arlin R <arlin.r.davis at intel.com>; Schmidt, William R
<william.r.schmidt at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com>; Nikolova, Tatyana E
<tatyana.e.nikolova at intel.com>; Pradeep Kankipati
<pradeep.kankipati at broadcom.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
Opened bugzilla: http://bugs.openfabrics.org/show_bug.cgi?id=2669
 
Will provide a patch to qedr next week. 
 
Thanks, 
Michal
 
From: Davis, Arlin R [mailto:arlin.r.davis at intel.com] 
Sent: Wednesday, January 31, 2018 8:56 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com>; Schmidt, William R
<william.r.schmidt at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com>; Nikolova, Tatyana E
<tatyana.e.nikolova at intel.com>; Pradeep Kankipati
<pradeep.kankipati at broadcom.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
True, thanks for catching this. Please open a qedr bug so we can track as
critical/blocking.
 
Tatyana and Pradeep, do Intel and/or Broadcom drivers need similar changes? 
 
Anyone else?
 
 
From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com] 
Sent: Wednesday, January 31, 2018 10:29 AM
To: Davis, Arlin R <arlin.r.davis at intel.com>; Schmidt, William R
<william.r.schmidt at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
We assumed this is a generic issue since it happened on several adapters, I
see now with the bug resolve that Steve made changes specific to cxgb3
We need to make similar changes in qede.
 
Thanks,
Michal
 
 
From: Davis, Arlin R [mailto:arlin.r.davis at intel.com] 
Sent: Wednesday, January 31, 2018 8:10 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com>; Schmidt, William R
<william.r.schmidt at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
Are you using the latest daily builds? Steve Wise reported this and
forwarded patches to Vlad on Jan 19th. 
 
Fix went into build:
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180122-1
411.tgz
 
Bug: http://bugs.openfabrics.org/show_bug.cgi?id=2662
 
Please let us know if you still have issues with latest builds.
 
-arlin
 
 
From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com] 
Sent: Wednesday, January 31, 2018 9:15 AM
To: Schmidt, William R <william.r.schmidt at intel.com>; Davis, Arlin R
<arlin.r.davis at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
Hi
 
We're seeing an issue on SLES12SP3 with modifying mtu. 
 
linux-p4eo:~ # ifconfig eth7 mtu 9000
SIOCSIFMTU: Invalid argument
Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500
 
We've seen this with other vendor devices as well. 
Chelsio reported an issue in the past regarding MTU change,
But I didn't see any related bugs open or discussions. 
 
Has this been discussed since? 
Attaching relevant email
 
Thanks,
Michal
 
 
From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Schmidt,
William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R <arlin.r.davis at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US
Pacific Time (12pm EST) - Minutes
 
Yes please. The omitted commits are listed in the OFED bugs.
 
From: Davis, Arlin R 
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R <william.r.schmidt at intel.com>;
'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org>
Cc: vlad at dev.mellanox.co.il; Woodruff, Robert J
<robert.j.woodruff at intel.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
Bill, thanks for the update. Do you need Vlad's help getting these fixes
into OFED 4.8 compat-rdma?
 
From: Schmidt, William R 
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R <arlin.r.davis at intel.com>; 'ewg at lists.openfabrics.org'
<ewg at lists.openfabrics.org>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time
(12pm EST) - Minutes
 
>>2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 -
Need update from Bill Schmidt (Intel)
 
Bug <http://bugs.openfabrics.org/show_bug.cgi?id=2664>  2664 - Bonding
doesn't work on RHEL 7.4
Bonding driver form RHEL 7.4 has integrated commit:
<https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729e
b86845>
https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729eb
86845
This requires ipoib driver to return correct speed and duplex mode.  Commit
adding this feature on ipoib driver:
 
<https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15fe
b574cc>
https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15feb
574cc was added in 4.12 kernel, so it is missing from OFED 4.8 compat-rdma.
In consequence Bonding driver cannot retrieve required data and fails.
 
 <http://bugs.openfabrics.org/show_bug.cgi?id=2665> Bug 2665 - Bonding
causes kernel panic on SLES 12.1 and SLES 12.2
Bonding driver from SLES 12.1/12.2 is missing kernel panic fix commit:
<https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa
0a74c0>
https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa0
a74c0
added in linux kernel 4.8. This makes it incompatible with ipoib drivers
from OFED 4.8 compat-rdma.
 
From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Davis,
Arlin R
Sent: Monday, January 29, 2018 11:57 AM
To: 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org>
Subject: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific
Time (12pm EST) - Minutes
 
Attendees                      Company     
Pradeep Kankipati         Broadcom      
Steve Wise                     Chelsio
Robert Woodruff            Intel
Arlin Davis                      Intel                                 
Vladimir Sokolovsky       Mellanox                            
Ariel Elior                       Cavium   
Michal Kalderon             Cavium   
Michael Rice                       HPE
 
Minutes:   
 
*         Opens - none
    
*	OFED 4.8-2:
<http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180124-
0818.tgz>
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180124-0
818.tgz
 
Status:  RH7.4 and SLES12.3 backports added. 
                Updated packages:  rdma_core-v16, perftest 4.1-0.2,
libfabric 1.5.3
                Installation changes:  --without-depcheck docs, vmw_pvrdma
moved out of tech preview
             Test Status:  Intel - build RH 7.0, 7.1, 7.2, 7.3, 7.4 SLES 12,
12.1, 12.2, and 12.3 - Passed
                        
Known issues to be resolved before RC1:
 
1. Bug #2663 - (P1) rping fails, iwpmd hitting segfault on SLES12.3 - 
*	Chelsio validation team hits bug, engineering team cannot reproduce.

*	Steve (Chelsio) needs help from Tatyana's (Intel) team to reproduce
and isolate.
2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 -
Need update from Bill Schmidt (Intel)
        
*	OFED 4.8-2 RC1 schedule: (2 blocking bugs) 
 
Plan is to clean up bugs this week and push hard for RC1 by Friday.
                The GA plan is to go from RC1 to GA, 1-2 week RC1
validation, and Feb 16th for a GA target.
Board approved OFED 4.8-2 so we can move to GA as soon as EWG is ready.
 
Regards,
 
Arlin
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20180201/15246144/attachment.html>


More information about the ewg mailing list