[ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Vladimir Sokolovsky vlad at mellanox.com
Thu Feb 1 12:45:01 PST 2018


Done (rdma-core-v16.2) : OFED-4.8-2-20180201-1219.tgz

Regards,
Vladimir

From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Steve Wise
Sent: Thursday, February 1, 2018 1:59 PM
To: 'Davis, Arlin R' <arlin.r.davis at intel.com>; 'Kalderon, Michal' <Michal.Kalderon at cavium.com>; ewg at lists.openfabrics.org
Cc: 'Srikakulam, Venkata' <Venkata.Srikakulam at cavium.com>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

And 2663 has been fixed in rdma-core-16.2 stable release.  I've asked Vlad to pull in that release.

Steve.



From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Thursday, February 01, 2018 1:25 PM
To: Kalderon, Michal; 'ewg at lists.openfabrics.org'
Cc: Srikakulam, Venkata
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Michal, thanks for the update.

Thanks Michal for the update. This is what we have so far:

Bug 2662<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2662&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=tAo8RuEwJ01xA%2Fsz%2FMaVOhEs%2Bn91HI7bMD3mi4bhItU%3D&reserved=0> Chelsio: Cannot set mtu greater than 1500 on SLES12Sp3 - Fixed/Closed, Thanks!
Bug 2668<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2668&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=SA6SgyqGb74nANM344ll0SYsHxPhU%2BccAfDriSO6fek%3D&reserved=0> Broadcom: Cannot change MTU to greater than default - No ETA on Fix
Bug 2669<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2669&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=Y54lcfC3PikhIyaYOwbQ3kyKBzPUN8Ob8V6qwGdy%2Bl0%3D&reserved=0> Cavium: Cannot set mtu greater than 1500 on SLES12Sp3 - ETA for fix, next week

We will need fixes before moving to RC1.

-arlin


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Thursday, February 01, 2018 7:08 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>; Nikolova, Tatyana E <tatyana.e.nikolova at intel.com<mailto:tatyana.e.nikolova at intel.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Opened bugzilla: http://bugs.openfabrics.org/show_bug.cgi?id=2669<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2669&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=Y54lcfC3PikhIyaYOwbQ3kyKBzPUN8Ob8V6qwGdy%2Bl0%3D&reserved=0>

Will provide a patch to qedr next week.

Thanks,
Michal

From: Davis, Arlin R [mailto:arlin.r.davis at intel.com]
Sent: Wednesday, January 31, 2018 8:56 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>; Nikolova, Tatyana E <tatyana.e.nikolova at intel.com<mailto:tatyana.e.nikolova at intel.com>>; Pradeep Kankipati <pradeep.kankipati at broadcom.com<mailto:pradeep.kankipati at broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

True, thanks for catching this. Please open a qedr bug so we can track as critical/blocking.

Tatyana and Pradeep, do Intel and/or Broadcom drivers need similar changes?

Anyone else?


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Wednesday, January 31, 2018 10:29 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

We assumed this is a generic issue since it happened on several adapters, I see now with the bug resolve that Steve made changes specific to cxgb3
We need to make similar changes in qede.

Thanks,
Michal


From: Davis, Arlin R [mailto:arlin.r.davis at intel.com]
Sent: Wednesday, January 31, 2018 8:10 PM
To: Kalderon, Michal <Michal.Kalderon at cavium.com<mailto:Michal.Kalderon at cavium.com>>; Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes


Are you using the latest daily builds? Steve Wise reported this and forwarded patches to Vlad on Jan 19th.



Fix went into build: http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180122-1411.tgz<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdownloads.openfabrics.org%2FOFED%2Fofed-4.8-2-daily%2FOFED-4.8-2-20180122-1411.tgz&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=Cg1wyDPhJ%2F%2BH7pLFpfKo3aJZBomJ0IYOiryfuEG6XOE%3D&reserved=0>


Bug: http://bugs.openfabrics.org/show_bug.cgi?id=2662<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2662&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=tAo8RuEwJ01xA%2Fsz%2FMaVOhEs%2Bn91HI7bMD3mi4bhItU%3D&reserved=0>

Please let us know if you still have issues with latest builds.

-arlin


From: Kalderon, Michal [mailto:Michal.Kalderon at cavium.com]
Sent: Wednesday, January 31, 2018 9:15 AM
To: Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: Srikakulam, Venkata <Venkata.Srikakulam at cavium.com<mailto:Venkata.Srikakulam at cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Hi

We're seeing an issue on SLES12SP3 with modifying mtu.


linux-p4eo:~ # ifconfig eth7 mtu 9000

SIOCSIFMTU: Invalid argument

Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500

We've seen this with other vendor devices as well.
Chelsio reported an issue in the past regarding MTU change,
But I didn't see any related bugs open or discussions.

Has this been discussed since?
Attaching relevant email

Thanks,
Michal


From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Schmidt, William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Yes please. The omitted commits are listed in the OFED bugs.

From: Davis, Arlin R
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R <william.r.schmidt at intel.com<mailto:william.r.schmidt at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Cc: vlad at dev.mellanox.co.il<mailto:vlad at dev.mellanox.co.il>; Woodruff, Robert J <robert.j.woodruff at intel.com<mailto:robert.j.woodruff at intel.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Bill, thanks for the update. Do you need Vlad's help getting these fixes into OFED 4.8 compat-rdma?

From: Schmidt, William R
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R <arlin.r.davis at intel.com<mailto:arlin.r.davis at intel.com>>; 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes


>>2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 - Need update from Bill Schmidt (Intel)

Bug 2664<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2664&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=ol6qYfsvWvNtVxkY0pinBhI7mp5cMRe9dcoSuikyWD0%3D&reserved=0> - Bonding doesn't work on RHEL 7.4

Bonding driver form RHEL 7.4 has integrated commit: https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729eb86845<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Ftorvalds%2Flinux%2Fcommit%2Fb5bf0f5b16b9c316c34df9f31d4be8729eb86845&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=SbYM7B0aSA9LFMrTCCSRu%2BQXt2rjJIJZzFHNZCq9Dk0%3D&reserved=0>

This requires ipoib driver to return correct speed and duplex mode.  Commit adding this feature on ipoib driver:

https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15feb574cc<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Ftorvalds%2Flinux%2Fcommit%2F0d7e2d2166f6b0b7d1959ca858052a15feb574cc&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=N6UBmqF0kVccIpG%2FBaK6g4uRHRPdjmDAjFwCbVGhaxw%3D&reserved=0> was added in 4.12 kernel, so it is missing from OFED 4.8 compat-rdma.

In consequence Bonding driver cannot retrieve required data and fails.

Bug 2665<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fbugs.openfabrics.org%2Fshow_bug.cgi%3Fid%3D2665&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=DIcZRtv7%2BW6WawndvELdPHc5%2FmOgy3rODxPlM8LW4Zs%3D&reserved=0> - Bonding causes kernel panic on SLES 12.1 and SLES 12.2

Bonding driver from SLES 12.1/12.2 is missing kernel panic fix commit: https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa0a74c0<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Ftorvalds%2Flinux%2Fcommit%2F1533e77315220dc1d5ec3bd6d9fe32e2aa0a74c0&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=%2BewXwTldMAwfffw9Ov3sryygsQBOrzdcxRcshKu3AL0%3D&reserved=0>

added in linux kernel 4.8. This makes it incompatible with ipoib drivers from OFED 4.8 compat-rdma.


From: ewg [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Monday, January 29, 2018 11:57 AM
To: 'ewg at lists.openfabrics.org' <ewg at lists.openfabrics.org<mailto:ewg at lists.openfabrics.org>>
Subject: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

Attendees                      Company
Pradeep Kankipati         Broadcom
Steve Wise                     Chelsio
Robert Woodruff            Intel
Arlin Davis                      Intel
Vladimir Sokolovsky       Mellanox
Ariel Elior                       Cavium
Michal Kalderon             Cavium
Michael Rice                       HPE



Minutes:



*         Opens - none



  *   OFED 4.8-2:  http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180124-0818.tgz<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdownloads.openfabrics.org%2FOFED%2Fofed-4.8-2-daily%2FOFED-4.8-2-20180124-0818.tgz&data=02%7C01%7Cvlad%40mellanox.com%7C446eb3a2e6c049530a2008d569ae4ab4%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636531119663401238&sdata=%2B%2FNurCyB92TPIZVqoT0eU90EaLhC66Q1w%2FnETBlzngo%3D&reserved=0>



Status:  RH7.4 and SLES12.3 backports added.

                Updated packages:  rdma_core-v16, perftest 4.1-0.2, libfabric 1.5.3

                Installation changes:  --without-depcheck docs, vmw_pvrdma moved out of tech preview

             Test Status:  Intel - build RH 7.0, 7.1, 7.2, 7.3, 7.4 SLES 12, 12.1, 12.2, and 12.3 - Passed



Known issues to be resolved before RC1:



1. Bug #2663 - (P1) rping fails, iwpmd hitting segfault on SLES12.3 -

        *   Chelsio validation team hits bug, engineering team cannot reproduce.
        *   Steve (Chelsio) needs help from Tatyana's (Intel) team to reproduce and isolate.

2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 - Need update from Bill Schmidt (Intel)



  *   OFED 4.8-2 RC1 schedule: (2 blocking bugs)



Plan is to clean up bugs this week and push hard for RC1 by Friday.

                The GA plan is to go from RC1 to GA, 1-2 week RC1 validation, and Feb 16th for a GA target.

Board approved OFED 4.8-2 so we can move to GA as soon as EWG is ready.



Regards,



Arlin


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20180201/28d07cb6/attachment.html>


More information about the ewg mailing list