[Users] Troubleshoot low LinkSpeed
Rupert Dance - SFI
rsdance at soft-forge.com
Wed Apr 14 12:01:54 PDT 2021
Firmware updates on the HCAs are a great place to start and solve 90% of
these problems.
Thanks
Rupert
From: Users <users-bounces at lists.openfabrics.org> On Behalf Of Michael
Robbert
Sent: Wednesday, April 14, 2021 2:42 PM
To: users at lists.openfabrics.org
Subject: [Users] Troubleshoot low LinkSpeed
I just upgraded a couple of nodes in one of our clusters from CentOS 6 to
CentOS 7 and after the upgrade the Infiniband connection dropped from QDR
rates to DDR rates.
I'm trying to figure out how to troubleshoot or fix this. If anybody has
seen this and knows what is going on that would be great too.
The 2 hosts in question have QLogic IBA7322 HCAs which is using the ib_qib
driver.
There are other hosts connected to the same switch that have the same HCA,
but haven't been upgraded from CentOS 6 to 7 yet and they are connecting at
full QDR speeds.
Ibstatus shows:
Infiniband device 'qib0' port 1 status:
default gid: fe80:0000:0000:0000:0011:7500:0070:2f82
base lid: 0xb4
sm lid: 0x115
state: 4: ACTIVE
phys state: 5: LinkUp
rate: 20 Gb/sec (4X DDR)
link_layer: InfiniBand
ibportstate shows an active link speed of 5.0 Gbps, but also shows that 10.0
Gbps is supported
[root at compute128 ~]# ibportstate 180 1 query
CA/RT PortInfo:
# Port info: Lid 180 port 1
LinkState:.......................Active
PhysLinkState:...................LinkUp
Lid:.............................180
SMLid:...........................277
LMC:.............................0
LinkWidthSupported:..............1X or 4X
LinkWidthEnabled:................1X or 4X
LinkWidthActive:.................4X
LinkSpeedSupported:..............2.5 Gbps or 5.0 Gbps or 10.0 Gbps
LinkSpeedEnabled:................2.5 Gbps or 10.0 Gbps
LinkSpeedActive:.................5.0 Gbps
Mkey:............................<not displayed>
MkeyLeasePeriod:.................0
ProtectBits:.....................0
I have tried changing the speed with the ibportstate command, but it fails:
[root at compute128 ~]# ibportstate 180 1 speed 10
Initial CA/RT PortInfo:
# Port info: Lid 180 port 1
LinkState:.......................Active
PhysLinkState:...................LinkUp
Lid:.............................180
SMLid:...........................277
LMC:.............................0
LinkWidthSupported:..............1X or 4X
LinkWidthEnabled:................1X or 4X
LinkWidthActive:.................4X
LinkSpeedSupported:..............2.5 Gbps or 5.0 Gbps or 10.0 Gbps
LinkSpeedEnabled:................2.5 Gbps or 10.0 Gbps
LinkSpeedActive:.................5.0 Gbps
Mkey:............................<not displayed>
MkeyLeasePeriod:.................0
ProtectBits:.....................0
ibportstate: iberror: failed: smp set portinfo failed
Any thoughts on how to troubleshoot or fix this would be appreciated.
Thanks,
Mike Robbert
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/users/attachments/20210414/2483ff6d/attachment.htm>
More information about the Users
mailing list