[ofa-general] Ofed v1.2rc2 IPoIB
Scott Shaw
sshaw at sgi.com
Fri Jun 29 14:42:55 PDT 2007
Hi,
I have a small cluster setup with NFS over IPoIB device and I am seeing
a high rate of transmit timed out errors begin logged in
/var/log/messages. What could be causing the problem and is there a
fix?
I am using a dual port DDR Mellanox Technologies MT25208 HCA within a
DDR IB fabric.
/etc/init.d/oenibd status reports
HCA driver loaded
Configured devices:
ib0
Currently active devices:
ib0
The following OFED modules are loaded:
rdma_ucm
rdma_cm
ib_addr
ib_local_sa
ib_ipoib
ib_ipath
ib_mthca
ib_uverbs
ib_umad
ib_sa
ib_cm
ib_mad
ib_core
SUSE Linux Enterprise Server 10 (x86_64)
VERSION = 10
PATCHLEVEL = 1
Jun 29 15:46:57 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:46:57 service2 kernel: ib0: transmit timeout: latency 1576
msecs
Jun 29 15:46:57 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:46:58 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:46:58 service2 kernel: ib0: transmit timeout: latency 2576
msecs
Jun 29 15:46:58 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:46:59 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:46:59 service2 kernel: ib0: transmit timeout: latency 3576
msecs
Jun 29 15:46:59 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:47:00 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:47:00 service2 kernel: ib0: transmit timeout: latency 4576
msecs
Jun 29 15:47:00 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:47:01 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:47:01 service2 kernel: ib0: transmit timeout: latency 5576
msecs
Jun 29 15:47:01 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:47:02 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:47:02 service2 kernel: ib0: transmit timeout: latency 6576
msecs
Jun 29 15:47:02 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
Jun 29 15:47:03 service2 kernel: NETDEV WATCHDOG: ib0: transmit timed
out
Jun 29 15:47:03 service2 kernel: ib0: transmit timeout: latency 7576
msecs
Jun 29 15:47:03 service2 kernel: ib0: queue stopped 1, tx_head 6355,
tx_tail 6291
TIA!
Scott Shaw
SILICON GRAPHICS | The Source of Innovation and Discovery
Office Ph: 734.437.6397 Cell Ph: 734.564.3832
Email:sshaw at sgi.com http://www.sgi.com
More information about the general
mailing list