[ofa-general] RHEL 5.3 (2.6.18-128.1.1.el5 kernel) and connected mode
Tziporet Koren
tziporet at dev.mellanox.co.il
Sun Aug 16 02:23:18 PDT 2009
Robert Cummins wrote:
> Hello,
>
> IHAC that is experiencing a problem with IB. Specifically, when placing
> the Infinihost III card in connected mode with 'echo connected
>
>> /sys/class/net/ib0/mode' some nodes stop responding. By 'stop
>>
> responding' I mean:
>
> - ping <ib ip address> doesn't work (no packets returned; 100% packet
> loss)
> - ib_rdma_bw -b node never runs
> - ibping does work
>
>
...
> It should be noted that the I have four nodes that fail and nearly 20
> that 'work'. The failing nodes are running the same kernel
> (2.6.18-128.el5) while the working nodes are running the
> 2.6.18-128.1.1.el5 kernel. I am at a loss as to how to proceed with
> debugging this short of getting the latest OFED distro and building it.
>
> Has anyone else run into this problem and if so, how did you get around
> it?
>
>
What is the FW version you use?
Can you see if there are any interesting messages in /var/log/messages,
especially from mthca driver
Tziporet
More information about the general
mailing list