[ofa-general] RHEL 5.3 (2.6.18-128.1.1.el5 kernel) and connected mode

Tziporet Koren tziporet at dev.mellanox.co.il
Sun Aug 16 02:23:18 PDT 2009


Robert Cummins wrote:
> Hello,
>
> IHAC that is experiencing a problem with IB.  Specifically, when placing
> the Infinihost III card in connected mode with 'echo connected
>   
>> /sys/class/net/ib0/mode' some nodes stop responding.  By 'stop
>>     
> responding' I mean:
>
>   - ping <ib ip address> doesn't work (no packets returned; 100% packet
> loss)
>   - ib_rdma_bw -b node never runs
>   - ibping does work
>
>   
...


> It should be noted that the I have four nodes that fail and nearly 20
> that 'work'.   The failing nodes are running the same kernel
> (2.6.18-128.el5) while the working nodes are running the
> 2.6.18-128.1.1.el5 kernel.  I am at a loss as to how to proceed with
> debugging this short of getting the latest OFED distro and building it.
>
> Has anyone else run into this problem and if so, how did you get around
> it?  
>
>   
What is the FW version you use?
Can you see if there are any interesting messages in /var/log/messages, 
especially from mthca driver

Tziporet




More information about the general mailing list