[ofa-general] [NFS/RDMA] Can't mount NFS/RDMA partition

Celine Bourde celine.bourde at ext.bull.net
Thu Dec 18 00:30:31 PST 2008


Jeff Becker wrote:
> Hi Celine.
>
> Celine Bourde wrote:
>   
>> Hi,
>>
>> I can't mount an NFS/RDMA partition.
>> I've applied
>> http://www.openfabrics.org//downloads/OFED/ofed-1.4/OFED-1.4-docs/nfs-rdma.release-notes.txt
>>
>> instructions.
>>
>> Every steps (loading modules, /etc/exports implementation, starting
>> nfs daemon,
>> etc..) seems to be ok, but when I do the last command :
>> mount -o rdma,port=2050 192.168.0.13:/export /tmp/nfs_client/
>> the mount processus blocks even last dmesg output seems correct  :
>> "RPC: Registered rdma transport module.
>> rpcrdma: connection to 192.168.0.13:2050 on mlx4_0, memreg 5 slots 32
>> ird 16
>> "
>>     
>
> I've successfully tested 2.6.27 + OFED1.4 + nfs-utils 1.3 +  mthca. Does
> your mlx4 card work correctly independent of NFSRDMA? 
Yes it works correctly, I've no other problems. To be sure, I've done 
performance tests with qperf (bandwith, latence)
and everything is ok. I've connected IB back to back, with same ConnectX 
cards on both computer.

> Also, given later
> replies, I'm a little concerned about the mad issues you see. Please
> keep me updated. Thanks.
>
> -jeff
>   
Of course.
I will wait Tom results and will keep you aware.

Céline.
>   
>> If I try "ibstat" after that, I have a kernel panic message :
>> "ibpanic: [4826] main: stat of IB device 'mlx4_0' failed: (Device or
>> resource
>> busy)" because device is in use.
>>
>> 100 % of processus is used by ib_mad1
>> [root at test]top
>> top - 14:55:07 up 19 min,  3 users,  load average: 2.00, 1.87, 1.12
>> Tasks: 190 total,   2 running, 188 sleeping,   0 stopped,   0 zombie
>> Cpu(s):  0.0%us, 12.5%sy,  0.0%ni, 87.5%id,  0.0%wa,  0.0%hi, 
>> 0.0%si,  0.0%st
>> Mem:   8066156k total,   615096k used,  7451060k free,    45604k buffers
>> Swap:  8193140k total,        0k used,  8193140k free,   343436k cached
>>  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
>> 2952 root      15  -5     0    0    0 R  100  0.0   5:23.55 ib_mad1
>>    1 root      20   0 10320  688  572 S    0  0.0   0:02.04 init
>>    2 root      15  -5     0    0    0 S    0  0.0   0:00.00 kthreadd
>>    3 root      RT  -5     0    0    0 S    0  0.0   0:00.00 migration/0
>>    4 root      15  -5     0    0    0 S    0  0.0   0:00.01 ksoftirqd/0
>>
>>
>> I can't kill mount process (kill -9 or shutdown -R or echo b >
>> sysrq-trigger)
>> and I have to restart the computer using "ipmitool target chassis
>> power reset".
>>
>> Have any idea ?
>>
>> Moreover, I sometimes have this dmesg log: mlx4_core 0000:01:00.0:
>> HW2SW_MPT
>> failed (-16). (I don't think there is an agreement with mount bug). I
>> saw this
>> error could be occured with old firmeware version but mine is 2.5.9 ..
>> For more details see bug report :
>> https://bugs.openfabrics.org/show_bug.cgi?id=1459
>>
>> Thanks for your help.
>>
>> Céline Bourde.
>>
>>
>>
>>
>>
>> _______________________________________________
>> general mailing list
>> general at lists.openfabrics.org
>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>
>> To unsubscribe, please visit
>> http://openib.org/mailman/listinfo/openib-general
>>     
>
>
>
>   




More information about the general mailing list