[ofa-general] [NFS/RDMA] Can't mount NFS/RDMA partition
Jeff Becker
Jeffrey.C.Becker at nasa.gov
Wed Dec 17 09:51:44 PST 2008
Hi Celine.
Celine Bourde wrote:
> Hi,
>
> I can't mount an NFS/RDMA partition.
> I've applied
> http://www.openfabrics.org//downloads/OFED/ofed-1.4/OFED-1.4-docs/nfs-rdma.release-notes.txt
>
> instructions.
>
> Every steps (loading modules, /etc/exports implementation, starting
> nfs daemon,
> etc..) seems to be ok, but when I do the last command :
> mount -o rdma,port=2050 192.168.0.13:/export /tmp/nfs_client/
> the mount processus blocks even last dmesg output seems correct :
> "RPC: Registered rdma transport module.
> rpcrdma: connection to 192.168.0.13:2050 on mlx4_0, memreg 5 slots 32
> ird 16
> "
I've successfully tested 2.6.27 + OFED1.4 + nfs-utils 1.3 + mthca. Does
your mlx4 card work correctly independent of NFSRDMA? Also, given later
replies, I'm a little concerned about the mad issues you see. Please
keep me updated. Thanks.
-jeff
> If I try "ibstat" after that, I have a kernel panic message :
> "ibpanic: [4826] main: stat of IB device 'mlx4_0' failed: (Device or
> resource
> busy)" because device is in use.
>
> 100 % of processus is used by ib_mad1
> [root at test]top
> top - 14:55:07 up 19 min, 3 users, load average: 2.00, 1.87, 1.12
> Tasks: 190 total, 2 running, 188 sleeping, 0 stopped, 0 zombie
> Cpu(s): 0.0%us, 12.5%sy, 0.0%ni, 87.5%id, 0.0%wa, 0.0%hi,
> 0.0%si, 0.0%st
> Mem: 8066156k total, 615096k used, 7451060k free, 45604k buffers
> Swap: 8193140k total, 0k used, 8193140k free, 343436k cached
> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
> 2952 root 15 -5 0 0 0 R 100 0.0 5:23.55 ib_mad1
> 1 root 20 0 10320 688 572 S 0 0.0 0:02.04 init
> 2 root 15 -5 0 0 0 S 0 0.0 0:00.00 kthreadd
> 3 root RT -5 0 0 0 S 0 0.0 0:00.00 migration/0
> 4 root 15 -5 0 0 0 S 0 0.0 0:00.01 ksoftirqd/0
>
>
> I can't kill mount process (kill -9 or shutdown -R or echo b >
> sysrq-trigger)
> and I have to restart the computer using "ipmitool target chassis
> power reset".
>
> Have any idea ?
>
> Moreover, I sometimes have this dmesg log: mlx4_core 0000:01:00.0:
> HW2SW_MPT
> failed (-16). (I don't think there is an agreement with mount bug). I
> saw this
> error could be occured with old firmeware version but mine is 2.5.9 ..
> For more details see bug report :
> https://bugs.openfabrics.org/show_bug.cgi?id=1459
>
> Thanks for your help.
>
> CĂ©line Bourde.
>
>
>
>
>
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
More information about the general
mailing list