[ofa-general] RE: [ewg] /dev/infiniband/rdma_cm not created
Woodruff, Robert J
robert.j.woodruff at intel.com
Wed May 13 11:39:15 PDT 2009
Is the driver loaded ? ie., do an /sbin/lsmod to see.
Also are there any messages that would indicate a
problem when you do a dmesg.
-----Original Message-----
From: ewg-bounces at lists.openfabrics.org [mailto:ewg-bounces at lists.openfabrics.org] On Behalf Of Jeff Squyres
Sent: Wednesday, May 13, 2009 11:34 AM
To: OpenFabrics General; OpenFabrics EWG
Subject: [ewg] /dev/infiniband/rdma_cm not created
I'm running on rhel4u6 with the 1.4.1 nightly from last night and
sometimes /dev/infiniband/rdma_cm is not created. I can see its entry
in /etc/udev/rules.d/90-ib.rules:
KERNEL="umad*", NAME="infiniband/%k"
KERNEL="issm*", NAME="infiniband/%k"
KERNEL="ucm*", NAME="infiniband/%k", MODE="0666"
KERNEL="uverbs*", NAME="infiniband/%k", MODE="0666"
KERNEL="ucma", NAME="infiniband/%k", MODE="0666"
KERNEL="rdma_cm", NAME="infiniband/%k", MODE="0666"
But only some of these are created:
[11:29] svbu-mpi005:/etc/udev/rules.d % l /dev/infiniband/
total 0
drwxr-xr-x 2 root root 120 May 13 02:39 ./
drwxr-xr-x 10 root root 5740 May 13 09:39 ../
crw------- 1 root root 231, 64 May 13 02:39 issm0
crw------- 1 root root 231, 0 May 13 02:39 umad0
crw-rw-rw- 1 root root 231, 192 May 13 02:39 uverbs0
crw-rw-rw- 1 root root 231, 193 May 13 02:39 uverbs1
[11:29] svbu-mpi005:/etc/udev/rules.d %
I have both an IB HCA and an iWARP RNIC in this server:
hca_id: mthca0
fw_ver: 1.2.917
node_guid: 0005:ad00:0008:bd60
sys_image_guid: 0005:ad00:0100:d050
vendor_id: 0x05ad
vendor_part_id: 25204
hw_ver: 0xA0
board_id: MT_03B0120002
phys_port_cnt: 1
port: 1
state: PORT_ACTIVE (4)
max_mtu: 2048 (4)
active_mtu: 2048 (4)
sm_lid: 2
port_lid: 34
port_lmc: 0x00
hca_id: nes0
node_guid: 0012:5502:b58c:0000
sys_image_guid: 0012:5502:b58c:0000
vendor_id: 0x1255
vendor_part_id: 256
hw_ver: 0x5
board_id: NES020 Board ID
phys_port_cnt: 1
port: 1
state: PORT_ACTIVE (4)
max_mtu: 2048 (4)
active_mtu: 2048 (4)
sm_lid: 0
port_lid: 1
port_lmc: 0x00
I don't see any obvious errors occurring in syslog or dmesg.
What could cause this failure?
--
Jeff Squyres
Cisco Systems
_______________________________________________
ewg mailing list
ewg at lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
More information about the general
mailing list