[ofa-general] Installing openIB on Linux FC5

Hossein Pourreza pourreza at cs.umanitoba.ca
Tue Jun 5 15:04:28 PDT 2007


Hi all,

I am new to infiniband stuff and am trying to configure an infiniband-based
cluster using Linux FC 5. I downloaded the OFED-1.0 and tried to install it on
cluster nodes. Now I can load the kernel modules without any error but I cannot
run a simple test like ibv_ud_pingpong to check the connectivity of nodes in
user-level.

I loaded the following devices:

ib_umad                25713  0 
ib_ucm                 26569  0 
ib_cm                  42521  1 ib_ucm
ib_uverbs              47889  1 ib_ucm
ib_mthca              133445  0 
ib_ipoib               61361  0 
ib_sa                  25341  2 ib_cm,ib_ipoib
ib_mad                 46969  4 ib_umad,ib_cm,ib_mthca,ib_sa
ib_core                63809  8 ib_umad,ib_ucm,ib_cm,ib_uverbs,ib_mthca,ib_ipoib,ib_sa,ib_mad

Also I have the following devices in /dev/infiniband

crw-rw---- 1 root root 231,  64 Jun  4 14:54 issm0
crw-rw---- 1 root root 231,  65 Jun  4 14:54 issm1
crw-rw---- 1 root root 231, 224 Jun  4 14:34 ucm0
crw-rw---- 1 root root 231,   0 Jun  4 14:54 umad0
crw-rw---- 1 root root 231,   1 Jun  4 14:54 umad1
crw-rw-rw- 1 root root 231, 192 Jun  4 14:34 uverbs0

ibroute shows all nodes and the switch and everything looks fine.
When I run ibv_ud_pingpong on the two nodes I am getting the following messages:

node 1 (server):
local address:  LID 0x0002, QPN 0x150406, PSN 0xb3a00d
remote address: LID 0x0003, QPN 0x0c0406, PSN 0x8f0f99


node 2 (client):
local address:  LID 0x0003, QPN 0x0c0406, PSN 0x8f0f99
remote address: LID 0x0002, QPN 0x150406, PSN 0xb3a00d

There is no message after these two lines. I am wondering if they are sending
any packets or not. I should say that although I have given ip addresses to
infiniband cards (ib0) they cannot ping each other using the normal Linux ping
tool.

Here is the result of ifconfig on these nodes:

node 1 (server)

ib0       Link encap:InfiniBand  HWaddr 00:00:04:04:00:00:00:00:00:00:00:00:00:00:00:00:00:00:00:00  
          inet addr:172.16.28.61  Bcast:172.16.255.255  Mask:255.255.0.0
          UP BROADCAST MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

node 2 (client):
ib0       Link encap:InfiniBand  HWaddr 00:00:04:04:00:00:00:00:00:00:00:00:00:00:00:00:00:00:00:00  
          inet addr:172.16.28.62  Bcast:172.16.255.255  Mask:255.255.0.0
          UP BROADCAST MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

Any help will be greatly appreciated.

Hossein

-- 
Hossein Pourreza		 			mail:<pourreza at cs.umanitoba.ca>    
Department of Computer Science		URL: http://www.cs.umanitoba.ca/~pourreza
University of Manitoba  			Phone: 204-488-5611            
Winnipeg, Manitoba, Canada R3T 2N2




More information about the general mailing list