[ofa-general] Re: should it be possible to run SDP over a T320?

Rick Jones rick.jones2 at hp.com
Fri Jul 13 15:45:24 PDT 2007


> I am only interested in IPoIB at this moment for both ofed-1.1 and 
> ofed-1.2. Is the device PCI-X or PCI-e based?

Well, I guess that's better than "everything" :)  but it is still a triffle 
broad.  Anyway, I'll suppress my "sending to another .com" paranoia by remiding 
myself that all this is shipping :) and include the results here.

The device is PCIe.  lspci shows:

03:00.0 InfiniBand: Mellanox Technologies MT25208 InfiniHost III Ex (Tavor 
compatibility mode) (rev 20)

RHEL5 included OFED 1.1 bits:

rx2660 to rx2660, rhel5, AD313A, IPoIB and now SDP, same
sysctl settings, irqbalance killed to keep things from moving around,
interrups on the HCA now on cpu0 on one system and cpu 1 on the other

[here are the sysctl.conf settings:

[root at hpcpc107 ~]# tail /etc/sysctl.conf
# Controls the maximum number of shared memory segments, in pages
kernel.shmall = 2147483648


net.core.rmem_max = 2097152
net.core.wmem_max = 2097152
net.ipv4.tcp_wmem = 4096 87380 2097152
net.ipv4.tcp_rmem = 4096 87380 2097152
net.ipv4.conf.default.arp_ignore = 1
net.ipv4.conf.default.arp_filter = 1

]

[ the first number is the CPU to which netperf is bound, the second is the CPU 
to which netserver is bound.  the systems under test had _four_ cores, which 
means that when netperf reports 25% CPU util it means the equivalent of a full 
core was consumed etc etc ]

single-connection, unidirectional TCP_STREAM 1Mx64:
[root at hpcpc106 netperf2_work]# for i in 0 1; do for j in 0 1; do echo $i $j 
`netperf -P 0 -T $i,$j -c -C -H 192.168.0.107 -t TCP_STREAM -i 30,3 -l 30 -- -s 
1M -S 1M -m 64K`; done; done
0 0 2097152 2097152 65536 30.00 2382.94 25.00 37.29 3.438 5.128
0 1 2097152 2097152 65536 30.00 2315.88 19.97 25.03 2.826 3.542
1 0 2097152 2097152 65536 30.00 2974.46 40.10 41.25 4.418 4.544 *
1 1 2097152 2097152 65536 30.00 2358.11 27.39 25.01 3.807 3.476


[NOTE NOTE NOTE - the units here are still transactions per second! so, to get 
to mbit/s multiply by 2x65536x8 and divide by 1000000...  To get the service 
demand in usec per KB transferred, divide the service demand by 128 since that 
was the number of KB transferred per transaction]

single-connection, bidrectional TCP_RR 1Mx64x, ad313a hca in x8 slot
[root at hpcpc106 netperf2_work]# for i in 0 1 ; do for j in 0 1 ; do echo $i $j 
`netperf -P 0 -T $i,$j -c -C -H 192.168.0.107 -t TCP_RR -i 30,3 -l 30 -- -s 1M 
-S 1M -r 64K -b 12`; done; done
0 0 2097152 2097152 65536 65536 30.00 2485.29 25.01 31.45 402.595 506.196 
2097152 2097152
0 1 2097152 2097152 65536 65536 30.00 2414.18 23.38 23.33 387.354 386.507 
2097152 2097152
1 0 2097152 2097152 65536 65536 30.00 3368.20 38.75 38.72 460.153 459.788 
2097152 2097152 *
1 1 2097152 2097152 65536 65536 30.00 2504.03 31.54 25.05 503.753 400.236 
2097152 2097152

[NOTE NOTE NOTE - when netperf reports a confidence of 20.7% it means +/- 10.35%]

single-connection, single-byte, TCP_RR, ad313a hca in x8 slot:
[root at hpcpc106 netperf2_work]# for i in 0 1 ; do for j in 0 1 ; do echo $i $j 
`netperf -P 0 -T $i,$j -c -C -H 192.168.0.107 -t TCP_RR -i 30,3 -l 30 `; done; done
0 0 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.1% !!! Local CPU util : 20.7% !!! Remote CPU util : 13.7% 87380 
87380 1 1 30.00 15743.40 4.84 10.08 12.293 25.610 87380 87380
0 1 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.4% !!! Local CPU util : 59.3% !!! Remote CPU util : 51.1% 87380 
87380 1 1 30.00 19298.77 4.70 7.09 9.751 14.694 87380 87380
1 0 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.2% !!! Local CPU util : 28.6% !!! Remote CPU util : 34.4% 87380 
87380 1 1 30.00 13016.11 6.15 6.57 18.912 20.195 87380 87380
1 1 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.1% !!! Local CPU util : 8.7% !!! Remote CPU util : 23.4% 87380 
87380 1 1 30.00 15375.13 9.93 6.30 25.839 16.393 87380 87380

And now the OFED 1.2 bits I installed overtop of the 1.1 stuff which shipped 
with RHEL5

RHEL5 rx2660 to rx2660, AD313A, OFED 1.2 GA software, TCP_STREAM
1Mx64K. CPU 0 taking interrupts, IB switch in place:

[root at hpcpc106 ~]# for i in 0 1; do for j in 0 1; do echo $i $j `netperf -P 0 -T 
$i,$j -t TCP_STREAM -H 192.168.1.107 -c -C -l 30 -i 30,3 -- -s 1M -S 1M -m 64K`; 
done; done
0 0 2097152 2097152 65536 30.00 5227.08 6.19 25.00 0.388 1.568
0 1 2097152 2097152 65536 30.00 5449.90 6.47 26.77 0.389 1.610
1 0 2097152 2097152 65536 30.00 5235.90 6.70 25.01 0.420 1.565
1 1 2097152 2097152 65536 30.00 5511.77 7.16 26.80 0.426 1.593 *

RHEL5 rx2660 to rx2660, AD313A, OFED 1.2 GA software, bidirectional
TCP_RR 1Mx64Kx12, CPU 0 taking interrupts, IB switch in place:

[root at hpcpc106 ~]# for i in 0 1; do for j in 0 1; do echo $i $j `netperf -P 0 -T 
$i,$j -t TCP_RR -H 192.168.1.107 -c -C -l 30 -i 30,3 -- -s 1M -S 1M -r 64K -b 
12`; done; done
0 0 2097152 2097152 65536 65536 30.00 5314.44 16.13 16.08 121.431 121.049 
2097152 2097152
0 1 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.3% !!! Local CPU util : 20.4% !!! Remote CPU util : 48.2% 2097152 
2097152 65536 65536 30.00 5384.71 17.24 23.42 128.082 174.245 2097152 2097152
1 0 2097152 2097152 65536 65536 30.00 5388.18 17.06 16.27 126.619 120.784 
2097152 2097152
1 1 !!! WARNING !!! Desired confidence was not achieved within the specified 
iterations. !!! This implies that there was variability in the test environment 
that !!! must be investigated before going further. !!! Confidence intervals: 
Throughput : 0.3% !!! Local CPU util : 45.3% !!! Remote CPU util : 0.3% 2097152 
2097152 65536 65536 30.00 5469.22 22.58 17.08 165.328 124.947 2097152 2097152 *

RHEL5 rx2660 to rx2660, AD313A, OFED 1.2 GA software, TCP_RR, CPU 0
taking interrupts, IB switch in place:

[root at hpcpc106 ~]# for i in 0 1; do for j in 0 1; do echo $i $j `netperf -P 0 -T 
$i,$j -t TCP_RR -H 192.168.1.107 -l 30 -i 30,3`; done; done
0 0 87380 87380 1 1 30.00 18990.16 87380 87380 *
0 1 87380 87380 1 1 30.00 14985.03 87380 87380
1 0 87380 87380 1 1 30.00 15045.17 87380 87380
1 1 87380 87380 1 1 30.00 12408.56 87380 87380

(I didn't bother asking for CPU util in the single-byte TCP_RR tests because I 
knew that the confidence intervals wouldn't be met and it would only lengthen 
the runtime)

Sorry that the confidence interval warnings make things hard to read there.

rick jones

> 
> Thanks
> Shirley




More information about the general mailing list