[openib-general] rdma_lat vs. perf_main

yipee yipeeyipeeyipeeyipee at yahoo.com
Thu Sep 15 06:21:59 PDT 2005


Hi, 
 
I'm trying to measure rdma write latencies. 
The platforms I'm using are a pair of 3GHz 64bit Xeons, 2GB ram, 2.6.13 vanilla 
kernel machines. 
The two machines are connected back-to-back through PCIe memfree DDR Mellanox 
HCA's (OpenSM handles the initialization of the "fabric"). 
 
When using the rdma_lat utility (gen2/trunk/src/userspace/perftest/rdma_lat/) 
to measure 4K rdma write latencies I get 22.8 usec (one way). 
Notice that I removed the IBV_SEND_INLINE flag from rdma_lat.c:376 in order to 
be able to send large non-inlined data. 
Also ".max_inline_data=size" was changed to ".max_inline_data=128" (line 342) 
otherwise the qp creation fails. 
 
When trying the perf_main utility from Mellanox 4.1 stack 
(vapi-linux-4_1_0.tgz) I get latencies of 12.8 usec. 
 
Is there a reason for the major latency differences? Do the two programs do 
rdma write latency tests in a fundamentally different way? 
Am I doing something wrong? 
 
 
The commands I used for rdma_lat are 
10.100.1.130] ./rdma_lat 
 
10.100.1.129] ./rdma_lat -s 4096 10.100.1.130 
 
 
The commands for perf_main are: 
10.100.1.129] perf_main --send --test=lat --rdma=write --size=4096 --iter=1000 
-a10.100.1.130 
10.100.1.130] perf_main --server 
 
 
 
Thanks, 
y 
 
 
 




More information about the general mailing list