[libfabric-users] connection-less send/recv with verbs
Maurizio Drocco
drocco at di.unito.it
Fri Jul 14 12:24:47 PDT 2017
> Now the problem is that, with the following code:
> ========================================
> ret = fi_send(ep, tx_buf, 32, NULL, to, ctx);
> assert(!ret);
> while(true) {
> ret = fi_cq_read(cq, &comp, 1);
> /*…*/
> }
> ========================================
> no completion event is ever found in the queue (always return FI_EAGAIN).
>
> With gdb, I noticed that the functions used by fi_send and fi_cq_read are, respectively:
> - fi_ibv_rdm_send
> - fi_ibv_rdm_tagged_cq_read
>
> Do you think there can be some other issue related to modes/capabilities?
Let me add a bit that could help to understand.
I ran some basic tests on the two infiniband clusters I have access to.
The following libfabric test completes successfully:
fi_pingpong -p verbs -e rdm & fi_pingpong -p verbs -e rdm localhost
Whereas the following fabtests test just stalls on the first call to fi_cq_read (in the very same manner as my code):
fi_rdm_pingpong -p verbs & fi_rdm_pingpong -p verbs localhost
I am getting confused :)
Maurizio
---
Maurizio Drocco
PhD Candidate
University of Torino, department of Computer Science
Via Pessinetto 12, 10149 Torino - Italy
More information about the Libfabric-users
mailing list