[openib-general] Detecting when an RDMA writer process disappears

Mike Heffner mike.heffner at evergrid.com
Thu Feb 1 21:10:09 PST 2007


Is there any method by which a receiving process that is polling in 
preregistered memory regions for data from a sender performing RDMA 
writes, can detect if the sender is killed? Say by a SIGKILL signal? The 
RC connection is setup using the RDMA CM and there do not appear to be 
any CM events created on the event channel, nor does there appear to be 
any async. events created. Occasionally I will get a CQE failure on the 
QP, depending on where the communication flow is, that I can use to mark 
the connection failed, but this happens only about 50% of the time.

An alternative solution would be periodically sending "keep-alives" and 
detecting the CQE failure, but I'd be interested to know if there are 
any other options that don't require sending keep-alives.


Thanks,

Mike

-- 

   Mike Heffner <mike.heffner at evergrid.com>
   EverGrid Software
   Blacksburg, VA USA

   Voice: (540) 443-3500 #603




More information about the general mailing list