[openib-general] Detecting when an RDMA writer process disappears

Michael S. Tsirkin mst at mellanox.co.il
Sun Feb 4 06:02:49 PST 2007


> Quoting Or Gerlitz <ogerlitz at voltaire.com>:
> Subject: Re: Detecting when an RDMA writer process disappears
> 
> Mike Heffner wrote:
> > Is there any method by which a receiving process that is polling in 
> > preregistered memory regions for data from a sender performing RDMA 
> > writes, can detect if the sender is killed? Say by a SIGKILL signal? The 
> > RC connection is setup using the RDMA CM and there do not appear to be 
> > any CM events created on the event channel
> 
> If you have a process with connected RDMA CM ID whose associated peer 
> process died you should get DISCONNECTED event. how do you verify that 
> there is no rdma cm event present at the polling side?

You may or may not get this event in case of packet loss - same as with sockets.
Sending keepalives is really the only way if you want to handle all
cases such as remote node crash.

-- 
MST




More information about the general mailing list