[ewg] Re: [ofa-general] possible mvapich2 problem
Steve Wise
swise at opengridcomputing.com
Wed Mar 7 15:23:19 PST 2007
Um,
Please ignore this email.
This is a test case problem...
:-\
Steve.
On Wed, 2007-03-07 at 16:57 -0600, Steve Wise wrote:
> Hey Shaun,
>
> I have a MPI test program that is detecting a buffer corruption when run
> on mvapich2-0.9.8-5. The same program works on mvapich2-0.9.8-4. The
> corruption happens over IB as well as iWARP on alpha libs and a recent
> set of kernel modules from ofa 1.2.
>
> At this point in this (complicated) test, all ranks enter into a
> MPI_Bcast(). The root rank, who is sending the data, checksums a bit of
> the data buffer before entering MPI_Bcast(), and afterwards if there was
> no error to validate that the data wasn't corrupted in the send buffer.
> The buffer checksum differs after the bcast. So somehow the data in the
> buffer was altered presumably by the MPI layer (but I don't know that
> yet).
>
> Have ya'll seen this problem? Maybe it was fixed in -6? I'm going to
> try and reduce this to a simple test, but I wanted to see if this is a
> known mvapich2 problem with the 0.9.8-5 release.
>
> Steve.
>
>
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
More information about the ewg
mailing list