[ewg] FW: complete test summary
Rupert Dance
rsdance at lampreynetworks.com
Sun Dec 14 16:13:38 PST 2008
Tziporet,
They have tested with same vendor HCAs and the MPI test are passing.
Rupert
-----Original Message-----
From: Tziporet Koren [mailto:tziporet at dev.mellanox.co.il]
Sent: Thursday, December 11, 2008 5:22 AM
To: Rupert Dance
Cc: ewg at lists.openfabrics.org; ofalab at postal.iol.unh.edu
Subject: Re: [ewg] FW: complete test summary
Rupert Dance wrote:
> Hello Tziporet,
>
> Here is the final UNH IOL summary report of testing done on RC6.
>
Many thanks
> Regarding the mandatory tests, the Link Init failure is a specific
> vendor issue and IPoIB failure has been documented in Bug 1287.
>
> The MPI failures in the Beta tests are being research now. I have
> asked Jeff and Arlin Davis to look into these failures. UNH has noted
> that it only occurs when the cluster includes HCA from multiple
> vendors and when the number of processors exceeds 38. Jeff made the
> following comment "I'm not entirely surprised that OMPI fails when
> used with multiple vendor HCAs; I don't know if anyone has ever tested
> that before...? I would not make it a requirement for passing that
> OMPI has to work in a single MPI job with multiple vendor HCAs; I
> don't know of many (any?) real-world environments that do this."
>
Can you test with same vendor HCA on all nodes and see this is passing
> Thanks
>
> Rupert
>
> -----Original Message-----
> From: Nickolas Wood [mailto:ndv2 at iol.unh.edu]
> Sent: Wednesday, December 10, 2008 9:41 AM
> To: Rupert Dance
> Cc: ofalab at postal.iol.unh.edu
> Subject: complete test summary
>
> Hi,
> It was my understanding that incremental status reports were
> acceptable regarding the rc6 testing. I have been told that it was
> not, there fore I have combined the previous emails into one for easier
use.
>
> All the below results were gathered while using the complete,
> multi vendor cluster with ofed 1.4 rc6 and the topology used during
> the debug event. This results in a 62 process mpi cluster.
>
> Mandatory test results:
> Link Init: FAIL - link speed issue
> Fabric init: pass
> IPoIB-Datagram: FAIL - initial packet loss
> iSER: NA - no iSER target to test against
> SRP: pass
> SDP: pass
>
> BETA tests completed:
> IPoIB-Connected: pass
> mvapich1: pingping, pingpong tests - pass
> all tests - FAIL
> mvapich2: pingping, pingpong tests - pass
> all tests - FAIL
> openmpi: pingping, pingpong tests - pass
> all tests - FAIL
> intelmpi: pingping, pingpong tests - pass
> all tests - FAIL
> hpmpi: all tests - FAIL
> dapltest: pass
>
> -Nick
>
>
> _______________________________________________
> ewg mailing list
> ewg at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
>
>
More information about the ewg
mailing list