[ewg] FW: complete test summary

Tziporet Koren tziporet at dev.mellanox.co.il
Thu Dec 11 02:21:56 PST 2008


Rupert Dance wrote:
> Hello Tziporet,
>
> Here is the final UNH IOL summary report of testing done on RC6. 
>   
Many thanks
> Regarding the mandatory tests, the Link Init failure is a specific vendor
> issue and IPoIB failure has been documented in Bug 1287. 
>
> The MPI failures in the Beta tests are being research now. I have asked Jeff
> and Arlin Davis to look into these failures. UNH has noted that it only
> occurs when the cluster includes HCA from multiple vendors and when the
> number of processors exceeds 38. Jeff made the following comment "I'm not
> entirely surprised that OMPI fails when used with multiple vendor HCAs; I
> don't know if anyone has ever tested that before...?  I would not make it a
> requirement for passing that OMPI has to work in a single MPI job with
> multiple vendor HCAs; I don't know of many (any?) real-world environments
> that do this."
>   
Can you test with same vendor HCA on all nodes and see this is passing
> Thanks
>
> Rupert
>
> -----Original Message-----
> From: Nickolas Wood [mailto:ndv2 at iol.unh.edu] 
> Sent: Wednesday, December 10, 2008 9:41 AM
> To: Rupert Dance
> Cc: ofalab at postal.iol.unh.edu
> Subject: complete test summary
>
> Hi,
>     It was my understanding that incremental status reports were acceptable
> regarding the rc6 testing. I have been told that it was not, there fore I
> have combined the previous emails into one for easier use.
>
>     All the below results were gathered while using the complete, multi
> vendor cluster with ofed 1.4 rc6 and the topology used during the debug
> event. This results in a 62 process mpi cluster.
>
> Mandatory test results:
>    Link Init: FAIL - link speed issue
>    Fabric init: pass
>    IPoIB-Datagram: FAIL - initial packet loss
>    iSER: NA - no iSER target to test against
>    SRP: pass
>    SDP: pass
>
> BETA tests completed:
>    IPoIB-Connected: pass
>    mvapich1: pingping, pingpong tests - pass
>                                             all tests - FAIL
>    mvapich2: pingping, pingpong tests - pass
>                                             all tests - FAIL
>    openmpi:   pingping, pingpong tests - pass
>                                              all tests - FAIL
>    intelmpi:     pingping, pingpong tests - pass
>                                              all tests - FAIL
>    hpmpi:                                all tests - FAIL
>    dapltest: pass
>
> -Nick
>
>
> _______________________________________________
> ewg mailing list
> ewg at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
>
>   




More information about the ewg mailing list