[openib-general] MVAPICH failure on SGI Altix SLES10

John Partridge johnip at sgi.com
Fri Jun 16 13:51:07 PDT 2006


Thank You Boris that seems to have fixed it.

Regards
John


Boris Shpolyansky wrote:
> Hi John,
> 
> Most probably you need to upgrade the FW on your HCAs.
> See the following section from MVAPICH 0.9.7 User Guide:
> 
> 7.2.5 Couldn't modify SRQ limit
> 
> This means that your HCA card doesn't support the ibv_modify_srq
> feature. Please upgrade
> the firmware version and OpenIB Gen2 libraries on your cluster. You can
> obtain the latest
> Mellanox firmware images from this webpage.
> Even after updating your firmware and OpenIB Gen2 libraries, you
> continue to experience
> this problem, please edit make.mvapich.gcc and replace -DMEMORY_SCALE
> with
> -DADAPTIVE_RDMA_FAST_PATH. After making this change you need to re-build
> the MVAPICH
> library. Note that you should first try to update your firmware and
> OpenIB Gen2 libraries
> before taking this measure.
> If you believe that your HCA supports this feature, yet you are
> experiencing this problem,
> please contact the MVAPICH community at
> mvapich-discuss at cse.ohio-state.edu. 
> 
> Regards,
> Boris Shpolyansky
> Application Engineer
> Mellanox Technologies Inc.
> 2900 Stender Way
> Santa Clara, CA 95054
> Tel.: (408) 916 0014
> Fax: (408) 970 3403
> Cell: (408) 834 9365
> www.mellanox.com
> 
> 
> -----Original Message-----
> From: openib-general-bounces at openib.org
> [mailto:openib-general-bounces at openib.org] On Behalf Of John Partridge
> Sent: Friday, June 16, 2006 12:51 PM
> To: openib-general at openib.org
> Subject: [openib-general] MVAPICH failure on SGI Altix SLES10
> 
> I am trying to run the example from MPI_README.txt (and other MPI apps
> like pallas), but I keep getting a Couldn't modify SRQ limit error
> message :-
> 
> mig129:~/OFED-1.0-pre1 #
> /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.1.0/bin/mpirun_rsh -rsh -np 2
> -hostfile /root/cluster
> /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.1.0/tests/osutests-1.0/bw
> 1000 16 [1] Abort: Couldn't modify SRQ limit
>   at line 995 in file viainit.c
> mpirun_rsh: Abort signaled from [1]
> [0] Abort: [mig125:0] Got completion with error, code=12
>   at line 2143 in file viacheck.c
> done.
> 
> I am using OFED-1.0-pre1 (kernel modules are from OFED-1.0-pre1 also) OS
> is SLES10 SUSE Linux Enterprise Server 10 (ia64) VERSION = 10
> 
> HW is SGI Altix ia64
> 
> Can anyone help please ?
> 
> Thanks
> John
> 
> --
> John Partridge
> 
> Silicon Graphics Inc
> Tel:  651-683-3428
> Vnet: 233-3428
> E-Mail: johnip at sgi.com
> 
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
> 
> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
> 


-- 
John Partridge

Silicon Graphics Inc
Tel:  651-683-3428
Vnet: 233-3428
E-Mail: johnip at sgi.com




More information about the general mailing list