[openib-general] MVAPICH failure on SGI Altix SLES10
John Partridge
johnip at sgi.com
Fri Jun 16 13:51:07 PDT 2006
Thank You Boris that seems to have fixed it.
Regards
John
Boris Shpolyansky wrote:
> Hi John,
>
> Most probably you need to upgrade the FW on your HCAs.
> See the following section from MVAPICH 0.9.7 User Guide:
>
> 7.2.5 Couldn't modify SRQ limit
>
> This means that your HCA card doesn't support the ibv_modify_srq
> feature. Please upgrade
> the firmware version and OpenIB Gen2 libraries on your cluster. You can
> obtain the latest
> Mellanox firmware images from this webpage.
> Even after updating your firmware and OpenIB Gen2 libraries, you
> continue to experience
> this problem, please edit make.mvapich.gcc and replace -DMEMORY_SCALE
> with
> -DADAPTIVE_RDMA_FAST_PATH. After making this change you need to re-build
> the MVAPICH
> library. Note that you should first try to update your firmware and
> OpenIB Gen2 libraries
> before taking this measure.
> If you believe that your HCA supports this feature, yet you are
> experiencing this problem,
> please contact the MVAPICH community at
> mvapich-discuss at cse.ohio-state.edu.
>
> Regards,
> Boris Shpolyansky
> Application Engineer
> Mellanox Technologies Inc.
> 2900 Stender Way
> Santa Clara, CA 95054
> Tel.: (408) 916 0014
> Fax: (408) 970 3403
> Cell: (408) 834 9365
> www.mellanox.com
>
>
> -----Original Message-----
> From: openib-general-bounces at openib.org
> [mailto:openib-general-bounces at openib.org] On Behalf Of John Partridge
> Sent: Friday, June 16, 2006 12:51 PM
> To: openib-general at openib.org
> Subject: [openib-general] MVAPICH failure on SGI Altix SLES10
>
> I am trying to run the example from MPI_README.txt (and other MPI apps
> like pallas), but I keep getting a Couldn't modify SRQ limit error
> message :-
>
> mig129:~/OFED-1.0-pre1 #
> /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.1.0/bin/mpirun_rsh -rsh -np 2
> -hostfile /root/cluster
> /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.1.0/tests/osutests-1.0/bw
> 1000 16 [1] Abort: Couldn't modify SRQ limit
> at line 995 in file viainit.c
> mpirun_rsh: Abort signaled from [1]
> [0] Abort: [mig125:0] Got completion with error, code=12
> at line 2143 in file viacheck.c
> done.
>
> I am using OFED-1.0-pre1 (kernel modules are from OFED-1.0-pre1 also) OS
> is SLES10 SUSE Linux Enterprise Server 10 (ia64) VERSION = 10
>
> HW is SGI Altix ia64
>
> Can anyone help please ?
>
> Thanks
> John
>
> --
> John Partridge
>
> Silicon Graphics Inc
> Tel: 651-683-3428
> Vnet: 233-3428
> E-Mail: johnip at sgi.com
>
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
>
> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>
--
John Partridge
Silicon Graphics Inc
Tel: 651-683-3428
Vnet: 233-3428
E-Mail: johnip at sgi.com
More information about the general
mailing list