[openfabrics-ewg] MVAPICH on PCI-X fails with [0] Abort: Couldn'tmodify SRQ limit
Aviram Gutman
aviram at mellanox.co.il
Tue May 2 07:51:01 PDT 2006
Thanks. We are out today and tomorrow but we will take a look at it.
Aviram
Regards,
Aviram
-----Original Message-----
From: openfabrics-ewg-bounces at openib.org
To: Openfabrics-ewg at openib.org
Sent: Tue May 02 04:36:04 2006
Subject: [openfabrics-ewg] MVAPICH on PCI-X fails with [0] Abort: Couldn'tmodify SRQ limit
Configuration:- RHEL4U3,ia32,rc3,PCI-X
I have been seeing the following error when I try to run MVAPICH test programs
----
[0] Abort: Couldn't modify SRQ limit
at line 999 in file viainit.c
-----
On debugging the issue it seems that MVAPICH is being incorrectly compiled for the PCI_EX cards rather than PCI_X cards. From the MPI code it seems that PCI-X cards do not support modifications of SRQs. The source of the problem lies in a bug in the mvapich.make script(new IBED addition) which incorrectly always returns the card as PCI_EX . I have appended a diff with the fix. I am not sure as to how the patch can be correctly generated as the whole MVAPICH is in the form of a zipped tarball.
There seems to be one more issue in the script where it deviates from the MVAPICH build. The script is treating the older PCI_EX cards(cards with the lspci signature of 15b3:6278) as PCI-X. Was this done because these cards also don't support resizing SRQs?
-Amit
-------------------
diff -u mvapich.make.old mvapich.make
--- mvapich.make.old 2006-05-02 15:32:11.000000000 +0530
+++ mvapich.make 2006-05-02 15:33:37.000000000 +0530
@@ -251,13 +251,13 @@
DEF_BUILDID="$DEF_BUILDID"
fi
fi
-if (/sbin/lspci -n | grep "15b3:6282" | wc -l | tr -d '[:space:]'); then
+if (test `/sbin/lspci -n | grep "15b3:6282" | wc -l | tr -d '[:space:]'` -gt 0); then
# Arbel
CFLAGS="$CFLAGS -D_PCI_EX_"
-elif (/sbin/lspci -n | grep -E "15b3:5e8c|15b3:6274" | wc -l | tr -d '[:space:]'); then
+elif (test `/sbin/lspci -n | grep -E "15b3:5e8c|15b3:6274" | wc -l | tr -d '[:space:]'` -gt 0); then
# Sinai
CFLAGS="$CFLAGS -D_PCI_EX_"
-elif (/sbin/lspci -n | grep -E "15b3:5a44|15b3:6278" | wc -l | tr -d '[:space:]'); then
+elif (test `/sbin/lspci -n | grep -E "15b3:5a44|15b3:6278" | wc -l | tr -d '[:space:]'` -gt 0); then
# Tavor
CFLAGS="$CFLAGS -D_PCI_X_"
fi
-----------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ewg/attachments/20060502/4b256a82/attachment.html>
More information about the ewg
mailing list