[openib-general] ibv_reg_mr failure with pvfs on ehca?
Troy Benjegerdes
troy at scl.ameslab.gov
Mon Oct 16 13:22:35 PDT 2006
I am running PVFS2 on OpenIB, with IBM's ehca.
When we start writing/reading large files, either with the NetPIPE
PVFS module we have or a modified GAMESS executable that uses
libpvfs2 directly, the 'ibv_reg_mr' function fails, and we get an error.
This is also correlated with kernel log messages like this:
Oct 16 11:14:45 p5l8 kernel: PU0003 000e0091:ehca_hcall_7arg_7ret
HCAD_ERROR opco
de=160 ret=fffffffffffffff7 arg1=1000000003000004 arg2=5
arg3=14f0ebc8 arg4=10000
arg5=e0000000000000 arg6=e3e9f200 arg7=0 out1=0 out2=0 out3=0 out4=0
out5=0 out6=0
out7=0
Oct 16 11:14:45 p5l8 kernel: PU0003 00090454:ehca_reg_mr HCAD_ERROR
hipz_alloc_mr
failed, h_ret=fffffffffffffff7 hca_hndl=1000000003000004
Oct 16 11:14:45 p5l8 kernel: PU0003 00090478:ehca_reg_mr <<<
ret=ffffffea shca=c00
00000e796b000 e_mr=c0000000d22c7d80 iova_start=0000000014f0ebc8
size=10000 acl=7 e
_pd=c0000000e3e9f200 pginfo=c0000001ad37fa70 num_pages=11 num_4k=11
Oct 16 11:14:45 p5l8 kernel: PU0003 00090176:ehca_reg_user_mr <<<
rc=fffffffffffff
fea pd=c0000000e3e9f200 region=c0000000cb73a9d0 mr_access_flags=7
udata=c0000001ad
37fba0
We are able to run on a 4x PCI-X Mellanox HCA, but obviously I'd like
to be using the 12x ehca.
More information about the general
mailing list