[openib-general] mvapich2 ofed 1.2 problem

Steve Wise swise at opengridcomputing.com
Tue Feb 13 07:30:10 PST 2007


Hey Roland, 

Does this stack indicate that libibverbs is accessing a 1.0 provider?
cxgb3 shouldn't be 1.0 right?


Core was generated by `IMB_2.3/src/IMB-MPI1'.
Program terminated with signal 11, Segmentation fault.

...

(gdb) bt
#0  __ibv_alloc_pd (context=0x1) at src/verbs.c:143
#1  0x00002b832d4d4381 in __ibv_alloc_pd_1_0 (context=0x617830)
    at src/compat-1_0.c:572
#2  0x00002b832cfef04e in rdma_cm_init_pd_cq ()
   from /usr/local/ofed/mpi/gcc/mvapich2-0.9.8-3/lib/libmpich.so
#3  0x00002b832cfef415 in rdma_cm_create_qp ()
   from /usr/local/ofed/mpi/gcc/mvapich2-0.9.8-3/lib/libmpich.so
#4  0x00002b832cfefa37 in ib_cma_event_handler ()
   from /usr/local/ofed/mpi/gcc/mvapich2-0.9.8-3/lib/libmpich.so
#5  0x00002b832cfefcc0 in cm_thread ()
   from /usr/local/ofed/mpi/gcc/mvapich2-0.9.8-3/lib/libmpich.so
#6  0x0000003cd9406305 in start_thread () from /lib64/libpthread.so.0
#7  0x0000003cd88cd66d in clone () from /lib64/libc.so.6
#8  0x0000000000000000 in ?? ()
(gdb)  p *context
Cannot access memory at address 0x1
(gdb) up
#1  0x00002b832d4d4381 in __ibv_alloc_pd_1_0 (context=0x617830)
    at src/compat-1_0.c:572
572     src/compat-1_0.c: No such file or directory.
        in src/compat-1_0.c
(gdb) p *context
$1 = {device = 0x617100, ops = {
    query_device = 0x2b832dcf2bc0 <iwch_query_device>,
    query_port = 0x2b832dcf2ba0 <iwch_query_port>,
    alloc_pd = 0x2b832dcf2b30 <iwch_alloc_pd>,
    dealloc_pd = 0x2b832dcf2af0 <iwch_free_pd>,
    reg_mr = 0x2b832dcf29b0 <iwch_reg_mr>,
    dereg_mr = 0x2b832dcf2c30 <iwch_dereg_mr>,
    create_cq = 0x2b832dcf3050 <iwch_create_cq>,
    poll_cq = 0x2b832dcf1770 <t3b_poll_cq>,
    req_notify_cq = 0x2b832dcf10c0 <iwch_arm_cq>, cq_event = 0,
    resize_cq = 0x2b832dcf2870 <iwch_resize_cq>,
    destroy_cq = 0x2b832dcf2f50 <iwch_destroy_cq>,
    create_srq = 0x2b832dcf2880 <iwch_create_srq>,
    modify_srq = 0x2b832dcf2890 <iwch_modify_srq>, query_srq = 0,
    destroy_srq = 0x2b832dcf28a0 <iwch_destroy_srq>,
    post_srq_recv = 0x2b832dcf28b0 <iwch_post_srq_recv>,
    create_qp = 0x2b832dcf2d30 <iwch_create_qp>, query_qp = 0,
    modify_qp = 0x2b832dcf2900 <iwch_modify_qp>,
    destroy_qp = 0x2b832dcf3200 <iwch_destroy_qp>,
    post_send = 0x2b832dcf1fa0 <t3b_post_send>,
    post_recv = 0x2b832dcf2460 <t3b_post_recv>,
    create_ah = 0x2b832dcf28c0 <iwch_create_ah>,
    destroy_ah = 0x2b832dcf28d0 <iwch_destroy_ah>,
    attach_mcast = 0x2b832dcf28e0 <iwch_attach_mcast>,
    detach_mcast = 0x2b832dcf28f0 <iwch_detach_mcast>}, cmd_fd = 768552128,
  async_fd = 11139, num_comp_vectors = 8, real_context = 0x1}






More information about the general mailing list