[ofa-general] opensm hang and osmtest report ERR 0130

Wen Hao Wang wangwhao at cn.ibm.com
Wed Aug 6 20:20:53 PDT 2008


Hi,  Yevgeny:

Thanks for your answer.

>This part is OK - opensm enters the stand-by state and
>waits in this state indefinitely. This happened because
>opensm detects other opensm in the subnet.
>If you kill that other opensm, the stand-by opensm will
>enter MASTER state after a short period.
>You can see who's the master opensm in your subnet by
>running 'sminfo' tool.

Here is the output of sminfo
[root at gaia-07 nodedef]# sminfo
sminfo: sm lid 2 sm guid 0x5ad0000094038, activity count 4999946 priority
10 state 3 SMINFO_MASTER
[root at gaia-07 nodedef]# ibnetdiscover |grep 0x5ad0000094038
switchguid=0x5ad0000094038(5ad0000094038)
[root at gaia-07 nodedef]# ibnetdiscover |grep "lid 2"
Switch  24 "S-0005ad0000094038"         # "Topspin Switch" enhanced port 0
lid 2 lmc 0
[1](2c903000134f5)      "S-0005ad0000094038"[13]                # lid 7 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f1040398b9f1)      "S-0005ad0000094038"[11]                # lid 8 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f104039955a5)      "S-0005ad0000094038"[10]                # lid 6 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f10403995879)      "S-0005ad0000094038"[9]         # lid 10 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f1040398ba19)      "S-0005ad0000094038"[7]         # lid 9 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f10403995861)      "S-0005ad0000094038"[4]         # lid 5 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f10403995875)      "S-0005ad0000094038"[3]         # lid 4 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](2c90300013371)      "S-0005ad0000094038"[14]                # lid 3 lmc
0 "Topspin Switch" lid 2 4xSDR

It seems the Cisco switch has subnet manager running.



>By default, osmtest runs all validation tests, which is similar
>to 'osmtest -f a'. This flow expects to get an input inventory file.
>You should first run 'osmtest -f c' to create such file, and then
>'osmtest' or 'osmtest -f a' to run the tests.
>See 'man osmtest' for more details.


"osmtest -f c" failed to create the inventory file.
[root at gaia-07 ~]# osmtest -f c

Command Line Arguments
Done with args
        Flow = Create Inventory
Aug 07 04:57:04 561325 [516EF3B0] 0x7f -> Setting log level to: 0x03
Aug 07 04:57:04 579744 [516EF3B0] 0x02 -> osm_vendor_bind: Binding to port
0x2c90300013371
Aug 07 04:57:04 602919 [516EF3B0] 0x02 ->
osmtest_validate_sa_class_port_info:
-----------------------------
SA Class Port Info:
 base_ver:1
 class_ver:2
 cap_mask:0x2601
 cap_mask2:0x0
 resp_time_val:0x14
-----------------------------
Aug 07 04:57:08 604366 [4236E940] 0x01 -> umad_receiver: ERR 5409: send
completed with error (method=0x12 attr=0x35 trans_id=0x2a00000004) --
dropping
Aug 07 04:57:08 604396 [4236E940] 0x01 -> umad_receiver: ERR 5410: class
0x3 LID 0x2
Aug 07 04:57:08 604420 [4236E940] 0x01 -> osmtest_query_res_cb: ERR 0003:
Error on query (IB_TIMEOUT)
Aug 07 04:57:08 604454 [516EF3B0] 0x01 -> osmtest_get_all_recs: ERR 0004:
ib_query failed (IB_TIMEOUT)
Aug 07 04:57:08 604476 [516EF3B0] 0x01 -> osmtest_write_all_path_recs: ERR
0025: osmtest_get_all_recs failed (IB_TIMEOUT)
Aug 07 04:57:08 604500 [516EF3B0] 0x01 -> osmtest_run: ERR 0139: Inventory
file create failed (IB_TIMEOUT)
OSMTEST: TEST "Create Inventory" FAIL

Here attatch the output of "osmtest -f c -V".
(See attached file: output)

-- Yevgeny


Wen Hao Wang
Email: wangwhao at cn.ibm.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080807/1e781f9a/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: output
Type: application/octet-stream
Size: 18773 bytes
Desc: not available
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080807/1e781f9a/attachment.obj>


More information about the general mailing list