[ofa-general] opensm hang and osmtest report ERR 0130
Wen Hao Wang
wangwhao at cn.ibm.com
Wed Aug 6 20:20:53 PDT 2008
Hi, Yevgeny:
Thanks for your answer.
>This part is OK - opensm enters the stand-by state and
>waits in this state indefinitely. This happened because
>opensm detects other opensm in the subnet.
>If you kill that other opensm, the stand-by opensm will
>enter MASTER state after a short period.
>You can see who's the master opensm in your subnet by
>running 'sminfo' tool.
Here is the output of sminfo
[root at gaia-07 nodedef]# sminfo
sminfo: sm lid 2 sm guid 0x5ad0000094038, activity count 4999946 priority
10 state 3 SMINFO_MASTER
[root at gaia-07 nodedef]# ibnetdiscover |grep 0x5ad0000094038
switchguid=0x5ad0000094038(5ad0000094038)
[root at gaia-07 nodedef]# ibnetdiscover |grep "lid 2"
Switch 24 "S-0005ad0000094038" # "Topspin Switch" enhanced port 0
lid 2 lmc 0
[1](2c903000134f5) "S-0005ad0000094038"[13] # lid 7 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f1040398b9f1) "S-0005ad0000094038"[11] # lid 8 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f104039955a5) "S-0005ad0000094038"[10] # lid 6 lmc
0 "Topspin Switch" lid 2 4xSDR
[1](8f10403995879) "S-0005ad0000094038"[9] # lid 10 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f1040398ba19) "S-0005ad0000094038"[7] # lid 9 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f10403995861) "S-0005ad0000094038"[4] # lid 5 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](8f10403995875) "S-0005ad0000094038"[3] # lid 4 lmc 0
"Topspin Switch" lid 2 4xSDR
[1](2c90300013371) "S-0005ad0000094038"[14] # lid 3 lmc
0 "Topspin Switch" lid 2 4xSDR
It seems the Cisco switch has subnet manager running.
>By default, osmtest runs all validation tests, which is similar
>to 'osmtest -f a'. This flow expects to get an input inventory file.
>You should first run 'osmtest -f c' to create such file, and then
>'osmtest' or 'osmtest -f a' to run the tests.
>See 'man osmtest' for more details.
"osmtest -f c" failed to create the inventory file.
[root at gaia-07 ~]# osmtest -f c
Command Line Arguments
Done with args
Flow = Create Inventory
Aug 07 04:57:04 561325 [516EF3B0] 0x7f -> Setting log level to: 0x03
Aug 07 04:57:04 579744 [516EF3B0] 0x02 -> osm_vendor_bind: Binding to port
0x2c90300013371
Aug 07 04:57:04 602919 [516EF3B0] 0x02 ->
osmtest_validate_sa_class_port_info:
-----------------------------
SA Class Port Info:
base_ver:1
class_ver:2
cap_mask:0x2601
cap_mask2:0x0
resp_time_val:0x14
-----------------------------
Aug 07 04:57:08 604366 [4236E940] 0x01 -> umad_receiver: ERR 5409: send
completed with error (method=0x12 attr=0x35 trans_id=0x2a00000004) --
dropping
Aug 07 04:57:08 604396 [4236E940] 0x01 -> umad_receiver: ERR 5410: class
0x3 LID 0x2
Aug 07 04:57:08 604420 [4236E940] 0x01 -> osmtest_query_res_cb: ERR 0003:
Error on query (IB_TIMEOUT)
Aug 07 04:57:08 604454 [516EF3B0] 0x01 -> osmtest_get_all_recs: ERR 0004:
ib_query failed (IB_TIMEOUT)
Aug 07 04:57:08 604476 [516EF3B0] 0x01 -> osmtest_write_all_path_recs: ERR
0025: osmtest_get_all_recs failed (IB_TIMEOUT)
Aug 07 04:57:08 604500 [516EF3B0] 0x01 -> osmtest_run: ERR 0139: Inventory
file create failed (IB_TIMEOUT)
OSMTEST: TEST "Create Inventory" FAIL
Here attatch the output of "osmtest -f c -V".
(See attached file: output)
-- Yevgeny
Wen Hao Wang
Email: wangwhao at cn.ibm.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080807/1e781f9a/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: output
Type: application/octet-stream
Size: 18773 bytes
Desc: not available
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080807/1e781f9a/attachment.obj>
More information about the general
mailing list