[openib-general] building and running IBMgtsim?

Eitan Zahavi eitan at mellanox.co.il
Thu Dec 21 11:09:24 PST 2006


Chris Elmquist wrote:
> Hi Guys...
>
> Thank you very much for the recipe.  We actually had a success getting
> it to go just after posting to the list but these instructions will now
> confirm whether we did it the right way or not.
>
> Are there any guidelines for how big of a network the simulator can
> deal with?  Maybe something that relates it to available memory on the
> platform it is running or other resource issues?  We threw one model
> at it already which tipped it over but we are certainly not sure we are
> using it the right way yet.
>   
I was able to simulate 10K nodes in the past.
What I did to get there was to use two machines: one for the simulator 
and one for the SM.
I also used 64bit (x86_64) machines to avoid the ~3GB data limit.
> Thanks again.  We hope to be activate participants in this space going
> forward and as soon as we know what we are doing, we'll feed it back to
> the group.
>
> Chris 
>
> On Thursday (12/21/2006 at 04:59PM +0200), Eitan Zahavi wrote:
>   
>> Hi Chris,
>>
>> Sorry for my late response on this:
>>
>> The simulator is a standalone "server" where clients connect to it 
>> through a TCP/IP socket.
>>
>> OpenSM which is not built with "sim" vendor (using --with-osmv=sim 
>> --with-sim=<prefix used for ibmgtsim install>)
>> will not try to connect to the simulator but will go to the real IB 
>> network instead.
>>
>> So you need a second "simulator" install of OpenSM.
>> You can simply clone the GIT tree and
>> ./autogen.sh
>> ./configure --with-osmv=sim --with-sim=<prefix used for ibmgtsim 
>> install> --prefix=<somewhere>
>> make
>> make install
>>
>> RunSimTest -f OsmTest.sim.tcl -c OsmTest.check.tcl -t IS1-16.topo -o 
>> <somewhere>/bin/opensm
>>
>> Actually OsmTest is a test that currently fail (due to last changes in 
>> InformInfo),
>> but any other *.check.tcl/*.sim.tcl pair should work.
>>
>> Eitan
>>
>>
>> Chris Elmquist wrote:
>>     
>>> Folks,
>>>
>>> I am trying to build and run IBMgtsim so that I can explore some different
>>> topologies and system sizes.  But I am having a lot of trouble getting
>>> OpenSM to work with the simulator.
>>>
>>> I pulled down Eitan's ibutils git tree (to get the simulator) and
>>> am otherwise using the OFED 1.1 tarball for the rest of the stuff.
>>> I suspect I have a problem with OpenSM not being built correctly to use
>>> the simulator.
>>>
>>> Does anyone have a recipe on how to build and install all of these pieces
>>> (ie, openib, openSM and ibmgtsim) so that they will work together?
>>>
>>> I have been just trying to run one of the tests provided with the
>>> simulator like this:
>>>
>>> % cd ~/ibutils/ibmgtsim/tests
>>> % RunSimTest -c OsmTest.check.tcl -f OsmTest.sim.tcl -t IS1-16.topo -o 
>>> /usr/local/bin/opensm
>>>
>>> but we get this sort of output:
>>>
>>> -I- Using random seed:43204
>>> -I- Simulation directory is: /tmp/ibmgtsim.29716
>>> -I- Calling IBMgtSim -s 43204 -V 0xA3 -t 
>>> /root/ibutils/ibmgtsim/tests/IS1-16.top
>>> o -f /root/ibutils/ibmgtsim/tests/OsmTest.sim.tcl -l 
>>> /tmp/ibmgtsim.29716/sim.log
>>> -I- Simulator Ready
>>> -I- Connecting to the simulator control server:pcplod.americas.sgi.com 
>>> port:3726
>>> 5 
>>> -I- Connected to the simulator control server
>>> -I- Defined 51 guids
>>> -I- Node H-1 data: 0x0002c90000000008 {0x0002c90000000009 1} 
>>> {0x0002c9000000000a
>>> 2}
>>> -I- Starting: /usr/local/bin/opensm -g 0x0002c90000000009  ...
>>> -I- Waiting for OpenSM subnet up ...
>>> -I- OpenSM Event:ERR Dec 20 10:53:09 470415 [5100E100] -> 
>>> osm_vendor_open_port: ERR 5422: Unable to find requested CA guid 
>>> 0x2c90000000009
>>> -I- New 1 events of /tmp/ibmgtsim.29716/osm.log
>>> -I- OpenSM Event:ERR Dec 20 10:53:09 470419 [5100E100] -> osm_vendor_bind: 
>>> ERR 5
>>> 424: Unable to Open Port 0x2c90000000009
>>> -I- New 1 events of /tmp/ibmgtsim.29716/osm.log
>>> -I- OpenSM Event:ERR Dec 20 10:53:09 470422 [5100E100] -> 
>>> osm_sm_mad_ctrl_bind: ERR 3118: Vendor specific bind failed
>>> -I- New 1 events of /tmp/ibmgtsim.29716/osm.log
>>> -I- OpenSM Event:ERR Dec 20 10:53:09 470427 [5100E100] -> osm_sm_bind: ERR 
>>> 2E10:
>>> SM MAD Controller bind failed (IB_ERROR)
>>> -I- New 1 events of /tmp/ibmgtsim.29716/osm.log
>>> -I- OpenSM Event:ERR Dec 20 10:53:09 470434 [5100E100] -> 
>>> osm_sa_mad_ctrl_unbind
>>> : ERR 1A11: No previous bind
>>> -I- New 1 events of /tmp/ibmgtsim.29716/osm.log
>>>
>>> Thank you.
>>>
>>> Chris
>>> SGI Network Engineering
>>>  
>>>       
>
>   





More information about the general mailing list