[ofa-general] RE: running opensm 3.0.3 on 4000+ node system

Sasha Khapyorsky sashak at voltaire.com
Wed Apr 9 17:50:10 PDT 2008


On 14:17 Wed 09 Apr     , Hal Rosenstock wrote:
> On Wed, 2008-04-09 at 15:13 -0600, Maestas, Christopher Daniel wrote:
> > I think we may have fixed it:
> > ---
> >  3998 pts/0    Sl     1:47 /usr/sbin/opensm -maxsmps 15 -t 200 -f /var/log/osm.log -g 0
> > --
> > 
> > I changed maxsmps to 15 (from default of 0 => unlimited) and it seems to be working now. 
> >  That is the same value we use for the cisco host based sm.
> 
> Yes, an infinite value could overrun the unflow controlled VL15 buffers
> in the switches.

Even if not - it overflows mad response matching table in vendor layer
(there are 4k+ nodes and only 1k entries in the table). In recent
version (master) this table size can be redefined with
OSM_UMAD_MAX_PENDING environment variable.

Sasha



More information about the general mailing list