[ofa-general] Re: [PATCH] opensm: enforce routing paths rebalancing on switch reconnection

Albert Chu chu11 at llnl.gov
Sun Mar 2 07:46:59 PST 2008


Hey Sasha,

In order to make things work, I also had to add this patch.  Seems like a
corner case that needs to be handled since we never fall into
__osm_pi_rcv_process_switch_port().  (BTW, I am working off a 3.1.10
branch for the test cluster, so this patch is forward ported and
technically untested.)

--- a/opensm/opensm/osm_port_info_rcv.c
+++ b/opensm/opensm/osm_port_info_rcv.c
@@ -564,6 +564,7 @@ void osm_pi_rcv_process(IN void *context, IN void *data)
                        ", Commencing heavy sweep\n",
                        cl_ntoh64(node_guid), cl_ntoh64(port_guid));
                sm->p_subn->force_heavy_sweep = 1;
+               sm->p_subn->ignore_existing_lfts = 1;
                goto Exit;
        }

Al

> Hey Sasha,
>
> This patch should definitely work.  I'll let you know after I get a chance
> to try it.
>
> Al
>
>> Hi Al,
>>
>> On 16:08 Sat 01 Mar     , Sasha Khapyorsky wrote:
>>>
>>> When switch ports were reconnected we need to recalculate routing paths
>>> balancing. Reconnection is detected by port state examination - when it
>>> becomes INIT routing paths rebalancing (ignore_existing_lfts flag) is
>>> enforced.
>>>
>>> Signed-off-by: Sasha Khapyorsky <sashak at voltaire.com>
>>
>> This patch is simpler than all previous ones. I tested it with ibsim
>> already. Could you test in your environment?
>>
>> Sasha
>>
>
>
> --
> Albert Chu
> chu11 at llnl.gov
> 925-422-5311
> Computer Scientist
> High Performance Systems Division
> Lawrence Livermore National Laboratory
>
>


-- 
Albert Chu
chu11 at llnl.gov
925-422-5311
Computer Scientist
High Performance Systems Division
Lawrence Livermore National Laboratory




More information about the general mailing list