[ofa-general] [PATCH 2/4] opensm/main.c rescan subnet configuration after SIGHUP

Eli Dorfman (Voltaire) dorfman.eli at gmail.com
Tue Feb 3 05:40:50 PST 2009


Sasha Khapyorsky wrote:
> On 11:28 Tue 03 Feb     , Eli Dorfman (Voltaire) wrote:
>> Sasha Khapyorsky wrote:
>>> On 16:32 Mon 26 Jan     , Eli Dorfman (Voltaire) wrote:
>>>>  rescan subnet configuration after SIGHUP
>>>>  call osm_subn_rescan_conf_files() after SIGHUP.
>>>>  this is important when priority is changed and SM is in standby.
>>>>  in that case it will not send capability mask trap and will not become master.
>>>>
>>>> Signed-off-by: Eli Dorfman <elid at voltaire.com>
>>>> ---
>>>>  opensm/opensm/main.c |    1 +
>>>>  1 files changed, 1 insertions(+), 0 deletions(-)
>>>>
>>>> diff --git a/opensm/opensm/main.c b/opensm/opensm/main.c
>>>> index f786192..0f7b822 100644
>>>> --- a/opensm/opensm/main.c
>>>> +++ b/opensm/opensm/main.c
>>>> @@ -507,6 +507,7 @@ int osm_manager_loop(osm_subn_opt_t * p_opt, osm_opensm_t * p_osm)
>>>>  			osm_hup_flag = 0;
>>>>  			/* a HUP signal should only start a new heavy sweep */
>>>>  			p_osm->subn.force_heavy_sweep = TRUE;
>>>> +			osm_subn_rescan_conf_files(&p_osm->subn);
>>> Is it synchronized with sweep? If regular (scheduled by timer) sweep
>>> starts in a middle of osm_subn_rescan_conf_files() (when QoS parameters
>>> are freed..., etc.). I think it is not.
>>>
>> i assume it is not.
>> what about the the following (though it uses yet another flag...)
>>
>> diff --git a/opensm/include/opensm/osm_subnet.h b/opensm/include/opensm/osm_subnet.h
>> index 8863e47..88c977d 100644
>> --- a/opensm/include/opensm/osm_subnet.h
>> +++ b/opensm/include/opensm/osm_subnet.h
>> @@ -169,6 +169,7 @@ typedef struct osm_subn_opt {
>>  	uint32_t polling_retry_number;
>>  	uint32_t max_msg_fifo_timeout;
>>  	boolean_t force_heavy_sweep;
>> +	boolean_t rescan_conf_file;
>>  	uint8_t log_flags;
>>  	char *dump_files_dir;
>>  	char *log_file;
>> diff --git a/opensm/opensm/main.c b/opensm/opensm/main.c
>> index de38056..f2d7846 100644
>> --- a/opensm/opensm/main.c
>> +++ b/opensm/opensm/main.c
>> @@ -507,7 +507,7 @@ int osm_manager_loop(osm_subn_opt_t * p_opt, osm_opensm_t * p_osm)
>>  			osm_hup_flag = 0;
>>  			/* a HUP signal should only start a new heavy sweep */
>>  			p_osm->subn.force_heavy_sweep = TRUE;
>> -			osm_subn_rescan_conf_files(&p_osm->subn);
>> +			p_osm->subn.rescan_conf_file  = TRUE;
>>  			osm_opensm_sweep(p_osm);
>>  		}
>>  	}
>> diff --git a/opensm/opensm/osm_state_mgr.c b/opensm/opensm/osm_state_mgr.c
>> index fc7ceb9..87a5db9 100644
>> --- a/opensm/opensm/osm_state_mgr.c
>> +++ b/opensm/opensm/osm_state_mgr.c
>> @@ -1042,6 +1042,13 @@ static void do_sweep(osm_sm_t * sm)
>>  	ib_api_status_t status;
>>  	osm_remote_sm_t *p_remote_sm;
>>  
>> +	if (sm->p_subn->rescan_conf_file) {
>> +		if (osm_subn_rescan_conf_files(sm->p_subn) < 0)
>> +			OSM_LOG(sm->p_log, OSM_LOG_ERROR, "ERR 331A: "
>> +				"osm_subn_rescan_conf_file failed\n");
>> +		sm->p_subn->rescan_conf_file = FALSE;
>> +	}
>> +
> 
> What would be wrong with using exiting 'force_heavy_sweep' flag?
> 
'force_heavy_sweep' flag is set in other occasions as well

> Another issue with this patch - config file will be rescanned later
> again (during heavy sweep). It would be really nice to avoid such
> obviously unneeded double parsing.
>
that is correct, but we need a special flag to handle the priority change when SM
is in standby.
In that case a rescan at the beginning of do_sweep is a must, otherwise it will 
simply return without doing anything.
what was the reason of putting rescan not in the beginning of do_sweep().
If none then we can simply rescan as first step.

Eli

> Sasha
> 
>>  	if (sm->p_subn->sm_state != IB_SMINFO_STATE_MASTER &&
>>  	    sm->p_subn->sm_state != IB_SMINFO_STATE_DISCOVERING)
>>  		return;




More information about the general mailing list