[openib-general] [PATCH] opensm fails to find HCA if port is down.

Hal Rosenstock halr at voltaire.com
Tue Jan 10 14:12:37 PST 2006


Hi Ralph,

On Tue, 2006-01-10 at 13:31, Ralph Campbell wrote:
> If opensm is started with no arguments, the default algorithm
> for finding a port to bind to will skip ports which are present
> but the link is DOWN.  If there is only one port in the system,
> no port is selected and opensm tries the default HCA name "mthca0"
> which, if not present, confuses opensm and it exits.
> 
> The following patch changes the port selection so that the first
> active port is selected, and if none, the first non-disabled port.

This is close and headed in the right direction but has one property I'm
not too fond of: when there are no active ports, it does not prefer a
port whose physical state is link up over one in (say) polling (anything
other than link up) so the subnet may not come up when it could in that
case.

-- Hal

> Signed-off-by: Ralph Campbell <ralphc at pathscale.com>
> 
> Index: umad.c
> ===================================================================
> --- umad.c	(revision 4900)
> +++ umad.c	(working copy)
> @@ -207,9 +207,9 @@
>  }
>  
>  /*
> - * if *port > 0 checks ca[port] state. Otherwise set *port to
> + * if *port > 0, check ca[port] state. Otherwise set *port to
>   * the first port that is active, and if such is not found, to
> - * the first port that is (physically) up. Otherwise return -1;
> + * the first port that is not disabled.  Otherwise return -1;
>   */
>  static int
>  resolve_ca_port(char *ca_name, int *port)
> @@ -228,14 +228,14 @@
>  		return 1;
>  	}
>  
> -	if (*port > 0) {	/* user wants user gets */
> +	if (*port > 0) {	/* check only the port the user wants */
>  		if (*port > ca.numports)
>  			return -1;
>  		if (!ca.ports[*port])
>  			return -1;
>  		if (ca.ports[*port]->state == 4)
>  			return 1;
> -		if (ca.ports[*port]->phys_state == 5)
> +		if (ca.ports[*port]->phys_state != 3)
>  			return 0;
>  		return -1;
>  	}
> @@ -244,7 +244,7 @@
>  		DEBUG("checking port %d", i);
>  		if (!ca.ports[i])
>  			continue;
> -		if (up < 0 && ca.ports[i]->phys_state == 5)
> +		if (up < 0 && ca.ports[i]->phys_state != 3)
>  			up = *port = i;
>  		if (ca.ports[i]->state == 4) {
>  			active = *port = i;
> @@ -278,10 +278,11 @@
>  		return ca_name;
>  	}
>  		
> -	/* find first existing HCA with Active port */
> +	/* Get the list of CA names. */
>  	if ((n = umad_get_cas_names((void *)names, UMAD_CA_NAME_LEN)) < 0)
>  		return 0;
>  
> +	/* Find the first existing CA with an active port. */
>  	for (caidx = 0; caidx < n; caidx++) {
>  		TRACE("checking ca '%s'", names[caidx]);
>  	



More information about the general mailing list