[openib-general] [PATCH] opensm fails to find HCA if port is down.
Ralph Campbell
ralphc at pathscale.com
Tue Jan 10 14:47:39 PST 2006
I understand. Maybe it should be the first active, if none, then the
first UP, and if none, the first !disabled.
Mostly I was trying to get something that picked ipath0 port 1
when it was the only port in the system even if the link
is down.
On Tue, 2006-01-10 at 17:12 -0500, Hal Rosenstock wrote:
> Hi Ralph,
>
> On Tue, 2006-01-10 at 13:31, Ralph Campbell wrote:
> > If opensm is started with no arguments, the default algorithm
> > for finding a port to bind to will skip ports which are present
> > but the link is DOWN. If there is only one port in the system,
> > no port is selected and opensm tries the default HCA name "mthca0"
> > which, if not present, confuses opensm and it exits.
> >
> > The following patch changes the port selection so that the first
> > active port is selected, and if none, the first non-disabled port.
>
> This is close and headed in the right direction but has one property I'm
> not too fond of: when there are no active ports, it does not prefer a
> port whose physical state is link up over one in (say) polling (anything
> other than link up) so the subnet may not come up when it could in that
> case.
>
> -- Hal
>
> > Signed-off-by: Ralph Campbell <ralphc at pathscale.com>
> >
> > Index: umad.c
> > ===================================================================
> > --- umad.c (revision 4900)
> > +++ umad.c (working copy)
> > @@ -207,9 +207,9 @@
> > }
> >
> > /*
> > - * if *port > 0 checks ca[port] state. Otherwise set *port to
> > + * if *port > 0, check ca[port] state. Otherwise set *port to
> > * the first port that is active, and if such is not found, to
> > - * the first port that is (physically) up. Otherwise return -1;
> > + * the first port that is not disabled. Otherwise return -1;
> > */
> > static int
> > resolve_ca_port(char *ca_name, int *port)
> > @@ -228,14 +228,14 @@
> > return 1;
> > }
> >
> > - if (*port > 0) { /* user wants user gets */
> > + if (*port > 0) { /* check only the port the user wants */
> > if (*port > ca.numports)
> > return -1;
> > if (!ca.ports[*port])
> > return -1;
> > if (ca.ports[*port]->state == 4)
> > return 1;
> > - if (ca.ports[*port]->phys_state == 5)
> > + if (ca.ports[*port]->phys_state != 3)
> > return 0;
> > return -1;
> > }
> > @@ -244,7 +244,7 @@
> > DEBUG("checking port %d", i);
> > if (!ca.ports[i])
> > continue;
> > - if (up < 0 && ca.ports[i]->phys_state == 5)
> > + if (up < 0 && ca.ports[i]->phys_state != 3)
> > up = *port = i;
> > if (ca.ports[i]->state == 4) {
> > active = *port = i;
> > @@ -278,10 +278,11 @@
> > return ca_name;
> > }
> >
> > - /* find first existing HCA with Active port */
> > + /* Get the list of CA names. */
> > if ((n = umad_get_cas_names((void *)names, UMAD_CA_NAME_LEN)) < 0)
> > return 0;
> >
> > + /* Find the first existing CA with an active port. */
> > for (caidx = 0; caidx < n; caidx++) {
> > TRACE("checking ca '%s'", names[caidx]);
> >
>
--
Ralph Campbell <ralphc at pathscale.com>
More information about the general
mailing list