[openib-general] gen2 opensm

Roland Fehrenbacher rf at q-leap.de
Tue Apr 5 13:33:34 PDT 2005


>>>>> "Hal" == Hal Rosenstock <halr at voltaire.com> writes:

    Hal> On Tue, 2005-04-05 at 13:26, Roland Fehrenbacher wrote:

    >> I have tried the kernel 2.6.11 drivers on an x86-64 machine
    >> with a MT23108 card. The driver loads ok after
    >> $ modprobe ib_mthca; modprobe ib_umad

    >> Since I use devfs, I have to manually create

    >> $ mknod /dev/infiniband/umad0 c 231 0
    >> $ mknod /dev/infiniband/umad1 c 231 1
    >> $ mknod /dev/infiniband/issm0 c 231 64
    >> $ mknod /dev/infiniband/issm1 c 231 65

    Hal> What are the permissions on those ? Are they crw ?

$ ls -l /dev/infiniband
total 0
crw-r--r--  1 root root 231, 64 Apr  5 18:53 issm0
crw-r--r--  1 root root 231, 65 Apr  5 18:54 issm1
crw-r--r--  1 root root 231,  0 Apr  5 18:52 umad0
crw-r--r--  1 root root 231,  1 Apr  5 18:54 umad1

    >> I get 
    >> 
    >> $ /usr/local/ib/bin/ibstat
    >> CA 'mthca0'
    >>         CA type: MT23108
    >>         Number of ports: 2
    >>         Firmware version: 3.2.0
    >>         Hardware version: a1
    >>         Node GUID: 0x000000008815bcaa
    >>         System image GUID: 0x000000008815bcaa
    >>         Port 1:
    >>                 State: Initializing
    >>                 Physical state: LinkUp
    >>                 Rate: 10
    >>                 Base lid: 0
    >>                 LMC: 0
    >>                 SM lid: 0
    >>                 Capability mask: 0x00500a68
    >>                 Port GUID: 0x0000000000000000
    >>         Port 2:
    >>                 State: Down
    >>                 Physical state: Polling
    >>                 Rate: 2
    >>                 Base lid: 0
    >>                 LMC: 0
    >>                 SM lid: 0
    >>                 Capability mask: 0x00500a68
    >>                 Port GUID: 0x0000000000000000
    >> 
    >> which already looks strange (GUID 0 ???).

    Hal> It looks like the port GUIDs are not set in NVRAM.

They seem to be shown alright with ibstatus (or isn't gid = GUID?):

$ /usr/local/ib/bin/ibstatus
Infiniband device 'mthca0' port 1 status:
        default gid:     fe80:0000:0000:0000:0002:c902:0000:771d
        base lid:        0x0
        sm lid:          0x0
        state:           2: INIT
        phys state:      5: LinkUp
        rate:            10 Gb/sec (4X)

Infiniband device 'mthca0' port 2 status:
        default gid:     fe80:0000:0000:0000:0002:c902:0000:771e
        base lid:        0x0
        sm lid:          0x0
        state:           1: DOWN
        phys state:      2: Polling
        rate:            2.5 Gb/sec (1X)

> Running opensm then doesn't activate the ports:
> 
> Apr 05 19:18:25 [4000] -> OpenSM Rev:openib-1.0.0
> Apr 05 19:18:25 [4000] -> osm_opensm_init: Forcing single threaded dispatcher.
> Apr 05 19:18:25 [4000] -> osm_report_notice: Reporting Generic Notice type:3 num:66 from LID:0x0000 GID:0x0000000030f2ffff,0x0000000000000000
> Apr 05 19:18:25 [4000] -> osm_report_notice: Reporting Generic Notice type:3 num:66 from LID:0x0000 GID:0x0000000030f2ffff,0x0000000000000000
> Apr 05 19:18:25 [4000] -> osm_vendor_get_all_port_attr: assign CA  0x7fffffffd010ort 1 guid (0x65babaa) as the default port.

    Hal> I see a bug in this message. I will fix it. Please sync
    Hal> OpenSM to at least version 2111 and rerun.

I will recompile tomorrow, and try a firmware upgrade.

Roland





More information about the general mailing list