***SPAM*** Re: [ofa-general] Mellanox Gen3, Linux and ibpanic - "Resource Temporarily unavailable"

Hal Rosenstock hal.rosenstock at gmail.com
Tue Nov 25 06:56:41 PST 2008


Hi Rob,

On Tue, Nov 25, 2008 at 9:46 AM, Robert Dunkley <Robert at saq.co.uk> wrote:
> Hi Hal,
>
> Machine A is powered on. It was after powering down machine B and OpenSM
> with it that Machine A went weird.

> /sys/class/infiniband/mthca0 exists on Machine A, contents is:
> board_id  fw_ver    hw_rev     node_guid  ports      sys_image_guid
> device    hca_type  node_desc  node_type  subsystem  uevent

What about machine B ? Do these files exist ? Also what is the port
state (down or init or something else) ?

-- Hal

> Thanks,
>
> Rob
>
> -----Original Message-----
> From: Hal Rosenstock [mailto:hal.rosenstock at gmail.com]
> Sent: 25 November 2008 14:46
> To: Robert Dunkley
> Cc: general at lists.openfabrics.org
> Subject: Re: [ofa-general] Mellanox Gen3, Linux and ibpanic - "Resource
> Temporarily unavailable"
>
> On Tue, Nov 25, 2008 at 9:20 AM, Robert Dunkley <Robert at saq.co.uk>
> wrote:
>> Hi everyone,
>>
>> I'm using a setup of two machines (Lets call them A and B) directly
>> connected by 1 cable. Each machine has a Mellanox MT25204 (Gen3
> Mellanox
>> PCI-E Infiniband card) and uses IPOIB, they run Centos 5.2 with OFED
> 1.3
>> installed, Machine B runs OpenSM.
>>
>> All was working fine. I shutdown Machine A did some maintenance and
> then
>> powered it on again, everything is OK again. I then shutdown Machine B
>> (The one running OpenSM), this seemed to really upset Machine A. After
>> booting Machine B again, Machine B looks OK with the port down and in
>> polling state.
>
> Is this with machine A powered off ?
>
>> Machine A however gives the following error if I run
>> ibstat: ibpanic: [11406] main: stat of IB device 'mthca0' failed:
>> (Resource temporarily unavailable)
>
> Does /sys/class/infiniband/mthca0 exist on machine A ? If so, what
> files are there ?
>
> -- Hal
>
>> I don't want to reboot Machine A as it must synch data with Machine B
>> over the Infiniband link first. Does anyone have any idea how to fix
>> machine A?
>>
>> Thanks,
>>
>> Rob
>>
>> The SAQ Group
>>
>> Registered Office: 18 Chapel Street, Petersfield, Hampshire GU32 3DZ
>> SEMTEC Limited Trading as SAQ is Registered in England & Wales
>> Company Number: 06481952
>>
>>
>>
>> http://www.saqnet.co.uk AS29219
>>
>> SAQ Group Delivers high quality, honestly priced communication and
> I.T. services to UK Business.
>>
>> DSL : Domains : Email : Hosting : CoLo : Servers : Racks : Transit :
> Backups : Managed Networks : Remote Support.
>>
>> Find us in http://www.thebestof.co.uk/petersfield
>>
>> _______________________________________________
>> general mailing list
>> general at lists.openfabrics.org
>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>
>> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>>
>



More information about the general mailing list