[Users] OpenSM high cpu usage on ESXi

Raphaël SCHITZ raphael at schitz.net
Fri Jun 7 05:52:22 PDT 2013


Yes,

The vib file attached to the post is a standard package for ESXi.
The daemon will start an openSM instance on each port found on the server and separate logs and partitions.conf as well.

I hope you guys would have a use case for using it in the future and I would be please to help or add features if needed!

Cheers

From: Hal Rosenstock [mailto:hal.rosenstock at gmail.com]
Sent: Friday, June 07, 2013 2:26 PM
To: Raphaël SCHITZ
Cc: users at lists.openfabrics.org
Subject: Re: [Users] OpenSM high cpu usage on ESXi


On Fri, Jun 7, 2013 at 7:22 AM, Raphaël SCHITZ <raphael at schitz.net<mailto:raphael at schitz.net>> wrote:
Hi,

Thanks to Hal Rosenstock, i was finally able to package a functional vib for ESXi 5.x : http://www.hypervisor.fr/?p=4662

It is now possible to manage opensm on ESX side so no switch needed for back to back connection.

Thanks again Hal!


Thanks! I can understand a good bit of what was written but my French isn't quite enough for everything written there ;-)

Is this now packaged up for others to use ?

-- Hal



On 6 avr. 2013, at 02:32, "Raphaël SCHITZ" <raphael at schitz.net<mailto:raphael at schitz.net>> wrote:
Hi,

To start practicing infinband in my personal home lab, i managed to compile OpenSM for ESXi to avoid buying an expansive switch and do back-to-back wiring between two HP ML110 servers and Mellanox Connect X cards. The trick is compiling the binary on CentOS 3.9 i386 and that makes is usable on ESXi but i had to modify some device path access (/sys/class/infiniband to /proc/infiniband and /dev/infiniband to /dev) in the source files of OpenSM.

It’s working but i have some issues and they might be related.

First, the cpu usage of two of the OpenSM processes are too high (almost 100% each) and makes me think of a cpu loop or something similar.
Second, i got a constant massive flow of this error in the opensm.log : [2F66AB90] 0x01 -> umad_receiver: ERR 5404: recv error on MAD sized umad (Resource temporarily unavailable)

Could some one help me to understand and solve this ?

Thanks
RS
_______________________________________________

Users mailing list
Users at lists.openfabrics.org<mailto:Users at lists.openfabrics.org>
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/users

_______________________________________________
Users mailing list
Users at lists.openfabrics.org<mailto:Users at lists.openfabrics.org>
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/users

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/users/attachments/20130607/71a87218/attachment.html>


More information about the Users mailing list