[ofa-general] How many processes on a node can open IB device ?
Tang, Changqing
changquing.tang at hp.com
Tue Aug 19 13:08:31 PDT 2008
Roland:
Thank you very much for the info.
I hope Mellanox can tell me what to do next, Our project needs to run 2048 ranks
on a node, every rank has IB communication(most of them are sleeping, only a few are active).
--CQ
> -----Original Message-----
> From: Roland Dreier [mailto:rdreier at cisco.com]
> Sent: Tuesday, August 19, 2008 2:53 PM
> To: Tang, Changqing
> Cc: general at lists.openfabrics.org
> Subject: Re: [ofa-general] How many processes on a node can
> open IB device ?
>
> > I have simple IBV code, which only open the device and create PD.
> > (attached below), then the code sleep there.
> >
> > When I start as many processes as I could, it fails at
> 895 copies, it fails with error:
>
> That sounds right for mlx4 with default firmware on a 4KB
> page size (ie
> x86) system. There are 1024 pages of user access registers
> available, but 128 + 1 = 129 are reserved for internal driver
> use. So that would leave 895 available for userspace use,
> exactly as you found.
>
> You should be able to build firmware that supports more
> processes, but I believe there may be some
> performance/stability tradeoffs related to that -- Mellanox
> could tell you more.
>
> - R.
>
More information about the general
mailing list