[openib-general] Updated IPoIB IETF WG presentation
David M. Brean
David.Brean at Sun.COM
Sun Aug 1 15:51:43 PDT 2004
Hello,
Some comments/questions about these slides:
* slide 1 - nit: perhaps the title should be "Some Experience with Linux
IPoIB Implementations" since the information is coming from Linux
developers.
* slide 4 - nit: move the first bullet after bullet containing "single
implementation"
* slide 6 - nit: first bullet should be highlighted as the "problem" and
the second bullet as the "solution".
* slide 7 and 8 - In section 5.0 of the I-D, there is text stating that
the "broadcast group may be created by the first IPoIB node to be
initialized or it can be created administratively before the IPoIB
subnet is setup". The mechanism used to administratively create the
group is intentionally beyond the scope of the I-D. For example, an
implementation could enable the fabric (or "network" as you say)
administrator to control membership in a partition and therefore make
sure that the first node added to that partition creates the broadcast
group correctly. In any case, mentioning the administrative option is
kinda a "helpful" hint. All the IPoIB nodes are free to create the
broadcast group, just like they can create any multicast group, as long
as the IPoIB node has enough information to specify the necessary
parameters as required by the SA interface. The I-D suggests how to
find the necessary parameters for the multicast groups and leaves open
how IPoIB nodes obtain that information if they need to create that group.
Are these slides suggesting that the I-D be changed to specify the
IPoIB parameters via defaults for the case where the IPoIB node must
create the broadcast group?
[Note, Q_Key is provided by broadcast group, so it isn't necessary to
distribute to all IPoIB nodes.]
* slide 9 and 10 - "Running" may be the description of a state that is
be OS is beyond the scope of the I-D (does Windows network interface
support a "running" state?). However, the I-D does say that an IPoIB
link is "formed" only when the broadcast group exists. The I-D doesn't
say anything about operation in a "degraded" mode, for example, when a
IPoIB node can't join a multicast group. Behavior in degraded mode
seems like an implementation issue. It's not clear what you would want
to change in the I-D, perhaps you can suggest what you want changed in
the presentation.
* slide 12 - I recall that during the email discussion:
1) a boot-time scenario where the IPoIB nodes had to access the SA to
obtain pathrecord information to fill the pathrecord cache and send
unicast ARP messages
2) a SM failover/restart scenario
For #1, the speed at which the IPoIB nodes can begin normal operation
depends on the fabric and SA implementation. I guess the question is
whether this is an architecture or implementation problem. Is it
impossible to implement a working system based on the current
architecture? I think the proposed alternative would require changes to
the encapsulation scheme plus specifying some defaults such as the SL so
that SA queries are eliminated. Some of that might require input from
the IBTA.
For #2, how long is too long for a subnet to operate without
successful SA queries? 10 seconds? 20 seconds? Or is this change
suggesting that the subnet should continue operating, perhaps
establishing new IP connections (note, this proposal doesn't attempt to
fix the situation at the IB transport level) even in the case where no
SA exists. Please clarify in the slides.
* slide 13 - An IB CA should perform as well as a "dumb" ethernet NIC
with respect to bandwidth and CPU utilization. If not, someone should
look at the overheads in the IB access layer and the CA implementation,
right? The statement "not equivalent to ethernet" is highlighting the
lack offload mechanisms in the CA such as checksum, correct? If so,
perhaps that point should be made explicit.
Note, I'm not attempting to respond to the issues raised on the slides
since that will happen at the meeting, but merely seeking clarification
of the issues being raised.
-David
Hal Rosenstock wrote:
> Here's an updated presentation based on the comments from yesterday:
> - Separate slide and more detail on openib
> - Eliminate checksum slide
>
> It is also available as
> https://openib.org/svn/trunk/contrib/voltaire/ietf_ipoib/ipoib_exp.pdf.
>
> -- Hal
>
>
More information about the general
mailing list