[openib-general] mwrite64 - need for uar object in access layer
Michael S. Tsirkin
mst at mellanox.co.il
Tue Sep 21 10:17:14 PDT 2004
Quoting r. Roland Dreier (roland at topspin.com) "Re: [openib-general] mwrite64 - need for uar object in access layer":
> Troy> How does just using the floating point unit compare the the
> Troy> SSE codepath? In a past life I had to get a flash driver for
> Troy> a 32 bit PPC board working that *had* to have 64 bit access
> Troy> to flash.
>
> It's a pretty huge loss, because saving/restoring the FPU state
> requires writing/reading something like 170 bytes. With SSE we can
> just save the 8 bytes of XMM register that we actually use. Even so
> I'm not convinced SSE is a win over just using a lock because saving
> CR0 is so expensive.
>
> As I said, I'd be curious to see benchmarks of other approaches. I
> think there's definitely room for improvement if someone is interested
> in working on this.
>
Well, profiling is always nice but hard to do.
But just doing a two word write is clearly a win, no?
MST
More information about the general
mailing list