[cairo] [PATCH] Rely less on the FPU, take 3
Jonathan Morton
jonathan.morton at movial.com
Wed Jun 2 10:29:13 PDT 2010
It turns out, of course, that memcmp() isn't the most efficient
mechanism to compare small, aligned, known-size blocks of memory.
Attached is yet another version of the patch, which is verified to
produce small but consistent increases in benchmark numbers on a
Cortex-A8. Note that the compare function unrolls and inlines to
branch-free code on ARM, and should behave similarly on PPC and x86.
What I don't yet know is whether this version is as fast as the original
on desktop platforms, given that these have properly integrated FPUs.
--
------
From: Jonathan Morton
jonathan.morton at movial.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-Rely-less-on-DP-FPU-for-common-matrix-funcs.patch
Type: text/x-patch
Size: 2014 bytes
Desc: not available
URL: <http://lists.cairographics.org/archives/cairo/attachments/20100602/24ddb03f/attachment.bin>
More information about the cairo
mailing list