SkHalfToFloat_01 / SkFloatToHalf_01 - skia

diff options

author	mtklein <mtklein@chromium.org>	2016-02-11 06:30:03 -0800
committer	Commit bot <commit-bot@chromium.org>	2016-02-11 06:30:03 -0800
commit	fff055cc5f9ca5015678f4f13a4f842084bd62d5 (patch)
tree	f7e00567455fbd81ab5c1b401e4e80ed52a2095e /bench/AndroidCodecBench.h
parent	cbefc5e4ca7fd7aaa5d2a3aa85b30f16148c3d2f (diff)

SkHalfToFloat_01 / SkFloatToHalf_01

These are basically inlined, 4-at-a-time versions of our existing functions, but cut down to avoid any work that's only necessary outside [0,1]. Both f16 and f32 denorms should work fine modulo the usual ARMv7 NEON denorm==zero caveat. In exchange for a little speed, f32->f16 does not round properly. Instead it truncates, so it's never off by more than 1 bit. Support for finite values >1 or <0 is straightforward to add back. >1 might already work as-is. Getting close to _u16 performance: micros bench 261.13 xferu64_bw_1_opaque_u16 1833.51 xferu64_bw_1_alpha_u16 2762.32 ? xferu64_aa_1_opaque_u16 3334.29 xferu64_aa_1_alpha_u16 249.78 xferu64_bw_1_opaque_f16 3383.18 xferu64_bw_1_alpha_f16 4214.72 xferu64_aa_1_opaque_f16 4701.19 xferu64_aa_1_alpha_f16 BUG=skia: GOLD_TRYBOT_URL= https://gold.skia.org/search2?unt=true&query=source_type%3Dgm&master=false&issue=1685133005 Committed: https://skia.googlesource.com/skia/+/9ea11a4235b3e3521cc8bf914a27c2d0dc062db9 CQ_EXTRA_TRYBOTS=client.skia:Test-Ubuntu-GCC-GCE-CPU-AVX2-x86_64-Release-SKNX_NO_SIMD-Trybot Review URL: https://codereview.chromium.org/1685133005

Diffstat (limited to 'bench/AndroidCodecBench.h')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: