]> granicus.if.org Git - libvpx/commit
Remove unnecessary calculation in 4-tap interpolation filter
authorchiyotsai <chiyotsai@google.com>
Sat, 3 Nov 2018 00:08:05 +0000 (17:08 -0700)
committerChi Yo Tsai <chiyotsai@google.com>
Tue, 15 Jan 2019 20:02:19 +0000 (20:02 +0000)
commitc182725cbc9e1e4892784a24c32b1bed80047b0c
tree4395b294c8b94afada7bee4393771ad82e26ebea
parent19882cdbf9518f35eaeccff9702dbb5bab708e06
Remove unnecessary calculation in 4-tap interpolation filter

Reduces the number of rows calculated for 2D 4-tap interpolation filter
from h+7 rows to h+3 rows.
Also fixes a bug in the avx2 function for 4-tap filters where the last
row is computed incorrectly.

Performance:
           | Baseline |  Result  | Pct Gain |
bitdepth lo| 4.00 fps | 4.02 fps |   0.5%   |
bitdepth 10| 1.90 fps | 1.91 fps |   0.5%   |

The performance is evaluated on speed 1 on jets.y4m br 500 over 100
frames.

No BDBR loss is observed.

Change-Id: I90b0d4d697319b7bba599f03c5dc01abd85d13b1
vpx_dsp/x86/convolve.h
vpx_dsp/x86/highbd_convolve_avx2.c
vpx_dsp/x86/vpx_subpixel_4t_intrin_sse2.c
vpx_dsp/x86/vpx_subpixel_8t_intrin_avx2.c
vpx_dsp/x86/vpx_subpixel_8t_intrin_ssse3.c