]> granicus.if.org Git - libvpx/commit
Optimize 8x8 idct function
authorYunqing Wang <yunqingwang@google.com>
Fri, 15 Mar 2013 18:33:10 +0000 (11:33 -0700)
committerYunqing Wang <yunqingwang@google.com>
Mon, 18 Mar 2013 22:34:14 +0000 (15:34 -0700)
commit6344c84c82f6a4f82a6a4f9f33a6d1ec85691930
treeda2f70ee88c0080ef3c0755db0bee6c6fa8bd7c1
parent449f136886e96fcf448bf9b68952977da703c614
Optimize 8x8 idct function

Wrote sse2 functions of vp9_short_idct8x8 and vp9_short_idct10_8x8.
Compared to c version, the sse2 version is 2X faster. The decoder
test didn't show noticeable gain since 8x8 idct doesn't take much
of decoding time (less than 1% in my test).

Change-Id: I56313e18cd481700b3b52c4eda5ca204ca6365f3
vp9/common/vp9_idct.h
vp9/common/vp9_rtcd_defs.sh
vp9/common/x86/vp9_idct_x86.c
vp9/decoder/vp9_dequantize.c
vp9/encoder/x86/vp9_dct_sse2_intrinsics.c