]> granicus.if.org Git - libx264/commit
x86: SSE2/AVX idct_dequant_2x4_(dc|dconly)
authorHenrik Gramner <henrik@gramner.com>
Sat, 20 Feb 2016 19:31:22 +0000 (20:31 +0100)
committerHenrik Gramner <henrik@gramner.com>
Tue, 12 Apr 2016 15:10:39 +0000 (17:10 +0200)
commit23d1d8e89be2d99f5c6924a6055fc80d69429503
tree08d9c8474ee0d0939091f8b610921b5d8012d8f3
parentdbbf1dd2836a21b65178442c1fb7a00ea089d7ec
x86: SSE2/AVX idct_dequant_2x4_(dc|dconly)

Only used in 4:2:2. Both 8-bit and high bit-depth implemented.

Approximate performance improvement compared to C on Ivy Bridge:

                         x86-32  x86-64
idct_dequant_2x4_dc      2.1x    1.7x
idct_dequant_2x4_dconly  2.7x    2.0x

Helps more on 32-bit due to the C versions being register starved.
common/quant.c
common/x86/quant-a.asm
common/x86/quant.h