]>
granicus.if.org Git - libvpx/log
Jingning Han [Mon, 16 Nov 2015 04:52:51 +0000 (20:52 -0800)]
Limit the reset range of inter_tx_size array
Reset the effective range of inter_tx_size, instead of the entire
array in the rate-distortion optimization loop.
Change-Id: Id453fbd6dddfe69f4e451ba8518c083326d5dd53
Yaowu Xu [Sat, 14 Nov 2015 13:00:06 +0000 (05:00 -0800)]
Merge branch 'master' into nextgenv2
jackychen [Sat, 14 Nov 2015 00:02:43 +0000 (16:02 -0800)]
Enable resize test(down&up) by changing the bitrate.
Change-Id: I5a4f1f7b9de20fbfc28cb743dcd29c0eeca736f8
Debargha Mukherjee [Fri, 13 Nov 2015 23:50:21 +0000 (23:50 +0000)]
Merge "Adding experiment for supertransform" into nextgenv2
Hui Su [Fri, 13 Nov 2015 21:19:27 +0000 (21:19 +0000)]
Merge "refactor ext-intra" into nextgenv2
hui su [Tue, 10 Nov 2015 20:09:59 +0000 (12:09 -0800)]
refactor ext-intra
Coding gain remains about the same, while overall speed is
substantially increased.
Change-Id: I2989bebcfd21092cd6a02653d4df4a3bf6780874
Debargha Mukherjee [Fri, 13 Nov 2015 19:32:17 +0000 (11:32 -0800)]
Adding experiment for supertransform
Change-Id: Ie43027f7d46c43df137fd4a7f731ff6ccb78fcee
Marco [Fri, 13 Nov 2015 16:09:17 +0000 (08:09 -0800)]
Reduce sampling time for noise estimate.
Change-Id: I46abd85e2187b8f4c2846416a23fab26d9b9f67d
Marco [Fri, 13 Nov 2015 14:20:41 +0000 (06:20 -0800)]
Fix resize internal test.
Temporary fix to make sure it always passes.
Change-Id: I56a0529986ad7049b6090f871c14e9e06d573d5f
Marco Paniconi [Fri, 13 Nov 2015 14:09:19 +0000 (14:09 +0000)]
Merge "VP9 noise estimation: add frame level motion metrics and adjust thresholds."
Yaowu Xu [Fri, 13 Nov 2015 13:00:05 +0000 (05:00 -0800)]
Merge branch 'master' into nextgenv2
JackyChen [Fri, 13 Nov 2015 02:36:17 +0000 (18:36 -0800)]
VP9 noise estimation: add frame level motion metrics and adjust thresholds.
Change-Id: Ia1aba00603b32cee6835951d3d8f740937cf20f4
James Zern [Fri, 13 Nov 2015 06:49:00 +0000 (06:49 +0000)]
Merge "libs.mk, testdata: rm redundant test of LIBVPX_TEST_DATA"
James Zern [Fri, 13 Nov 2015 06:29:20 +0000 (06:29 +0000)]
Merge "Add AVX vectorized vp9_diamond_search_sad"
Zoe Liu [Fri, 13 Nov 2015 04:55:54 +0000 (04:55 +0000)]
Merge "Fixed a few sanity checks." into nextgenv2
Marco [Fri, 13 Nov 2015 00:52:45 +0000 (16:52 -0800)]
Adjust variance threshold for 16x16 split at low resolutions.
Change-Id: I635e37f81237e9703d7d9a11ed76a043f4ec6eb0
Marco Paniconi [Fri, 13 Nov 2015 00:13:32 +0000 (00:13 +0000)]
Revert "Update to noise estimation."
This reverts commit
6b79a1e3e0dfc265f43445b8181aa95e1c02deac .
Change-Id: I5a4923ca8a6de842855ce0725e92567ccbed6fb7
Angie Chiang [Thu, 12 Nov 2015 23:38:44 +0000 (23:38 +0000)]
Merge changes Ifafbd497,I042bba27,Id6fd8558,Id5b79519 into nextgenv2
* changes:
Add adst_dct config to vp10_inv_txfm2d_cfg
Add adst_adst config to vp10_inv_txfm2d_cfg
Add dct_adst config to vp10_inv_txfm2d_cfg
Add dct_dct config to vp10_inv_txfm2d_cfg
Marco [Thu, 12 Nov 2015 17:06:12 +0000 (09:06 -0800)]
Update to noise estimation.
Add frame level global check and adjust some parameters.
Change-Id: I42103394f2d329781195d94ce6cbb5b3383eea17
Yaowu Xu [Thu, 12 Nov 2015 13:00:05 +0000 (05:00 -0800)]
Merge branch 'master' into nextgenv2
Marco Paniconi [Thu, 12 Nov 2015 06:08:38 +0000 (06:08 +0000)]
Merge "Non-rd partition: reduce variance threshold low resolutions."
Marco Paniconi [Thu, 12 Nov 2015 06:06:51 +0000 (06:06 +0000)]
Merge "Adjust varianace threshold for high noise condition."
Marco [Thu, 12 Nov 2015 02:21:20 +0000 (18:21 -0800)]
Adjust motion threshold to limit cyclic refresh.
Change-Id: Icfca27a567eb8929c312c6315856ee130d982a04
Marco [Thu, 12 Nov 2015 02:05:11 +0000 (18:05 -0800)]
Adjust varianace threshold for high noise condition.
Change-Id: I91c722e480328ff95b8c57614d8176ccaceb2539
Angie Chiang [Fri, 6 Nov 2015 19:06:04 +0000 (11:06 -0800)]
Add adst_dct config to vp10_inv_txfm2d_cfg
Change-Id: Ifafbd4974be44685ab2550ed159dbf0411b6f031
Angie Chiang [Wed, 28 Oct 2015 21:10:50 +0000 (14:10 -0700)]
Add adst_adst config to vp10_inv_txfm2d_cfg
Change-Id: I042bba27540ab2a3d8a00871980295e98f616480
Angie Chiang [Wed, 28 Oct 2015 21:10:11 +0000 (14:10 -0700)]
Add dct_adst config to vp10_inv_txfm2d_cfg
Change-Id: Id6fd8558452f64c4ac30d7cb656b659f0587b5d6
Angie Chiang [Wed, 28 Oct 2015 21:09:13 +0000 (14:09 -0700)]
Add dct_dct config to vp10_inv_txfm2d_cfg
Change-Id: Id5b795198552443a700413284a1015296e267dcf
Marco Paniconi [Wed, 11 Nov 2015 20:40:29 +0000 (20:40 +0000)]
Merge "vp9 denoiser: Add another noise level to denoising."
James Zern [Wed, 11 Nov 2015 20:39:12 +0000 (20:39 +0000)]
Merge "Revert "VPX: x86 asm version of vpx_idct32x32_34_add()""
Marco [Wed, 11 Nov 2015 18:09:40 +0000 (10:09 -0800)]
vp9 denoiser: Add another noise level to denoising.
Change-Id: Idc755ab54e4f78bb7d75bc97634c451804edad99
James Zern [Wed, 11 Nov 2015 19:11:37 +0000 (11:11 -0800)]
Revert "VPX: x86 asm version of vpx_idct32x32_34_add()"
This reverts commit
9aeaa2016e7470c4e316d90da33d883098eed6f4 .
This causes some test vectors to fail.
Change-Id: I3659a2068404ec5a0591fba5c88b1bec0c9059a4
James Zern [Wed, 11 Nov 2015 19:09:42 +0000 (11:09 -0800)]
Revert "VP9: Only zero counts when !frame_parallel_decoding_mode"
This reverts commit
380a5519cca999f54b17f37b709996a11f3204ea .
This causes an assertion failure in debug_check_frame_counts() which
probably isn't valid with this change; leaving the investigation for
later now.
Change-Id: Ieda5ca811ed2fa50a0cc6935919a8d10dca996e0
Zoe Liu [Tue, 10 Nov 2015 01:21:11 +0000 (17:21 -0800)]
Fixed a few sanity checks.
Change-Id: Ieec4a7be5945dc6de192e2d8292ab978baf47f53
Geza Lore [Wed, 28 Oct 2015 14:35:04 +0000 (14:35 +0000)]
Add AVX vectorized vp9_diamond_search_sad
This function now has an AVX intrinsics version which is about 80%
faster compared to the C implementation. This provides a 2-4% total
speed-up for encode, depending on encoding parameters. The function
utilizes 3 properties of the cost function lookup table, constructed
in 'cal_nmvjointsadcost' and 'cal_nmvsadcosts'.
For the joint cost:
- mvjointsadcost[1] == mvjointsadcost[2] == mvjointsadcost[3]
For the component costs:
- For all i: mvsadcost[0][i] == mvsadcost[1][i]
(equal per component cost)
- For all i: mvsadcost[0][i] == mvsadcost[0][-i]
(Cost function is even)
These must hold, otherwise the AVX version of the function cannot be used.
Change-Id: I6c2791d43022822a9e6ab43cd124a773946d0bdc
Yaowu Xu [Wed, 11 Nov 2015 13:00:06 +0000 (05:00 -0800)]
Merge branch 'master' into nextgenv2
James Zern [Wed, 11 Nov 2015 01:54:51 +0000 (17:54 -0800)]
libs.mk, testdata: rm redundant test of LIBVPX_TEST_DATA
the return value of enabled, which may be empty, is handled by the for
loop. this avoids making an unnecessarily long command line which may
fail in certain cases.
Change-Id: Ib88ecbbe2c0f6d7debb600b4caed4884497263b1
Yaowu Xu [Wed, 11 Nov 2015 01:31:14 +0000 (01:31 +0000)]
Merge "Replace inline with INLINE" into nextgenv2
Yaowu Xu [Wed, 11 Nov 2015 01:31:06 +0000 (01:31 +0000)]
Merge "Fix msvc compling" into nextgenv2
Jingning Han [Wed, 11 Nov 2015 01:25:55 +0000 (01:25 +0000)]
Merge "Fix an encoding failure case when speed features are on" into nextgenv2
Angie Chiang [Wed, 11 Nov 2015 00:28:35 +0000 (00:28 +0000)]
Merge "Add vp10_fwd_txfm2d_test" into nextgenv2
Yaowu Xu [Wed, 11 Nov 2015 00:09:09 +0000 (16:09 -0800)]
Replace inline with INLINE
Change-Id: I37b5ed9fef0e97feabd856bd4c1b4c7869991a34
Yaowu Xu [Wed, 11 Nov 2015 00:08:09 +0000 (16:08 -0800)]
Fix msvc compling
Change-Id: I5abd6d2fd198b3789732e81b23a5bac009af5290
Jingning Han [Wed, 11 Nov 2015 00:02:33 +0000 (16:02 -0800)]
Fix an encoding failure case when speed features are on
This commit fixes an encoding failure case triggered when early
termination feature is turned on for transform block size search.
It resolves the corresponding enc/dec mismatch issue.
Change-Id: I2c5b7d8b1efe25fe3810e6ed307f4b1865dede49
Marco [Tue, 10 Nov 2015 23:41:39 +0000 (15:41 -0800)]
Non-rd partition: reduce variance threshold low resolutions.
Change-Id: I06306905d187948a92f839357df5d21413823808
Marco Paniconi [Tue, 10 Nov 2015 23:10:31 +0000 (23:10 +0000)]
Merge "Add bias to zero/small motion for noisy source."
Yaowu Xu [Tue, 10 Nov 2015 23:08:51 +0000 (23:08 +0000)]
Merge "Get test to build with MSVC" into nextgenv2
James Zern [Tue, 10 Nov 2015 22:35:12 +0000 (22:35 +0000)]
Merge "convolve_copy_sse2: replace SSE w/SSE2 code"
Scott LaVarnway [Tue, 10 Nov 2015 21:40:11 +0000 (21:40 +0000)]
Merge "VPX: x86 asm version of vpx_idct32x32_34_add()"
Scott LaVarnway [Tue, 10 Nov 2015 19:54:56 +0000 (11:54 -0800)]
VPX: x86 asm version of vpx_idct32x32_34_add()
Change-Id: I8a933c63b7fbf3c65e2c06dbdca9646cadd0b7cb
Marco [Mon, 9 Nov 2015 21:36:56 +0000 (13:36 -0800)]
Add bias to zero/small motion for noisy source.
Change is only for real-time mode, speed >= 5, and non-screen content mode.
Add bias to zero/low motion for big blocks, if noise estimation
is enabled and noise level is above threshold.
Change-Id: I3a0a4608ede6aa535bda6eca528d20f8aba738e7
James Zern [Thu, 5 Nov 2015 04:01:34 +0000 (20:01 -0800)]
convolve_copy_sse2: replace SSE w/SSE2 code
this should be neutral or slightly faster on modern (P4+) architectures
Change-Id: Iec4c080275941eb8c9e05a66a2daf0405d86a69b
JackyChen [Tue, 10 Nov 2015 03:18:29 +0000 (19:18 -0800)]
VP9 noise estimate: no noise estimate if frame size change.
Change-Id: I521f7b53c143d562a88fe7de330aa3f0ef09f414
Jacky Chen [Tue, 10 Nov 2015 03:05:30 +0000 (03:05 +0000)]
Merge "VP9: add unit test for realtime external resize."
Johann [Tue, 10 Nov 2015 01:00:37 +0000 (17:00 -0800)]
Merge branch 'javanwhistlingduck'
Change-Id: Ib63fde31ae7b3f71e608830f7433113733b2a275
jackychen [Mon, 9 Nov 2015 22:58:14 +0000 (14:58 -0800)]
VP9: add unit test for realtime external resize.
Change-Id: I9bfa80de73847d9be88b6ce9865d7bb5fafaaa57
Angie Chiang [Wed, 28 Oct 2015 21:01:38 +0000 (14:01 -0700)]
Add vp10_fwd_txfm2d_test
Change-Id: Icbc17403430751d3a841f822a190f0c30450d603
Jacky Chen [Mon, 9 Nov 2015 22:54:53 +0000 (22:54 +0000)]
Merge "VP9 dynamic resize: enable resize unit test(DownUp)."
James Zern [Mon, 9 Nov 2015 22:23:34 +0000 (22:23 +0000)]
Merge "VP9: Only zero counts when !frame_parallel_decoding_mode"
Johann [Wed, 28 Oct 2015 23:03:55 +0000 (16:03 -0700)]
Release v1.5.0
Javan Whistling Duck release.
Change-Id: If44c9ca16a8188b68759325fbacc771365cb4af8
jackychen [Mon, 9 Nov 2015 22:04:58 +0000 (14:04 -0800)]
VP9 dynamic resize: enable resize unit test(DownUp).
The unit test requires a longer clip which is already in the repo.
Change-Id: Ic42e8d83e636fafd20d485a7f5f8422835319245
Marco Paniconi [Mon, 9 Nov 2015 21:11:51 +0000 (21:11 +0000)]
Merge "VP9 dynamic resize: increase waiting time after key frame."
Yaowu Xu [Mon, 9 Nov 2015 20:07:25 +0000 (12:07 -0800)]
Get test to build with MSVC
Added _USE_MATH_DEFINES to make sure M_PI can be refered to definitions
in math.h for MSVC
Change-Id: Idca128910384593a002eb08bae72c739fb998e19
jackychen [Mon, 9 Nov 2015 19:47:26 +0000 (11:47 -0800)]
VP9 dynamic resize: increase waiting time after key frame.
For 1 pass CBR mode: increase waiting time after key frame
before we start sampling rate control behavior for determining
resize. This change need to disable one internal resize(DownUp)
temporally since it requires a longer clip to do so.
Change-Id: If21beda1be23f169ee541ab4dd642f718347887a
Marco Paniconi [Mon, 9 Nov 2015 19:29:35 +0000 (19:29 +0000)]
Merge "Use same bias (against non-zero mv for big blocks) for speed 5."
Alex Converse [Mon, 9 Nov 2015 18:27:40 +0000 (18:27 +0000)]
Merge "Expand unconstrained nodes in pack_mb_tokens and loop on zeros."
Scott LaVarnway [Mon, 9 Nov 2015 18:14:13 +0000 (10:14 -0800)]
VP9: Only zero counts when !frame_parallel_decoding_mode
The counts are never used when frame_parallel_decoding_mode
is set.
Change-Id: Ic7a566a048297f7373c9ffbb48929ea09eff674f
Marco [Mon, 9 Nov 2015 18:03:27 +0000 (10:03 -0800)]
Use same bias (against non-zero mv for big blocks) for speed 5.
Use same setting for speed 5 (as it is for speed > 5).
Change is only for real-time (non-rd) mode.
Change-Id: I830250eac654328373cb318baa89d4f0e63942e1
Yaowu Xu [Mon, 9 Nov 2015 16:58:31 +0000 (08:58 -0800)]
Merge branch 'master' into nextgenv2
Change-Id: I8811bfd8fc132b9f515707e795bb6308e4bf263b
James Zern [Sat, 7 Nov 2015 00:57:06 +0000 (00:57 +0000)]
Merge changes I8c83b86d,Ic53b2ed5,I4acc8a84
* changes:
variance_test: create fn pointers w/'&' ref
sixtap_predict_test: create fn pointers w/'&' ref
sad_test: create fn pointers w/'&' ref
Hui Su [Sat, 7 Nov 2015 00:22:50 +0000 (00:22 +0000)]
Merge "Use accurate bit cost for uv_mode in UV intra mode RD selection"
Alex Converse [Tue, 3 Nov 2015 00:28:10 +0000 (16:28 -0800)]
Expand unconstrained nodes in pack_mb_tokens and loop on zeros.
Reduces Linux perf estimated cycle count for pack_mb_tokens on a
lossless encode on my desktop from
61858501855 to
48154040219 or from
26% of the overall profile to 21%.
Change-Id: I9ca3426d7e3272bc7f7030abda4f0d0cec87fb4a
Debargha Mukherjee [Fri, 6 Nov 2015 23:35:07 +0000 (23:35 +0000)]
Merge "Resolve conficts caused by master branch merging" into nextgenv2
Angie Chiang [Fri, 6 Nov 2015 23:34:56 +0000 (23:34 +0000)]
Merge changes I7ca0cc34,I97189d6e,I4e2b51cf,I21158867,I8d73beee into nextgenv2
* changes:
Add adst_dct config to vp10_fwd_txfm2d_cfg
Add adst_adst config to vp10_fwd_txfm2d_cfg
Add dct_adst config to vp10_fwd_txfm2d_cfg
Add dct_dct config to vp10_fwd_txfm2d_cfg
Add vp10_fwd_txfm2d_8x8/16x16/32x32
Angie Chiang [Fri, 6 Nov 2015 23:34:35 +0000 (23:34 +0000)]
Merge "Add vp10_fwd_txfm2d_4x4" into nextgenv2
hui su [Fri, 6 Nov 2015 21:56:51 +0000 (13:56 -0800)]
Use accurate bit cost for uv_mode in UV intra mode RD selection
On derflr, +0.1% for VP10; however, -0.03% on VP9.
Change-Id: I09c724232ede74254043d61d3cadc506256af0af
James Zern [Fri, 6 Nov 2015 22:37:20 +0000 (22:37 +0000)]
Merge "Revert "Add AVX vectorized vp9_diamond_search_sad""
hui su [Fri, 6 Nov 2015 22:35:08 +0000 (14:35 -0800)]
Resolve conficts caused by master branch merging
Change-Id: I167e241b789331572581fcb0567ebe535b4b9345
James Zern [Fri, 6 Nov 2015 21:03:13 +0000 (13:03 -0800)]
Revert "Add AVX vectorized vp9_diamond_search_sad"
This reverts commit
f1342a7b070ef61b9fbdf03e899ac2107cfcb6bd .
This breaks 32-bit builds:
runtime error: load of misaligned address 0xf72fdd48 for type 'const
__m128i' (vector of 2 'long long' values), which requires 16 byte
alignment
+ _mm_set1_epi64x is incompatible with some versions of visual studio
Change-Id: I6f6fc3c11403344cef78d1c432cdc9147e5c1673
James Zern [Fri, 6 Nov 2015 03:00:45 +0000 (19:00 -0800)]
variance_test: create fn pointers w/'&' ref
this helps some toolchains (vs9) resolve the type of the parameter
Change-Id: I8c83b86da53b1783cd18c0f765b67ba33da91d72
James Zern [Fri, 6 Nov 2015 03:00:45 +0000 (19:00 -0800)]
sixtap_predict_test: create fn pointers w/'&' ref
this helps some toolchains (vs9) resolve the type of the parameter
Change-Id: Ic53b2ed5fbce05c5b5e633b4a4ef9ea75c55360a
Angie Chiang [Wed, 28 Oct 2015 20:57:48 +0000 (13:57 -0700)]
Add adst_dct config to vp10_fwd_txfm2d_cfg
Change-Id: I7ca0cc341ae36ac9f7aa24789f8872161b832b7b
Angie Chiang [Wed, 28 Oct 2015 20:02:39 +0000 (13:02 -0700)]
Add adst_adst config to vp10_fwd_txfm2d_cfg
Change-Id: I97189d6e917929c756a3f89fe0ab66077a0a5436
Angie Chiang [Wed, 28 Oct 2015 20:02:16 +0000 (13:02 -0700)]
Add dct_adst config to vp10_fwd_txfm2d_cfg
Change-Id: I4e2b51cf5b0dedb9ea1106747edb76835804fffc
Angie Chiang [Wed, 28 Oct 2015 20:01:23 +0000 (13:01 -0700)]
Add dct_dct config to vp10_fwd_txfm2d_cfg
Change-Id: I21158867fb2b762d3632d0664ebe70c68d0953e1
Angie Chiang [Wed, 28 Oct 2015 00:15:18 +0000 (17:15 -0700)]
Add vp10_fwd_txfm2d_8x8/16x16/32x32
Change-Id: I8d73beee5a619d26f3f8640a6679150d874522c4
Angie Chiang [Wed, 28 Oct 2015 00:13:33 +0000 (17:13 -0700)]
Add vp10_fwd_txfm2d_4x4
Change-Id: I9bca3b1c76b64575366d71ab65ffef7264ce0c9b
Debargha Mukherjee [Fri, 30 Oct 2015 16:19:36 +0000 (09:19 -0700)]
New interpolation experiment
Adds a new interpolation experiment.
Improves entropy coding to send the filter type only if
the motion vectors have subpel components.
Adds one new 8-tap smooth filter, and tweaks the others.
derflr: +0.695%
hevcmr: +0.305%
About 5% encode slowdown. No visible impact for decoding.
Also makes the interpolation framework flexible to support both
strictly interpolating filters as well as non-interpolating
filters that filter integer offsets. This is mainly for
further experimentation and if not found useful the code will
be removed.
Change-Id: I8db9cde56ca916be771fe54a130d608bf10786e6
Hui Su [Fri, 6 Nov 2015 17:40:49 +0000 (17:40 +0000)]
Merge "ext-intra experiment" into nextgenv2
Marco [Fri, 6 Nov 2015 16:33:41 +0000 (08:33 -0800)]
vp9: Disable noise estimate on resize trigger frame.
Change-Id: I35767a6320943582ee11d737b5f240cea2d01b25
Yaowu Xu [Fri, 6 Nov 2015 13:00:06 +0000 (05:00 -0800)]
Merge branch 'master' into nextgenv2
Debargha Mukherjee [Fri, 6 Nov 2015 09:20:46 +0000 (09:20 +0000)]
Merge "Flip the result of the inverse transform for FLIPADST." into nextgenv2
Debargha Mukherjee [Fri, 6 Nov 2015 08:37:25 +0000 (08:37 +0000)]
Merge "Eliminate copying for FLIPADST in fwd transforms." into nextgenv2
James Zern [Fri, 6 Nov 2015 03:00:45 +0000 (19:00 -0800)]
sad_test: create fn pointers w/'&' ref
this helps some toolchains (vs9) resolve the type of the parameter
Change-Id: I4acc8a844d1e55b766f66482bd6d32998174d70f
Marco Paniconi [Fri, 6 Nov 2015 06:51:11 +0000 (06:51 +0000)]
Merge "vp9: Updates to noise estimation."
Marco [Fri, 6 Nov 2015 00:00:15 +0000 (16:00 -0800)]
vp9: Updates to noise estimation.
Add threshold/condition on spatial_variance and brightness level.
Modification to normalization of block variance.
Change resolution limit below which we disable noise estimation.
Change-Id: If5be08a26ceda351242d8a58d2f0bc88c0a918f0
James Zern [Thu, 5 Nov 2015 23:22:39 +0000 (15:22 -0800)]
vp9_spatial_svc_encoder.sh: fix command line param
-l -> -sl, renamed in:
be3b08d [svc] Temporal svc with two pass rate control
Change-Id: I5a7b179b33d94e20e54825090659156dece928c0
Yunqing Wang [Thu, 5 Nov 2015 20:17:13 +0000 (20:17 +0000)]
Merge "Add AVX vectorized vp9_diamond_search_sad"
Geza Lore [Wed, 28 Oct 2015 14:35:04 +0000 (14:35 +0000)]
Add AVX vectorized vp9_diamond_search_sad
This function now has an AVX intrinsics version which is about 80%
faster compared to the C implementation. This provides a 2-4% total
speed-up for encode, depending on encoding parameters. The function
utilizes 3 properties of the cost function lookup table, constructed
in 'cal_nmvjointsadcost' and 'cal_nmvsadcosts'.
For the joint cost:
- mvjointsadcost[1] == mvjointsadcost[2] == mvjointsadcost[3]
For the component costs:
- For all i: mvsadcost[0][i] == mvsadcost[1][i]
(equal per component cost)
- For all i: mvsadcost[0][i] == mvsadcost[0][-i]
(Cost function is even)
These must hold, otherwise the AVX version of the function cannot be used.
Change-Id: I184055b864c5a2dc37b2d8c5c9012eb801e9daf6
Angie Chiang [Tue, 27 Oct 2015 23:59:02 +0000 (16:59 -0700)]
Add vp10_inv_txfm1d_test
Change-Id: I3b76c0146af7f191cdae31d2b53ab6d51ac791a4