]>
granicus.if.org Git - libvpx/log
Yaowu Xu [Fri, 29 Jul 2016 16:50:33 +0000 (16:50 +0000)]
Merge "More vp8/vp9 clean up" into nextgenv2
Yaowu Xu [Wed, 27 Jul 2016 22:52:42 +0000 (15:52 -0700)]
More vp8/vp9 clean up
Change-Id: I8101de20e873c19d03c7fd2977bc22003e395807
hui su [Thu, 28 Jul 2016 16:54:50 +0000 (09:54 -0700)]
Fix a missing space
Change-Id: I6a9ab351012b731308d6b4fc78c09744c6afb61a
Jingning Han [Thu, 28 Jul 2016 16:39:44 +0000 (16:39 +0000)]
Merge "Separate frame context index for different frame types" into nextgenv2
Yaowu Xu [Mon, 25 Jul 2016 22:39:46 +0000 (15:39 -0700)]
Remove vp8, vp9 folders
Change-Id: I09b8acd22d031ece52e1fee18b998349bf1cf06b
Jingning Han [Wed, 27 Jul 2016 16:12:53 +0000 (09:12 -0700)]
Separate frame context index for different frame types
This commit makes the encoder to use different frame context index
for different frame types. In the baseline setting, it sets the
frame context index of the overlay frame to be different from other
regular inter frames. In the ext-refs setting, it further allows
the backward reference frame to use a different index.
It improves the compression performance for both settings.
Baseline
lowres 0.12%
ext-refs
lowres 0.50%
midres 0.56%
Change-Id: I7c63ddec9fc296c56a86353cf2c661a740b97a97
Hui Su [Tue, 26 Jul 2016 21:53:50 +0000 (21:53 +0000)]
Merge "ext-intra: squeeze the derivative table" into nextgenv2
hui su [Tue, 26 Jul 2016 01:08:08 +0000 (18:08 -0700)]
ext-intra: squeeze the derivative table
Reduce its size form 270x2 to 90.
Change-Id: Icaacc4258e43cdc19c6f06598fee17c3dd06d7e9
Yaowu Xu [Tue, 26 Jul 2016 17:21:57 +0000 (17:21 +0000)]
Merge "Fix compilation error under Clang 4.0." into nextgenv2
Yaowu Xu [Tue, 26 Jul 2016 17:21:46 +0000 (17:21 +0000)]
Merge "MinArfFreqTest: Don't leak video on failure." into nextgenv2
Yaowu Xu [Tue, 26 Jul 2016 17:21:37 +0000 (17:21 +0000)]
Merge "blockiness_test: fix implicit float conversion" into nextgenv2
Yaowu Xu [Tue, 26 Jul 2016 17:21:30 +0000 (17:21 +0000)]
Merge "resize_test: fix implicit float->int conversion" into nextgenv2
Yaowu Xu [Tue, 26 Jul 2016 17:21:22 +0000 (17:21 +0000)]
Merge "Add VPX_SWAP macro" into nextgenv2
Yaowu Xu [Tue, 26 Jul 2016 17:21:05 +0000 (17:21 +0000)]
Merge "Make test encoder test driver less likely to leak on failure." into nextgenv2
Yunqing Wang [Tue, 26 Jul 2016 04:16:21 +0000 (04:16 +0000)]
Merge "Combine vpxcx_set_ref example for VP9 and VP10" into nextgenv2
Ivan Krasin [Mon, 25 Jul 2016 19:11:02 +0000 (12:11 -0700)]
Fix compilation error under Clang 4.0.
The LLVM trunk has reached 4.0 and now __clang_major__ is not enough
to distinguish between old XCode Clang and the new 'real' Clang.
Using __apple_build_version__ allows to make this distinction.
BUG=chromium:631144
Change-Id: I0b6e46fddfe4f409c7b7e558bda34872e60ee2d9
Alex Converse [Thu, 21 Jul 2016 18:38:27 +0000 (11:38 -0700)]
MinArfFreqTest: Don't leak video on failure.
Change-Id: I250379f0ac8d4929c9032e7343290e2980fc2e77
James Zern [Fri, 22 Jul 2016 22:08:59 +0000 (15:08 -0700)]
blockiness_test: fix implicit float conversion
float->int as reported by -Wfloat-conversion
Change-Id: Icb0ecb9e2d54edb95813d9f2de34cb6c27b63cbd
(cherry picked from commit
5e2791b54da066cc8543c066813373c9274ff53f )
James Zern [Fri, 22 Jul 2016 20:11:07 +0000 (13:11 -0700)]
resize_test: fix implicit float->int conversion
Change-Id: I1efc16fa158740a06da719a1ea90c6dd6a182bb4
(cherry picked from commit
325bdddc38ad15cf7dd2bc618461a13cfb006812 )
Yury Gitman [Fri, 15 Jul 2016 17:32:04 +0000 (10:32 -0700)]
Add VPX_SWAP macro
Change-Id: I60e233eddef238ad918183392794084673f27d2d
Alex Converse [Thu, 21 Jul 2016 18:36:41 +0000 (11:36 -0700)]
Make test encoder test driver less likely to leak on failure.
Individual tests still need to be updated.
Change-Id: Ic433d0f742e13560b136f136b72b2a9973970d78
James Zern [Wed, 20 Jul 2016 03:46:39 +0000 (20:46 -0700)]
y4minput.c: correct empty loop formatting
prefer {}s over ';'
Change-Id: I563fc82717e1deb4f42a40e03dca318c6adaa0c1
James Zern [Mon, 18 Jul 2016 23:30:58 +0000 (16:30 -0700)]
build/make/Makefile: add a 'test_*' default target
allows 'make test_libvpx', etc. some reworking of the makefiles would be
needed to avoid hard coding targets here.
Change-Id: I18982dbf691e7d36ab8bcf5934bab9340687b061
(cherry picked from commit
25085a6ac21fc8e2341b92e2f1f14d5a7fef30c6 )
James Zern [Mon, 18 Jul 2016 23:30:40 +0000 (16:30 -0700)]
build/make/Makefile: remove default suffix rules
Change-Id: I15c8976c6478bf75ec617398f49461b310ab7569
Yaowu Xu [Tue, 26 Jul 2016 04:02:36 +0000 (04:02 +0000)]
Merge "take II: variance_test partial clean-up" into nextgenv2
skal [Thu, 14 Jul 2016 00:33:22 +0000 (17:33 -0700)]
take II: variance_test partial clean-up
remove some (but not all yet!) tuple mis-use, and revamp the code a lot.
Factorize some common chores into MainTestClass.
Change-Id: Id37b7330eebe80d19b9d12a454f24ff9be6b1116
Yunqing Wang [Sat, 23 Jul 2016 00:14:22 +0000 (17:14 -0700)]
Combine vpxcx_set_ref example for VP9 and VP10
Modified the example so that the test script worked correctly. Also
included minor fixes.
Change-Id: If97525dd9b85004a07e91c384132eadae35cdbf8
Yaowu Xu [Mon, 25 Jul 2016 20:48:52 +0000 (20:48 +0000)]
Merge "inline->INLINE: vpx_dsp/x86/blend_a64_vmask_sse4.c" into nextgenv2
Yunqing Wang [Mon, 25 Jul 2016 18:53:27 +0000 (18:53 +0000)]
Merge "Adjust adaptive_rd_thresh speed feature" into nextgenv2
Yaowu Xu [Mon, 25 Jul 2016 17:43:12 +0000 (10:43 -0700)]
inline->INLINE: vpx_dsp/x86/blend_a64_vmask_sse4.c
This fixes the build using MSVC.
Change-Id: I7788e28fd4f0c6ff3d405c4b4a2ff59eda3ba8b6
James Zern [Fri, 22 Jul 2016 20:23:10 +0000 (13:23 -0700)]
vp10/ratectrl.c: fix implicit float conversion
float->int as reported by -Wfloat-conversion
Change-Id: I3c55990821d922bda7a7600c00ae8d5dcc3cee94
James Zern [Fri, 22 Jul 2016 20:21:55 +0000 (13:21 -0700)]
rd_pick_palette_intra_sbuv: fix implicit float conv
float->int as reported by -Wfloat-conversion
Change-Id: I9e3d6ce9dbb0689f214afc8d5950f209275e883d
Sarah Parker [Fri, 22 Jul 2016 18:51:15 +0000 (11:51 -0700)]
Fix compiler errors in warped_motion.c
A few functions and variables were named incorrectly
Change-Id: Ib32fa459c51b9e9aad8bb107e1b689a96d98b368
James Zern [Fri, 22 Jul 2016 22:24:58 +0000 (22:24 +0000)]
Merge "Restore vp10_default_scan_orders[]" into nextgenv2
Yunqing Wang [Thu, 21 Jul 2016 16:00:36 +0000 (09:00 -0700)]
Adjust adaptive_rd_thresh speed feature
Set adaptive_rd_thresh to 0 at speed 0. This allows a thorough mode
search, and eliminates a blocking artifact seen in an encoder test.
Borg test:
1. lowres
Overall PSNR: -0.135%; SSIM: -0.293%;
2. hdres
Overall PSNR: -0.122%; SSIM: -0.208%;
Encoder speed tests: 2% - 6% slower.
Change-Id: Ie7601cb8824df8f6f2ae0b2942bd938600f76990
Sarah Parker [Fri, 22 Jul 2016 18:29:31 +0000 (11:29 -0700)]
Restore vp10_default_scan_orders[]
vp10_default_scan_orders was removed in:
e5848de Rectangular transforms 4x8 & 8x4
This fixes compiler error in vp10_quantize_test.cc
Change-Id: I1b8a637e011f9426c3b41e61e00e3babc80defba
James Zern [Thu, 14 Jul 2016 22:18:30 +0000 (15:18 -0700)]
add .clang-format, based on Google style
derived from clang-format 3.7.1; same as used in libaom
Change-Id: I8ea915a41d1f2ea3b0d4e4dab9ebc808e9116f11
Yaowu Xu [Fri, 22 Jul 2016 17:04:06 +0000 (10:04 -0700)]
Add and remove explicit type conversions
Change-Id: I8b791fda7c64a0363549add99dc9fcae3b29beae
Yaowu Xu [Fri, 22 Jul 2016 16:30:52 +0000 (09:30 -0700)]
Fix compiling issues
Change-Id: I530348b12a1c039842ce4e33d21046fe63878f19
Sarah Parker [Thu, 21 Jul 2016 23:17:33 +0000 (23:17 +0000)]
Merge "Add affine model to global motion" into nextgenv2
Debargha Mukherjee [Thu, 21 Jul 2016 21:54:12 +0000 (21:54 +0000)]
Merge "Rectangular transforms 4x8 & 8x4" into nextgenv2
Sarah Parker [Thu, 21 Jul 2016 00:11:39 +0000 (17:11 -0700)]
Add affine model to global motion
Change-Id: I9cd355a3ea344ef66a61028efa25d94f54e7e2bd
Debargha Mukherjee [Fri, 1 Jul 2016 19:57:14 +0000 (12:57 -0700)]
Rectangular transforms 4x8 & 8x4
Added a new expt rect-tx to be used in conjunction with ext-tx.
[rect-tx is a temporary config flag and will eventually be
merged into ext-tx once it works correctly with all other
experiments].
Added 4x8 and 8x4 tranforms for use initially with rectangular
sub8x8 y blocks as part of this experiment.
There is about a -0.2% BDRATE improvement on lowres, others pending.
When var-tx is on rectangular transforms are currently not used.
That will be enabled in a subsequent patch.
Change-Id: Iaf3f88ede2740ffe6a0ffb1ef5fc01a16cd0283a
Yaowu Xu [Thu, 21 Jul 2016 02:51:27 +0000 (02:51 +0000)]
Merge "Cleanup x86inc leftovers" into nextgenv2
Yi Luo [Wed, 20 Jul 2016 22:48:52 +0000 (22:48 +0000)]
Merge "HBD fast path quantization speed improvement" into nextgenv2
Yi Luo [Fri, 15 Jul 2016 23:41:30 +0000 (16:41 -0700)]
HBD fast path quantization speed improvement
- HBD encoder speed improvement (SSE4.1):
Enable CONFIG_VP9_HIGHBITDEPTH, on Xeon E5-2680,
50 frames, park_joy_1080p, 12-bit,
Encoding time reduces from
4846481 to
4177471 (ms)
- Add unit test to verify bit-exact and EOB calculation
Change-Id: I08e8ef3549ddad5ab36d86e78557df3b288537ea
Yaowu Xu [Wed, 20 Jul 2016 18:57:29 +0000 (11:57 -0700)]
Cleanup x86inc leftovers
Change-Id: I732d6942d56042a79c4259f775a6045fa95fbeff
Pascal Massimino [Wed, 20 Jul 2016 13:27:36 +0000 (06:27 -0700)]
make signatures match for vp10_init_plane_quantizers
Change-Id: I1efbc91d0ca9183fe34692315307c00c4b346e73
Sarah Parker [Tue, 19 Jul 2016 21:27:58 +0000 (21:27 +0000)]
Merge "Change order of warped motion parameters" into nextgenv2
Zoe Liu [Tue, 19 Jul 2016 20:55:14 +0000 (13:55 -0700)]
A small refactor on the rate controller
Change-Id: Ie39e16de2457dd201121c62967e4ddaf5a05c33a
James Zern [Tue, 19 Jul 2016 19:53:08 +0000 (19:53 +0000)]
Merge "fix vp10_convolve() signatures" into nextgenv2
sarahparker [Mon, 18 Jul 2016 23:40:06 +0000 (16:40 -0700)]
Change order of warped motion parameters
This makes it easier to interface between global motion and warped motion
Change-Id: I850e0a383969a1973f03fb207f100713cda6bb51
Hui Su [Tue, 19 Jul 2016 16:55:11 +0000 (16:55 +0000)]
Merge "Extra round of subpel MV search around second best full-pixel MV" into nextgenv2
Wei-ting Lin [Tue, 19 Jul 2016 04:28:07 +0000 (04:28 +0000)]
Merge "Allow OVERLAY frames to use the show_exsiting_frame flag" into nextgenv2
Sarah Parker [Mon, 11 Jul 2016 18:47:55 +0000 (11:47 -0700)]
Add global motion parameters to compressed header
Currently nothing is implemented to compute GM parameters, this
just adds the capability to send them in the bitstream if they
were computed. Still need to implement the reconstruction
based on the parameters in reconinter.
Change-Id: I72aea3c6a9de9f5a40f96da76c82b54a52781fe2
Wei-ting Lin [Tue, 12 Jul 2016 20:19:45 +0000 (13:19 -0700)]
Allow OVERLAY frames to use the show_exsiting_frame flag
ARF with zero strength temporal filter can be reused by setting the
show_existing_frame = 1, and in this case, there is no need to
refresh the reference frame buffer. However, we used the flag
"refresh_golden_frame" as the identifier for the starting point of a gf
group.
A new flags "is_arf_filter_off" is used to record if the filter with
strengrh zero is used.
Change-Id: I25971a760f6e1638d5147fe30488c48125512b1a
Yaowu Xu [Mon, 18 Jul 2016 22:43:40 +0000 (22:43 +0000)]
Merge "Merge changes from libvpx/master by cherry-pick" into nextgenv2
Sarah Parker [Mon, 18 Jul 2016 22:40:37 +0000 (22:40 +0000)]
Merge "Add buf0, width, height fields to buf_2d" into nextgenv2
Jingning Han [Mon, 18 Jul 2016 21:54:05 +0000 (21:54 +0000)]
Merge "Align the quantizers for inter/inter modes in the first pass coding" into nextgenv2
hui su [Mon, 11 Jul 2016 17:46:17 +0000 (10:46 -0700)]
Extra round of subpel MV search around second best full-pixel MV
Keep track of the best and second best full pixel motion vector
candidates, and do subpel search around both of them.
Compression improvement:
lowres 0.22% midres 0.23% hdres 0.18%
No noticeable encoding speed changes observed on lowres test clips.
Change-Id: I5f4df2a03d1db061cfdfdba6138b27e9ea91f089
Zoe Liu [Mon, 18 Jul 2016 18:28:31 +0000 (11:28 -0700)]
Correct the experiment names for ext-refs
Change-Id: I83a2b22d12e4573453e2ad866c7ceb430ff062c6
Sarah Parker [Thu, 14 Jul 2016 23:50:00 +0000 (16:50 -0700)]
Add buf0, width, height fields to buf_2d
These are needed for the warping function in the global motion
experiment.
Change-Id: Iaab176d0c0b90f6b938e2bac48b24c07e87e3cd9
Johann [Wed, 22 Jun 2016 23:08:10 +0000 (16:08 -0700)]
Merge changes from libvpx/master by cherry-pick
This commit bring all up-to-date changes from master that are
applicable to nextgenv2. Due to the remove VP10 code in master,
we had to cherry pick the following commits to get those changes:
Add default flags for arm64/armv8 builds
Allows building simple targets with sane default flags.
For example, using the Android arm64 toolchain from the NDK:
https://developer.android.com/ndk/guides/standalone_toolchain.html
./build/tools/make-standalone-toolchain.sh --arch=arm64 \
--platform=android-24 --install-dir=/tmp/arm64
CROSS=/tmp/arm64/bin/aarch64-linux-android- \
~/libvpx/configure --target=arm64-linux-gcc --disable-multithread
BUG=webm:1143
vpx_lpf_horizontal_4_sse2: Remove dead load.
Change-Id: I51026c52baa1f0881fcd5b68e1fdf08a2dc0916e
Fail early when android target does not include --sdk-path
Change-Id: I07e7e63476a2e32e3aae123abdee8b7bbbdc6a8c
configure: clean up var style and set_all usage
Use quotes whenever possible and {} always for variables.
Replace multiple set_all calls with *able_feature().
Conflicts:
build/make/configure.sh
vp9-svc: Remove some unneeded code/comment.
datarate_test,DatarateTestLarge: normalize bits type
quiets a msvc warning:
conversion from 'const int64_t' to 'size_t', possible loss of data
mips added p6600 cpu support
Removed -funroll-loops
psnr.c: use int64_t for sum of differences
Since the values can be negative.
*.asm: normalize label format
add a trailing ':', though it's optional with the tools we support, it's
more common to use it to mark a label. this also quiets the
orphan-labels warning with nasm/yasm.
BUG=b/
29583530
Prevent negative variance
Due to rounding, hbd variance may become negative. This commit put in
check and clamp of negative values to 0.
configure: remove old visual studio support (<2010)
BUG=b/
29583530
Conflicts:
configure
configure: restore vs_version variable
inadvertently lost in the final patchset of:
078dff7 configure: remove old visual studio support (<2010)
this prevents an empty CONFIG_VS_VERSION and avoids make failure
Require x86inc.asm
Force enable x86inc.asm when building for x86. Previously there were
compatibility issues so a flag was added to simplify disabling this
code.
The known issues have been resolved and x86inc.asm is the preferred
abstraction layer (over x86_abi_support.asm).
BUG=b:
29583530
convolve_test: fix byte offsets in hbd build
CONVERT_TO_BYTEPTR(x) was corrected in:
003a9d2 Port metric computation changes from nextgenv2
to use the more common (x) within the expansion. offsets should occur
after converting the pointer to the desired type.
+ factorized some common expressions
Conflicts:
test/convolve_test.cc
vpx_dsp: remove x86inc.asm distinction
BUG=b:
29583530
Conflicts:
vpx_dsp/vpx_dsp.mk
vpx_dsp/vpx_dsp_rtcd_defs.pl
vpx_dsp/x86/highbd_variance_sse2.c
vpx_dsp/x86/variance_sse2.c
test: remove x86inc.asm distinction
BUG=b:
29583530
Conflicts:
test/vp9_subtract_test.cc
configure: remove x86inc.asm distinction
BUG=b:
29583530
Change-Id: I59a1192142e89a6a36b906f65a491a734e603617
Update vpx subpixel 1d filter ssse3 asm
Speed test shows the new vertical filters have degradation on Celeron
Chromebook. Added "X86_SUBPIX_VFILTER_PREFER_SLOW_CELERON" to control
the vertical filters activated code. Now just simply active the code
without degradation on Celeron. Later there should be 2 set of vertical
filters ssse3 functions, and let jump table to choose based on CPU type.
improve vpx_filter_block1d* based on replace paddsw+psrlw to pmulhrsw
Make set_reference control API work in VP9
Moved the API patch from NextGenv2. An example was included.
To try it, for example, run the following command:
$ examples/vpx_cx_set_ref vp9 352 288 in.yuv out.ivf 4 30
Conflicts:
examples.mk
examples/vpx_cx_set_ref.c
test/cx_set_ref.sh
vp9/decoder/vp9_decoder.c
deblock filter : moved from vp8 code branch
The deblocking filters used in vp8 have been moved to vpx_dsp for
use by both vp8 and vp9.
vpx_thread.[hc]: update webp source reference
+ drop the blob hash, the updated reference will be updated in the
commit message
BUG=b/
29583578
vpx_thread: use native windows cond var if available
BUG=b/
29583578
original webp change:
commit
110ad5835ecd66995d0e7f66dca1b90dea595f5a
Author: James Zern <jzern@google.com>
Date: Mon Nov 23 19:49:58 2015 -0800
thread: use native windows cond var if available
Vista / Server 2008 and up. no speed difference observed.
100644 blob
4fc372b7bc6980a9ed3618c8cce5b67ed7b0f412 src/utils/thread.c
100644 blob
840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h
vpx_thread: use InitializeCriticalSectionEx if available
BUG=b/
29583578
original webp change:
commit
63fadc9ffacc77d4617526a50c696d21d558a70b
Author: James Zern <jzern@google.com>
Date: Mon Nov 23 20:38:46 2015 -0800
thread: use InitializeCriticalSectionEx if available
Windows Vista / Server 2008 and up
100644 blob
f84207d89b3a6bb98bfe8f3fa55cad72dfd061ff src/utils/thread.c
100644 blob
840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h
vpx_thread: use WaitForSingleObjectEx if available
BUG=b/
29583578
original webp change:
commit
0fd0e12bfe83f16ce4f1c038b251ccbc13c62ac2
Author: James Zern <jzern@google.com>
Date: Mon Nov 23 20:40:26 2015 -0800
thread: use WaitForSingleObjectEx if available
Windows XP and up
100644 blob
d58f74e5523dbc985fc531cf5f0833f1e9157cf0 src/utils/thread.c
100644 blob
840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h
vpx_thread: use CreateThread for windows phone
BUG=b/
29583578
original webp change:
commit
d2afe974f9d751de144ef09d31255aea13b442c0
Author: James Zern <jzern@google.com>
Date: Mon Nov 23 20:41:26 2015 -0800
thread: use CreateThread for windows phone
_beginthreadex is unavailable for winrt/uwp
Change-Id: Ie7412a568278ac67f0047f1764e2521193d74d4d
100644 blob
93f7622797f05f6acc1126e8296c481d276e4047 src/utils/thread.c
100644 blob
840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h
vp9_postproc.c missing extern.
BUG=webm:1256
deblock: missing const on extern const.
postproc - move filling of noise buffer to vpx_dsp.
Fix encoder crashes for odd size input
clean-up vp9_intrapred_test
remove tuple and overkill VP9IntraPredBase class.
postproc: noise style fixes.
gtest-all.cc: quiet an unused variable warning
under windows / mingw builds
vp9_intrapred_test: follow-up cleanup
address few comments from
ce050afaf3e288895c3bee4160336e2d2133b6ea
Change-Id: I3eece7efa9335f4210303993ef6c1857ad5c29c8
Jingning Han [Mon, 18 Jul 2016 17:14:13 +0000 (10:14 -0700)]
Align the quantizers for inter/inter modes in the first pass coding
Use regular extended zero bin quantizer for both inter and intra
modes in the first pass. This doesn't affect lowres and midres
significantly, but would bring back 0.9% coding gains for hdres.
Change-Id: Ifa5977fa7b141fc5be595c0f3a4fc81a93f6606f
skal [Mon, 18 Jul 2016 15:32:43 +0000 (08:32 -0700)]
fix vp10_convolve() signatures
fortunately, the call site was calling the function with
the correct parameter order.
Change-Id: Ia48099c18288a2416c8b9a7062d2b8d417fd07df
Yaowu Xu [Fri, 15 Jul 2016 04:45:52 +0000 (04:45 +0000)]
Merge "Merge branch 'master' into nextgenv2" into nextgenv2
Yaowu Xu [Thu, 14 Jul 2016 17:57:35 +0000 (10:57 -0700)]
Merge branch 'master' into nextgenv2
Change-Id: Ia3c0f2103fd997613d9f16156795028f89f63265
Sarah Parker [Thu, 14 Jul 2016 22:15:32 +0000 (22:15 +0000)]
Merge "Add new_quant quantization in rdopt for 4x4 blocks and intra" into nextgenv2
Debargha Mukherjee [Thu, 14 Jul 2016 19:28:21 +0000 (12:28 -0700)]
Remove unused zcoeff_blk
from PICK_MODE_CONTEXT and MACROBLOCK
Change-Id: I42f98ce51871948244bdcaaaeb3d0191622116ae
Pascal Massimino [Thu, 14 Jul 2016 18:59:37 +0000 (18:59 +0000)]
Merge "Fix highbd obmc_variance unit test" into nextgenv2
Sarah Parker [Tue, 12 Jul 2016 21:16:00 +0000 (14:16 -0700)]
Add new_quant quantization in rdopt for 4x4 blocks and intra
Originally the uniform quantization function was not being
replaced with the new_quant version in rdopt when new_quant
is turned on. This fixes the bug.
Change-Id: I593793bb909e1e1a6f89544eeca6783fe0576f25
Jingning Han [Thu, 14 Jul 2016 17:11:32 +0000 (10:11 -0700)]
Fix highbd obmc_variance unit test
Fix the compiling errors in highbd obmc_variance unit test.
Change-Id: Id1bdfd50aeaff996e54067d5e9b369a5fd2d87a8
Hui Su [Thu, 14 Jul 2016 00:13:46 +0000 (00:13 +0000)]
Merge "Refactor codes about motion search" into nextgenv2
Jingning Han [Wed, 13 Jul 2016 21:35:29 +0000 (21:35 +0000)]
Merge "Fix highbd inter prediction filter sse4 overwriting issue" into nextgenv2
Jingning Han [Wed, 13 Jul 2016 19:46:59 +0000 (12:46 -0700)]
Fix highbd inter prediction filter sse4 overwriting issue
Properly handle the case where the height is an integer multiple
of 4.
Change-Id: I11ac188c13f78db20902e2e333c60ce76ce837c5
Yue Chen [Wed, 13 Jul 2016 18:40:48 +0000 (18:40 +0000)]
Merge "Optimize and cleanup obmc predictor and rd search." into nextgenv2
hui su [Wed, 6 Jul 2016 17:37:18 +0000 (10:37 -0700)]
Refactor codes about motion search
1. Add "best_mv" in MACROBLOCK to store the best motion vector
during motion search, so that we don't need to pass its pointer
to various motion search functions.
2. Declare some functions as static when possible.
3. Fix some indents.
Change-Id: I0778146c0866cbc55e245988c59222577ea8260e
Geza Lore [Fri, 8 Jul 2016 12:53:39 +0000 (13:53 +0100)]
Optimize and cleanup obmc predictor and rd search.
Use vpx_blend_a64_hmask and vpx_blend_a64_vmask to speed up
computing the obmc predictor. Clean up calc_target_weighted_pred.
Encoder speedup: 1.3%
Decoder speedup: 6.5%
Change-Id: I0c774fe53d22399e92a10d1daf3af0010d88d2c5
Geza Lore [Tue, 12 Jul 2016 10:41:54 +0000 (11:41 +0100)]
Add SSE4.1 vpx_obmc_variance* implementations and cosmetics
Speedup for these functions: 4x
Also include some cosmetic changes to SAD functions
Change-Id: I344c32c795492507ae08742f52d035a13f583799
Pascal Massimino [Wed, 13 Jul 2016 03:09:52 +0000 (03:09 +0000)]
Merge "Clean up FunctionEquivalenceTest." into nextgenv2
Geza Lore [Tue, 12 Jul 2016 14:26:36 +0000 (15:26 +0100)]
Clean up FunctionEquivalenceTest.
remove use of tuple in favor of struct.
Change-Id: If3b1aa5c2fc3cfe1446fff7a8fd270f2ca85fedf
Aamir Anis [Tue, 12 Jul 2016 23:56:37 +0000 (23:56 +0000)]
Merge "Fix for loop filter selection procedure" into nextgenv2
Aamir Anis [Fri, 8 Jul 2016 21:36:36 +0000 (14:36 -0700)]
Fix for loop filter selection procedure
Fixed best error reported by loop filter selection, this value is used
during loop restoration to pick best mode. Baseline remains unchanged,
change in BDRate for loop restoration experiment:
-0.628 -> -0.625 for lowres,
-1.262 -> -1.283 for highres.
Change-Id: I69ef1608bc232b250ac46f59e31fdbed1a999dcd
Yi Luo [Tue, 12 Jul 2016 19:28:48 +0000 (19:28 +0000)]
Merge "HBD convolution filtering (10/12 taps) SSE4.1 optimization" into nextgenv2
Yi Luo [Fri, 8 Jul 2016 22:41:59 +0000 (15:41 -0700)]
HBD convolution filtering (10/12 taps) SSE4.1 optimization
- For experiment EXT_INTERP under high bit depth.
- Add unit test to verify bit-exact.
- Speed performance improvement:
On Xeon E5-2680, park_joy_1080p_12.y4m, 50 frames, encoding time
drops from
6682503 ms to
5390270 ms.
Change-Id: Iea4debf5414f3accf1eb5672abeab56a0539ac77
Geza Lore [Tue, 12 Jul 2016 12:20:04 +0000 (13:20 +0100)]
Cleanup obmc_sad function prototypes.
Name 'wsrc', 'mask' and 'pre' explicitly, rather than
using 'b', 'm' and 'a'.
Change-Id: Iaee6d1ac1211b0b05b47cf98b50570089b12d600
James Zern [Tue, 12 Jul 2016 02:39:24 +0000 (02:39 +0000)]
Merge changes from topic 'missing-proto' into nextgenv2
* changes:
vp10/encoder/rdopt.c: make a function static
vp10/encoder/rd.c: make a function static
vp10_convolve_ssse3.c: make some functions static
vp10/encoder/bitstream.[hc]: correct a prototype
vp10/common/idct.h: add some missing prototypes
highbd_quantize_intrin_sse2.c: add missing rtcd include
vp10: add some missing includes
Yue Chen [Tue, 12 Jul 2016 01:21:32 +0000 (01:21 +0000)]
Merge "Cosmetics for vp10/common/vp10_rtcd_defs.pl" into nextgenv2
James Zern [Sat, 2 Jul 2016 00:50:06 +0000 (17:50 -0700)]
vp10/encoder/rdopt.c: make a function static
+ remove vp10_ prefix
quiets a -Wmissing-prototypes warning
BUG=b/
29584271
Change-Id: I8821c38009b90296280f9b14233e73c92076e81f
James Zern [Sat, 2 Jul 2016 00:48:43 +0000 (17:48 -0700)]
vp10/encoder/rd.c: make a function static
+ remove vp10_ prefix
quiets a -Wmissing-prototypes warning
BUG=b/
29584271
Change-Id: I6b5d71f8120a6d1fee4c782beb4c6d6eef980f65
James Zern [Sat, 2 Jul 2016 00:43:20 +0000 (17:43 -0700)]
vp10_convolve_ssse3.c: make some functions static
quiets -Wmissing-prototypes warnings
BUG=b/
29584271
Change-Id: I4d2eb7f4b45d7b829421976641b3212bcf29e7dd
James Zern [Sat, 2 Jul 2016 00:47:11 +0000 (17:47 -0700)]
vp10/encoder/bitstream.[hc]: correct a prototype
quiets a -Wmissing-prototypes warning
BUG=b/
29584271
Change-Id: I91aba2a75dccd6752bdf91837564c2aa45817c09
James Zern [Sat, 2 Jul 2016 00:38:47 +0000 (17:38 -0700)]
vp10/common/idct.h: add some missing prototypes
quiets the warning of the same name
BUG=b/
29584271
Change-Id: I220cd58e1060f77e3910472fed1b167add3a08f8
James Zern [Sat, 2 Jul 2016 00:45:46 +0000 (17:45 -0700)]
highbd_quantize_intrin_sse2.c: add missing rtcd include
quiets -Wmissing-prototypes warnings
BUG=b/
29584271
Change-Id: Iff5214df0d1781810afbfc20bfaf664f109e2f29
James Zern [Sat, 2 Jul 2016 00:41:26 +0000 (17:41 -0700)]
vp10: add some missing includes
quiets some -Wmissing-prototypes warnings
BUG=b/
29584271
Change-Id: I9174728459fcabb6d9ac0028ae58029e52c0da92
Yue Chen [Mon, 11 Jul 2016 21:23:08 +0000 (14:23 -0700)]
Cosmetics for vp10/common/vp10_rtcd_defs.pl
Change-Id: Iaf8c6f0b1e340f0406df2871a3dc2ded19b7009a
Debargha Mukherjee [Mon, 11 Jul 2016 22:30:33 +0000 (22:30 +0000)]
Merge "Add a few branch hints to vp10_optimize_b." into nextgenv2
Debargha Mukherjee [Mon, 11 Jul 2016 22:30:16 +0000 (22:30 +0000)]
Merge "Optimize and cleanup supertx predictor." into nextgenv2
Debargha Mukherjee [Mon, 11 Jul 2016 19:30:03 +0000 (19:30 +0000)]
Merge "Improve vpx_blend_* functions." into nextgenv2
Geza Lore [Fri, 8 Jul 2016 13:10:53 +0000 (14:10 +0100)]
Optimize and cleanup supertx predictor.
Use vpx_blend_a64_hmask and vpx_blend_a64_vmask to speed up
computing the supertx predictor.
Decoder speedup of up to 4% has been observed.
Change-Id: I255a5ba4cc24f78dc905d25b6e2f7fbafac13253