]> granicus.if.org Git - libvpx/log
libvpx
10 years agoMerge "adapt the adjustment limit for rate correction factor in RTC mode"
Yaowu Xu [Thu, 13 Nov 2014 23:50:30 +0000 (15:50 -0800)]
Merge "adapt the adjustment limit for rate correction factor in RTC mode"

10 years agoMerge "Prepare for dynamic frame resizing in the recode loop"
Adrian Grange [Thu, 13 Nov 2014 23:01:49 +0000 (15:01 -0800)]
Merge "Prepare for dynamic frame resizing in the recode loop"

10 years agoMerge "Fix 32 bit build emms problem."
Paul Wilkins [Thu, 13 Nov 2014 23:00:41 +0000 (15:00 -0800)]
Merge "Fix 32 bit build emms problem."

10 years agoMerge "Revert "Fix SSSE3 partial_idct_test detection""
Johann [Thu, 13 Nov 2014 22:49:22 +0000 (14:49 -0800)]
Merge "Revert "Fix SSSE3 partial_idct_test detection""

10 years agoMerge "Use reconstructed pixels for intra prediction"
Jingning Han [Thu, 13 Nov 2014 21:59:02 +0000 (13:59 -0800)]
Merge "Use reconstructed pixels for intra prediction"

10 years agoMerge "Refactor nonrd_use_partition coding process"
Jingning Han [Thu, 13 Nov 2014 21:58:21 +0000 (13:58 -0800)]
Merge "Refactor nonrd_use_partition coding process"

10 years agoPrepare for dynamic frame resizing in the recode loop
Adrian Grange [Mon, 10 Nov 2014 17:51:38 +0000 (09:51 -0800)]
Prepare for dynamic frame resizing in the recode loop

Prepare for the introduction of frame-size change
logic into the recode loop.

Separated the speed dependent features into
separate static and dynamic parts, the latter being
those features that are dependent on the frame size.

Change-Id: Ia693e28c5cf069a1a7bf12e49ecf83e440e1d313

10 years agoFix 32 bit build emms problem.
Paul Wilkins [Wed, 12 Nov 2014 17:36:49 +0000 (09:36 -0800)]
Fix 32 bit build emms problem.

Add extra vp9_clear_system_state() calls to fix
double / mmx issue introduced into first pass
code for 32 bit builds.

Change-Id: I84cd2986b80d83650a091ab25c43755efeb82e03

10 years agoRevert "Fix SSSE3 partial_idct_test detection"
James Zern [Thu, 13 Nov 2014 18:59:54 +0000 (10:59 -0800)]
Revert "Fix SSSE3 partial_idct_test detection"

This reverts commit 7d07f512cd87446eef541e9af4af19b1e8c6342a.

this breaks visual studio builds:
'#' : invalid character : possibly the result of a macro expansion

Change-Id: I77170d549afb71e75a878fa0f6acd204fe8d9e67

10 years agoadapt the adjustment limit for rate correction factor in RTC mode
Yaowu Xu [Thu, 13 Nov 2014 19:20:04 +0000 (11:20 -0800)]
adapt the adjustment limit for rate correction factor in RTC mode

Rate correction factor is used to correct the estimated rate for any
given quantizer, and feeds into rate control for quantizer selection.
We make use of the actual bits used to calculate this rate correction
factor with an adjustment limit to prevent over-adjustment.

This commit adapts the adjustment limit to the difference between the
estimated bits and the actual bits, allows the adjustment limit to vary
between 0.125 (when estimate is close to actual) and 0.625 (when there
is >10X factor off between estimated and actual bits). By doing this,
the commit appears to have largely corrected two observed issues:
1. Adjustment is too slow when the actual bits used is way off from
estimate due to the small adjustment limit.
2. Extreme oscillating quantizer choices due to the feedback loop.

Change-Id: I4ee148d2c9d26d173b6c48011313ddb07ce2d7d6

10 years agoMerge "Vidyo: Turn off keyframes in higher spatial layers"
Deb Mukherjee [Thu, 13 Nov 2014 11:27:11 +0000 (03:27 -0800)]
Merge "Vidyo: Turn off keyframes in higher spatial layers"

10 years agoMerge "Added highbitdepth sse2 SAD acceleration and tests"
Debargha Mukherjee [Thu, 13 Nov 2014 05:20:34 +0000 (21:20 -0800)]
Merge "Added highbitdepth sse2 SAD acceleration and tests"

10 years agoRemove vp9_temporal_denoising from experimental list.
Marco [Thu, 13 Nov 2014 01:34:54 +0000 (17:34 -0800)]
Remove vp9_temporal_denoising from experimental list.

Change-Id: I7a941c0efbb5912aa97348da6fd1aba28df4482a

10 years agoAdded highbitdepth sse2 SAD acceleration and tests
Peter de Rivaz [Thu, 16 Oct 2014 12:41:55 +0000 (13:41 +0100)]
Added highbitdepth sse2 SAD acceleration and tests

Change-Id: I1a74a1b032b198793ef9cc526327987f7799125f
(cherry picked from commit b1a6f6b9cb47eafe0ce86eaf0318612806091fe5)

10 years agoMerge "Fix SSSE3 partial_idct_test detection"
Johann [Wed, 12 Nov 2014 20:20:36 +0000 (12:20 -0800)]
Merge "Fix SSSE3 partial_idct_test detection"

10 years agoMerge "Use normal rate_correction_factor for gf in CBR mode"
Yaowu Xu [Wed, 12 Nov 2014 16:00:26 +0000 (08:00 -0800)]
Merge "Use normal rate_correction_factor for gf in CBR mode"

10 years agoVidyo: Turn off keyframes in higher spatial layers
Deb Mukherjee [Wed, 12 Nov 2014 00:09:07 +0000 (16:09 -0800)]
Vidyo: Turn off keyframes in higher spatial layers

Change-Id: Icdd5e71cd6a2b59bc4b3b972af9e4d4a36821792

10 years agoMerge "Vidyo: Support for one-pass rc-enabled SVC encoder"
Deb Mukherjee [Wed, 12 Nov 2014 00:03:11 +0000 (16:03 -0800)]
Merge "Vidyo: Support for one-pass rc-enabled SVC encoder"

10 years agoUse reconstructed pixels for intra prediction
Jingning Han [Sat, 8 Nov 2014 01:50:55 +0000 (17:50 -0800)]
Use reconstructed pixels for intra prediction

This commit makes the speed -6 and above use the reconstructed
boundary pixels for precise intra prediction. This allows more
intra prediction modes to be tested in the non-RD coding process.

Enabling horizontal and vertical intra prediction modes can
improve the speed -6 compression performance for rtc set
by 0.331%.

Change-Id: I3a99f9d12c6af54de2bdbf28c76eab8e0905f744

10 years agoMerge "AQ1 - remove first pass weights."
Paul Wilkins [Tue, 11 Nov 2014 17:17:33 +0000 (09:17 -0800)]
Merge "AQ1 - remove first pass weights."

10 years agoUse normal rate_correction_factor for gf in CBR mode
Yaowu Xu [Mon, 10 Nov 2014 19:46:58 +0000 (11:46 -0800)]
Use normal rate_correction_factor for gf in CBR mode

I0c5f010 changed to allow update golden reference buffer in CBR mode,
this commit changes the use of rate_correction_factor for those frames
to be aligned with the new usage. This commit attempts to solve two
issues:

a. Initialization of rate correction factor for Golden Frame
Prior to this patch, even the regular inter frame has been update
the rate correction factor based on content and encoding results,
the first golden frame would still use the ininitialized value
that can be way off.

b. Allowing rate correction factor update to be slightly faster
Prior to this patch, when the rate correction factor is off, the
update to the factor is too slow, the factor could not get close
to a semi-correct value even after many frames.

The commit helps all clips in psnr/ssim metric, but especially to
a few clip in RTC set that rate correction was way off. For example
thaloundeskmtgvga gained about .5dB for both overall/average psnr.

Change-Id: I0be5c41691be57891d824505348b64be87fa3545

10 years agoVidyo: Support for one-pass rc-enabled SVC encoder
Deb Mukherjee [Fri, 7 Nov 2014 19:01:53 +0000 (11:01 -0800)]
Vidyo: Support for one-pass rc-enabled SVC encoder

Adds support for one-pass rc-enabled SVC encoder with callbacks for
getting per-layer packets.
- the callback function registration is implemented as an encoder
control function.
- if the callback function is not registered, the old way of
aggregating packets with superframe will take effect.
- one more control function “VP9E_GET_SVC_LAYER_ID” has been
implemented to get the temporal/spatial id from the encoder
within the callback. This can be used to get the ids to put on RTP
packet.

Change-Id: I1a90e00135dde65da128b758e6c00b57299a111a

10 years agoMerge "Iadst transforms to use internal low precision"
Deb Mukherjee [Mon, 10 Nov 2014 23:39:46 +0000 (15:39 -0800)]
Merge "Iadst transforms to use internal low precision"

10 years agoMerge "Remove asm offset dependencies"
Johann [Mon, 10 Nov 2014 18:22:09 +0000 (10:22 -0800)]
Merge "Remove asm offset dependencies"

10 years agoMerge "Fix LAST SKIP when considering GOLDEN"
Alex Converse [Sat, 8 Nov 2014 00:21:37 +0000 (16:21 -0800)]
Merge "Fix LAST SKIP when considering GOLDEN"

10 years agoIadst transforms to use internal low precision
Deb Mukherjee [Thu, 6 Nov 2014 20:50:33 +0000 (12:50 -0800)]
Iadst transforms to use internal low precision

Change-Id: I266777d40c300bc53b45b205144520b85b0d6e58
(cherry picked from commit a1b726117f5470f227bc90cd030b7d25045dc510)

10 years agoTest name changes to use SSE/SSE2 exactly
Deb Mukherjee [Fri, 7 Nov 2014 18:19:46 +0000 (10:19 -0800)]
Test name changes to use SSE/SSE2 exactly

Change-Id: I3b5a478d198868c2796366f0ac59d0e2036308b8

10 years agoFix LAST SKIP when considering GOLDEN
Alex Converse [Fri, 7 Nov 2014 21:44:17 +0000 (13:44 -0800)]
Fix LAST SKIP when considering GOLDEN

Change-Id: I39d9f13fa34984ee9dad0c4f303ef672635f420e

10 years agoMerge "Add intra complexity and brightness weight to first pass."
Paul Wilkins [Fri, 7 Nov 2014 17:22:12 +0000 (09:22 -0800)]
Merge "Add intra complexity and brightness weight to first pass."

10 years agoMerge "Change the use of a reserved color space entry"
Yaowu Xu [Fri, 7 Nov 2014 14:24:59 +0000 (06:24 -0800)]
Merge "Change the use of a reserved color space entry"

10 years agoAQ1 - remove first pass weights.
Paul Wilkins [Fri, 7 Nov 2014 14:11:29 +0000 (14:11 +0000)]
AQ1 - remove first pass weights.

Removed redundant weighting function tied for AQ1 from first
pass code.

Improvment in baseline AQ1 results:-
Derf  opsnr +0.142% SSIm +0.258%
YT  opsnr +0.173% SSIm +0.3%

Change-Id: I16ef91caf2d7f302cd5940cc5e2626d48ebcb212

10 years agoChange the use of a reserved color space entry
Yaowu Xu [Fri, 7 Nov 2014 03:09:30 +0000 (19:09 -0800)]
Change the use of a reserved color space entry

This commit rename a reserved color space entry to BT_2020, it intends
to provide support for VP9 bitstream to pass along the color space
type defined in BT.2020(Rec.2020)

please note this entry does not have any effect on encoding/decoding
behavior, but allow applications to the pass the information along
from encoding end to decoding end.

Change-Id: I4678520e89141ea5e8900f7bd1c0e95b710b7091

10 years agoRefactor nonrd_use_partition coding process
Jingning Han [Thu, 6 Nov 2014 19:53:18 +0000 (11:53 -0800)]
Refactor nonrd_use_partition coding process

This commit integrates the non-RD mode decision process and the
encoding process into a single recursion scheme.

Change-Id: I6a7e72a0b84d567554801ebbe01ec75d54c1f77d

10 years agoRemove asm offset dependencies
Johann [Thu, 31 Jul 2014 21:19:31 +0000 (14:19 -0700)]
Remove asm offset dependencies

The obj_int_extract code is no longer worth maintaining. It creates
significant issues when adapting for different build systems and no
longer offers as significant of a performance benefit due to
improvements in intrinsics.

Source files will remain until the various third-party builds are updated.

The neon fast quantizer has been moved to intrinsics. The armv6 version
has been removed because so few remaining targets require it.

Compilers and processors have improved significantly since the
pack_tokens code was written. The assembly is no longer faster than the
C code.

pack_tokens were the only optimizations for the armv5te targets so the targets
will be removed after the test infrastructure has been updated.

BUG=710

Change-Id: Ic785b167cd9f95eeff31c7c76b7b736c07fb30eb

10 years agoMerge "Modify the frame context memory deallocation"
Yunqing Wang [Thu, 6 Nov 2014 21:08:57 +0000 (13:08 -0800)]
Merge "Modify the frame context memory deallocation"

10 years agoMerge "Remove unused is_background function"
Jingning Han [Thu, 6 Nov 2014 20:03:15 +0000 (12:03 -0800)]
Merge "Remove unused is_background function"

10 years agoMerge "Rework cut-off decisions in cyclic refresh aq mode"
Jingning Han [Thu, 6 Nov 2014 20:03:07 +0000 (12:03 -0800)]
Merge "Rework cut-off decisions in cyclic refresh aq mode"

10 years agoModify the frame context memory deallocation
Yunqing Wang [Thu, 30 Oct 2014 01:38:18 +0000 (18:38 -0700)]
Modify the frame context memory deallocation

This patch was to fix the vpxdec fuzzing3 test failure. When an
error occurs, setjmp() is invoked, which calls the decoder
removing routine. In multiple thread situation, other threads
could try to access the frame context memory that is already
deallocated, thus causing a segfault.

An invalid unit test was added for this issue.

Change-Id: Ida7442154f3d89759483f0f4fe0324041fffb952

10 years agoAdd intra complexity and brightness weight to first pass.
Paul Wilkins [Tue, 4 Nov 2014 13:13:39 +0000 (13:13 +0000)]
Add intra complexity and brightness weight to first pass.

The aim of this patch is to apply a positive weighting to
frames that have a significant number of blocks that are
of low spatial complexity and are dark. The rationale behind
this is that artifacts tend to be more visible in such frames.

In this patch the weight is only applied in regard to the distribution
of bits between frames. Hence if all the frames share similar
characteristics (as is the case for most of our short test clips) there
will be little or no net effect.

However, the effect can be seen on some longer form test content.

For example Tears of steel baseline test:
2323.09 Kbit/s opsnr 39.915 ssim 74.729
With this patch:-
2213.34 Kbit/s opsnr 39.963 ssim 74.808
(Sligtly better metrics and about 5% smaller)

The weighting may well need some further tuning along side changes
to the aq modes.

Change-Id: Ieced379bca03938166ab87b2b97f55d94948904c

10 years agoRemove unused is_background function
Jingning Han [Tue, 4 Nov 2014 22:42:07 +0000 (14:42 -0800)]
Remove unused is_background function

Change-Id: Ia540eac5f066ae95280c2f898370eddf0110c279

10 years agoRework cut-off decisions in cyclic refresh aq mode
Jingning Han [Tue, 4 Nov 2014 22:32:04 +0000 (14:32 -0800)]
Rework cut-off decisions in cyclic refresh aq mode

This commit removes the cyclic aq mode dependency on
in_static_area and reworks the corresponding cut-off thresholds.
It improves the compression performance of speed -5 by 1.47% in
PSNR and 2.07% in SSIM, and the compression performance of speed
-6 by 3.10% in PSNR and 5.25% in SSIM. Speed wise, about 1% faster
in both settings at high bit-rates.

Change-Id: I1ffc775afdc047964448d9dff5751491ba4ff4a9

10 years agoMerge "Totally remove prev_mi in VP9 decoder."
hkuang [Thu, 6 Nov 2014 01:48:47 +0000 (17:48 -0800)]
Merge "Totally remove prev_mi in VP9 decoder."

10 years agoTotally remove prev_mi in VP9 decoder.
hkuang [Mon, 3 Nov 2014 19:23:22 +0000 (11:23 -0800)]
Totally remove prev_mi in VP9 decoder.

This will save the memory and improve the decode speed due to
removing unnecessary memset of big prev_mi array for
all the key frames.

Decoding a all key frames 1080p video shows speed improve around 2%.

Change-Id: I6284a445c1291056e3c15135c3c20d502f791c10

10 years agoMerge "Fix visual studio 2013 compiler warnings"
Yaowu Xu [Wed, 5 Nov 2014 23:34:13 +0000 (15:34 -0800)]
Merge "Fix visual studio 2013 compiler warnings"

10 years agoMerge "vp8 quantize is now in intrinsics, not asm"
Johann [Wed, 5 Nov 2014 22:05:26 +0000 (14:05 -0800)]
Merge "vp8 quantize is now in intrinsics, not asm"

10 years agoFix visual studio 2013 compiler warnings
Yaowu Xu [Wed, 5 Nov 2014 19:29:21 +0000 (11:29 -0800)]
Fix visual studio 2013 compiler warnings

For configured with --enable-vp9-highbitdepth

Change-Id: I2b181519d7192f8d7a241ad5760c3578255f24e6

10 years agoMerge "Simplify interface of write_selected_tx_size and read_tx_size"
Hui Su [Wed, 5 Nov 2014 21:33:09 +0000 (13:33 -0800)]
Merge "Simplify interface of write_selected_tx_size and read_tx_size"

10 years agoFix SSSE3 partial_idct_test detection
Johann [Wed, 5 Nov 2014 20:56:04 +0000 (12:56 -0800)]
Fix SSSE3 partial_idct_test detection

The test filter is not a prefix matcher. It requires test type to
contain no more than the optimization type. In this example, SSSE3_64
fails to match and the test is not skipped even when SSSE3 is not
available.

Change-Id: Ia74229a167c88da4e6da169012a7a77d438c3f75

10 years agovp8 quantize is now in intrinsics, not asm
Johann [Wed, 5 Nov 2014 18:58:26 +0000 (10:58 -0800)]
vp8 quantize is now in intrinsics, not asm

Change-Id: Ie106c5335c9ba5aac81e23150e7026fb6ea6196e

10 years agoMerge "Skip ref frame mode search conditioned on predicted mv residuals"
Jingning Han [Wed, 5 Nov 2014 20:04:10 +0000 (12:04 -0800)]
Merge "Skip ref frame mode search conditioned on predicted mv residuals"

10 years agoquantize_test: delete QuantizePair related tests
James Zern [Wed, 5 Nov 2014 12:11:34 +0000 (13:11 +0100)]
quantize_test: delete QuantizePair related tests

functions were removed in:
2134eb2 Remove pair quantization

fixes ARM build

Change-Id: I634de50c2752408381d70afb0f5088b61052853c

10 years agoMerge "Remove pair quantization"
Johann [Wed, 5 Nov 2014 00:29:19 +0000 (16:29 -0800)]
Merge "Remove pair quantization"

10 years agoMerge "vp8 quantization -> intrinsics"
Johann [Wed, 5 Nov 2014 00:28:46 +0000 (16:28 -0800)]
Merge "vp8 quantization -> intrinsics"

10 years agoSimplify interface of write_selected_tx_size and read_tx_size
Hui Su [Fri, 31 Oct 2014 00:56:36 +0000 (17:56 -0700)]
Simplify interface of write_selected_tx_size and read_tx_size

Change-Id: Ia2b2a895deefaaf7b34bf26df86add56dbab082c

10 years agoMerge "[spatial svc] Make spatial svc working for one pass rate control"
Minghai Shang [Tue, 4 Nov 2014 23:57:16 +0000 (15:57 -0800)]
Merge "[spatial svc] Make spatial svc working for one pass rate control"

10 years agoMerge "Fix the memory leak due to missing free frame_mvs."
hkuang [Tue, 4 Nov 2014 22:43:47 +0000 (14:43 -0800)]
Merge "Fix the memory leak due to missing free frame_mvs."

10 years agoMerge "Avoid divide-by-zero in vp8 initialization"
Johann [Tue, 4 Nov 2014 22:28:45 +0000 (14:28 -0800)]
Merge "Avoid divide-by-zero in vp8 initialization"

10 years agoFix the memory leak due to missing free frame_mvs.
hkuang [Mon, 3 Nov 2014 20:25:18 +0000 (12:25 -0800)]
Fix the memory leak due to missing free frame_mvs.

Change-Id: I2ceee7341d906259002c0ea31ea009ae32c04bfd

10 years agoAvoid divide-by-zero in vp8 initialization
Johann [Tue, 4 Nov 2014 18:46:24 +0000 (10:46 -0800)]
Avoid divide-by-zero in vp8 initialization

Check that the numerator is not zero. If it is, guess 30fps.

Fixes a clang IOC error in the quantize test. It's very unlikely for
this to occur in the wild because the setup in the quantize test is very
nonstandard.

Change-Id: Icdab7b81d4e168d3423e14db20787f960052e0c3

10 years ago[spatial svc] Make spatial svc working for one pass rate control
Minghai Shang [Tue, 4 Nov 2014 19:46:48 +0000 (11:46 -0800)]
[spatial svc] Make spatial svc working for one pass rate control

Change-Id: Ibd9114485c3d747f9d148f64f706bf873ea473ac

10 years agoAlign structures in quantize test
Johann [Tue, 4 Nov 2014 18:32:51 +0000 (10:32 -0800)]
Align structures in quantize test

MACROBLOCKD structures require alignment.

This fixes an issue caught by clang IOC.

Change-Id: Ibb5bcc122f531b4302a87e1144e4feaf46c1de64

10 years agoAdd some indirection to the quantize test
Johann [Tue, 4 Nov 2014 16:20:25 +0000 (08:20 -0800)]
Add some indirection to the quantize test

Visual Studio and XCode require the address of the functions be passed.

Change-Id: Id39cab8e50061fdc6ac6018371a3a158c713e14b

10 years agoMerge "Refactor sub-pixel motion search unit"
Jingning Han [Tue, 4 Nov 2014 17:11:15 +0000 (09:11 -0800)]
Merge "Refactor sub-pixel motion search unit"

10 years agoSkip ref frame mode search conditioned on predicted mv residuals
Jingning Han [Mon, 3 Nov 2014 19:36:34 +0000 (11:36 -0800)]
Skip ref frame mode search conditioned on predicted mv residuals

This commit makes the RTC coding mode to conditionally skip the
reference frame mode search, when the predicted motion vector of
the current reference frame gives more than two times sum of
absolute difference compared to that of other reference frames.

It reduces the runtim by 1% - 4% for speed -5 and -6. The average
compression performance is improved by about 0.1% in both settings.

It is of particular benefit to light change scenarios. The
compression performance of test clip mmmovingvga.y4m is improved by
6.39% and 15.69% at high bit rates for speed -5 and -6, respectively.

Speed -5
vidyo1 16555 b/f, 40.818 dB, 12422 ms ->
       16552 b/f, 40.804 dB, 12100 ms

nik    33211 b/f, 39.138 dB, 11341 ms ->
       33228 b/f, 39.139 dB, 11023 ms

mmmoving 33263 b/f, 40.935 dB, 13508 ms ->
         33256 b/f, 41.068 dB, 12861 ms

Speed -6
vidyo1 16541 b/f, 40.227 dB, 8437 ms ->
       16540 b/f, 40.220 dB, 8216 ms

nik    33272 b/f, 38.399 dB, 7610 ms ->
       33267 b/f, 38.414 dB, 7490 ms

mmmoving 33255 b/f, 40.555 dB, 7523 ms ->
         33257 b/f, 40.975 dB, 7493 ms

Change-Id: Id2aef76ef74a3cba5e9a82a83b792144948c6a91

10 years agoMerge "WORKAROUND FIX FOR GCC4.9.1"
Yunqing Wang [Tue, 4 Nov 2014 00:56:38 +0000 (16:56 -0800)]
Merge "WORKAROUND FIX FOR GCC4.9.1"

10 years agoMerge "vp8 quantize test"
Johann [Mon, 3 Nov 2014 23:45:21 +0000 (15:45 -0800)]
Merge "vp8 quantize test"

10 years agoMerge "Allow disable of refresh golden for more than 1 layer encoding."
Marco [Mon, 3 Nov 2014 22:38:05 +0000 (14:38 -0800)]
Merge "Allow disable of refresh golden for more than 1 layer encoding."

10 years agovp8 quantize test
Johann [Mon, 20 Oct 2014 17:04:45 +0000 (10:04 -0700)]
vp8 quantize test

Test for Regular, Fast and Pair quantization

Change-Id: I0a26c164afe632db869099402189826c0d43f9a2

10 years agoRefactor sub-pixel motion search unit
Jingning Han [Sun, 2 Nov 2014 18:45:32 +0000 (10:45 -0800)]
Refactor sub-pixel motion search unit

This commit unfolds the legacy macro definitions used in the
sub-pixel motion search and refactors the operational flow for
later optimizations.

Change-Id: I3e3f770cad961d03d1a6eb0b2a0186cc77eaf2b8

10 years agoMerge "Fix the THR_MODES array used in vp9_pick_inter_mode"
Jingning Han [Mon, 3 Nov 2014 16:46:42 +0000 (08:46 -0800)]
Merge "Fix the THR_MODES array used in vp9_pick_inter_mode"

10 years agoMerge "Fix speed 7 and speed 12 for rt"
Yaowu Xu [Mon, 3 Nov 2014 16:02:58 +0000 (08:02 -0800)]
Merge "Fix speed 7 and speed 12 for rt"

10 years agoAllow disable of refresh golden for more than 1 layer encoding.
Marco [Mon, 3 Nov 2014 06:10:56 +0000 (22:10 -0800)]
Allow disable of refresh golden for more than 1 layer encoding.

The current logic was allowing for disabling golden refresh only
for two pass svc encoding. This change disables it as long as
more than 1 layer encoding is used (for example temporal layers under 1pass CBR).

Change-Id: I4dc5204a7ad365c821ec7963e93b59da82e1826b

10 years agoFix the THR_MODES array used in vp9_pick_inter_mode
Jingning Han [Sun, 2 Nov 2014 20:18:53 +0000 (12:18 -0800)]
Fix the THR_MODES array used in vp9_pick_inter_mode

Fix the alignment of entries fo intra prediction modes.

Change-Id: Ie32ad87cf90694efd591a4b1cc29c916c4cd56f7

10 years agoWORKAROUND FIX FOR GCC4.9.1
levytamar82 [Thu, 16 Oct 2014 23:56:37 +0000 (16:56 -0700)]
WORKAROUND FIX FOR GCC4.9.1

In the function mb_lpf_horizontal_edge_w_avx2_16 the usage of the intrinsic
_mm256_cvtepu8_epi16 cause a compiler bug in gcc 4.9.1.
until it will be fixed I created a workaround that create the up convert by
using broadcast128+shuffle.
The bug was reported here:
https://code.google.com/p/webm/issues/detail?id=867

Change-Id: I73452e6806f42e0fadcde96b804ea3afa7eeb351

10 years agoFix speed 7 and speed 12 for rt
Yaowu Xu [Sat, 1 Nov 2014 00:27:54 +0000 (17:27 -0700)]
Fix speed 7 and speed 12 for rt

A recent change has introduced big quality drops for speed 7 and 12
for --rt mode. The change reverted the big drop and improved quality
by 9.5% for speed 7 and 13.4% for speed 12.

Change-Id: I07b82e3bb6002a73af486a083458c88877bdad01

10 years agoBind motion vectors with frame buffer structure.
hkuang [Mon, 27 Oct 2014 23:19:04 +0000 (16:19 -0700)]
Bind motion vectors with frame buffer structure.

This will save a lot of memory for decoder due to removing of prev_mi,
but prev_mi is still needed in encoder. So this will increase a little bit
memory for encoder.

Change-Id: I24b2f1a423ebffa55a9bd2fcee1077dac995b2ed

10 years agoRemove pair quantization
Johann [Fri, 31 Oct 2014 20:42:55 +0000 (13:42 -0700)]
Remove pair quantization

The intrinsics version of the pair quant is slower than running it
individually.

Change-Id: I7b4ea8599d4aab04be0a5a0c59b8b29a7fc283f4

10 years agovp8 quantization -> intrinsics
Johann [Mon, 20 Oct 2014 23:03:28 +0000 (16:03 -0700)]
vp8 quantization -> intrinsics

Use intrinsics for neon quantization. Slight loss (<5%) of performance
compared to the assembly. Roughly 10x faster on arm64 because that was
running C code before.

Change-Id: I7cf5242d8f29b7eab5bca6a1c20c89c9fc9ca66d

10 years agoMerge "Fix mode index use case in vp9_pick_inter_mode"
Jingning Han [Fri, 31 Oct 2014 15:55:40 +0000 (08:55 -0700)]
Merge "Fix mode index use case in vp9_pick_inter_mode"

10 years agoMerge "Refactor vp9_update_rd_thresh_fact"
Jingning Han [Fri, 31 Oct 2014 15:55:28 +0000 (08:55 -0700)]
Merge "Refactor vp9_update_rd_thresh_fact"

10 years agoMerge "Rework pred pixel buffer system in non-RD coding mode"
Jingning Han [Fri, 31 Oct 2014 15:55:24 +0000 (08:55 -0700)]
Merge "Rework pred pixel buffer system in non-RD coding mode"

10 years agoFix mode index use case in vp9_pick_inter_mode
Jingning Han [Thu, 30 Oct 2014 02:30:12 +0000 (19:30 -0700)]
Fix mode index use case in vp9_pick_inter_mode

This improves coding performance of speed -5 and -6 by 0.6%,
respectively.

Change-Id: Ic5a7746a88c73285f0b14333d35dc16b02152c25

10 years agoRefactor vp9_update_rd_thresh_fact
Jingning Han [Thu, 30 Oct 2014 00:37:54 +0000 (17:37 -0700)]
Refactor vp9_update_rd_thresh_fact

Reduce the scope of function parameters.

Change-Id: Ifef2cfb559908a97498ffdbd6ea53da1cd45a73c

10 years agoRework pred pixel buffer system in non-RD coding mode
Jingning Han [Wed, 29 Oct 2014 23:37:16 +0000 (16:37 -0700)]
Rework pred pixel buffer system in non-RD coding mode

This commit makes the inter prediction buffer system to support
hybrid partition search. It reduces the runtime of speed -5 by
about 3%. No compression performance change.

vidyo1 720p 1000 kbps
11831 ms -> 11497 ms

nik 720p 1000 kbps
10919 ms -> 10645 ms

Change-Id: I5b2da747c6395c253cd074d3907f5402e1840c36

10 years agoMerge "Move the definition of switchable filter numbers into enum INTERP_FILTER;...
Hui Su [Thu, 30 Oct 2014 18:05:04 +0000 (11:05 -0700)]
Merge "Move the definition of switchable filter numbers into enum INTERP_FILTER; Modify the macro ADD_MV_REF_LIST and IF_DIFF_REF_FRAME_ADD_MV."

10 years agoMerge "Combine vp9_encode_block_intra and encode_block_intra"
Hui Su [Thu, 30 Oct 2014 18:02:31 +0000 (11:02 -0700)]
Merge "Combine vp9_encode_block_intra and encode_block_intra"

10 years agoMerge "Remove unused speed feature"
Yunqing Wang [Thu, 30 Oct 2014 02:00:32 +0000 (19:00 -0700)]
Merge "Remove unused speed feature"

10 years agoMerge "Addd error resilience test for temporal layers."
Marco [Thu, 30 Oct 2014 01:23:13 +0000 (18:23 -0700)]
Merge "Addd error resilience test for temporal layers."

10 years agoRemove unused speed feature
Yunqing Wang [Thu, 30 Oct 2014 00:05:04 +0000 (17:05 -0700)]
Remove unused speed feature

Partition_check was unused and removed.

Change-Id: I15ec9162d86dc61f04c09229c498629878ed7155

10 years agoAddd error resilience test for temporal layers.
Marco [Wed, 29 Oct 2014 22:34:18 +0000 (15:34 -0700)]
Addd error resilience test for temporal layers.

Test for successful decoding when dropping enhancement layer frames.

Change-Id: Id3ae6e5676894f352680973e52352dc5d98bbf55

10 years agoMerge "Enable mode search threshold update in non-RD coding mode"
Jingning Han [Wed, 29 Oct 2014 19:42:22 +0000 (12:42 -0700)]
Merge "Enable mode search threshold update in non-RD coding mode"

10 years agoEnable mode search threshold update in non-RD coding mode
Jingning Han [Tue, 28 Oct 2014 21:50:10 +0000 (14:50 -0700)]
Enable mode search threshold update in non-RD coding mode

Adaptively adjust the mode thresholds after each mode search round
to skip checking less likely selected modes. Local tests indicate
5% - 10% speed-up in speed -5 and -6. Average coding performance
loss is -1.055%.

speed -5
vidyo1 720p 1000 kbps
16533 b/f, 40.851 dB, 12607 ms -> 16556 b/f, 40.796 dB, 11831 ms

nik 720p 1000 kbps
33229 b/f, 39.127 dB, 11468 ms -> 33235 b/f, 39.131 dB, 10919 ms

speed -6
vidyo1 720p 1000 kbps
16549 b/f, 40.268 dB, 10138 ms -> 16538 b/f, 40.212 dB, 8456 ms

nik 720p 1000 kbps
33271 b/f, 38.433 dB,  7886 ms -> 33279 b/f, 38.416 dB, 7843 ms

Change-Id: I2c2963f1ce4ed9c1cf233b5b2c880b682e1c1e8b

10 years agoSimplify vp9_set_rd_speed_thresholds_sub8x8
Adrian Grange [Wed, 29 Oct 2014 16:09:46 +0000 (09:09 -0700)]
Simplify vp9_set_rd_speed_thresholds_sub8x8

Change-Id: I4bf0f9a38697f5aea564a47afd7f02bb8b2888b6

10 years agoCombine vp9_encode_block_intra and encode_block_intra
Hui Su [Wed, 22 Oct 2014 20:01:40 +0000 (13:01 -0700)]
Combine vp9_encode_block_intra and encode_block_intra

Change-Id: I79091fb677b64892ecca2fb466fde14602d8cdfc

10 years agoMerge "Use zero motion vector in choose_partitioning"
Jingning Han [Tue, 28 Oct 2014 19:00:13 +0000 (12:00 -0700)]
Merge "Use zero motion vector in choose_partitioning"

10 years agoMerge "vp9_denoiser_sse2: refactor the code."
JackyChen [Tue, 28 Oct 2014 18:06:05 +0000 (11:06 -0700)]
Merge "vp9_denoiser_sse2: refactor the code."

10 years agoMerge "Allow update of golden refernce buffer in CBR mode"
Yaowu Xu [Tue, 28 Oct 2014 17:48:02 +0000 (10:48 -0700)]
Merge "Allow update of golden refernce buffer in CBR mode"

10 years agovp9_denoiser_sse2: refactor the code.
JackyChen [Sat, 25 Oct 2014 17:51:51 +0000 (10:51 -0700)]
vp9_denoiser_sse2: refactor the code.

Combined vp9_denoiser_8xM_sse2 and vp9_denoiser_4xM_sse2 into one
function vp9_denoiser_NxM_sse2_small and passed the bitexact testing.
Changed the name of the function vp9_denoiser_64_32_16xM_sse2 to
vp9_denoiser_NxM_sse2_big.

Change-Id: Ib22478df585994dd347ebae04202c0b701e7f451

10 years agoMerge "Add a new control of golden frame boost in CBR mode"
Yaowu Xu [Tue, 28 Oct 2014 16:32:58 +0000 (09:32 -0700)]
Merge "Add a new control of golden frame boost in CBR mode"

10 years agoAllow update of golden refernce buffer in CBR mode
Yaowu Xu [Fri, 17 Oct 2014 19:13:16 +0000 (12:13 -0700)]
Allow update of golden refernce buffer in CBR mode

This commit changes to allow the usage of golden reference frame in
VP9 CBR mode to improve quality. VP9 supports potentially up to 8
reference buffers, it has reference buffers available for this
purpose. This was not possible in VP8 as golden and alt-ref buffers
were used for temporal scalability purpose in CBR mode in WebRTC.

For frames that update golden frame, there can be a quality boost.
The amount of allowed bitrate boost can be controlled via parameter
rc_max_inter_bitrate_pct. The inital value of the boost ratior is
currently based on over_shoot_pct. Further experiments will work
out the adaption of this boost value.

Change-Id: I0c5f010c8fd8b7b598f69779c1b30e5b2ac30a4d