]> granicus.if.org Git - libvpx/log
libvpx
7 years agoSplit dsp/x86/inv_txfm_sse2.c
Linfeng Zhang [Wed, 3 May 2017 22:43:02 +0000 (15:43 -0700)]
Split dsp/x86/inv_txfm_sse2.c

Spin out highbd idct functions.

BUG=webm:1412

Change-Id: I0cfe4117c00039b6778c59c022eee79ad089a2af

7 years agoUpdate highbd idct functions arguments to use uint16_t dst
Linfeng Zhang [Wed, 3 May 2017 20:32:08 +0000 (13:32 -0700)]
Update highbd idct functions arguments to use uint16_t dst

BUG=webm:1388

Change-Id: I3581d80d0389b99166e70987d38aba2db6c469d5

7 years agoClean CONVERT_TO_BYTEPTR/SHORTPTR in idct
Linfeng Zhang [Tue, 2 May 2017 17:44:12 +0000 (10:44 -0700)]
Clean CONVERT_TO_BYTEPTR/SHORTPTR in idct

BUG=webm:1388

Change-Id: Ida62c941f2b836d6c9e27b427a7d5008ab6dc112

7 years agoMerge changes I8bb660de,Ica51d780,I6037525d
Linfeng Zhang [Wed, 3 May 2017 19:17:55 +0000 (19:17 +0000)]
Merge changes I8bb660de,Ica51d780,I6037525d

* changes:
  Clean specializes of idct functions
  Clean add_protos of highbd idct functions
  Clean add_protos of idct functions

7 years agoMerge changes Ia5293d94,I90d481d3,Ia509d622,I54549b03,I89b635d6
James Zern [Wed, 3 May 2017 03:31:18 +0000 (03:31 +0000)]
Merge changes Ia5293d94,I90d481d3,Ia509d622,I54549b03,I89b635d6

* changes:
  ppc: Add convolve8_vsx and convolve8_avg_vsx
  ppc: Add convolve8_avg_vert_vsx
  ppc: Add convolve8_vert
  ppc: Add convolve8_horiz_avg
  ppc: Add convolve8_horiz

7 years agoppc: Add convolve8_vsx and convolve8_avg_vsx
Luca Barbato [Mon, 1 May 2017 17:02:03 +0000 (17:02 +0000)]
ppc: Add convolve8_vsx and convolve8_avg_vsx

Change-Id: Ia5293d948003a7fff5a7cbad6e83d8a72717c857

7 years agoppc: Add convolve8_avg_vert_vsx
Luca Barbato [Mon, 1 May 2017 15:53:44 +0000 (15:53 +0000)]
ppc: Add convolve8_avg_vert_vsx

Only the generic one again, speedups for 8x8 and larger blocks to
come later.

Change-Id: I90d481d3a602d1e277ead8f3934eca126b86b72d

7 years agoppc: Add convolve8_vert
Luca Barbato [Mon, 1 May 2017 15:53:44 +0000 (15:53 +0000)]
ppc: Add convolve8_vert

Only the generic one again, speedups for 8x8 and larger blocks
to come later.

Change-Id: Ia509d6225984b4930ec03928c9bcbf51486da99f

7 years agoppc: Add convolve8_horiz_avg
Luca Barbato [Sat, 29 Apr 2017 12:34:20 +0000 (12:34 +0000)]
ppc: Add convolve8_horiz_avg

The 8x8 and larger blocks cases can be sped up further.

Change-Id: I54549b03ac6c7a4e3f485738b100c3cac7ac2e15

7 years agoppc: Add convolve8_horiz
Luca Barbato [Sat, 29 Apr 2017 12:34:20 +0000 (12:34 +0000)]
ppc: Add convolve8_horiz

The 8x8 and larger blocks cases can be sped up further.

Change-Id: I89b635d6b01c59f523f2d54b1284ed32916c5046

7 years agoClean specializes of idct functions
Linfeng Zhang [Tue, 2 May 2017 21:14:06 +0000 (14:14 -0700)]
Clean specializes of idct functions

Change-Id: I8bb660de47b5f97263ec381dc428db96e9c9a4b2

7 years agoClean add_protos of highbd idct functions
Linfeng Zhang [Tue, 2 May 2017 20:31:56 +0000 (13:31 -0700)]
Clean add_protos of highbd idct functions

Change-Id: Ica51d780b92b316ce9112740c56cdf7670816371

7 years agoClean add_protos of idct functions
Linfeng Zhang [Tue, 2 May 2017 18:19:44 +0000 (11:19 -0700)]
Clean add_protos of idct functions

Change-Id: I6037525d92ec172810edab720389eb1865ed3b1a

7 years agoMerge "block error sse2: sum in 32 bits when possible"
Johann Koenig [Tue, 2 May 2017 14:16:47 +0000 (14:16 +0000)]
Merge "block error sse2: sum in 32 bits when possible"

7 years agoblock error avx2: rename variables
Johann [Tue, 2 May 2017 00:54:19 +0000 (17:54 -0700)]
block error avx2: rename variables

Change-Id: I2b8a9253f2c3d1fd85304c2970ebe70213870fe9

7 years agoMerge "block error avx2: sum in 32 bits when possible"
Johann Koenig [Tue, 2 May 2017 00:52:58 +0000 (00:52 +0000)]
Merge "block error avx2: sum in 32 bits when possible"

7 years agoMerge "vp9: SVC: Early exit on golden ref in non-rd pickmode."
Marco Paniconi [Mon, 1 May 2017 21:04:51 +0000 (21:04 +0000)]
Merge "vp9: SVC: Early exit on golden ref in non-rd pickmode."

7 years agoMerge "Clean vp9_highbd_build_inter_predictor() and highbd_inter_predictor()"
Linfeng Zhang [Mon, 1 May 2017 19:54:40 +0000 (19:54 +0000)]
Merge "Clean vp9_highbd_build_inter_predictor() and highbd_inter_predictor()"

7 years agoMerge "move vp9_error_intrin_avx2.c"
Johann Koenig [Mon, 1 May 2017 19:52:36 +0000 (19:52 +0000)]
Merge "move vp9_error_intrin_avx2.c"

7 years agoblock error avx2: sum in 32 bits when possible
Kyle Siefring [Mon, 1 May 2017 16:19:11 +0000 (09:19 -0700)]
block error avx2: sum in 32 bits when possible

Add 31bit pairs before unpacking in x86 block error code

AVX2 code provides a very minor performance improvement.

BUG=webm:1210

Change-Id: I4c82308eaf65741dca2f5c6db9be9c85f905073a

7 years agoMerge "vpx_scale_test: fix segfault on alloc failure"
James Zern [Mon, 1 May 2017 19:22:21 +0000 (19:22 +0000)]
Merge "vpx_scale_test: fix segfault on alloc failure"

7 years agovp9: SVC: Early exit on golden ref in non-rd pickmode.
Marco [Mon, 1 May 2017 17:04:31 +0000 (10:04 -0700)]
vp9: SVC: Early exit on golden ref in non-rd pickmode.

For SVC 1 pass real-time: add condition to skip the
golden (spatial) reference mode in non-rd pickmode.
Condition is to skip golden if the sse of zeromv-last mode
is below threshold. And change order in ref_mode_set_svc
to make sure golden zeromv is tested after last-nearest.

Speedup ~3-4% with little/negligible quality loss.

Change-Id: I6cbe314a93210454ba2997945f714015f1b2fca3

7 years agoblock error sse2: sum in 32 bits when possible
Kyle Siefring [Mon, 1 May 2017 16:15:29 +0000 (09:15 -0700)]
block error sse2: sum in 32 bits when possible

Add 31bit pairs before unpacking in x86 block error code

BUG=webm:1210

Change-Id: I5ca8c7f7775585a17fe09d6bbfc25e1f2955eb0a

7 years agomove vp9_error_intrin_avx2.c
Johann [Mon, 1 May 2017 16:12:25 +0000 (09:12 -0700)]
move vp9_error_intrin_avx2.c

There is only one avx2 implementation. Drop '_intrin'

Change-Id: I887a0d27d58567eaad49f749f127eca61313f312

7 years agovpx_scale_test: fix segfault on alloc failure
James Zern [Sat, 29 Apr 2017 20:12:53 +0000 (13:12 -0700)]
vpx_scale_test: fix segfault on alloc failure

check the return of ResetImage() before continuing

Change-Id: Iff0b038f7b9761113b8cf33a511a5306640d1273

7 years agoppc: Add convolve_avg
Luca Barbato [Wed, 26 Apr 2017 16:31:11 +0000 (16:31 +0000)]
ppc: Add convolve_avg

Change-Id: Ib203c444c708f42072e38301ee3db97b5b53d014

7 years agoppc: Add convolve_copy
Luca Barbato [Wed, 26 Apr 2017 15:43:19 +0000 (15:43 +0000)]
ppc: Add convolve_copy

Change-Id: Ie26d6dbe090e711d84bac01ba7da270db983f405

7 years agoMerge "Use uint32_t for accumulator"
Johann Koenig [Fri, 28 Apr 2017 18:32:08 +0000 (18:32 +0000)]
Merge "Use uint32_t for accumulator"

7 years agoMerge "vp9: Fix condition for disabling adaptive_rd_thresh."
Jerome Jiang [Fri, 28 Apr 2017 18:10:35 +0000 (18:10 +0000)]
Merge "vp9: Fix condition for disabling adaptive_rd_thresh."

7 years agovp9: Fix condition for disabling adaptive_rd_thresh.
Jerome Jiang [Thu, 27 Apr 2017 19:56:52 +0000 (12:56 -0700)]
vp9: Fix condition for disabling adaptive_rd_thresh.

Add speed constrains for disabling adaptive_rd_thresh when
row_mt_bit_exact is set.

Change-Id: I2445115c2f9a2e46b8a0966031a0fea488d4964e

7 years agoMerge "Generalize vp9 sse2 denoiser test for other platforms."
Jerome Jiang [Fri, 28 Apr 2017 15:45:52 +0000 (15:45 +0000)]
Merge "Generalize vp9 sse2 denoiser test for other platforms."

7 years agoUse uint32_t for accumulator
Johann [Fri, 28 Apr 2017 13:34:21 +0000 (06:34 -0700)]
Use uint32_t for accumulator

Be specific about the data type size.

Use convenience macro vp9_zero_array.

Change-Id: I5fadf7dbd408befb73820d85db0be4832e8cfcbd

7 years agoMerge "vp9 temporal filter: sse4 implementation"
Johann Koenig [Fri, 28 Apr 2017 13:22:40 +0000 (13:22 +0000)]
Merge "vp9 temporal filter: sse4 implementation"

7 years agoGeneralize vp9 sse2 denoiser test for other platforms.
Jerome Jiang [Thu, 27 Apr 2017 22:56:39 +0000 (15:56 -0700)]
Generalize vp9 sse2 denoiser test for other platforms.

Renamed to vp9_denoiser_test.

Change-Id: I0d8f4c94bcb81a60949a13d9fe839cee95d03f77

7 years agoMerge "VP9: enable trellis for high bitdepth intra"
Yaowu Xu [Fri, 28 Apr 2017 00:16:55 +0000 (00:16 +0000)]
Merge "VP9: enable trellis for high bitdepth intra"

7 years agoMerge "webm_read_frame: avoid NULL dereference"
James Zern [Thu, 27 Apr 2017 21:47:09 +0000 (21:47 +0000)]
Merge "webm_read_frame: avoid NULL dereference"

7 years agovp9 temporal filter: sse4 implementation
Johann [Wed, 15 Mar 2017 17:40:58 +0000 (10:40 -0700)]
vp9 temporal filter: sse4 implementation

Approximates division using multiply and shift.

Speeds up both sizes (8x8 and 16x16) by 30 times.

Fix the call sites to use the RTCD function.

Delete sse2 and mips implementation. They were based on a previous
implementation of the filter. It was changed in Dec 2015:
ece4fd5d2247c9512b31a93dd593de567beaf928

BUG=webm:1378

Change-Id: I0818e767a802966520b5c6e7999584ad13159276

7 years agovp9: Don't force disabling of adaptive_rd_thresh for realtime.
Jerome Jiang [Wed, 26 Apr 2017 18:12:21 +0000 (11:12 -0700)]
vp9: Don't force disabling of adaptive_rd_thresh for realtime.

Don't force disabling of adaptive_rd_thresh for realtime when
row_mt_bit_exact is set.

Row based adaptive rd is made usable in CL
454882(https://chromium-review.googlesource.com/c/454882) for REALTIME.

Change-Id: Ief023414f0fd6eb86f299dd46ae58f4436875af5

7 years agoMerge "Make the row based multi-threaded encoder deterministic"
Yunqing Wang [Wed, 26 Apr 2017 16:12:14 +0000 (16:12 +0000)]
Merge "Make the row based multi-threaded encoder deterministic"

7 years agoMerge "Update highbd convolve functions arguments to use uint16_t src/dst"
Linfeng Zhang [Wed, 26 Apr 2017 15:50:45 +0000 (15:50 +0000)]
Merge "Update highbd convolve functions arguments to use uint16_t src/dst"

7 years agoMerge "vp9: SVC: Adjust some speed settings for temporal layers."
Marco Paniconi [Wed, 26 Apr 2017 15:45:05 +0000 (15:45 +0000)]
Merge "vp9: SVC: Adjust some speed settings for temporal layers."

7 years agoVP9: enable trellis for high bitdepth intra
Peter de Rivaz [Wed, 26 Apr 2017 10:40:58 +0000 (11:40 +0100)]
VP9: enable trellis for high bitdepth intra

BUG=webm:1409

Change-Id: I5236595aac1c09386c60ffe8ad621e01422ed5a7

7 years agoMerge "Fix the decoder seg fault when frame is corrupted."
Jerome Jiang [Wed, 26 Apr 2017 00:09:29 +0000 (00:09 +0000)]
Merge "Fix the decoder seg fault when frame is corrupted."

7 years agoMerge "vp9: speed >= 8: Skip uv variance in model_rd_sb_y_large"
Jerome Jiang [Wed, 26 Apr 2017 00:09:21 +0000 (00:09 +0000)]
Merge "vp9: speed >= 8: Skip uv variance in model_rd_sb_y_large"

7 years agovp9: SVC: Adjust some speed settings for temporal layers.
Marco [Tue, 25 Apr 2017 23:11:19 +0000 (16:11 -0700)]
vp9: SVC: Adjust some speed settings for temporal layers.

Make some speed setting changes for temporal enhancement layers,
and remove the switch in subpel_force_stop for the aggressive_base_mv
in non-rd pickmode.

Gain some 2-3% speed with little/negligible quality loss.

Change-Id: I3e2a7f80ff45f38c0a6ceb01b34dbca2f53edbf0

7 years agovp9: speed >= 8: Skip uv variance in model_rd_sb_y_large
Jerome Jiang [Fri, 21 Apr 2017 17:10:05 +0000 (10:10 -0700)]
vp9: speed >= 8: Skip uv variance in model_rd_sb_y_large

For speed >= 8 and color_sensitivity not set, skip the transform
skipping test in UV planes.
Add a new condition to check noise level to skip chroma check
for speed >= 8 if y_sad is high.

1~2% speedup on ARM for speed 8.

Borg tests show neutral results in both rtc and rtc_derf.

Change-Id: Idecd3ff6e28c97757a43bb6f3a7082c85f72109c

7 years agoClean vp9_highbd_build_inter_predictor() and highbd_inter_predictor()
Linfeng Zhang [Wed, 19 Apr 2017 21:48:07 +0000 (14:48 -0700)]
Clean vp9_highbd_build_inter_predictor() and highbd_inter_predictor()

BUG=webm:1388

Change-Id: I7ee32e0c08f0fb41712a8cc640b2c5bba872421d

7 years agoUpdate highbd convolve functions arguments to use uint16_t src/dst
Linfeng Zhang [Wed, 19 Apr 2017 20:08:25 +0000 (13:08 -0700)]
Update highbd convolve functions arguments to use uint16_t src/dst

BUG=webm:1388

Change-Id: I6912de2639895d817ce850da8ea9f6c8fe21da42

7 years agowebm_read_frame: avoid NULL dereference
James Zern [Sat, 22 Apr 2017 20:11:16 +0000 (13:11 -0700)]
webm_read_frame: avoid NULL dereference

block may be NULL with block_entry_eos or from return of GetBlock()

Change-Id: Ia0dd3ffa46305ee70efcdc55c05c2ad24efc993b

7 years agovp9; Reduce artifact in non-rd pickmode for lighting changes.
Marco [Fri, 14 Apr 2017 18:32:19 +0000 (11:32 -0700)]
vp9; Reduce artifact in non-rd pickmode for lighting changes.

Add a low-variance high-sumdiff to the superblock content state
and use it to limit the mv and bias some decisions in non-rd pickmode.
Only affects speed >= 6.

Reduces artifact for lighting changes.
Small/no difference in metrics on RTC set.

Change-Id: Ic84b2379fe0ae3fa71ae826ee6bae3eaf551a25b

7 years agoMake the row based multi-threaded encoder deterministic
Yunqing Wang [Mon, 24 Apr 2017 19:06:49 +0000 (12:06 -0700)]
Make the row based multi-threaded encoder deterministic

This patch followed allow_exhaustive_searches feature modification and
continued to modify the encoder to achieve the determinism in the row
based multi-threaded encoding. While row-mt = 1 and using multiple
threads, the adaptive feature in encoder was disabled, which gave
BDRate gain(at speed 1, -0.6% ~ -0.7%; at speed 2, -0.46% ~ -0.59%),
but some encoder speed losses(7% ~ 10% at speed 1 and 3% ~ 6% at
speed 2). These speed losses were acceptable considering the speed
gains obtained from row-mt.

Change-Id: I60d87a25346ebc487a864b57d559f560b7e398bb

7 years agoMerge "Make allow_exhaustive_searches feature no longer adaptive"
Yunqing Wang [Mon, 24 Apr 2017 17:41:10 +0000 (17:41 +0000)]
Merge "Make allow_exhaustive_searches feature no longer adaptive"

7 years agoMerge "vp9: SVC: fix condition for partition/skip threshold when denoising."
Marco Paniconi [Fri, 21 Apr 2017 21:28:16 +0000 (21:28 +0000)]
Merge "vp9: SVC: fix condition for partition/skip threshold when denoising."

7 years agoMake allow_exhaustive_searches feature no longer adaptive
Yunqing Wang [Thu, 20 Apr 2017 00:00:08 +0000 (17:00 -0700)]
Make allow_exhaustive_searches feature no longer adaptive

A previous patch turned on allow_exhaustive_searches feature only for
FC_GRAPHICS_ANIMATION content. This patch further modified the feature
by removing the exhaustive search limit, and made it no longer adaptive.
As a result, the 2 counts that recorded the number of motion searches
were removed, which helped achieve the determinism in the row based
multi-threading encoding. Tests showed that this patch didn't cause
the encoder much slower.

Used exhaustive_searches_thresh for this speed feature, and removed
allow_exhaustive_searches. Also, refactored the speed feature code
to follow the general speed feature setting style.

Change-Id: Ib96b182c4c8dfff4c1ab91d2497cc42bb9e5a4aa

7 years agoMerge "vp9: Non-rd pickmode: Avoid computation duplication."
Jerome Jiang [Fri, 21 Apr 2017 00:51:46 +0000 (00:51 +0000)]
Merge "vp9: Non-rd pickmode:  Avoid computation duplication."

7 years agovp9: SVC: fix condition for partition/skip threshold when denoising.
Marco [Thu, 20 Apr 2017 23:32:46 +0000 (16:32 -0700)]
vp9: SVC: fix condition for partition/skip threshold when denoising.

The more aggressive settings should only be used when denoise_svc
condition is satisfied (which means top spatial layer).

Change-Id: Ia8e3515b27f31bf21b1976ca80a2fa826daece3a

7 years agovp9: Non-rd pickmode: Avoid computation duplication.
Jerome Jiang [Thu, 20 Apr 2017 17:57:02 +0000 (10:57 -0700)]
vp9: Non-rd pickmode:  Avoid computation duplication.

In non-rd pickmode (speed >= 5), avoid duplication of computations in
model_rd_for_sb_y when the speed feature use_simple_block_yrd is
enabled (or for high bitdepth build under certain conditions).

QVGA, VGA and HD have 1.23%, 2.68% and 1.7% speedup on ARM for speed 8,
respectively.

Encoding results are bitexact for speed >= 5.

Change-Id: I3f9130810c21439f5ad7e159e21cb2243dcd05f1

7 years agoFix the decoder seg fault when frame is corrupted.
Jerome Jiang [Thu, 20 Apr 2017 21:48:22 +0000 (14:48 -0700)]
Fix the decoder seg fault when frame is corrupted.

BUG=webm:1399

Change-Id: I1e006e0260d9b56a4d2273659ca19b86c69c474b

7 years agovp9: 1 pass SVC: Fix comment and condition for up-sampling reference.
Marco [Thu, 20 Apr 2017 21:13:57 +0000 (14:13 -0700)]
vp9: 1 pass SVC: Fix comment and condition for up-sampling reference.

No change in behavior.

Change-Id: I218fb30289091da623acb23324027435b8510d0e

7 years agoMerge "Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content"
Yunqing Wang [Thu, 20 Apr 2017 19:57:46 +0000 (19:57 +0000)]
Merge "Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content"

7 years agoMerge "vp9: Re-enable SVC datarate tests."
Marco Paniconi [Thu, 20 Apr 2017 19:53:19 +0000 (19:53 +0000)]
Merge "vp9: Re-enable SVC datarate tests."

7 years agovp9: Re-enable SVC datarate tests.
Marco [Wed, 19 Apr 2017 18:12:42 +0000 (11:12 -0700)]
vp9: Re-enable SVC datarate tests.

Re-enable the SVC tests, wrap the non-zero expectation
in GetMismatchFrames around #if CONFIG_VP9_DECODER.

Change-Id: I0e8a2d78b868c32f18fe597540f397d3a1b303b5

7 years agovp9: SVC: Redefine the source downsample filter choice.
Marco [Thu, 20 Apr 2017 16:50:16 +0000 (09:50 -0700)]
vp9: SVC: Redefine the source downsample filter choice.

Rename the source downsampling filter, and define it
per spatial layers. Used 1 pass CBR SVC.

Change-Id: I8135f2ab89c535c53429b9c58b586f746bb668c7

7 years agoppc: Add the intra predictor tests
Luca Barbato [Tue, 18 Apr 2017 23:37:57 +0000 (23:37 +0000)]
ppc: Add the intra predictor tests

Change-Id: Idea15b916044ab3d8e74519337880a484ecfd87e

7 years agoppc: h predictor 8x8
Luca Barbato [Tue, 18 Apr 2017 22:55:53 +0000 (22:55 +0000)]
ppc: h predictor 8x8

Slightly faster with the current compiler.

Change-Id: Iae225fac08395eb430c97a2abec69c60f5cf5c47

7 years agoppc: d63 predictor 8x8
Luca Barbato [Tue, 11 Apr 2017 23:18:35 +0000 (01:18 +0200)]
ppc: d63 predictor 8x8

10x faster.

Change-Id: I7cedbf4df2ce7df5b6f1108b11815d088fdb9ba8

7 years agoppc: tm predictor 4x4
Luca Barbato [Sun, 9 Apr 2017 15:07:03 +0000 (15:07 +0000)]
ppc: tm predictor 4x4

Slightly faster.

Change-Id: I0ca43f309b3d9b50435d69bd5be64b53a99bd191

7 years agoppc: h predictor 4x4
Luca Barbato [Sun, 9 Apr 2017 13:44:41 +0000 (13:44 +0000)]
ppc: h predictor 4x4

2x faster.

Change-Id: I0583dec353299c6797401b646099f18db4e0420d

7 years agoppc: dc predictor 8x8
Luca Barbato [Sun, 9 Apr 2017 13:05:09 +0000 (13:05 +0000)]
ppc: dc predictor 8x8

Slightly faster, the other dc predictors cannot be faster since
the computation speedup is overwhelmed by the time spent reading
dst to write just the 8x8 part.

Change-Id: I94a0b50500adf8b7b6bb919dbf5c7adf5b9fba66

7 years agoppc: d45 predictor 8x8
Luca Barbato [Sun, 9 Apr 2017 11:07:22 +0000 (11:07 +0000)]
ppc: d45 predictor 8x8

11x faster.

Change-Id: I5b8f39213ee1f5260724fc254e3fb5c462435798

7 years agoppc: d63 predictor 32x32
Luca Barbato [Sun, 9 Apr 2017 00:09:56 +0000 (00:09 +0000)]
ppc: d63 predictor 32x32

About 10x faster.

Change-Id: If7d0645f75c5d7deb9751edd0bf47e2f9068e9e7

7 years agoppc: d63 predictor 16x16
Luca Barbato [Sun, 9 Apr 2017 00:09:56 +0000 (00:09 +0000)]
ppc: d63 predictor 16x16

About 18x faster.

Change-Id: Id043bf76c011e03e992085bb5e20f330d3e98cd4

7 years agoppc: d45 predictor 32x32
Luca Barbato [Sat, 8 Apr 2017 22:41:41 +0000 (22:41 +0000)]
ppc: d45 predictor 32x32

About 12x faster.

Change-Id: I22c150256aefb4941861ab1f6c17d554fb694bed

7 years agoppc: d45 predictor 16x16
Luca Barbato [Sat, 8 Apr 2017 22:41:41 +0000 (22:41 +0000)]
ppc: d45 predictor 16x16

About 16x faster.

Change-Id: Ie5469fb32d5fd11bb6cb06318cea475d8a5b00b9

7 years agoppc: dc predictor 32x32
Luca Barbato [Sat, 8 Apr 2017 02:55:33 +0000 (02:55 +0000)]
ppc: dc predictor 32x32

10x and 5x faster.

Change-Id: I7913c58c768334d818f541a5e219f1035791eeaf

7 years agoppc: dc top and left predictor 32x32
Luca Barbato [Sat, 8 Apr 2017 02:55:33 +0000 (02:55 +0000)]
ppc: dc top and left predictor 32x32

6x faster.

Change-Id: I717995b4056e5579c68191d11b495372971fe1ae

7 years agoppc: dc top and left predictor 16x16
Luca Barbato [Sat, 8 Apr 2017 02:55:33 +0000 (02:55 +0000)]
ppc: dc top and left predictor 16x16

13x faster.

Change-Id: I1771ac39fda599153f933cb3f0506c9f97a6cbe6

7 years agoppc: dc_128 predictor 32x32
Luca Barbato [Sat, 8 Apr 2017 00:39:24 +0000 (00:39 +0000)]
ppc: dc_128 predictor 32x32

6x faster.

Change-Id: I1da8f51b4262871cb98f0aa03ccda41b0ac2b08b

7 years agoppc: dc_128 predictor 16x16
Luca Barbato [Sat, 8 Apr 2017 00:26:54 +0000 (00:26 +0000)]
ppc: dc_128 predictor 16x16

20x faster.

Change-Id: I05f0deb2d38ae7966eae6b71fbc0aa51880e5709

7 years agoppc: tm predictor 32x32
Luca Barbato [Fri, 7 Apr 2017 14:49:00 +0000 (14:49 +0000)]
ppc: tm predictor 32x32

About 8x faster.

Change-Id: I9bad827ccbdf47ec95406e961c74ac2ff45f80cf

7 years agoMerge changes I1f5a3752,I95123051,I3bb724e0,Ie81077fa,Ic80f3c05, ...
James Zern [Thu, 20 Apr 2017 02:45:44 +0000 (02:45 +0000)]
Merge changes I1f5a3752,I95123051,I3bb724e0,Ie81077fa,Ic80f3c05, ...

* changes:
  ppc: tm predictor 16x16
  ppc: tm predictor 8x8
  ppc: horizontal predictor 32x32
  ppc: horizontal predictor 16x16
  ppc: vertical intrapred 16x16 and 32x32
  configure: Workaround clang not enabling altivec on -mvsx
  configure: Match power*64* as ppc64

7 years agoOnly allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content
Yunqing Wang [Wed, 19 Apr 2017 23:32:59 +0000 (16:32 -0700)]
Only allow allow_exhaustive_searches for FC_GRAPHICS_ANIMATION content

The allow_exhaustive_searches feature improves the encoding quality
of FC_GRAPHICS_ANIMATION content a lot. For non-FC_GRAPHICS_ANIMATION
content, the quality test result is almost neutral. This patch makes
this feature to be used only for FC_GRAPHICS_ANIMATION content.

The motivation of doing that is to make this feature no longer adaptive,
which will be implemented in the following patch.

Change-Id: Ic911df6dd757402b6480789cc247801e99840369

7 years agoMerge changes I9e18a73b,Ie47c8cd4
Linfeng Zhang [Wed, 19 Apr 2017 23:55:57 +0000 (23:55 +0000)]
Merge changes I9e18a73b,Ie47c8cd4

* changes:
  Clean CONVERT_TO_BYTEPTR/SHORTPTR in convolve
  Create CAST_TO_BYTEPTR/SHORTPTR

7 years agoClean CONVERT_TO_BYTEPTR/SHORTPTR in convolve
Linfeng Zhang [Thu, 6 Apr 2017 00:54:42 +0000 (17:54 -0700)]
Clean CONVERT_TO_BYTEPTR/SHORTPTR in convolve

Replace by CAST_TO_BYTEPTR/SHORTPTR.
The rule is: if a short ptr is casted to a byte ptr, any offset
operation on the byte ptr must be doubled. We do this by casting to
short ptr first, adding offset, then casting back to byte ptr.

BUG=webm:1388

Change-Id: I9e18a73ba45ddae58fc9dae470c0ff34951fe248

7 years agoMerge "vp9: Add phase to get averaging filter for 1:2 downsampling."
Marco Paniconi [Wed, 19 Apr 2017 15:27:55 +0000 (15:27 +0000)]
Merge "vp9: Add phase to get averaging filter for 1:2 downsampling."

7 years agovp9: Fix the disabling of a SVC 3TL datarate test.
Marco [Wed, 19 Apr 2017 14:59:59 +0000 (07:59 -0700)]
vp9: Fix the disabling of a SVC 3TL datarate test.

Change-Id: Ib42d23ab5ee39ab3c85e1d9a84e36249e59fe74e

7 years agovp9: Add phase to get averaging filter for 1:2 downsampling.
Marco [Fri, 14 Apr 2017 00:19:06 +0000 (17:19 -0700)]
vp9: Add phase to get averaging filter for 1:2 downsampling.

The scaling filter with zero shift will give sub-sampling for
2x downsampling. Allow for a phase shift to get an averaging filter.

Usage is for source scaling in 1 pass SVC mode for 1:2 downscale.
Reduces aliasing in downsampled image.

Keep the phase to 0/off for now.

Change-Id: Ic547ea0748d151b675f877527e656407fcf4d51e

7 years agoppc: tm predictor 16x16
Luca Barbato [Fri, 7 Apr 2017 14:49:00 +0000 (14:49 +0000)]
ppc: tm predictor 16x16

About 10x faster.

Change-Id: I1f5a3752d346459df3b45f92963208bf3e520f06

7 years agoppc: tm predictor 8x8
Luca Barbato [Fri, 7 Apr 2017 14:49:00 +0000 (14:49 +0000)]
ppc: tm predictor 8x8

About 5x faster.

Change-Id: I951230517f49c0dca9ac9eac2efa8916a303b85a

7 years agoppc: horizontal predictor 32x32
Luca Barbato [Fri, 7 Apr 2017 14:49:00 +0000 (14:49 +0000)]
ppc: horizontal predictor 32x32

About 5x faster.

Change-Id: I3bb724e07baffd901aa2d0f65060ba48882cc9b8

7 years agoppc: horizontal predictor 16x16
Luca Barbato [Fri, 7 Apr 2017 14:49:00 +0000 (14:49 +0000)]
ppc: horizontal predictor 16x16

About 10x faster.

Change-Id: Ie81077fa32ad214cdb46bdcb0be4e9e2c7df47c2

7 years agoppc: vertical intrapred 16x16 and 32x32
Luca Barbato [Fri, 7 Apr 2017 13:50:12 +0000 (13:50 +0000)]
ppc: vertical intrapred 16x16 and 32x32

Change-Id: Ic80f3c050cfbe7697e81a311b4edaaa597b85cab

7 years agoconfigure: Workaround clang not enabling altivec on -mvsx
Luca Barbato [Tue, 18 Apr 2017 18:31:18 +0000 (18:31 +0000)]
configure: Workaround clang not enabling altivec on -mvsx

The flag `-mvsx` implies `-maltivec`.

Change-Id: I7544553eba131a533467b387f8bf329d57f5af5c

7 years agoconfigure: Match power*64* as ppc64
Luca Barbato [Fri, 7 Apr 2017 13:14:35 +0000 (13:14 +0000)]
configure: Match power*64* as ppc64

Change-Id: Ie640dff50a5db935bb57c5a2570b423ce8946f2c

7 years agoCreate CAST_TO_BYTEPTR/SHORTPTR
Linfeng Zhang [Thu, 6 Apr 2017 00:40:12 +0000 (17:40 -0700)]
Create CAST_TO_BYTEPTR/SHORTPTR

They will replace CONVERT_TO_BYTEPTR/SHORTPTR module by module.

BUG=webm:1388

Change-Id: Ie47c8cd4897696481b9cbbf9e2d439dc22dc85ec

7 years agovp9: Disable some SVC tests for now.
Marco [Tue, 18 Apr 2017 16:43:32 +0000 (09:43 -0700)]
vp9: Disable some SVC tests for now.

Disable the 1 pass CBR SVC tests with temporal_layers > 1.
Issue with the commit 863f860, which will cause encoder/decoder
mismatch due to skipping encoder loopfilter for non-reference frames.

Will re-enable the tests when fixed.

Change-Id: I74918a0045a17976b069c4be63fbeb921974df0d

7 years agovp9: Add key_frame condition to is_reference check for loopfilter.
Marco [Mon, 17 Apr 2017 21:36:11 +0000 (14:36 -0700)]
vp9: Add key_frame condition to is_reference check for loopfilter.

This condiiton is not needed as key_frame should set the refresh
of the reference frames, but good to have for clarity in condition.

Change-Id: Icf9838e7e4f0ff5cf0a9562ae3b5d6c7e6f78702

7 years agoMerge "re-enable vpx_comp_avg_pred_sse2"
Johann Koenig [Mon, 17 Apr 2017 22:07:34 +0000 (22:07 +0000)]
Merge "re-enable vpx_comp_avg_pred_sse2"

7 years agoRevert "Revert "vp9: Avoid encoder loopfilter for non-reference frames.""
Marco Paniconi [Mon, 17 Apr 2017 18:00:09 +0000 (18:00 +0000)]
Revert "Revert "vp9: Avoid encoder loopfilter for non-reference frames.""

This reverts commit e9b7f98c56b3b9c99a60eb41b83bf8346b3ad25f.

Reason for revert:
Commit d578bdad fixes the issue (encoder/decoder mismatch
in 3TL datarate test) that causes the original revert.

Original change's description:
> Revert "vp9: Avoid encoder loopfilter for non-reference frames."
>
> This reverts commit 863f860bfcf3bdc26eeecb299aa481d0f63d11ac.
>
> This causes encoder / decoder mismatches in various
> VP9/DatarateTestVP9Large.BasicRateTargeting3TemporalLayers tests
>
> BUG=webm:1408
>
> Change-Id: Ic200c39d7ed9c0b0247ef562f5d6f7b2625f7e14
>

TBR=jzern@google.com,marpan@google.com,builds@webmproject.org,jianj@google.com
BUG=webm:1408

Change-Id: Ifeb81460856d1d56482d4e0477a70ee98f8bfaa6

7 years agovp9: Datarate test: modify frame flags for 3 TL.
Marco [Mon, 17 Apr 2017 16:19:03 +0000 (09:19 -0700)]
vp9: Datarate test: modify frame flags for 3 TL.

Modify the frame flags to update the ARF on top layer,
for the tests:
VP9/DatarateTestVP9Large.BasicRateTargeting3TemporalLayers
VP9/DatarateTestVP9Large.BasicRateTargeting3TemporalLayersFrameDropping

This is needed to fix the encode/decoder mismatches caused by 863f860,
and removed in the revert e9b7f98.

Change-Id: I6b9fecfdd17315fc0179e29949338c77636026c0