]> granicus.if.org Git - libvpx/log
libvpx
7 years agoBug in scale_sse_threshold()
paulwilkins [Thu, 9 Feb 2017 16:30:38 +0000 (16:30 +0000)]
Bug in scale_sse_threshold()

The function scale_sse_threshold() returns a threshold scaled
if necessary for use with 10 and 12 bit from an 8 bit baseline.

SSE error values would be expected to rise for the 10 and 12
bit cases where there are more bits of precision.

Hence the threshold used for the test should also be scaled up.

Change-Id: I4009c98b6eecd1bf64c3c38aaa56598e0136b03d

7 years agoAdditional first pass stats.
paulwilkins [Mon, 12 Dec 2016 14:05:19 +0000 (14:05 +0000)]
Additional first pass stats.

Added counts that split the intra coded blocks into low and high variance.

Change-Id: Ic540144b34d5141659081bb22f7ee16fd6861f14

7 years agoMerge "Aggressive VBR method."
Paul Wilkins [Wed, 15 Feb 2017 10:37:02 +0000 (10:37 +0000)]
Merge "Aggressive VBR method."

7 years agoMerge "vp9 fdct higbd neon: connect existing highbd calls"
Johann Koenig [Wed, 15 Feb 2017 01:33:00 +0000 (01:33 +0000)]
Merge "vp9 fdct higbd neon: connect existing highbd calls"

7 years agoMerge "Row based multi-threading of encoding stage"
Yunqing Wang [Wed, 15 Feb 2017 00:54:10 +0000 (00:54 +0000)]
Merge "Row based multi-threading of encoding stage"

7 years agoRow based multi-threading of encoding stage
Ranjit Kumar Tulabandu [Fri, 10 Feb 2017 10:55:50 +0000 (16:25 +0530)]
Row based multi-threading of encoding stage

(Yunqing Wang)
This patch implements the row-based multi-threading within tiles in
the encoding pass, and substantially speeds up the multi-threaded
encoder in VP9.

Speed tests at speed 1 on STDHD(using 4 tiles) set show that the
average speedups of the encoding pass(second pass in the 2-pass
encoding) is 7% while using 2 threads, 16% while using 4 threads,
85% while using 8 threads, and 116% while using 16 threads.

Change-Id: I12e41dbc171951958af9e6d098efd6e2c82827de

7 years agoMerge "Replace 14 with DCT_CONST_BITS in idct NEON functions' shifts"
Linfeng Zhang [Wed, 15 Feb 2017 00:46:29 +0000 (00:46 +0000)]
Merge "Replace 14 with DCT_CONST_BITS in idct NEON functions' shifts"

7 years agovp9 fdct higbd neon: connect existing highbd calls
Johann [Fri, 3 Feb 2017 23:25:50 +0000 (15:25 -0800)]
vp9 fdct higbd neon: connect existing highbd calls

Change-Id: Ia8f822bd6e70b3911bc433a5a750bfb6f9a3a75c

7 years agoMerge "quantize_fp highbd neon: use tran_low_t for coeff"
Johann Koenig [Tue, 14 Feb 2017 21:28:22 +0000 (21:28 +0000)]
Merge "quantize_fp highbd neon: use tran_low_t for coeff"

7 years agoReplace 14 with DCT_CONST_BITS in idct NEON functions' shifts
Linfeng Zhang [Tue, 14 Feb 2017 20:44:57 +0000 (12:44 -0800)]
Replace 14 with DCT_CONST_BITS in idct NEON functions' shifts

Change-Id: I2a39a3bb87516b04d273bc1c0f4a634e3fb6f0f6

7 years agoapply clang-format
clang-format [Tue, 14 Feb 2017 04:06:18 +0000 (20:06 -0800)]
apply clang-format

Change-Id: I75e4a9e0b37bd4586f26c8d6c1fa27f3f6ff1bce

7 years ago.clang-format: update to 3.9.1
James Zern [Wed, 1 Feb 2017 02:06:43 +0000 (18:06 -0800)]
.clang-format: update to 3.9.1

Change-Id: Ia51f2201df897651067d09122075953382b59139

7 years agoMerge "Replace idct32x32_34_add_ssse3 assembly with intrinsics"
Yi Luo [Tue, 14 Feb 2017 20:13:26 +0000 (20:13 +0000)]
Merge "Replace idct32x32_34_add_ssse3 assembly with intrinsics"

7 years agoReplace idct32x32_34_add_ssse3 assembly with intrinsics
Yi Luo [Wed, 8 Feb 2017 19:09:03 +0000 (11:09 -0800)]
Replace idct32x32_34_add_ssse3 assembly with intrinsics

- No user-level speed performance change.
- Pass unit tests.

Change-Id: Idfc598e00f354265e41f6b3219f4734216c115c6

7 years agoquantize_fp highbd neon: use tran_low_t for coeff
Johann [Fri, 3 Feb 2017 22:24:32 +0000 (14:24 -0800)]
quantize_fp highbd neon: use tran_low_t for coeff

Change-Id: I90fd815f15884490ad138f35df575a00d31e8c95

7 years agovp8 onyx_if: assert divide by zero
Johann [Tue, 31 Jan 2017 23:51:15 +0000 (15:51 -0800)]
vp8 onyx_if: assert divide by zero

Clears up static clang analysis warning regarding divide by zero.

Trying to explain to the compiler how it's impossible to avoid
incrementing num_blocks at least once is difficult.

Change-Id: Ibaae43be572e5cd7a689b440dcd341c17d33443b

7 years agoMerge "Remove UNINITIALIZED_IS_SAFE"
Johann Koenig [Tue, 14 Feb 2017 03:02:50 +0000 (03:02 +0000)]
Merge "Remove UNINITIALIZED_IS_SAFE"

7 years agoMerge "Add vpx_highbd_idct16x16_256_add_neon()"
Linfeng Zhang [Tue, 14 Feb 2017 01:15:34 +0000 (01:15 +0000)]
Merge "Add vpx_highbd_idct16x16_256_add_neon()"

7 years agoRemove UNINITIALIZED_IS_SAFE
Johann [Tue, 31 Jan 2017 23:18:40 +0000 (15:18 -0800)]
Remove UNINITIALIZED_IS_SAFE

Where clang static analysis or gcc -Wmaybe-uninitialized warns of
uninitialized values, assign 0 to ints, MB_MODE_COUNT to
MB_PREDICTION_MODE, and B_MODE_COUNT to B_PREDICTION_MODE.

Assert that the modes have been changed from the invalid value by
the end of the function.

Change-Id: Ib11e1ffb08f0a6fe4b6c6729dc93b83b1c4b6350

7 years agoAdd vpx_highbd_idct16x16_256_add_neon()
Linfeng Zhang [Thu, 2 Feb 2017 00:32:57 +0000 (16:32 -0800)]
Add vpx_highbd_idct16x16_256_add_neon()

BUG=webm:1301

Change-Id: I6bb755552a39bdd26eef3f449601f6a9766c65ec

7 years agoMerge "fdct8x8 highbd neon: use tran_low_t for output"
Johann Koenig [Mon, 13 Feb 2017 23:11:30 +0000 (23:11 +0000)]
Merge "fdct8x8 highbd neon: use tran_low_t for output"

7 years agofdct8x8 highbd neon: use tran_low_t for output
Johann [Fri, 3 Feb 2017 23:14:06 +0000 (15:14 -0800)]
fdct8x8 highbd neon: use tran_low_t for output

Change-Id: I100c4a1955d80bec4d28e82796b3e7f57e84d0ba

7 years agoThe bitstream bit match test in multi-threaded encoder
Yunqing Wang [Mon, 13 Feb 2017 20:29:31 +0000 (12:29 -0800)]
The bitstream bit match test in multi-threaded encoder

While the new-mt mode is enabled(namely, allowing to use row-based
multi-threading in encoder), several speed features that adaptively
adjust encoding parameters during encoding would cause mismatch
between single-thread encoded bitstream and multi-thread encoded
bitstream. This patch provides a set_control API to disable these
features, so that the bit match bitstream is obtained in the unit
test.

Change-Id: Ie9868bafdfe196296d1dd29e0dca517f6a9a4d60

7 years agoMerge "Minor code style refactoring"
Yunqing Wang [Mon, 13 Feb 2017 21:01:41 +0000 (21:01 +0000)]
Merge "Minor code style refactoring"

7 years agoMerge "cosmetics,vp9_ratectrl: apply clang-format"
James Zern [Mon, 13 Feb 2017 21:01:18 +0000 (21:01 +0000)]
Merge "cosmetics,vp9_ratectrl: apply clang-format"

7 years agoMerge "vpx_usec_timer_elapsed: use 64-bit math"
James Zern [Mon, 13 Feb 2017 21:00:33 +0000 (21:00 +0000)]
Merge "vpx_usec_timer_elapsed: use 64-bit math"

7 years agoMinor code style refactoring
Yunqing Wang [Mon, 13 Feb 2017 20:59:01 +0000 (12:59 -0800)]
Minor code style refactoring

Change-Id: I20107693d0a87e08a10520bfb573ff3dcef69fdb

7 years agocosmetics,vp9_ratectrl: apply clang-format
James Zern [Mon, 13 Feb 2017 19:29:39 +0000 (11:29 -0800)]
cosmetics,vp9_ratectrl: apply clang-format

broken since:
c3f095c8b Merge "Fix to avoid abrupt relaxation of max qindex in recode path"
5f21aba4b Fix to avoid abrupt relaxation of max qindex in recode path

the original change pre-dated the addition of .clang-format

Change-Id: If5e399d9a805bcad9147360b13b36fbc8c560a7c

7 years agoAdd vpx_highbd_idct{16x16,32x32}_1_add_neon()
Linfeng Zhang [Fri, 13 Jan 2017 18:01:51 +0000 (10:01 -0800)]
Add vpx_highbd_idct{16x16,32x32}_1_add_neon()

and update vpx_highbd_idct8x8_1_add_neon()

BUG=webm:1301

Change-Id: I18d1a0cbe98ba822d5194c1b4e13a4c29c5c75f4

7 years agoAggressive VBR method.
paulwilkins [Thu, 5 Jan 2017 12:35:53 +0000 (12:35 +0000)]
Aggressive VBR method.

VBR method that allows a wider Q range for the first normal frame
in each ARF group and then centers the min - max range for the rest of
the arf group on the chosen Q value for that first frame.

This allows for quite rapid adjustment of the active Q range even if the
initial estimate is poor.

In some cases where the ARF frames themselves are tending to
undershoot but the normal frames are overshooting this can still give
net undershoot. This can be corrected by allowing a larger Q delta for
arf frames but is usually is a sign that the allocation to the arfs was to
high.

Change-Id: Icec87758925d8f7aeb2dca29aac0ff9496237469

7 years agoMerge "Add vpx_idct16x16_38_add_neon()"
James Zern [Sat, 11 Feb 2017 03:42:35 +0000 (03:42 +0000)]
Merge "Add vpx_idct16x16_38_add_neon()"

7 years agovp9: Non-rd mode: use simple block_yrd for 8 bit high bitdepth builds
Marco [Tue, 7 Feb 2017 19:46:36 +0000 (11:46 -0800)]
vp9: Non-rd mode: use simple block_yrd for 8 bit high bitdepth builds

Temporary fix until optimization work for block_yrd is completed.
This essentially reverts back to the state before the change:
https://chromium-review.googlesource.com/c/433821/

Compression loss is about ~5-6% on RTC set.
Speed-up (from using this simple/model-based block_yrd) over the low
bitdepth builds (which uses more complex block_yrd) is ~5% on 720p.

Change-Id: Ie0af9eb0d111e5595f587870c44f08317403b8d8

7 years agovpx_usec_timer_elapsed: use 64-bit math
James Zern [Fri, 10 Feb 2017 03:28:59 +0000 (19:28 -0800)]
vpx_usec_timer_elapsed: use 64-bit math

this prevents a rollover when tv_sec is a long:
signed integer overflow: 2776 * 1000000 cannot be represented in type
'long'

Change-Id: I03dc4476ee122b02e2856dad28358a20cf16a9f8

7 years agoMerge "Fix to avoid abrupt relaxation of max qindex in recode path"
Paul Wilkins [Thu, 9 Feb 2017 17:17:55 +0000 (17:17 +0000)]
Merge "Fix to avoid abrupt relaxation of max qindex in recode path"

7 years agoMerge "Fix for max qindex calculation of a gf interval"
Paul Wilkins [Thu, 9 Feb 2017 17:17:44 +0000 (17:17 +0000)]
Merge "Fix for max qindex calculation of a gf interval"

7 years agoAdd vpx_idct16x16_38_add_neon()
Linfeng Zhang [Tue, 7 Feb 2017 18:06:21 +0000 (10:06 -0800)]
Add vpx_idct16x16_38_add_neon()

The RunQuantCheck() test on it exposes 16-bit overflow in stage 7 of
pass 2. Change to use saturating add/sub for both
vpx_idct16x16_38_add_neon() and vpx_idct16x16_256_add_neon() for high
bitdepth.

Change-Id: Ibf4c107a887553a52852cc582e28d38a5a5a2712

7 years agoReplace idct8x8_12_add_ssse3 assembly code with intrinsics
Yi Luo [Wed, 8 Feb 2017 00:40:10 +0000 (16:40 -0800)]
Replace idct8x8_12_add_ssse3 assembly code with intrinsics

- Performance achieves the same as assembly.
- Unit tests pass.

Change-Id: I6eacfbbd826b3946c724d78fbef7948af6406ccd

7 years agoMerge "Add vpx_idct16x16_38_add_c()"
Linfeng Zhang [Wed, 8 Feb 2017 17:20:19 +0000 (17:20 +0000)]
Merge "Add vpx_idct16x16_38_add_c()"

7 years agoMerge "block_error_fp highbd sse2: use tran_low_t for coeff"
Johann Koenig [Tue, 7 Feb 2017 23:26:09 +0000 (23:26 +0000)]
Merge "block_error_fp highbd sse2: use tran_low_t for coeff"

7 years agoMerge "vp9: Denoiser speed-up: increase partition and ac skip thresholds."
Marco Paniconi [Tue, 7 Feb 2017 22:24:59 +0000 (22:24 +0000)]
Merge "vp9: Denoiser speed-up: increase partition and ac skip thresholds."

7 years agoMerge "Row based multi-threading of ARNR filtering stage"
Yunqing Wang [Tue, 7 Feb 2017 19:55:40 +0000 (19:55 +0000)]
Merge "Row based multi-threading of ARNR filtering stage"

7 years agoMerge "vp9: Adjust rate_err threshold for setting active_worst factor."
Marco Paniconi [Tue, 7 Feb 2017 19:25:47 +0000 (19:25 +0000)]
Merge "vp9: Adjust rate_err threshold for setting active_worst factor."

7 years agovp9: Denoiser speed-up: increase partition and ac skip thresholds.
Marco [Tue, 7 Feb 2017 01:02:28 +0000 (17:02 -0800)]
vp9: Denoiser speed-up: increase partition and ac skip thresholds.

Add factor to increase varianace partition and ac skip thresholds,
under certain conditions (noise level and sum_diff), to increase
denoiser speed.

Change-Id: I7671140ef3598bf5f114a72623d68792bcd7b77b

7 years agoAdd vpx_idct16x16_38_add_c()
Linfeng Zhang [Wed, 11 Jan 2017 20:22:37 +0000 (12:22 -0800)]
Add vpx_idct16x16_38_add_c()

When eob is less than or equal to 38 for 16x16 idct, call this function.

Change-Id: Ief6f3fb16a49ace3c92cebf4e220bf5bf52a6087

7 years agovp9: Adjust rate_err threshold for setting active_worst factor.
Marco [Tue, 7 Feb 2017 17:35:56 +0000 (09:35 -0800)]
vp9: Adjust rate_err threshold for setting active_worst factor.

Only affects 1 pass vbr.
Small improvement on ytlive set.

Change-Id: I09a7456fe658fbea82ece1035cf683bd8bd8bd14

7 years agoMerge "Update 16x16 8-bit idct NEON intrinsics"
Linfeng Zhang [Tue, 7 Feb 2017 16:52:39 +0000 (16:52 +0000)]
Merge "Update 16x16 8-bit idct NEON intrinsics"

7 years agoblock_error_fp highbd sse2: use tran_low_t for coeff
Johann [Thu, 2 Feb 2017 23:28:16 +0000 (15:28 -0800)]
block_error_fp highbd sse2: use tran_low_t for coeff

BUG=webm:1365

Change-Id: Id2ed3ebaaaa6a4b68628c23e08b64ea5f1341761

7 years agoRow based multi-threading of ARNR filtering stage
Ranjit Kumar Tulabandu [Mon, 6 Feb 2017 13:44:16 +0000 (19:14 +0530)]
Row based multi-threading of ARNR filtering stage

Change-Id: Ic238d32c7e10b730342224ab56712a89a6026a8f

7 years agoMerge "highbd x86: consolidate tran_low_t conversions"
Johann Koenig [Tue, 7 Feb 2017 02:49:58 +0000 (02:49 +0000)]
Merge "highbd x86: consolidate tran_low_t conversions"

7 years agovp9: speed 8: Tune threshold of ac skip and partitioning.
Jerome Jiang [Fri, 3 Feb 2017 01:51:01 +0000 (17:51 -0800)]
vp9: speed 8: Tune threshold of ac skip and partitioning.

Threshold for partitioning only affects VGA and lower res.
0.07% quality regression is observed in borg tests on rtc_derf
and 0.2% regression on rtc.
5.6% speed up for low res and 6.8% for VGA on Nexus 6.

Change-Id: If85a2919b48c991de66059c90f32ed06980452be

7 years agohighbd x86: consolidate tran_low_t conversions
Johann [Thu, 2 Feb 2017 22:17:26 +0000 (14:17 -0800)]
highbd x86: consolidate tran_low_t conversions

Create new helper files specifically for converting tran_low_t types.

Change-Id: I7c4c458ef910f3b3d10a3cfbf9df4de7682fd905

7 years agoMerge "Changes to facilitate multi-threading of encoding stage"
Yunqing Wang [Sat, 4 Feb 2017 01:02:29 +0000 (01:02 +0000)]
Merge "Changes to facilitate multi-threading of encoding stage"

7 years agoFix visual studio build failure
Yunqing Wang [Fri, 3 Feb 2017 16:29:21 +0000 (08:29 -0800)]
Fix visual studio build failure

Fixed the following issue.
..\test\vp9_ethread_test.cc(69): warning C4805: '|=' : unsafe mix of type 'bool' and type 'int' in operation [C:\src\buildbot\test-libvpx\tests\dveCPjwhBE\.build-x86_64-win64-vs10\test_libvpx.vcxproj]
..\test\vp9_ethread_test.cc(69): warning C4800: 'int' : forcing value to bool 'true' or 'false' (performance warning) [C:\src\buildbot\test-libvpx\tests\dveCPjwhBE\.build-x86_64-win64-vs10\test_libvpx.vcxproj]

Change-Id: I37f897cf12a0b7500d2fcbac9e4615f08a83fdb4

7 years agoMerge "Add unit tests for vp9_block_error_fp."
Jerome Jiang [Thu, 2 Feb 2017 22:20:42 +0000 (22:20 +0000)]
Merge "Add unit tests for vp9_block_error_fp."

7 years agoMerge "Add SSSE3 intrinsic 8x8 inverse 2D-DCT"
Jingning Han [Thu, 2 Feb 2017 22:18:31 +0000 (22:18 +0000)]
Merge "Add SSSE3 intrinsic 8x8 inverse 2D-DCT"

7 years agoAdd unit tests for vp9_block_error_fp.
Jerome Jiang [Wed, 1 Feb 2017 04:21:55 +0000 (20:21 -0800)]
Add unit tests for vp9_block_error_fp.

BUG=webm:1365

Change-Id: I004e5cd7ca331d14b31b7fc3edeee45fce064026

7 years agoMerge "Update third_party/googletest to 1.8.0"
Johann Koenig [Thu, 2 Feb 2017 20:15:45 +0000 (20:15 +0000)]
Merge "Update third_party/googletest to 1.8.0"

7 years agoUpdate third_party/googletest to 1.8.0
Johann [Thu, 26 Jan 2017 22:34:45 +0000 (14:34 -0800)]
Update third_party/googletest to 1.8.0

Change-Id: If61137e28291f2a0911e9260eb58f234e0d8594c

7 years agoChanges to facilitate multi-threading of encoding stage
Ranjit Kumar Tulabandu [Thu, 2 Feb 2017 08:49:29 +0000 (14:19 +0530)]
Changes to facilitate multi-threading of encoding stage

Modified the encoding stage to have row level entry points with relevant
initializations and to access the token information at row level

Change-Id: Ife10e55a7c1a420ee906d711caf75002688d9e39

7 years agoMerge "Add mips msa sum_squares_2d_i16 function"
Kaustubh Raste [Thu, 2 Feb 2017 08:09:21 +0000 (08:09 +0000)]
Merge "Add mips msa sum_squares_2d_i16 function"

7 years agoMerge "Remove neon assembly for idct 16x16 and 8x8"
Johann Koenig [Thu, 2 Feb 2017 03:25:31 +0000 (03:25 +0000)]
Merge "Remove neon assembly for idct 16x16 and 8x8"

7 years agoMerge changes I43521ad3,I013659f6
Johann Koenig [Thu, 2 Feb 2017 03:03:58 +0000 (03:03 +0000)]
Merge changes I43521ad3,I013659f6

* changes:
  satd highbd neon: use tran_low_t for coeff
  satd highbd sse2: use tran_low_t for coeff

7 years agoUpdate 16x16 8-bit idct NEON intrinsics
Linfeng Zhang [Mon, 30 Jan 2017 20:25:58 +0000 (12:25 -0800)]
Update 16x16 8-bit idct NEON intrinsics

Remove redundant memory accesses.

Change-Id: I8049074bdba5f49eab7e735b2b377423a69cd4c8

7 years agoAdd SSSE3 intrinsic 8x8 inverse 2D-DCT
Jingning Han [Wed, 1 Feb 2017 20:19:01 +0000 (12:19 -0800)]
Add SSSE3 intrinsic 8x8 inverse 2D-DCT

The intrinsic version reduces the average cycles from 183 to 175.

Change-Id: I7c1bcdb0a830266e93d8347aed38120fb3be0e03

7 years agoMerge "Changes to facilitate row based multi-threading of ARNR filtering"
Yunqing Wang [Wed, 1 Feb 2017 22:04:14 +0000 (22:04 +0000)]
Merge "Changes to facilitate row based multi-threading of ARNR filtering"

7 years agoMerge changes I374dfc08,I7e15192e,Ica414007
Johann Koenig [Wed, 1 Feb 2017 21:56:35 +0000 (21:56 +0000)]
Merge changes I374dfc08,I7e15192e,Ica414007

* changes:
  hadamard highbd ssse3: use tran_low_t for coeff
  hadamard highbd neon: use tran_low_t for coeff
  hadamard highbd sse2: use tran_low_t for coeff

7 years agoChanges to facilitate row based multi-threading of ARNR filtering
Ranjit Kumar Tulabandu [Wed, 1 Feb 2017 11:04:49 +0000 (16:34 +0530)]
Changes to facilitate row based multi-threading of ARNR filtering

Change-Id: I2fd72af00afbbeb903e4fe364611abcc148f2fbb

7 years agoMerge "vp9_rdopt: declare 'c' closer to use"
Johann Koenig [Wed, 1 Feb 2017 20:55:11 +0000 (20:55 +0000)]
Merge "vp9_rdopt: declare 'c' closer to use"

7 years agovp9_rdopt: declare 'c' closer to use
Johann [Tue, 31 Jan 2017 23:35:20 +0000 (15:35 -0800)]
vp9_rdopt: declare 'c' closer to use

Clears up static clang analysis warning regarding a dead store. Only
declare 'c' when it will be used.

Change-Id: I1ac0fc7f94bc44da63938c63cd1efcd6b95e0eb3

7 years agoMerge "deblock: annotate postproc parameters"
Johann Koenig [Wed, 1 Feb 2017 19:57:29 +0000 (19:57 +0000)]
Merge "deblock: annotate postproc parameters"

7 years agosatd highbd neon: use tran_low_t for coeff
Johann [Tue, 31 Jan 2017 00:03:05 +0000 (16:03 -0800)]
satd highbd neon: use tran_low_t for coeff

BUG=webm:1365

Change-Id: I43521ad32b6c96737a8ef2b8c327f901fd7eaf84

7 years agosatd highbd sse2: use tran_low_t for coeff
Johann [Mon, 30 Jan 2017 19:39:02 +0000 (11:39 -0800)]
satd highbd sse2: use tran_low_t for coeff

BUG=webm:1365

Change-Id: I013659f6b9fbf9cc52ab840eae520fe0b5f883fb

7 years agohadamard highbd ssse3: use tran_low_t for coeff
Johann [Tue, 31 Jan 2017 16:16:19 +0000 (08:16 -0800)]
hadamard highbd ssse3: use tran_low_t for coeff

BUG=webm:1365

Change-Id: I374dfc08732932382043905f128e928b08cb4f57

7 years agohadamard highbd neon: use tran_low_t for coeff
Johann [Tue, 31 Jan 2017 00:16:27 +0000 (16:16 -0800)]
hadamard highbd neon: use tran_low_t for coeff

BUG=webm:1365

Change-Id: I7e15192ead3a3631755b386f102c979f06e26279

7 years agohadamard highbd sse2: use tran_low_t for coeff
Johann [Fri, 27 Jan 2017 21:37:36 +0000 (13:37 -0800)]
hadamard highbd sse2: use tran_low_t for coeff

BUG=webm:1365

Change-Id: Ica414007d8412ceebfffa9e58e8416226a3fe934

7 years agoMerge "quantize ssse3: remove unused pxor"
Johann Koenig [Wed, 1 Feb 2017 19:41:41 +0000 (19:41 +0000)]
Merge "quantize ssse3: remove unused pxor"

7 years agoMake satd unit test support all bit-depth settings
Jingning Han [Fri, 27 Jan 2017 18:46:53 +0000 (10:46 -0800)]
Make satd unit test support all bit-depth settings

Turn on satd unit test for c function in both regular and high
bit-depth settings.

Change-Id: I4b0c56addfb84964ede0da3ab760fe0ee640cfd0

7 years agoUnify the hadamard transform unit test for bit-depth settings
Jingning Han [Fri, 27 Jan 2017 18:38:58 +0000 (10:38 -0800)]
Unify the hadamard transform unit test for bit-depth settings

Unify the 8x8 and 16x16 Hadamard unit test system for both 8-bit
and high bit-depth settings.

Change-Id: I53373c1d43f3ced514ad1e53e03f0fb9b25d9ead

7 years agoFix real-time compression regression in hbd mode
Jingning Han [Thu, 26 Jan 2017 23:00:04 +0000 (15:00 -0800)]
Fix real-time compression regression in hbd mode

This commit resolves the compression performance regression in
real-time encoding setting when high bit-depth mode is enabled.

The current solution temporarily disables the SIMD implementations
of vpx_satd, hadamard8x8, and hadamard16x16 in high bit-depth mode.

The commit makes the coding results bit-wise identical between
regular coding pipeline and high bit-depth at profile 0.

BUG=webm:1365

Change-Id: Icfb900821733749685370460a1a5a7e07f76f4bf

7 years agodeblock: annotate postproc parameters
Johann [Tue, 31 Jan 2017 23:05:51 +0000 (15:05 -0800)]
deblock: annotate postproc parameters

Clears a clang static analyzer warning where 'cols' is assumed to be
less than 0, preventing the for loop from executing.

The assembly already requires that the size be 8 or 16 (U/V or Y plane)
and cols is a multiple of 8.

Change-Id: Ica4612690ead1638c94cfe56b306e87f8ce644f9

7 years agoMerge "Use Buffer class for post proc tests"
Johann Koenig [Tue, 31 Jan 2017 15:28:28 +0000 (15:28 +0000)]
Merge "Use Buffer class for post proc tests"

7 years agoAdd mips msa sum_squares_2d_i16 function
Kaustubh Raste [Tue, 31 Jan 2017 12:07:10 +0000 (17:37 +0530)]
Add mips msa sum_squares_2d_i16 function

average improvement ~4x-5x

Change-Id: I8d91b71d0677009be52b412e4f52b40b98573a53

7 years agoAdd mips msa vpx_minmax_8x8 function
Kaustubh Raste [Tue, 31 Jan 2017 04:30:43 +0000 (10:00 +0530)]
Add mips msa vpx_minmax_8x8 function

average improvement ~4x-5x

Change-Id: I83aee9977534fddb8a9b80d31af646c0b6b1a8c3

7 years agoMerge "Add mips msa vpx_vector_var function"
Kaustubh Raste [Tue, 31 Jan 2017 02:34:51 +0000 (02:34 +0000)]
Merge "Add mips msa vpx_vector_var function"

7 years agoquantize ssse3: remove unused pxor
Johann [Tue, 31 Jan 2017 01:02:57 +0000 (17:02 -0800)]
quantize ssse3: remove unused pxor

Change-Id: Ifa22d77fd530827de0b32ae71810dc2213ab2937

7 years agovp9: Modify bsize condition for using model_rd_large for speed 7.
Marco [Mon, 30 Jan 2017 19:12:49 +0000 (11:12 -0800)]
vp9: Modify bsize condition for using model_rd_large for speed 7.

In non-rd pickmode: Allow speed 7 to also use larger block size in
model_rd. Small change in behavior for speed 7.

Change-Id: I8c5523e424308e8f0bc71b3f6324dec42a464cc8

7 years agoMerge "Disable multi-threading in first pass for SVC encoding"
Yunqing Wang [Sat, 28 Jan 2017 19:29:01 +0000 (19:29 +0000)]
Merge "Disable multi-threading in first pass for SVC encoding"

7 years agoAdd mips msa vpx_vector_var function
Kaustubh Raste [Fri, 27 Jan 2017 09:22:51 +0000 (14:52 +0530)]
Add mips msa vpx_vector_var function

average improvement ~4x-5x

Change-Id: I2f63ef83d816052ca8dc42421e7e9d42f7a7af6b

7 years agoMerge "vp9: Fix to pick_filter_level for highbitdepth build."
Marco Paniconi [Fri, 27 Jan 2017 22:47:44 +0000 (22:47 +0000)]
Merge "vp9: Fix to pick_filter_level for highbitdepth build."

7 years agoMerge "Add macOS Sierra support in configure"
Jerome Jiang [Fri, 27 Jan 2017 21:15:52 +0000 (21:15 +0000)]
Merge "Add macOS Sierra support in configure"

7 years agovp9: Fix to pick_filter_level for highbitdepth build.
Marco [Fri, 27 Jan 2017 18:42:29 +0000 (10:42 -0800)]
vp9: Fix to pick_filter_level for highbitdepth build.

Change-Id: I53b3fa8bfc0a0717eb1b730c29f2b70060b1b1b7

7 years agoAdd macOS Sierra support in configure
Jerome Jiang [Fri, 27 Jan 2017 01:53:00 +0000 (17:53 -0800)]
Add macOS Sierra support in configure

BUG=webm:1367

Change-Id: I3000b6d9f93ec49ca86d08151348d33d86bf0034

7 years agoDisable multi-threading in first pass for SVC encoding
Ranjit Kumar Tulabandu [Fri, 27 Jan 2017 13:37:45 +0000 (19:07 +0530)]
Disable multi-threading in first pass for SVC encoding

BUG=webm:1366

Change-Id: I204ef8496884ba7c4debe64f23f50d298b4090c3

7 years agoMerge "vp9: Modify bsize condition for using model_rd_large."
Marco Paniconi [Fri, 27 Jan 2017 15:15:36 +0000 (15:15 +0000)]
Merge "vp9: Modify bsize condition for using model_rd_large."

7 years agoMerge "vp9: Fixes for usage of skin_map for high bit depth."
Marco Paniconi [Fri, 27 Jan 2017 15:15:15 +0000 (15:15 +0000)]
Merge "vp9: Fixes for usage of skin_map for high bit depth."

7 years agovp9: Modify bsize condition for using model_rd_large.
Marco [Fri, 27 Jan 2017 04:01:41 +0000 (20:01 -0800)]
vp9: Modify bsize condition for using model_rd_large.

In non-rd pickmode: small change in behavior for speed 6 and 7.
Remove condition on HIGHBITDEPTH flag.

Change-Id: I360a13fcc313d72612fe9b918162ef4bb278cdea

7 years agoAdd mips msa vpx Integer projection row/col functions
Kaustubh Raste [Fri, 27 Jan 2017 05:41:42 +0000 (11:11 +0530)]
Add mips msa vpx Integer projection row/col functions

average improvement ~4x-5x

Change-Id: I17c41383250282b39f5ecae0197ef1df7de20801

7 years agoMerge "Add mips msa vpx satd function"
Kaustubh Raste [Fri, 27 Jan 2017 04:08:51 +0000 (04:08 +0000)]
Merge "Add mips msa vpx satd function"

7 years agovp9: Fixes for usage of skin_map for high bit depth.
Marco [Thu, 26 Jan 2017 22:09:57 +0000 (14:09 -0800)]
vp9: Fixes for usage of skin_map for high bit depth.

Also avoid noise_estimation and source_sad if use_highbitdepth is set.

Change-Id: I5fea396b8f8380ea377045d99ba22a52b92daa46

7 years agoUse Buffer class for post proc tests
Johann [Sat, 10 Dec 2016 00:51:01 +0000 (16:51 -0800)]
Use Buffer class for post proc tests

Add Buffer features for:
Setting the buffer to the output of an ACMRandom function.
Copying a buffer.
Comparing two buffers.
Printing two buffers.

Change-Id: Ib53fb602451a3abdcee279ea2b65b51fbc02d3df