]> granicus.if.org Git - libvpx/log
libvpx
11 years agoMerge "New mode_info_context storage -- undo revert"
Scott LaVarnway [Fri, 13 Sep 2013 15:56:20 +0000 (08:56 -0700)]
Merge "New mode_info_context storage -- undo revert"

11 years agoMerge "Deleted #include <inttypes.h>"
Paul Wilkins [Fri, 13 Sep 2013 08:05:31 +0000 (01:05 -0700)]
Merge "Deleted #include <inttypes.h>"

11 years agoMerge "fix clang warning in rdopt"
Jim Bankoski [Thu, 12 Sep 2013 23:39:05 +0000 (16:39 -0700)]
Merge "fix clang warning in rdopt"

11 years agoMerge "Add neon optimize iht8x8 which is 282% faster than C."
hkuang [Thu, 12 Sep 2013 22:42:44 +0000 (15:42 -0700)]
Merge "Add neon optimize iht8x8 which is 282% faster than C."

11 years agoMerge "First draft of vp9_short_idct32x32_add_neon."
Christian Duvivier [Thu, 12 Sep 2013 21:23:00 +0000 (14:23 -0700)]
Merge "First draft of vp9_short_idct32x32_add_neon."

11 years agoAdd neon optimize iht8x8 which is 282% faster than C.
hkuang [Thu, 12 Sep 2013 18:45:15 +0000 (11:45 -0700)]
Add neon optimize iht8x8 which is 282% faster than C.

Change-Id: I963dd4a6e8671957403ccbb9a16ea7de703e3530

11 years agofix clang warning in rdopt
Jim Bankoski [Thu, 12 Sep 2013 13:39:42 +0000 (06:39 -0700)]
fix clang warning in rdopt

either missed this or it crept back in

Change-Id: I6cc1519d09e558be7250254c25bde2ae720555ea

11 years agoclang warnings : ref frame enum mismatch
Jim Bankoski [Fri, 6 Sep 2013 17:13:35 +0000 (10:13 -0700)]
clang warnings : ref frame enum mismatch

Convert from refframe_type_t to VP9_REFFRAME

Change-Id: Iff4043c3fdb3e1c9c2b412bdffd5da8ed913ec13

11 years agoMerge "clang warnings : remove split and i4x4_pred fake modes"
Jim Bankoski [Thu, 12 Sep 2013 13:20:45 +0000 (06:20 -0700)]
Merge "clang warnings : remove split and i4x4_pred fake modes"

11 years agoDeleted #include <inttypes.h>
Paul Wilkins [Thu, 12 Sep 2013 12:43:07 +0000 (13:43 +0100)]
Deleted #include <inttypes.h>

This seems not to be needed and is not supported
in the Windows build.

Change-Id: Iaca3bbf8cca283aee6bc336cb31ba9dd4610322b

11 years agoMerge "Changes in speed 2 settings"
Paul Wilkins [Thu, 12 Sep 2013 09:22:45 +0000 (02:22 -0700)]
Merge "Changes in speed 2 settings"

11 years agoclang warnings : remove split and i4x4_pred fake modes
Jim Bankoski [Wed, 11 Sep 2013 23:34:55 +0000 (16:34 -0700)]
clang warnings : remove split and i4x4_pred fake modes

Change-Id: I8ef3c7c0f08f0f1f4ccb8ea4deca4cd8143526ee

11 years agoFirst draft of vp9_short_idct32x32_add_neon.
Christian Duvivier [Wed, 11 Sep 2013 22:18:47 +0000 (15:18 -0700)]
First draft of vp9_short_idct32x32_add_neon.

Lots of TODO which will be taken care in upcoming changes. As is,
about 6x faster than C version.

Change-Id: Ie2557b72fd2d8edca376dbf400a4d173aa5e63e0

11 years agoMerge "Improved 8t filters"
Scott LaVarnway [Wed, 11 Sep 2013 21:34:54 +0000 (14:34 -0700)]
Merge "Improved 8t filters"

11 years agoMerge "resolve clang issue : implicit convert tx_mode -> tx_size"
Jim Bankoski [Wed, 11 Sep 2013 20:40:11 +0000 (13:40 -0700)]
Merge "resolve clang issue : implicit convert tx_mode -> tx_size"

11 years agoImproved 8t filters
Scott LaVarnway [Wed, 11 Sep 2013 17:56:32 +0000 (13:56 -0400)]
Improved 8t filters

Reformatted version of a patch submitted by Erik/Tamar
from Intel.  For the test clips used, the decoder
performance improved by ~2%.

Change-Id: Ifbc37ac6311bca9ff1cfefe3f2e9b7f13a4a511b

11 years agoChanges in speed 2 settings
Deb Mukherjee [Tue, 10 Sep 2013 23:13:15 +0000 (16:13 -0700)]
Changes in speed 2 settings

Propose some changes to the speed 2 settings to improve quality.
In particular, turns off the adjust_thresholds_by_speed feature
which improves results by 6%. Also removes the code for
adjust_thresholds_by_speed since it conflicts with the adaptive
rd thresh feature.

Overall, with this change speed 2 is -15.2% from speed 0 settings,
on derf, which is significantly better than -21.6% down before.

Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505

11 years agoNew mode_info_context storage -- undo revert
Scott LaVarnway [Wed, 11 Sep 2013 17:45:44 +0000 (13:45 -0400)]
New mode_info_context storage -- undo revert

mode_info_context was stored as a grid of MODE_INFO structs.
The grid now constists of pointers to MODE_INFO structs.  The
MODE_INFO structs are now stored as a stream (decoder only),
eliminating unnecessary copies and is a little more cache
friendly.

Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d

11 years agocode cleanup
Yunqing Wang [Wed, 11 Sep 2013 16:32:00 +0000 (09:32 -0700)]
code cleanup

Removed unused function.

Change-Id: Icb12a09e4d303968be6aec9fae1ef05935913a4f

11 years agoMerge "Remove redundant condition check in 32x32 quant"
Jingning Han [Tue, 10 Sep 2013 23:39:18 +0000 (16:39 -0700)]
Merge "Remove redundant condition check in 32x32 quant"

11 years agoMerge "vpx_mem: increase default alignment"
James Zern [Tue, 10 Sep 2013 21:03:31 +0000 (14:03 -0700)]
Merge "vpx_mem: increase default alignment"

11 years agoMerge "Enable and fix resize_test for VP9"
Adrian Grange [Tue, 10 Sep 2013 19:46:38 +0000 (12:46 -0700)]
Merge "Enable and fix resize_test for VP9"

11 years agoMerge "Remove the use of uninitialized_safe in encode_sb_"
Jingning Han [Tue, 10 Sep 2013 19:05:22 +0000 (12:05 -0700)]
Merge "Remove the use of uninitialized_safe in encode_sb_"

11 years agoRemove redundant condition check in 32x32 quant
Jingning Han [Mon, 9 Sep 2013 23:02:07 +0000 (16:02 -0700)]
Remove redundant condition check in 32x32 quant

The c code implementation of 32x32 quantization does the zbin check
of all coefficients prior to the quant/dequant loop, hence removing
the redundant zbin check inside the loop. This only affects the
c code version. SSSE3 version does not separate the zbin check out.

Change-Id: Ic197a7d61d0b25fcac3cc092987651378cb56e4e

11 years agoEnable and fix resize_test for VP9
Adrian Grange [Tue, 10 Sep 2013 19:02:37 +0000 (12:02 -0700)]
Enable and fix resize_test for VP9

Added the resize_test unit test to the VP9 set.

Set g_in_frames = 0 to avoid a problem when the total
number of frames being encoded is smaller than
g_in_frames. In this case the test will not have
access to the encoded frames and will not be able to
compare them for testing for encoder/decoder mismatch.

Change-Id: I0d2ff8ef058de7002c5faa894ed6ea794d5f900b

11 years agoMerge "Small tweaks on the constant quality mode"
Deb Mukherjee [Tue, 10 Sep 2013 18:16:47 +0000 (11:16 -0700)]
Merge "Small tweaks on the constant quality mode"

11 years agoSmall tweaks on the constant quality mode
Deb Mukherjee [Tue, 10 Sep 2013 00:31:58 +0000 (17:31 -0700)]
Small tweaks on the constant quality mode

Improves results a little.
derf is now +1.078% over bitrate control.

Change-Id: I4812136f3e67be21d14ec089419976a32a841785

11 years agoStop partition checking when distortion is small
Yunqing Wang [Fri, 6 Sep 2013 23:59:55 +0000 (16:59 -0700)]
Stop partition checking when distortion is small

If the current obtained distortion is very small, which happens
for static image case, we pick the current partition type without
further split checking.

This won't affect regular videos. For static videos, we got 10%~12%
encoding speed gain. PSNR was better for some clips, and worse for
others. Overall it was even.

Change-Id: If787a57bedf46fc595ca4f5ded2b0c0a69e9fdef

11 years agoMerge "Modify encode breakout for static frames"
Yunqing Wang [Tue, 10 Sep 2013 17:04:30 +0000 (10:04 -0700)]
Merge "Modify encode breakout for static frames"

11 years agoModify encode breakout for static frames
Yunqing Wang [Fri, 6 Sep 2013 00:10:58 +0000 (17:10 -0700)]
Modify encode breakout for static frames

Thank Paul for the suggestions. While turning on static-thresh
for static-image videos, a big jump on bitrate was seen. In this
patch, we detected static frames in the video using first-pass
stats. For different cases, disable encode breakout or reduce
encode breakout threshold to limit the skipping.

More modification need be done to break incorrect partition
picking pattern for static frames while skipping happens.

Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f

11 years agoMerge "Enable accuracy/memory check for 16x16 transforms"
Jingning Han [Tue, 10 Sep 2013 16:05:34 +0000 (09:05 -0700)]
Merge "Enable accuracy/memory check for 16x16 transforms"

11 years agoMerge "Rework 16x16 transform unit test"
Jingning Han [Tue, 10 Sep 2013 16:05:04 +0000 (09:05 -0700)]
Merge "Rework 16x16 transform unit test"

11 years agoMerge "Speed up idct16x16 by rearrange instructions."
hkuang [Tue, 10 Sep 2013 15:23:57 +0000 (08:23 -0700)]
Merge "Speed up idct16x16 by rearrange instructions."

11 years agoModified mode skip functionality.
Paul Wilkins [Thu, 5 Sep 2013 00:15:05 +0000 (17:15 -0700)]
Modified mode skip functionality.

A previous speed feature skipped modes not used in earlier
partitions but this not longer worked as intended following
changes to the partition coding order and in conjunction
with some other speed features (Especially speed 2 and above).

This modified mode skip feature sets a mask after the first X
modes have been tested in each partition depending on the
reference frame of the current best case.

This patch also makes some changes to the order modes are
tested to fit better with this skip functionality.

Initial testing suggests speed and rd hit count improvements
of up to 20% at speed 1. Quality results. (derf -1.9%, std hd  +0.23%).

Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1

11 years agoAdded extra check to rd_auto_partition_range()
Paul Wilkins [Tue, 3 Sep 2013 22:46:22 +0000 (15:46 -0700)]
Added extra check to rd_auto_partition_range()

Added check that the returned max and minimum are
valid in bottom and right border cases.

Change-Id: I2d6cdc9b5f04c7d0ff512ddcf3228331e028bf9b

11 years agotest/idct_test: add missing vpx_integer.h include
James Zern [Tue, 10 Sep 2013 05:20:41 +0000 (22:20 -0700)]
test/idct_test: add missing vpx_integer.h include

Change-Id: I9de764638ec981bb34fc8e183985d8c285b006fb

11 years agoSpeed up idct16x16 by rearrange instructions.
hkuang [Mon, 9 Sep 2013 18:36:51 +0000 (11:36 -0700)]
Speed up idct16x16 by rearrange instructions.

Speed improve from 376% to 400% faster base on assembly-perf.

Change-Id: If0b2eccc39d5793dc101ce9feb7fcadf88396ea2

11 years agoEnable accuracy/memory check for 16x16 transforms
Jingning Han [Tue, 10 Sep 2013 00:07:55 +0000 (17:07 -0700)]
Enable accuracy/memory check for 16x16 transforms

This commit completes the per coefficient accuracy check and memory
overflow check for SSE2 and other implemented versions of 16x16
transform.

Change-Id: If26a3e4f6ba82ccecc13f0b73cb8f7bb6ac14584

11 years agoMerge "API extensions and sample app for spacial scalable encoder"
Ivan Maltz [Mon, 9 Sep 2013 23:57:01 +0000 (16:57 -0700)]
Merge "API extensions and sample app for spacial scalable encoder"

11 years agoRework 16x16 transform unit test
Jingning Han [Thu, 5 Sep 2013 19:44:03 +0000 (12:44 -0700)]
Rework 16x16 transform unit test

This commit refactors the 16x16 transform unit test. It enables the
test on all implemented versions of forward and inverse 16x16 transform
modules.

Change-Id: I0c7d5f3c5fdd5d789a25f73e287aeeaf463b9d69

11 years agoAPI extensions and sample app for spacial scalable encoder
Ivan Maltz [Thu, 5 Sep 2013 15:55:47 +0000 (08:55 -0700)]
API extensions and sample app for spacial scalable encoder

Sample app: vp9_spatial_scalable_encoder
vpx_codec_control extensions:
  VP9E_SET_SVC
  VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER
  VP9E_SET_MIN_Q, VP9E_SET_MAX_Q
expanded buffer size for vp9_convolve

modified setting of initial width in vp9_onyx_if.c so that layer size
can be set prior to initial encode

Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS)
Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers

Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197

11 years agoRemove the use of uninitialized_safe in encode_sb_
Jingning Han [Mon, 9 Sep 2013 22:41:16 +0000 (15:41 -0700)]
Remove the use of uninitialized_safe in encode_sb_

Initialize the probability model context with default value in
encode_sb.

Change-Id: Id826114024dfc21c7ef41aea9f4a0316d4a5cb95

11 years agoMerge "Revert "New mode_info_context storage""
James Zern [Mon, 9 Sep 2013 21:38:01 +0000 (14:38 -0700)]
Merge "Revert "New mode_info_context storage""

11 years agoRevert "New mode_info_context storage"
James Zern [Mon, 9 Sep 2013 20:37:01 +0000 (13:37 -0700)]
Revert "New mode_info_context storage"

This reverts commit dae17734ece414091ba1184f7becd0aa6c0004f1

Encode crashes, leaks and increases integer overflow errors.

Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2

11 years agochanged to enable vp9_postproc
Yaowu Xu [Fri, 6 Sep 2013 22:05:05 +0000 (15:05 -0700)]
changed to enable vp9_postproc

In configure when internal-stats is enabled, because postprocessing
code is needed for computing stats for enabling internal-stats

Change-Id: I3601dc5a4aa65feb99465452486a21e75eb62c1f

11 years agoMerge "Reduce the amount of extension in src frames"
Yaowu Xu [Mon, 9 Sep 2013 15:09:56 +0000 (08:09 -0700)]
Merge "Reduce the amount of extension in src frames"

11 years agoMerge "Enable kf restrictions at speed 4"
Paul Wilkins [Mon, 9 Sep 2013 12:39:13 +0000 (05:39 -0700)]
Merge "Enable kf restrictions at speed 4"

11 years agoReduce the amount of extension in src frames
Yaowu Xu [Fri, 23 Aug 2013 23:22:50 +0000 (16:22 -0700)]
Reduce the amount of extension in src frames

The commit changes the border pixel extension from 160 pixel each side
to what is necessary in arnr filter or motion estimation portion, i.e.
16 pixel on top and left side. For right or bottom side, the extension
is changed to either round up image size to multiple of 64 or at least
16 pixels.

Change-Id: Ic05e19b94368c1ab4df568723aae5734e6c3d2c5

11 years agoresolve clang issue : implicit convert tx_mode -> tx_size
Jim Bankoski [Sun, 8 Sep 2013 13:55:17 +0000 (06:55 -0700)]
resolve clang issue : implicit convert tx_mode -> tx_size

Change-Id: Ifc9da470358f58e800e3d0d70a565b61e5f7834a

11 years agoMerge "New mode_info_context storage"
Jim Bankoski [Sun, 8 Sep 2013 14:16:25 +0000 (07:16 -0700)]
Merge "New mode_info_context storage"

11 years agoFix overflow issue in 16x16 quantization SSSE3
Jingning Han [Fri, 6 Sep 2013 23:06:21 +0000 (16:06 -0700)]
Fix overflow issue in 16x16 quantization SSSE3

The 16x16 transform unit test suggested that the peak coefficient
value can reach 32639. This could cause potential overflow issue
in the SSSE3 implmentation of 16x16 block quantization. This commit
fixes this issue by replacing addition with saturated addition.

Change-Id: I6d5bb7c5faad4a927be53292324bd2728690717e

11 years agovpx_mem: increase default alignment
James Zern [Sat, 7 Sep 2013 01:20:10 +0000 (18:20 -0700)]
vpx_mem: increase default alignment

this prevents returning an address smaller than the natural heap
alignment from vpx_malloc on e.g., x86_64

Change-Id: I283e858664a8529f28b22060c3815116a7798c0d

11 years agoMerge "Support a constant quality mode in VP9"
Deb Mukherjee [Fri, 6 Sep 2013 18:22:54 +0000 (11:22 -0700)]
Merge "Support a constant quality mode in VP9"

11 years agoEnable kf restrictions at speed 4
Paul Wilkins [Thu, 29 Aug 2013 23:29:53 +0000 (00:29 +0100)]
Enable kf restrictions at speed 4

Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20

11 years agoSupport a constant quality mode in VP9
Deb Mukherjee [Thu, 29 Aug 2013 23:21:44 +0000 (16:21 -0700)]
Support a constant quality mode in VP9

Adds a new end-usage option for constant quality encoding in vpx. This
first version implemented for VP9, encodes all regular inter frames
using the quality specified in the --cq-level= option, while encoding
all key frames and golden/altref frames at a quality better than that.

The current performance on derfraw300 is +0.910% up from bitrate control,
but achieved without multiple recode loops per frame.

The decision for qp for each altref/golden/key frame will be improved
in subsequent patches based on better use of stats from the first pass.
Further, the qp for regular inter frames may also be varied around the
provided cq-level.

Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3

11 years agocleanup cpplint warnings
Yaowu Xu [Thu, 5 Sep 2013 15:45:56 +0000 (08:45 -0700)]
cleanup cpplint warnings

Suggested by James Zern to clear out cpplint warnings for all unit
test code.

Change-Id: I731a3fa4d2a257eb9ef733426ba84286fbd7ea34

11 years agoNew mode_info_context storage
Scott LaVarnway [Fri, 6 Sep 2013 16:33:34 +0000 (12:33 -0400)]
New mode_info_context storage

mode_info_context was stored as a grid of MODE_INFO structs.
The grid now constists of a pointer to a MODE_INFO struct and
a "in the image" flag.  The MODE_INFO structs are now stored
as a stream, eliminating unnecessary copies and is a little
more cache friendly.

For the test clips used, the decoder performance improved
by ~4.3% (1080p) and ~9.7% (720p).

Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p)
and 5.9% (720p).

Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256

11 years agoMerge "fix loop filter setup_mask could reach out of bounds issue"
Jim Bankoski [Fri, 6 Sep 2013 13:21:28 +0000 (06:21 -0700)]
Merge "fix loop filter setup_mask could reach out of bounds issue"

11 years agoMerge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270...
hkuang [Fri, 6 Sep 2013 00:37:13 +0000 (17:37 -0700)]
Merge "Speed up idct8x8 by rearrange instructions. Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf."

11 years agofix loop filter setup_mask could reach out of bounds issue
Jim Bankoski [Thu, 5 Sep 2013 22:53:31 +0000 (15:53 -0700)]
fix loop filter setup_mask could reach out of bounds issue

Change-Id: Ic8446c4f26b6782a6dc482c19ea73c77646df418

11 years agoMerge "Enable 32x32 Transform unit test"
Jingning Han [Thu, 5 Sep 2013 22:23:27 +0000 (15:23 -0700)]
Merge "Enable 32x32 Transform unit test"

11 years agoEnable 32x32 Transform unit test
Jingning Han [Tue, 3 Sep 2013 18:57:34 +0000 (11:57 -0700)]
Enable 32x32 Transform unit test

This commit enabled a full functional test on 32x32 forward/inverse
transform, including round-trip error and memory overflow check. It
tests the prototype functions in C and all other implementations if
applicable.

Change-Id: I9cc50b05abdb4863e7abbcb29209a19b1fe90da7

11 years agoMerge "Use saturated addition in SSSE3 of 32x32 quant"
Jingning Han [Thu, 5 Sep 2013 21:09:40 +0000 (14:09 -0700)]
Merge "Use saturated addition in SSSE3 of 32x32 quant"

11 years agoMerge "resolve clang warnings : uninitialized vars in vp9_entropy.h"
Jim Bankoski [Thu, 5 Sep 2013 19:55:32 +0000 (12:55 -0700)]
Merge "resolve clang warnings : uninitialized vars in vp9_entropy.h"

11 years agoUse saturated addition in SSSE3 of 32x32 quant
Jingning Han [Wed, 4 Sep 2013 19:03:28 +0000 (12:03 -0700)]
Use saturated addition in SSSE3 of 32x32 quant

The 32x32 forward transform can potentially reach peak coefficient
value close to 32700, while the rounding factor can go upto 610.
This could cause overflow issue in the SSSE3 implementation of 32x32
quantization process.

This commit resolves this issue by replacing the addition operations
with saturated addition operations in 32x32 block quantization.

Change-Id: Id6b98996458e16c5b6241338ca113c332bef6e70

11 years agoMerge "faster accounting of inc_mv"
Jim Bankoski [Thu, 5 Sep 2013 19:38:56 +0000 (12:38 -0700)]
Merge "faster accounting of inc_mv"

11 years agoMerge "make bsize requirement for SEG_LVL_SKIP explicit"
Yaowu Xu [Thu, 5 Sep 2013 15:15:03 +0000 (08:15 -0700)]
Merge "make bsize requirement for SEG_LVL_SKIP explicit"

11 years agoMerge "Added ClearSystemState in a unit test"
Yaowu Xu [Thu, 5 Sep 2013 15:14:44 +0000 (08:14 -0700)]
Merge "Added ClearSystemState in a unit test"

11 years agoresolve clang warnings : uninitialized vars in vp9_entropy.h
Jim Bankoski [Thu, 5 Sep 2013 01:38:41 +0000 (18:38 -0700)]
resolve clang warnings : uninitialized vars in vp9_entropy.h

This helps clear out some of the warnings

Change-Id: Ie7ccaca8fd92542386a7f1b257398e1bdf2f55dc

11 years agoMerge "wrap non420 loop filter code in macro"
Jim Bankoski [Thu, 5 Sep 2013 00:20:53 +0000 (17:20 -0700)]
Merge "wrap non420 loop filter code in macro"

11 years agoMerge "Attempt to fix speed 4"
Paul Wilkins [Thu, 5 Sep 2013 00:19:22 +0000 (17:19 -0700)]
Merge "Attempt to fix speed 4"

11 years agomake bsize requirement for SEG_LVL_SKIP explicit
Yaowu Xu [Wed, 4 Sep 2013 23:32:26 +0000 (16:32 -0700)]
make bsize requirement for SEG_LVL_SKIP explicit

The segment feature SEG_LVL_SKIP requires the prediction unit size
to be at least BLOCK_8X8. This commit makes the requirement to be
explicit. This is to prevent future encoder implementations from
making wrong choices.

Change-Id: I0127f0bd4c66e130b81f0cb0a8d3dbfe3b2da5c2

11 years agoSpeed up idct8x8 by rearrange instructions.
hkuang [Wed, 4 Sep 2013 22:41:26 +0000 (15:41 -0700)]
Speed up idct8x8 by rearrange instructions.
Speed improve from 264% ~ 270% to 280% ~ 300% base on assembly-perf.

Change-Id: I3e2cc818ec14b432204ff43732f39b6438db685d

11 years agoAdded ClearSystemState in a unit test
Yaowu Xu [Tue, 3 Sep 2013 20:50:17 +0000 (13:50 -0700)]
Added ClearSystemState in a unit test

There is another unit test that has been failing randomly on win32
build. Investigation has shown that the failure was caused by simd
register state is not reset appropriately in the fdct8x8 test. This
commit added ClearSystemState() in the teardown of this test, tests
showed it resolved the random failure issue for win32 build.

Related issue: https://code.google.com/p/webm/issues/detail?id=614

Change-Id: I9381d0c1a6f4b855ccaeef1aca8c417ac8c71ee2

11 years agoMerge "Fixing problem with invalid delta_q reading."
Yaowu Xu [Wed, 4 Sep 2013 21:21:30 +0000 (14:21 -0700)]
Merge "Fixing problem with invalid delta_q reading."

11 years agoMerge "Add neon optimize vp9_short_iht4x4_add."
hkuang [Wed, 4 Sep 2013 20:35:09 +0000 (13:35 -0700)]
Merge "Add neon optimize vp9_short_iht4x4_add."

11 years agoAdd neon optimize vp9_short_iht4x4_add.
hkuang [Tue, 3 Sep 2013 17:19:21 +0000 (10:19 -0700)]
Add neon optimize vp9_short_iht4x4_add.

Change-Id: I42c497b68ae1ee645b59c9968ad805db0a43e37e

11 years agoFixing problem with invalid delta_q reading.
Dmitry Kovalev [Thu, 29 Aug 2013 22:33:06 +0000 (15:33 -0700)]
Fixing problem with invalid delta_q reading.

This is a bitstream change but no currently produces videos should
be affected. https://code.google.com/p/webm/issues/detail?id=610

Change-Id: Ic85a6477df6c201cdf7f70f6bd84607b71f4593c

11 years agoMerge "Replacing init_dequantizer() with setup_plane_dequants()."
Yaowu Xu [Wed, 4 Sep 2013 17:42:12 +0000 (10:42 -0700)]
Merge "Replacing init_dequantizer() with setup_plane_dequants()."

11 years agoMerge "speed up inc_mv_component"
Jim Bankoski [Wed, 4 Sep 2013 17:35:51 +0000 (10:35 -0700)]
Merge "speed up inc_mv_component"

11 years agoMerge "make vp9 postproc a config option"
Jim Bankoski [Wed, 4 Sep 2013 17:35:26 +0000 (10:35 -0700)]
Merge "make vp9 postproc a config option"

11 years agoMerge "Use correct bit cost while static-thresh is on"
Yunqing Wang [Wed, 4 Sep 2013 17:26:37 +0000 (10:26 -0700)]
Merge "Use correct bit cost while static-thresh is on"

11 years agowrap non420 loop filter code in macro
Jim Bankoski [Tue, 3 Sep 2013 02:04:37 +0000 (19:04 -0700)]
wrap non420 loop filter code in macro

Change-Id: I62bca0e7a4bffc1a78b750dbb9df9d2378e92423

11 years agomake vp9 postproc a config option
Jim Bankoski [Wed, 4 Sep 2013 17:02:08 +0000 (10:02 -0700)]
make vp9 postproc a config option

Vp9 postproc is disabled for now as its not been shown to help and
may be merged with vp8.

Change-Id: I25620d6cd34c6e10331b18c7b5ef7482e39c6057

11 years agofaster accounting of inc_mv
Jim Bankoski [Wed, 4 Sep 2013 16:47:57 +0000 (09:47 -0700)]
faster accounting of inc_mv

Moves counting of mv branches to where we have a new mv, instead of after
the whole frame is summed.

Change-Id: I945d9f6d9199ba2443fe816c92d5849340d17bbd

11 years agoReplacing init_dequantizer() with setup_plane_dequants().
Dmitry Kovalev [Wed, 28 Aug 2013 21:49:09 +0000 (14:49 -0700)]
Replacing init_dequantizer() with setup_plane_dequants().

Change-Id: Ib67e996b4a6dcb6f481889f5a0d84811a9e3c5d1

11 years agospeed up inc_mv_component
Jim Bankoski [Tue, 3 Sep 2013 22:49:58 +0000 (15:49 -0700)]
speed up inc_mv_component

Convert mv_class if statements to look up.  re order to avoid ifs...

Change-Id: I76966a21bf517bb1f9a7957c08c476c7bb3e9a63

11 years agoMerge "Fix intermediate height in convolve_c"
James Zern [Tue, 3 Sep 2013 22:50:33 +0000 (15:50 -0700)]
Merge "Fix intermediate height in convolve_c"

11 years agoAttempt to fix speed 4
Paul Wilkins [Fri, 30 Aug 2013 23:39:26 +0000 (00:39 +0100)]
Attempt to fix speed 4

Speed 4 fixed partition size. Use fixed size unless it does not
fit inside image, in which case use the largest size that does.

Change-Id: I250f7a80506750dd82ab355721624a1344247223

11 years agoMerge "Fix 32x32 forward transform SSE2 version"
Jingning Han [Tue, 3 Sep 2013 15:58:03 +0000 (08:58 -0700)]
Merge "Fix 32x32 forward transform SSE2 version"

11 years agoMerge "Improved mb_lpf_horizontal_edge_w_sse2_8"
Scott LaVarnway [Tue, 3 Sep 2013 12:44:01 +0000 (05:44 -0700)]
Merge "Improved mb_lpf_horizontal_edge_w_sse2_8"

11 years agoFix 32x32 forward transform SSE2 version
Jingning Han [Fri, 30 Aug 2013 17:57:23 +0000 (10:57 -0700)]
Fix 32x32 forward transform SSE2 version

This commit fixed the potential overflow issue in the SSE2
implementation of 32x32 forward DCT. It resolved the corrupted
coded frames in the border of scenes.

Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9

11 years agoUse correct bit cost while static-thresh is on
Yunqing Wang [Fri, 30 Aug 2013 22:19:49 +0000 (15:19 -0700)]
Use correct bit cost while static-thresh is on

While static-thresh is on, we only need to transmit skip
flag if skip = 1. The cost of skip bit is added to the
total rate cost.

Change-Id: I64e73e482bc297eba22907026298a15fa8cc3920

11 years agoMerge "Added per pixel inter rd hit count stats"
Paul Wilkins [Fri, 30 Aug 2013 15:56:01 +0000 (08:56 -0700)]
Merge "Added per pixel inter rd hit count stats"

11 years agoMerge "Refactor 16x16 unit tests"
Jingning Han [Fri, 30 Aug 2013 15:53:19 +0000 (08:53 -0700)]
Merge "Refactor 16x16 unit tests"

11 years agoFix intermediate height in convolve_c
Tero Rintaluoma [Thu, 22 Aug 2013 08:29:19 +0000 (11:29 +0300)]
Fix intermediate height in convolve_c

- Intermediate height was not correct i.e. when block size is 4 and
  y_step_q4 is 6. In this case intermediate height was
  (4*6) >> 4 = 1 and vertical interpolation needs two source pixels
  plus 7 extra pixels for taps.
- Also if the current output block is 16x16 and we are using 4x upscaling
  we need only 12 rows after horizontal filtering instead of 16.

  Patch Set 2: Intermediate_height updated after CL 66723
               "Fix bug in convolution functions (filter selection)"

Change-Id: I5a1a1bc2ac9d5edb3a6e0818de618bf318fdd589

11 years agoMerge "rework filter_block_plane"
Jim Bankoski [Fri, 30 Aug 2013 03:11:09 +0000 (20:11 -0700)]
Merge "rework filter_block_plane"

11 years agorework filter_block_plane
Jim Bankoski [Fri, 30 Aug 2013 00:00:05 +0000 (17:00 -0700)]
rework filter_block_plane

Change-Id: I55c3b60c4c0f4910d3dfb70e3edaae00cfa8dc4d

11 years agoRefactor 16x16 unit tests
Jingning Han [Wed, 28 Aug 2013 00:03:46 +0000 (17:03 -0700)]
Refactor 16x16 unit tests

Make the new test module comply to the unit test rules.

Change-Id: Id79ff7f03f870973ffbc74f26d64edb418b75299

11 years agoMerge "Fix overflow issue in SSSE3 32x32 quantization"
Jingning Han [Thu, 29 Aug 2013 23:49:04 +0000 (16:49 -0700)]
Merge "Fix overflow issue in SSSE3 32x32 quantization"