]> granicus.if.org Git - libvpx/log
libvpx
11 years agofix a small bug in 16 point forward dct
Yaowu Xu [Thu, 31 Jan 2013 23:39:41 +0000 (15:39 -0800)]
fix a small bug in 16 point forward dct

The commit fixes a minor error in 16 point fdct where in a rotation can
produce result of -1 instead of 0.

Change-Id: I45aac4a52bcd06225c6d04e643547a13e1c1aade

11 years agoA fix point implementation of 32x32 idct
Yaowu Xu [Wed, 30 Jan 2013 21:01:49 +0000 (13:01 -0800)]
A fix point implementation of 32x32 idct

This commit changes the 32x32 idct to use integer only. The algorithm
was taken directly from "A Fast Computational Algorithm for the
Discrete Cosine Tranform" by W. Chen, et al., which was published in
IEEE Transaction on Communication Vol. Com.-25 No. 9, 1977. The signal
flow graph in the original paper is for a 32 point forward dct, the
current implementation of inverse DCT was done by follow the graph in
reversed direction.

With this implementation, the 32 point inverse dct contains a 16 point
inverse dct in its even portion, similarly the 16 point idct further
contains 8 point and 4 point inverse dcts.

As of patch 4, encoding tests showed there is no compression loss when
compared against the floating point baseline. Numbers even showed very
small postives. (cif: .01%, std-hd: .05%).

Change-Id: I2d2d17a424b0b04b42422ef33ec53f5802b0f378

11 years agoMerge "Simplify Zero bin and zero bin run code." into experimental
Paul Wilkins [Mon, 28 Jan 2013 16:35:36 +0000 (08:35 -0800)]
Merge "Simplify Zero bin and zero bin run code." into experimental

11 years agoSimplify Zero bin and zero bin run code.
Paul Wilkins [Thu, 24 Jan 2013 18:28:33 +0000 (18:28 +0000)]
Simplify Zero bin and zero bin run code.

Simplification to eliminate a number of very large data
data structures. All zero run, zbin boosts for different
transform sizes are now limited to a maximum run length
of 15 before they max out the boost.

Some further work still needs be done to refactor, rationalize
and optimize the multiple quantizer functions.

The simplification coupled with tweaks to the 16 element array
now used for all transform sizes, has minimal effect on quality.

Change-Id: I6f3948b8ca0418b60d4db9030ff19026a34ed423

11 years agoTweak DropFramesWithoutRecovery params
John Koleszar [Sat, 26 Jan 2013 01:48:18 +0000 (17:48 -0800)]
Tweak DropFramesWithoutRecovery params

The previous parameters did not pass with VP8.

Change-Id: I1505171fbd713d21029541828ebdbe4c7dea79b8

11 years agoMerge "Remove "update_context" variable from VP9_COMP context." into experimental
Ronald S. Bultje [Fri, 25 Jan 2013 17:43:42 +0000 (09:43 -0800)]
Merge "Remove "update_context" variable from VP9_COMP context." into experimental

11 years agoRemove "update_context" variable from VP9_COMP context.
Ronald S. Bultje [Fri, 25 Jan 2013 00:28:53 +0000 (16:28 -0800)]
Remove "update_context" variable from VP9_COMP context.

The variable is always zero.

Change-Id: Id5cdbecad543bca465a5b1d471badaec7e112c8d

11 years agoMvref speedup
Paul Wilkins [Thu, 24 Jan 2013 12:52:13 +0000 (12:52 +0000)]
Mvref speedup

Quality / decode speed trade off changes.
Simpler insert method without sort. Quality impact small.

Change-Id: Id0c0941bc508d985405abd06a13ffe7489170b62

11 years agoMerge "MSVS compiler error" into experimental
Paul Wilkins [Thu, 24 Jan 2013 17:21:49 +0000 (09:21 -0800)]
Merge "MSVS compiler error" into experimental

11 years agoMSVS compiler error
Paul Wilkins [Thu, 24 Jan 2013 11:42:38 +0000 (11:42 +0000)]
MSVS compiler error

Visual Studio reports and error on the line
  static const int bsize2 = bsize >> 1;

"error C2099: initializer is not a constant"

Also warnings on signed/unsigned mismatch

Change-Id: I76cb06f736bd9c90107a26bfb3c130ec3f6c33de

11 years agoMerge "Intrinsic version of loopfilter now matches C code" into experimental
Scott LaVarnway [Thu, 24 Jan 2013 16:45:22 +0000 (08:45 -0800)]
Merge "Intrinsic version of loopfilter now matches C code" into experimental

11 years agoAdds an error-resilient mode with test
Deb Mukherjee [Tue, 15 Jan 2013 14:43:35 +0000 (06:43 -0800)]
Adds an error-resilient mode with test

Adds an error-resilient mode where frames can be continued
to be decoded even when there are errors (due to network losses)
on a prior frame. Specifically, backward updates are turned off
and probabilities of various symbols are reset to defaults at
the beginning of each frame. Further, the last frame's mvs are
not used for the mv reference list, and the sorting of the
initial list based on search on previous frames is turned off
as well.

Also adds a test where an arbitrary set of frames are skipped
from decoding to simulate errors. The test verifies (1) that if
the error frames are droppable - i.e. frame buffer updates have
been turned off - there are no mismatch errors for the remaining
frames after the error frames; and (2) if the error-frames are non
droppable, there are not only no decoding errors but the mismatch
PSNR between the decoder's version of the post-error frames and the
encoder's version is at least 20 dB.

Change-Id: Ie6e2bcd436b1e8643270356d3a930e8989ff52a5

11 years agoMerge "Support multiple codecs in test infrastructure" into experimental
John Koleszar [Wed, 23 Jan 2013 22:52:22 +0000 (14:52 -0800)]
Merge "Support multiple codecs in test infrastructure" into experimental

11 years agoSupport multiple codecs in test infrastructure
John Koleszar [Fri, 18 Jan 2013 19:51:12 +0000 (11:51 -0800)]
Support multiple codecs in test infrastructure

This commit starts to convert the tests to a system where the codec
to be used is provided by a factory object. Currently no tests are
instantiated for VP9 since they all fail for various reasons, but it
was verified that they're called and the correct codec is
instantiated.

Change-Id: Ia7506df2ca3a7651218ba3ca560634f08c9fbdeb

11 years agoMerge "Modifies the comp inter-intra expt" into experimental
Deb Mukherjee [Wed, 23 Jan 2013 17:43:07 +0000 (09:43 -0800)]
Merge "Modifies the comp inter-intra expt" into experimental

11 years agoIntrinsic version of loopfilter now matches C code
Scott LaVarnway [Wed, 23 Jan 2013 17:31:40 +0000 (09:31 -0800)]
Intrinsic version of loopfilter now matches C code

Updated the instrinsic code to match Yaowu's latest loopfilter change.
(I584393906c4f5f948a581d6590959522572743bb)

The decoder performance improved by ~30% for the test clip used.

Change-Id: I026cfc75d5bcb7d8d58be6f0440ac9e126ef39d2

11 years agoMerge changes Ia82cef79,I7324a75a,I7b66daad,I73344451,I91dc210f,I5945b5ce into experi...
John Koleszar [Tue, 22 Jan 2013 16:31:55 +0000 (08:31 -0800)]
Merge changes Ia82cef79,I7324a75a,I7b66daad,I73344451,I91dc210f,I5945b5ce into experimental

* changes:
  Use alt-ref frame context for keyframes
  Preserve the previous golden frame on golden updates
  Generalize and increase frame coding contexts
  Start to anonymize reference frames
  Update encoder to use fb_idx_ref_cnt
  Remove buffer-to-buffer copy logic

11 years agoUse alt-ref frame context for keyframes
John Koleszar [Fri, 18 Jan 2013 22:35:20 +0000 (14:35 -0800)]
Use alt-ref frame context for keyframes

This matches the behavior prior to generalizing the frame context
selection, and intuitively makes sense in that the first forward ref
is immediately after the keyframe, so it's quality is improved a bit
by using the keyframe's entropy context rather than the default.

Change-Id: Ia82cef79382b9d8cfafdc44ba0533d4dc3e44053

11 years agoa minor change to a portion of loop filtering
Yaowu Xu [Thu, 17 Jan 2013 21:26:24 +0000 (13:26 -0800)]
a minor change to a portion of loop filtering

The loop filtering used for MB edge or internal edge of a MB using 8x8
tranform was reading 5 pixel each side and writting 3 pixel each side.
With suggestion from Aki and Scott on hardware&software performance,
this commit changed to read 4 pixel each side and write 3 pixel each
side.

Change-Id: I584393906c4f5f948a581d6590959522572743bb

11 years agoPreserve the previous golden frame on golden updates
John Koleszar [Wed, 16 Jan 2013 20:19:42 +0000 (12:19 -0800)]
Preserve the previous golden frame on golden updates

This commit restores the quality lost when the buffer-to-buffer copy
logic was removed. Note that this is specific to the current use of
golden frames and will need rework when RTC functionality is added.

Change-Id: I7324a75acd96eafd9e0f9b8633d782e390d5dc21

11 years agoGeneralize and increase frame coding contexts
John Koleszar [Tue, 15 Jan 2013 23:57:11 +0000 (15:57 -0800)]
Generalize and increase frame coding contexts

Previously there were two frame coding contexts tracked, one for normal
frames and one for alt-ref frames. Generalize this by signalling the
context to use in the bitstream, rather than tieing it to the alt ref
refresh bit. Also increase the number of contexts available to 4, which
may be useful for temporal scalability.

Change-Id: I7b66daaddd55c535c20cd16713541fab182b1662

11 years agoStart to anonymize reference frames
John Koleszar [Tue, 15 Jan 2013 21:49:44 +0000 (13:49 -0800)]
Start to anonymize reference frames

Remove lst_fb_idx, gld_fb_idx, alt_fb_idx, refresh_last_frame,
refresh_golden_frame, refresh_alt_ref_frame from common. Gold/Alt are
encode side conventions. From the decoder's perspective, we want to be
dealing with numbered references.

Updates to active_ref 2 signal mode context switches, vestigial from
refresh_alt_ref_frame. This needs some clean up to make sense with
increased numbers of reference frames, as well as reimplementing the
swapping of alt/golden which was previously done using the
buffer-to-buffer copy mechanism removed in an earlier commit.

Change-Id: I7334445158b7666f9295d2a2dd22aa03f4485f58

11 years agoUpdate encoder to use fb_idx_ref_cnt
John Koleszar [Mon, 14 Jan 2013 23:52:20 +0000 (15:52 -0800)]
Update encoder to use fb_idx_ref_cnt

Do reference counting the same way on the encoder as the decoder does,
rather than maintaining the 'flags' member of YV12_BUFFER_CONFIG.

Change-Id: I91dc210ffca081acaf9d5c09a06e7461b3c3139c

11 years agoRemove buffer-to-buffer copy logic
John Koleszar [Mon, 14 Jan 2013 22:44:39 +0000 (14:44 -0800)]
Remove buffer-to-buffer copy logic

This is the first in a series of commits to add additional reference
frames to the codec. Each frame will be able to update any of the
available references, but copying between references is not
supported.

Change-Id: I5945b5ce6cc3582c495102b4e7eed4f08c44d5a1

11 years agofix a number issues that cause failures
Yaowu Xu [Tue, 15 Jan 2013 02:03:34 +0000 (18:03 -0800)]
fix a number issues that cause failures

During master jenkins verification proces

Change-Id: I3722b8753eaf39f99b45979ce407a8ea0bea0b89

11 years agoModifies the comp inter-intra expt
Deb Mukherjee [Tue, 15 Jan 2013 01:32:26 +0000 (17:32 -0800)]
Modifies the comp inter-intra expt

Uses a single 1D table to implement the weighting of the predictors
for the compound inter-intra experiment.

Change-Id: I204ffbe4f9fc79d5d43b6c724ad253d800461012

11 years agoUse INT64_MAX instead of LLONG_MAX
John Koleszar [Mon, 14 Jan 2013 19:49:30 +0000 (11:49 -0800)]
Use INT64_MAX instead of LLONG_MAX

These variables have the type int64_t, not long long. long long could
be a larger type than 64 bits. Emulate INT64_MAX for older versions of
MSVC, and remove the unreferenced vpx_ports/vpxtypes.h

Change-Id: Ideaca71838fcd3849d816d5ab17aa347c97d03b0

11 years agoRemove compound intra-intra experiment.
Ronald S. Bultje [Mon, 14 Jan 2013 22:37:53 +0000 (14:37 -0800)]
Remove compound intra-intra experiment.

This experiment gives little gains and adds relatively much code
complexity (and it hinders other experiments), so let's get rid of
it.

Change-Id: Id25e79a137a1b8a01138aa27a1fa0ba4a2df274a

11 years agoMerge experiment "subpelrefmv"
Yaowu Xu [Mon, 14 Jan 2013 22:39:54 +0000 (14:39 -0800)]
Merge experiment "subpelrefmv"

Change-Id: Iac7f3d108863552b850c92c727e00c95571c9e96

11 years agoMerge experiment "widerlpf"
Yaowu Xu [Mon, 14 Jan 2013 20:11:06 +0000 (12:11 -0800)]
Merge experiment "widerlpf"

Change-Id: I0c94475075e66e13cfe4c20fab7db6474441ae86

11 years agoMerge "changed UV plane loop filtering for TX_8X8" into experimental
Yaowu Xu [Mon, 14 Jan 2013 22:47:31 +0000 (14:47 -0800)]
Merge "changed UV plane loop filtering for TX_8X8" into experimental

11 years agoMerge "change to evaluate reference mvs using above only" into experimental
Yaowu Xu [Mon, 14 Jan 2013 22:40:38 +0000 (14:40 -0800)]
Merge "change to evaluate reference mvs using above only" into experimental

11 years agochanged UV plane loop filtering for TX_8X8
Yaowu Xu [Mon, 14 Jan 2013 17:28:35 +0000 (09:28 -0800)]
changed UV plane loop filtering for TX_8X8

In commit 9a1d73d, loop filtering was added for UV 4x4 boundaries
when TX_8X8 is used by a MB. This commit further refined the decision
to be based on the actual transform used for the UV planes. When
UV planes use 4x4 transform, i.e. when prediction mode used is either
I8X8_PRED or SPLITMV, UV planes are filtered on 4x4 boundaries, and no
filtering is applied on 4x4 block boundaries when UV planes use 8X8
transform.

Change-Id: Ibb404face0a1d129b4b4abaf67c55d82e8df8bec

11 years agoMerge "Fix compiler warnings" into experimental
Paul Wilkins [Mon, 14 Jan 2013 22:20:57 +0000 (14:20 -0800)]
Merge "Fix compiler warnings" into experimental

11 years agoMerge "Merge prediction filter" into experimental
Adrian Grange [Mon, 14 Jan 2013 22:18:21 +0000 (14:18 -0800)]
Merge "Merge prediction filter" into experimental

11 years agochange to evaluate reference mvs using above only
Yaowu Xu [Thu, 10 Jan 2013 23:38:29 +0000 (15:38 -0800)]
change to evaluate reference mvs using above only

Change-Id: Ibcc342efac0a9be7a21d9b2c09984d9e16bbb225

11 years agoFix compiler warnings
Yaowu Xu [Mon, 14 Jan 2013 21:56:52 +0000 (13:56 -0800)]
Fix compiler warnings

The warnings caused verify failure with gerrit for several  commits

Change-Id: I030df8638bd69b8783a3ac58e720ff9f0bfd546c

11 years agoMerge prediction filter
Adrian Grange [Mon, 14 Jan 2013 20:57:07 +0000 (12:57 -0800)]
Merge prediction filter

Removed the experimental flag from around the prediction filter.

Change-Id: Ic1dd2db8fe8ac17ed5129f83094d4c5cdd5527d2

11 years agoReset x->skip for each iteration in the RD loop.
Ronald S. Bultje [Mon, 14 Jan 2013 20:43:12 +0000 (12:43 -0800)]
Reset x->skip for each iteration in the RD loop.

This prevents ill-defined behaviour, such as setting x->skip for a mode
that is excluded because of frame-level flags (e.g. filter selection,
compound prediction selection), then not breaking out of the RD loop
because the mode is not allowed, but keeping the flag on. Whatever mode
is iterated through next in the RD loop will then carry this flag, and
all sort of bad stuff happens, such as x->skip being set on intra pred
modes.

Change-Id: I5bec46b36e38292174acb1c564b3caf00a9b4b9a

11 years agoFix unused variable warnings
John Koleszar [Mon, 14 Jan 2013 20:10:11 +0000 (12:10 -0800)]
Fix unused variable warnings

Previous commit does not build cleanly on Jenkins with the DWT/DCT
hybrid experiment enabled (--enable-dwtdcthybrid).

Change-Id: Ia67e8f59d17ef2d5200ec6b90dfe6711ed6835a5

11 years agoFurther enhancements/fixes on dct/dwt hybrid txfm
Deb Mukherjee [Wed, 9 Jan 2013 14:26:54 +0000 (06:26 -0800)]
Further enhancements/fixes on dct/dwt hybrid txfm

Fixes some scaling issues. Adds an option to only compute the
dct on the low-low subband for 32x32 and 64x64 blocks using
only a single 16x16 dct after 1 and 2 wavelet decomposition
levels respectively. Also adds an option to use a 8x8 dct
as building block.

Currenlty with the 2/6 filter and with a single 16x16 dct on
the low low band, the reuslts compared to full 32x32 dct is
as follows:
derf: -0.15%
yt: -0.29%
std-hd: -0.18%
hd: -0.6%
These are my current recommended settings, since the 2/6 filter
is very simple.

Results with 8x8 dct are about 0.3% worse.

Change-Id: I00100cdc96e32deced591985785ef0d06f325e44

11 years agoMerge "WIP: Added sse2 version of vp9_mb_lpf_horizontal_edge_w" into experimental
Jim Bankoski [Sat, 12 Jan 2013 01:15:41 +0000 (17:15 -0800)]
Merge "WIP: Added sse2 version of vp9_mb_lpf_horizontal_edge_w" into experimental

11 years agoWIP: Added sse2 version of vp9_mb_lpf_horizontal_edge_w
Scott LaVarnway [Sat, 12 Jan 2013 01:11:04 +0000 (17:11 -0800)]
WIP: Added sse2 version of vp9_mb_lpf_horizontal_edge_w

and vp9_mb_lpf_vertical_edge_w_sse2.  This was quickly done so we can
run some tests over the weekend.  Future commits will optimize/refactor these
functions further.

The decoder performance improved by ~17% for the clip used.

Change-Id: I612687cd5a7670ee840a0cbc3c68dc2b84d4af76

11 years agoMerge "Upstream changes from Chromium Android Clang build." into experimental
Jim Bankoski [Sat, 12 Jan 2013 01:06:26 +0000 (17:06 -0800)]
Merge "Upstream changes from Chromium Android Clang build." into experimental

11 years agoMerge "Add loop filtering for UV plane" into experimental
Yaowu Xu [Sat, 12 Jan 2013 00:56:39 +0000 (16:56 -0800)]
Merge "Add loop filtering for UV plane" into experimental

11 years agoAdd loop filtering for UV plane
Yaowu Xu [Fri, 11 Jan 2013 23:53:32 +0000 (15:53 -0800)]
Add loop filtering for UV plane

On block boundary within a MB when 8x8 block boundary only is filtered
for Y.

Change-Id: Ie1c804c877d199e78e2fecd8c2d3f1e114ce9ec1

11 years agoUpstream changes from Chromium Android Clang build.
Frank Galligan [Fri, 11 Jan 2013 23:34:05 +0000 (15:34 -0800)]
Upstream changes from Chromium Android Clang build.

See https://codereview.chromium.org/11875006/

Change-Id: Ied2a17df2b3222635f84aef120eaa9feb53750d2

11 years agoMerge "Initial sse2 version of the wide loopfilters" into experimental
Scott LaVarnway [Fri, 11 Jan 2013 23:34:26 +0000 (15:34 -0800)]
Merge "Initial sse2 version of the wide loopfilters" into experimental

11 years agoInitial sse2 version of the wide loopfilters
Scott LaVarnway [Fri, 11 Jan 2013 22:54:14 +0000 (14:54 -0800)]
Initial sse2 version of the wide loopfilters

Updated the rtcd_defs and used the sse2 uv version
of the loopfilter.  The performance improved by ~8%
for the test clip used.

Change-Id: I5a0bca3b6674198d40ca4a77b8cc722ddde79c36

11 years agoRemove INT64_MAX references.
Paul Wilkins [Fri, 11 Jan 2013 19:41:39 +0000 (19:41 +0000)]
Remove INT64_MAX references.

Replace INT64_MAX references with LLONG_MAX
for windows build.

Change-Id: Ib8b45c1e9c15c043b2f54c27ed83b8682b2be34f

11 years agoMerge "Reduce the usage of widerlpf" into experimental
Yaowu Xu [Fri, 11 Jan 2013 19:15:43 +0000 (11:15 -0800)]
Merge "Reduce the usage of widerlpf" into experimental

11 years agortcd for new wider loop filters
Jim Bankoski [Fri, 11 Jan 2013 17:45:45 +0000 (09:45 -0800)]
rtcd for new wider loop filters

Change-Id: I8826bcdcf72ba6d86bde31cd13902a710399805c

11 years agoReduce the usage of widerlpf
Yaowu Xu [Thu, 10 Jan 2013 18:55:07 +0000 (10:55 -0800)]
Reduce the usage of widerlpf

The commit changed to not to use wider lpf within a superblock when
32x32 transform is used for the block.

The commit also changed to use the shorter version of loop filtering:
for UV planes.

Change-Id: I344c1fb9a3be9d1200782a788bcb0b001fedcff8

11 years agoFix wrong pitch argument in dct32x32 unit test.
Ronald S. Bultje [Thu, 10 Jan 2013 16:36:42 +0000 (08:36 -0800)]
Fix wrong pitch argument in dct32x32 unit test.

Change-Id: Id9474a1686daebfa3d004e21823bf1888ec9e534

11 years agoMerge tx32x32 experiment.
Ronald S. Bultje [Thu, 10 Jan 2013 16:23:59 +0000 (08:23 -0800)]
Merge tx32x32 experiment.

Change-Id: I615651e4c7b09e576a341ad425cf80c393637833

11 years agoMerge "Merge superblocks64 experiment." into experimental
Ronald S. Bultje [Thu, 10 Jan 2013 16:18:33 +0000 (08:18 -0800)]
Merge "Merge superblocks64 experiment." into experimental

11 years agoMerge superblocks64 experiment.
Ronald S. Bultje [Thu, 10 Jan 2013 01:21:28 +0000 (17:21 -0800)]
Merge superblocks64 experiment.

Change-Id: If6c88752dffdb566f8d4322f135145270716fb8e

11 years agoexperiment a wider loop filter for MB border
Yaowu Xu [Wed, 19 Dec 2012 23:53:03 +0000 (15:53 -0800)]
experiment a wider loop filter for MB border

when larger transforms are used

Change-Id: I25251442b44bf251df4c25a1c1fcf71fb2ad913b

11 years agoNew prediction filter
Adrian Grange [Tue, 8 Jan 2013 22:14:01 +0000 (14:14 -0800)]
New prediction filter

This patch removes the old pred-filter experiment and replaces it
with one that is implemented using the switchable filter framework.

If the pred-filter experiment is enabled, three interopolation
filters are tested during mode selection; the standard 8-tap
interpolation filter, a sharp 8-tap filter and a (new) 8-tap
smoothing filter.

The 6-tap filter code has been preserved for now and if the
enable-6tap experiment is enabled (in addition to the pred-filter
experiment) the original 6-tap filter replaces the new 8-tap smooth
filter in the switchable mode.

The new experiment applies the prediction filter in cases of a
fractional-pel motion vector. Future patches will apply the filter
where the mv is pel-aligned and also to intra predicted blocks.

Change-Id: I08e8cba978f2bbf3019f8413f376b8e2cd85eba4

11 years agoAdds 64x64 hybrid dct/dwt transform
Deb Mukherjee [Tue, 8 Jan 2013 20:18:16 +0000 (12:18 -0800)]
Adds 64x64 hybrid dct/dwt transform

This is to add to the 64x64 transform experiment as an alternative to
a 64x64 DCT.
Two levels of wavelet decomposition is used on a 64x64 block, followed
by 16x16 DCT on the four lowest subbands. The highest three subbands
are left untransformed after the first level DWT.

Change-Id: I3d48d5800468d655191933894df6b46e15adca56

11 years agoMerge "Merge superblocks (32x32) experiment." into experimental
Ronald S. Bultje [Tue, 8 Jan 2013 21:31:37 +0000 (13:31 -0800)]
Merge "Merge superblocks (32x32) experiment." into experimental

11 years agoMerge "vp9_sub_pixel_variance16x2 SSE2 optimization" into experimental
Yunqing Wang [Tue, 8 Jan 2013 20:59:08 +0000 (12:59 -0800)]
Merge "vp9_sub_pixel_variance16x2 SSE2 optimization" into experimental

11 years agoMerge superblocks (32x32) experiment.
Ronald S. Bultje [Tue, 8 Jan 2013 18:29:22 +0000 (10:29 -0800)]
Merge superblocks (32x32) experiment.

Change-Id: I0df99742029834a85c4933652b0587cf5b6b2587

11 years agovp9_sub_pixel_variance16x2 SSE2 optimization
Yunqing Wang [Tue, 8 Jan 2013 18:44:19 +0000 (10:44 -0800)]
vp9_sub_pixel_variance16x2 SSE2 optimization

About 5% decoder speedup.

Change-Id: Ib6687d337af758a536a0e7e289f400990f1f9794

11 years agoMerge vp9-preview changes into experimental branch
John Koleszar [Tue, 8 Jan 2013 18:11:26 +0000 (10:11 -0800)]
Merge vp9-preview changes into experimental branch

Incorportate vp9-preview changes by merging master branch into experimental.

Conflicts:
test/test.mk
vp9/common/vp9_filter.c
vp9/common/vp9_idctllm.c
vp9/common/vp9_invtrans.h
vp9/common/vp9_mbpitch.c
vp9/common/vp9_rtcd_defs.sh
vp9/common/vp9_systemdependent.h
vp9/common/vp9_type_aliases.h
vp9/common/x86/vp9_asm_stubs.c
vp9/common/x86/vp9_subpixel_mmx.asm
vp9/decoder/vp9_decodframe.c
vp9/decoder/vp9_dequantize.c
vp9/decoder/vp9_dequantize.h
vp9/decoder/vp9_onyxd_int.h
vp9/encoder/vp9_bitstream.c
vp9/encoder/vp9_encodeframe.c
vp9/encoder/vp9_rdopt.c

Change-Id: I17f51c3666d1b59cf1a699f87607cbc5d30a87c5

11 years agoreset segement map on key frame
Yaowu Xu [Fri, 14 Dec 2012 07:53:11 +0000 (23:53 -0800)]
reset segement map on key frame

This is to fix a decoder crash when decoder skips a number of frame to
continue decoding from a later key frame.

Change-Id: I3ba116eba6c3440e0528a21f53745f694302e4ad

11 years agoMerge "minor loop filter refactoring and cleanup" into experimental
Yaowu Xu [Tue, 8 Jan 2013 16:40:03 +0000 (08:40 -0800)]
Merge "minor loop filter refactoring and cleanup" into experimental

11 years agominor loop filter refactoring and cleanup
Yaowu Xu [Wed, 19 Dec 2012 19:34:49 +0000 (11:34 -0800)]
minor loop filter refactoring and cleanup

This commit did a couple of minor cleanup/refactoring to prepare for
futher loop filter experiments. It merged y_only version of loop filter
function into the regular one, which makes sure that same logic is used
for functions for picking level and for actual loop filtering.

Change-Id: Id10c94dccd45f58e5310bacfdf6ee63cbb60b86f

11 years agoRemove a few redundant function arguments in encodeframe.c.
Ronald S. Bultje [Mon, 7 Jan 2013 19:41:49 +0000 (11:41 -0800)]
Remove a few redundant function arguments in encodeframe.c.

Also reindent a block of code that was misindented after addition of
the tx32x32 experiment.

Change-Id: Ic3e4aae3effd8a40136da68c9f382af03632ba08

11 years agoRe-enable support for static_threshold (encode_breakout).
Ronald S. Bultje [Mon, 7 Jan 2013 19:02:14 +0000 (11:02 -0800)]
Re-enable support for static_threshold (encode_breakout).

Change-Id: Ibd7380f478d3127f9db91d0a4fd2fd0dfde961ab

11 years agoDon't use tx32x32 for macroblocks.
Ronald S. Bultje [Fri, 4 Jan 2013 20:05:40 +0000 (12:05 -0800)]
Don't use tx32x32 for macroblocks.

Change-Id: Ib674e0153ca360867ab7a20ba291ac9171a01250

11 years ago64x64 blocksize support.
Ronald S. Bultje [Sun, 6 Jan 2013 02:20:25 +0000 (18:20 -0800)]
64x64 blocksize support.

3.2% gains on std/hd, 1.0% gains on hd.

Change-Id: I481d5df23d8a4fc650a5bcba956554490b2bd200

11 years agoFix mode selection infinite loop bug
Adrian Grange [Fri, 4 Jan 2013 17:00:47 +0000 (09:00 -0800)]
Fix mode selection infinite loop bug

Mode selection for SBs could enter an infinite loop because
the interpolation filter mode index was not being reset
correctly.

Change-Id: I4bbe726f29ef5b6836e94884067c46084713cc11

11 years agoFurther change to mv reference search.
Paul Wilkins [Thu, 3 Jan 2013 15:14:36 +0000 (15:14 +0000)]
Further change to mv reference search.

This experimental change reorders the search so
that all possible references that match the target
reference frame are tested first and these in order
of distance from the current block. These will usually
be the highest scoring candidates.

If we do not find enough good candidates this way
we try non matching cases. These will usually be lower
scoring candidates.

The change in order together with breakouts when
we have found enough candidates should reduce
the computational cost and especially reduce the number
of sort operations.

Quality Results:
Std Hd +0.228%, Hd +0.074%, YT +0.046%, derf +0.137%

This effect is probably due to the fact that more distant
weak candidates are now less likely to get "promoted" over
near candidates even if they are repeated.

Change-Id: Iec37e77d88a48ad0ee1f315b14327a95d63f81f6

11 years agoUse seg/ref/mode offsets in loop_filter_partial.
Marco Paniconi [Thu, 3 Jan 2013 23:47:34 +0000 (15:47 -0800)]
Use seg/ref/mode offsets in loop_filter_partial.

Use the proper seg/mode/ref filter offsets when selecting the
frame loop filter level for fast mode (pick_filter_level_fast).

Change-Id: I2473e2131c800ad19755cb6211ad735fecfe2ac0

11 years agoMerge "make cost_coeffs() and tokenize_b() consistent" into experimental
Yaowu Xu [Thu, 3 Jan 2013 17:57:07 +0000 (09:57 -0800)]
Merge "make cost_coeffs() and tokenize_b() consistent" into experimental

11 years agoMerge "Merge cost_coeffs_2x2() into cost_coeffs()" into experimental
Yaowu Xu [Thu, 3 Jan 2013 17:33:21 +0000 (09:33 -0800)]
Merge "Merge cost_coeffs_2x2() into cost_coeffs()" into experimental

11 years agomake cost_coeffs() and tokenize_b() consistent
Yaowu Xu [Thu, 3 Jan 2013 17:00:30 +0000 (09:00 -0800)]
make cost_coeffs() and tokenize_b() consistent

Change-Id: I7cdb5c32a1400f88ec36d08ea982e38b77731602

11 years agoNew interpolation filter selection algorithm
Adrian Grange [Thu, 20 Dec 2012 22:56:19 +0000 (14:56 -0800)]
New interpolation filter selection algorithm

Old Scheme:
When SWITCHABLE filter selection is enabled the encoder
evaluates the use of each interpolation filter type and
selects the best one to use at the MB level. A frame-
level flag can be set to force the use of a particular
filter type for all MBs in a frame if it is more efficient
to encode that way. The logic here involved a Q dependent
threshold that assumed that the second 8-tap filter was
a high-pass filter. However, this requires a trip around
the recode loop. If the frame-level flag indicates use
of a particular filter, the other filters are not
evaluated in the pick_mode loop.

New Scheme:
Each filter type is evaluated at the MB level and a record
of the best filter is kept, irrespective of what filter
is signaled at the frame-level. Once all MBs have been
encoded, a decision is made as to what frame-level mode
to set for the *next* frame. If one filter is used by 80%
or more of the MBs, then this filter is forced since it
is assumed that this will be more efficient if the
next frame has similar characteristics. i.e. there is a
one-frame lag between measuring the filter selection and
setting the frame-level mode to use.

Change-Id: I6a7e7ced8f27e120fafb99db2dc9c6293f8d20f7

11 years agoMerge cost_coeffs_2x2() into cost_coeffs()
Yaowu Xu [Thu, 3 Jan 2013 16:00:00 +0000 (08:00 -0800)]
Merge cost_coeffs_2x2() into cost_coeffs()

Remove special case function cost_coeffs_2x2() and change function
cost_coeffs() to handle 2nd order haar block as it is handle all
other block types already.

Change-Id: I2aac6f81ee0ae9e03d6a8da4f8681d69b79ce41f

11 years agoMerge "Switch the order of calculating 2-D inverse transform" into experimental
Yunqing Wang [Wed, 2 Jan 2013 19:45:27 +0000 (11:45 -0800)]
Merge "Switch the order of calculating 2-D inverse transform" into experimental

11 years agoMerge "Skip finding best ref_mvs when the mode is ZEROMV" into experimental
Yunqing Wang [Wed, 2 Jan 2013 19:45:19 +0000 (11:45 -0800)]
Merge "Skip finding best ref_mvs when the mode is ZEROMV" into experimental

11 years agoChange INT64_MAX to LLONG_MAX
Paul Wilkins [Wed, 2 Jan 2013 18:06:00 +0000 (18:06 +0000)]
Change INT64_MAX to LLONG_MAX

This is needed to make the windows build work after
the removal of vp9_type_alisases.h.

Change-Id: I8addf38e9f3c8b864e0e30a8916a26e0264dd02c

11 years agoAdded update-able mv-ref probabilities.
Paul Wilkins [Mon, 10 Dec 2012 12:38:48 +0000 (12:38 +0000)]
Added update-able mv-ref probabilities.

Part of NEW_MVREF experiment.
Added update-able probabilities.

Change-Id: I5a4fcf4aaed1d0d1dac980f69d535639a3d59401

11 years agoSkip finding best ref_mvs when the mode is ZEROMV
Yunqing Wang [Fri, 28 Dec 2012 00:04:44 +0000 (16:04 -0800)]
Skip finding best ref_mvs when the mode is ZEROMV

Read mode before calling vp9_find_best_ref_mvs(). If the mode is
ZEROMV, the best ref_mvs are not needed. Then, we can skip calling
vp9_find_best_ref_mvs().

Change-Id: I5baa3658dd3f1c7107211cbbbcf919b4584be2e2

11 years agoSwitch the order of calculating 2-D inverse transform
Yunqing Wang [Thu, 27 Dec 2012 21:48:17 +0000 (13:48 -0800)]
Switch the order of calculating 2-D inverse transform

The 2-D inverse transform X = M1*Z*Transposed_M2 was calculated
in 2 steps from left to right:
1. Vertical transform: Y = M1*Z
2. Horizontal transform: X= Y*Transposed_M2
In SIMD, a transpose is needed in vertical transform.

Here, switched the calculation order to do it from right to left.
In this way, we could eliminate that transpose by writing the
intermediate results out to their transposed positions.

Change-Id: I34dfe5eb01292f6e363712420d99475e2e81e12c

11 years agoMerge branch 'vp9-preview' of review:webm/libvpx
John Koleszar [Fri, 21 Dec 2012 23:20:41 +0000 (15:20 -0800)]
Merge branch 'vp9-preview' of review:webm/libvpx

Merge the vp9-preview branch into master.

Change-Id: If700b9054676f24bed9deb59050af546c1ca5296

11 years agoBuild fixes to merge vp9-preview into master
John Koleszar [Sun, 23 Dec 2012 15:20:10 +0000 (07:20 -0800)]
Build fixes to merge vp9-preview into master

Various fixups to resolve issues when building vp9-preview under the more stringent
checks placed on the experimental branch.

Change-Id: I21749de83552e1e75c799003f849e6a0f1a35b07

11 years agoFix a warning
Yunqing Wang [Tue, 18 Dec 2012 20:09:46 +0000 (12:09 -0800)]
Fix a warning

Fixed the warning: the size of array ‘intermediate_buffer’ can’t
be evaluated [-Wvla].

Change-Id: Ibcffd6969bd71cee0c10f7cf18960e58cd0bd915

11 years agoRemoved mmx versions of vp9_bilinear_predict filters
Scott LaVarnway [Fri, 21 Dec 2012 22:41:49 +0000 (14:41 -0800)]
Removed mmx versions of vp9_bilinear_predict filters

These filters will not work with VP9.

Change-Id: Ic26c77961084fcea6bfa97f4cd95afdea2282e85

11 years agoUpdate CHANGELOG for v1.2.0 release v1.2.0
John Koleszar [Fri, 21 Dec 2012 22:14:06 +0000 (14:14 -0800)]
Update CHANGELOG for v1.2.0 release

Change-Id: Ie0525d300c94bfde2947b1cc2572bb7d37e17998

11 years agoMerge "add emmintrin_compat.h for builds with gcc < 4" into vp9-preview vp9-preview
John Koleszar [Fri, 21 Dec 2012 22:21:50 +0000 (14:21 -0800)]
Merge "add emmintrin_compat.h for builds with gcc < 4" into vp9-preview

11 years agoMerge "configure: test for -msse4" into vp9-preview
John Koleszar [Fri, 21 Dec 2012 22:20:15 +0000 (14:20 -0800)]
Merge "configure: test for -msse4" into vp9-preview

11 years agofixed sizes of global arrays
Jim Bankoski [Fri, 21 Dec 2012 21:09:04 +0000 (13:09 -0800)]
fixed sizes of global arrays

Change-Id: Ibc077cf1c1da0c86063f88c6d3073c6876989119

11 years agoFix MSVS build for removed vp9/common/vp9_onyxd.h
John Koleszar [Thu, 20 Dec 2012 23:39:43 +0000 (15:39 -0800)]
Fix MSVS build for removed vp9/common/vp9_onyxd.h

Change-Id: I75ad0b4ca5b53b5bf759cc26a484ec196d275279

11 years agoadd emmintrin_compat.h for builds with gcc < 4
James Zern [Thu, 20 Dec 2012 22:46:55 +0000 (14:46 -0800)]
add emmintrin_compat.h for builds with gcc < 4

Change-Id: If7822e6fcd0d3568b934032322b19ba3e401df26

11 years agoconfigure: test for -msse4
James Zern [Thu, 20 Dec 2012 22:05:19 +0000 (14:05 -0800)]
configure: test for -msse4

not present in < GCC 4.2

Change-Id: I7b1a3078fd782644b1c432ce682d2160ffc2263a

11 years agoadd private to assembly files to insure proper chromebuild
Jim Bankoski [Thu, 20 Dec 2012 17:40:18 +0000 (09:40 -0800)]
add private to assembly files to insure proper chromebuild

Change-Id: I6e43ca73f35401a974ed8ee27738d4318f09fd37

11 years agoNew previous coef context experiment
Deb Mukherjee [Tue, 27 Nov 2012 23:51:06 +0000 (15:51 -0800)]
New previous coef context experiment

Adds an experiment to derive the previous context of a coefficient
not just from the previous coefficient in the scan order but from a
combination of several neighboring coefficients previously encountered
in scan order.  A precomputed table of neighbors for each location
for each scan type and block size is used. Currently 5 neighbors are
used.

Results are about 0.2% positive using a strategy where the max coef
magnitude from the 5 neigbors is used to derive the context.

Change-Id: Ie708b54d8e1898af742846ce2d1e2b0d89fd4ad5

11 years agoDisabled x86inc style assembly functions.... part 2
Scott LaVarnway [Wed, 19 Dec 2012 22:13:25 +0000 (14:13 -0800)]
Disabled x86inc style assembly functions.... part 2

Missed a file

Change-Id: I33179de6755bc9eda9ad906e4fec6902ace435a5