]> granicus.if.org Git - libvpx/log
libvpx
11 years agoMerge "Replacing "txfm" with "tx" in identifiers."
Dmitry Kovalev [Sun, 4 Aug 2013 09:52:22 +0000 (02:52 -0700)]
Merge "Replacing "txfm" with "tx" in identifiers."

11 years agoreworked find_mv_ref
Jim Bankoski [Sun, 4 Aug 2013 02:51:56 +0000 (19:51 -0700)]
reworked find_mv_ref

This is an attempt at rewriting vp9_find_mv_refs_idx.   I believe that it gains
about 1-2% decode speed

Change-Id: Ia5359c94ce9bb43b32652890e605e9a385485c1b

11 years agoReplacing "txfm" with "tx" in identifiers.
Dmitry Kovalev [Sat, 3 Aug 2013 00:28:23 +0000 (17:28 -0700)]
Replacing "txfm" with "tx" in identifiers.

Consistent names with TX_SIZE, TX_MODE, and TX_MODE.

Change-Id: I79592218bf5a40ace89197a34a06ee7de581ed8d

11 years agoMerge "Adding is_inter_block function."
Dmitry Kovalev [Fri, 2 Aug 2013 23:54:32 +0000 (16:54 -0700)]
Merge "Adding is_inter_block function."

11 years agoMerge "Changing function arg type from int_mv* to MV*."
Dmitry Kovalev [Fri, 2 Aug 2013 23:30:06 +0000 (16:30 -0700)]
Merge "Changing function arg type from int_mv* to MV*."

11 years agoMerge "Cleanups around allow_high_precision_mv flag."
Dmitry Kovalev [Fri, 2 Aug 2013 23:27:05 +0000 (16:27 -0700)]
Merge "Cleanups around allow_high_precision_mv flag."

11 years agoAdding is_inter_block function.
Dmitry Kovalev [Fri, 2 Aug 2013 23:25:33 +0000 (16:25 -0700)]
Adding is_inter_block function.

Using it instead of long unclear verbose check
"mbmi->ref_frame[0] != INTRA_FRAME".

Change-Id: I9c7b4b3797942fa962bf3ba7460fff3084beabe9

11 years agoMerge "Cleaning up set_contexts_on_border function."
Dmitry Kovalev [Fri, 2 Aug 2013 23:22:50 +0000 (16:22 -0700)]
Merge "Cleaning up set_contexts_on_border function."

11 years agoMerge "Add more checking to using_small_partition_info"
Yunqing Wang [Fri, 2 Aug 2013 22:55:09 +0000 (15:55 -0700)]
Merge "Add more checking to using_small_partition_info"

11 years agoCleaning up set_contexts_on_border function.
Dmitry Kovalev [Fri, 2 Aug 2013 22:52:26 +0000 (15:52 -0700)]
Cleaning up set_contexts_on_border function.

Change-Id: I8f21c18b29f54b277fb1c167f278f109d9f3b996

11 years agoChanging function arg type from int_mv* to MV*.
Dmitry Kovalev [Fri, 2 Aug 2013 22:26:32 +0000 (15:26 -0700)]
Changing function arg type from int_mv* to MV*.

Change-Id: Ic878d31df2ce783a2c9a8c4bc9ed301ec8ffe25e

11 years agoMoving struct loop_filter_info from *.h to *.c file.
Dmitry Kovalev [Fri, 2 Aug 2013 18:53:49 +0000 (11:53 -0700)]
Moving struct loop_filter_info from *.h to *.c file.

Change-Id: I3fe90eb40088a5b07bdc7d66d93ffe6ef99943d5

11 years agoMerge "Fixed typos and added a few explanatory comments"
Adrian Grange [Fri, 2 Aug 2013 18:37:47 +0000 (11:37 -0700)]
Merge "Fixed typos and added a few explanatory comments"

11 years agoMerge "Changed name of rd_pick_intra4x4mby_modes"
Adrian Grange [Fri, 2 Aug 2013 18:36:46 +0000 (11:36 -0700)]
Merge "Changed name of rd_pick_intra4x4mby_modes"

11 years agoMerge "vp9: neon: convolve: replace some insns with simpler equivalents"
Johann [Fri, 2 Aug 2013 18:28:31 +0000 (11:28 -0700)]
Merge "vp9: neon: convolve: replace some insns with simpler equivalents"

11 years agoMerge "vp9: neon: convolve: simplify branching to C fallbacks"
Johann [Fri, 2 Aug 2013 18:28:25 +0000 (11:28 -0700)]
Merge "vp9: neon: convolve: simplify branching to C fallbacks"

11 years agoMerge "vp9: neon: optimise loads in horiz convolve functions"
Johann [Fri, 2 Aug 2013 18:28:04 +0000 (11:28 -0700)]
Merge "vp9: neon: optimise loads in horiz convolve functions"

11 years agoMerge "vp9: neon: add vp9_mb_lpf_* functions"
Johann [Fri, 2 Aug 2013 18:27:52 +0000 (11:27 -0700)]
Merge "vp9: neon: add vp9_mb_lpf_* functions"

11 years agoCleanups around allow_high_precision_mv flag.
Dmitry Kovalev [Fri, 2 Aug 2013 18:21:16 +0000 (11:21 -0700)]
Cleanups around allow_high_precision_mv flag.

Change-Id: Ic07f5f8ffeaedd5b7513b464871f83afc82dcd5c

11 years agoReplacing long block size enum values with shorter ones.
Dmitry Kovalev [Fri, 2 Aug 2013 17:48:27 +0000 (10:48 -0700)]
Replacing long block size enum values with shorter ones.

Change-Id: I0e9329490828684a4fd46f540d89114cc68e8407

11 years agoMerge "Comment out 2 unused speed features"
Yunqing Wang [Fri, 2 Aug 2013 16:58:46 +0000 (09:58 -0700)]
Merge "Comment out 2 unused speed features"

11 years agovp9: neon: convolve: replace some insns with simpler equivalents
Mans Rullgard [Tue, 30 Jul 2013 17:08:17 +0000 (18:08 +0100)]
vp9: neon: convolve: replace some insns with simpler equivalents

Change-Id: I5d6906772e6e6adf68d7f0fd5b8b5207a64a3a37

11 years agovp9: neon: convolve: simplify branching to C fallbacks
Mans Rullgard [Tue, 30 Jul 2013 16:33:52 +0000 (17:33 +0100)]
vp9: neon: convolve: simplify branching to C fallbacks

Change-Id: Ic7cacd02d6dc9243ad8fc85082c5618a9d1e66dc

11 years agovp9: neon: optimise loads in horiz convolve functions
Mans Rullgard [Tue, 30 Jul 2013 17:11:06 +0000 (18:11 +0100)]
vp9: neon: optimise loads in horiz convolve functions

Loading to single lanes in multiple registers is expensive since
it requires a read and write of each register which saturates
the register file access.  Loading to single registers followed
by a separate transpose reduces this pressure.

Change-Id: I4cc35887ddbca80e5e635b50d2b1d158de9668ee

11 years agovp9: neon: add vp9_mb_lpf_* functions
Mans Rullgard [Thu, 4 Jul 2013 18:58:41 +0000 (14:58 -0400)]
vp9: neon: add vp9_mb_lpf_* functions

Change-Id: I13e0880df234f15abc4cc7c57fe84488d5d46a75

11 years agoMerge "Cleanup: replacing xd->seg with seg, and xd->lf with lf."
Dmitry Kovalev [Fri, 2 Aug 2013 06:17:17 +0000 (23:17 -0700)]
Merge "Cleanup: replacing xd->seg with seg, and xd->lf with lf."

11 years agoMerge "Cleanup: reusing clamp_mv function."
Dmitry Kovalev [Fri, 2 Aug 2013 06:16:56 +0000 (23:16 -0700)]
Merge "Cleanup: reusing clamp_mv function."

11 years agoMerge "Remove unused vp9_short_idct10_32x32_add"
Jingning Han [Thu, 1 Aug 2013 22:41:35 +0000 (15:41 -0700)]
Merge "Remove unused vp9_short_idct10_32x32_add"

11 years agoCleanup: replacing xd->seg with seg, and xd->lf with lf.
Dmitry Kovalev [Thu, 1 Aug 2013 21:53:14 +0000 (14:53 -0700)]
Cleanup: replacing xd->seg with seg, and xd->lf with lf.

Change-Id: I73b59d7699a8e7e7acd3bf8041cb6c98ce9ba4bf

11 years agoMerge "Cleanup: removing unused function arguments."
Dmitry Kovalev [Thu, 1 Aug 2013 22:07:12 +0000 (15:07 -0700)]
Merge "Cleanup: removing unused function arguments."

11 years agoCleanup: reusing clamp_mv function.
Dmitry Kovalev [Thu, 1 Aug 2013 22:06:34 +0000 (15:06 -0700)]
Cleanup: reusing clamp_mv function.

Change-Id: I8715f08a3554bdb557c5f935f1dfbd671f18e766

11 years agoMerge "Nice looking motion vector clamping functions."
Dmitry Kovalev [Thu, 1 Aug 2013 21:50:14 +0000 (14:50 -0700)]
Merge "Nice looking motion vector clamping functions."

11 years agoMerge "Adds a source variance computation function"
Deb Mukherjee [Thu, 1 Aug 2013 21:18:43 +0000 (14:18 -0700)]
Merge "Adds a source variance computation function"

11 years agoMerge "vp9_get_pred_context_intra_inter cleanup."
Dmitry Kovalev [Thu, 1 Aug 2013 21:15:53 +0000 (14:15 -0700)]
Merge "vp9_get_pred_context_intra_inter cleanup."

11 years agoCleanup: removing unused function arguments.
Dmitry Kovalev [Wed, 31 Jul 2013 23:59:15 +0000 (16:59 -0700)]
Cleanup: removing unused function arguments.

Change-Id: I27471768980fc631916069f24bc7c482a5c9ca17

11 years agoNice looking motion vector clamping functions.
Dmitry Kovalev [Wed, 31 Jul 2013 23:11:03 +0000 (16:11 -0700)]
Nice looking motion vector clamping functions.

Removing assign_and_clamp_mv function, making implementation of clamp_mv
and clamp_mv2 more clear and consistent.

Change-Id: Iecd08e1c1bf0379f8314ebe01811f8253f4ade58

11 years agoAdds a source variance computation function
Deb Mukherjee [Thu, 1 Aug 2013 19:56:12 +0000 (12:56 -0700)]
Adds a source variance computation function

Adds a function to compute source variance for various
sb_types to be used for pruning mode and partition searches.
[The existing activity measure function is currently specialized
for only 16x16 MBs and needs to be updated].

Change-Id: I22a41e6f1430184201487326fdbebb9b47e6fc24

11 years agoRemove unused vp9_short_idct10_32x32_add
Jingning Han [Thu, 1 Aug 2013 19:45:16 +0000 (12:45 -0700)]
Remove unused vp9_short_idct10_32x32_add

The inverse 32x32 transform detects all zero entries and skips the
computations accordingly per 8 rows in the first 1-D operation. The
function vp9_short_idct10_32x32_add performs differently and is not
used anywhere, hence removed.

Change-Id: Ic4fad422debbde7b6b6ffed47c69fbd4268a906c

11 years agoMerge "Optimize 32x32 2D inverse DCT for speed-up"
Jingning Han [Thu, 1 Aug 2013 18:53:39 +0000 (11:53 -0700)]
Merge "Optimize 32x32 2D inverse DCT for speed-up"

11 years agoAdd more checking to using_small_partition_info
Yunqing Wang [Thu, 1 Aug 2013 18:37:41 +0000 (11:37 -0700)]
Add more checking to using_small_partition_info

If the partition is out of partition size range, we don't
need to process small partition information.

Change-Id: Ice9bfbbdebe1f2ef79271a3aee17de0ed4608376

11 years agoComment out 2 unused speed features
Yunqing Wang [Thu, 1 Aug 2013 18:03:34 +0000 (11:03 -0700)]
Comment out 2 unused speed features

use_min_partition_size and use_max_partition_size are not used
currently, and could be added back if needed later.

Change-Id: Ib22a9c06b064567a7c1d6d5445567ed77e0d3acc

11 years agoMerge "Adding missing const to vp9_extra_bits array."
Dmitry Kovalev [Thu, 1 Aug 2013 17:19:51 +0000 (10:19 -0700)]
Merge "Adding missing const to vp9_extra_bits array."

11 years agoFixed typos and added a few explanatory comments
Adrian Grange [Thu, 1 Aug 2013 16:04:15 +0000 (09:04 -0700)]
Fixed typos and added a few explanatory comments

Change-Id: Ib4e4b41094b54874ee34343dd77c0c131ceed9d2

11 years agoChanged name of rd_pick_intra4x4mby_modes
Adrian Grange [Wed, 31 Jul 2013 19:58:19 +0000 (12:58 -0700)]
Changed name of rd_pick_intra4x4mby_modes

The function name rd_pick_intra4x4mby_modes is confusing, so
I changed it to rd_pick_intra_sub_8x8_y_modes to better
reflect what the function does. Also added const qualifiers
to some of the input parameters and removed camel-case.

Change-Id: I23d53d4c7af5d79ed8a471acd59a09bbb47add39

11 years agoAdding missing const to vp9_extra_bits array.
Dmitry Kovalev [Thu, 1 Aug 2013 01:51:18 +0000 (18:51 -0700)]
Adding missing const to vp9_extra_bits array.

Change-Id: Icd128ab58719e0b9066bdfa66a5d0d427a84d6df

11 years agovp9_get_pred_context_intra_inter cleanup.
Dmitry Kovalev [Thu, 1 Aug 2013 01:33:04 +0000 (18:33 -0700)]
vp9_get_pred_context_intra_inter cleanup.

Change-Id: I8beeee4c020425175f7d5ec83be86afa7b95da1a

11 years agoOptimize 32x32 2D inverse DCT for speed-up
Jingning Han [Wed, 31 Jul 2013 23:50:34 +0000 (16:50 -0700)]
Optimize 32x32 2D inverse DCT for speed-up

This commit exploits the sparsity of quantized coefficient matrix.
It detects each 32x8 array and skip the corresponding inverse
transformation if all entries are zero.

For ped1080p at 8000 kbps, this on average reduces the runtime of
32x32 inverse 2D-DCT SSE2 function from 6256 cycles -> 5200
cycles. It makes the overall encoding process about 2% faster at
speed 0. The speed-up is more pronounceable for the decoding process.

Change-Id: If20056c3566bd117642a76f8884c83e8bc8efbcf

11 years agoRemove unnecessary arguments in rd_pick_ref_frame
Jingning Han [Thu, 1 Aug 2013 00:02:06 +0000 (17:02 -0700)]
Remove unnecessary arguments in rd_pick_ref_frame

This commit removes redundant arguments passing in the function of
rd_pick_reference_frame. This resolves the clang warnings about
potential use of uninitialized values.

Change-Id: Ic68f949a9f8fcd0a583786b0c75321104ea44739

11 years agovp9_decodemv.c cleanup.
Dmitry Kovalev [Wed, 31 Jul 2013 22:03:36 +0000 (15:03 -0700)]
vp9_decodemv.c cleanup.

Inlining VP9_NMV_UPDATE_PROB constant, consistent local variable names.

Change-Id: I01692501982568fa535882d6b320e3c692f88abb

11 years agoRemoving get_mi_{row, col} functions.
Dmitry Kovalev [Wed, 31 Jul 2013 18:58:00 +0000 (11:58 -0700)]
Removing get_mi_{row, col} functions.

Passing mi_row and mi_col parameters to functions explicitly. Removing
unused xd argument from scale_mv function.

Change-Id: Icb4c495ec72d26fb066c14470d3ae0b741fbf18a

11 years agoMerge "Removing unused "ishp" arguments."
Dmitry Kovalev [Wed, 31 Jul 2013 19:03:04 +0000 (12:03 -0700)]
Merge "Removing unused "ishp" arguments."

11 years agoMerge "Consistent update for inter_mode probabilities."
Dmitry Kovalev [Wed, 31 Jul 2013 19:02:35 +0000 (12:02 -0700)]
Merge "Consistent update for inter_mode probabilities."

11 years agoRemoving unused "ishp" arguments.
Dmitry Kovalev [Wed, 31 Jul 2013 18:27:53 +0000 (11:27 -0700)]
Removing unused "ishp" arguments.

Using different variable names "allow_hp" and "use_hp" instead of "usehp".

Change-Id: I0cd5996ddeb46bd754473b680a993c0aaf8eb879

11 years agoMerge "Make the use of ref_frame index consistent"
Jingning Han [Wed, 31 Jul 2013 16:11:37 +0000 (09:11 -0700)]
Merge "Make the use of ref_frame index consistent"

11 years agoMake the use of ref_frame index consistent
Jingning Han [Tue, 30 Jul 2013 22:47:12 +0000 (15:47 -0700)]
Make the use of ref_frame index consistent

Refactor the frame buffer referencing in choose_partition and make
it consistent with other places. This means to prevent potential
issues when we extend reference frame buffer.

Change-Id: I5ff33ed5f671e1f4cc7049622212769a9b4578d9

11 years agoConsistent update for inter_mode probabilities.
Dmitry Kovalev [Wed, 31 Jul 2013 01:06:34 +0000 (18:06 -0700)]
Consistent update for inter_mode probabilities.

Using inter-mode counts instead of inter-mode-tree branch counts inside
FRAME_COUNTS structure.

Change-Id: I60dde13af37d06146d7d15543311c1b5044e9e04

11 years agoMerge "Cleanup: remove two stray '+', fix typos."
Adrian Grange [Tue, 30 Jul 2013 20:01:14 +0000 (13:01 -0700)]
Merge "Cleanup: remove two stray '+', fix typos."

11 years agoMerge "Cleanup typos, remove unnecessary lines, replace switch"
Adrian Grange [Tue, 30 Jul 2013 19:59:46 +0000 (12:59 -0700)]
Merge "Cleanup typos, remove unnecessary lines, replace switch"

11 years agoCleanup: remove two stray '+', fix typos.
Adrian Grange [Tue, 30 Jul 2013 17:47:46 +0000 (10:47 -0700)]
Cleanup: remove two stray '+', fix typos.

Change-Id: I9c30e3dbedabe4942439a0ee2f691fb9a04cd03b

11 years agoCleanup typos, remove unnecessary lines, replace switch
Adrian Grange [Tue, 30 Jul 2013 17:16:03 +0000 (10:16 -0700)]
Cleanup typos, remove unnecessary lines, replace switch

Removed unnecessary code lines, replaced switch with an if,
fixed spelling errors and formatting.

Change-Id: Ie48aa4604aa0ed48362ca359d792fb21b2ec1dc6

11 years agoMerge "removed duplication"
Yaowu Xu [Tue, 30 Jul 2013 16:38:02 +0000 (09:38 -0700)]
Merge "removed duplication"

11 years agoremoved duplication
Yaowu Xu [Tue, 30 Jul 2013 16:09:14 +0000 (09:09 -0700)]
removed duplication

Change-Id: Ica23b66f6664e5a5b168499584f0afffbc54794f

11 years agoRemove a redundant branching in tokenize_b
Jingning Han [Tue, 30 Jul 2013 00:06:18 +0000 (17:06 -0700)]
Remove a redundant branching in tokenize_b

The tokenize_b function is only called when output flag is on. Hence
removing the conditional branch on it therein.

Change-Id: Ib709f47f23f39ca05a695faf86fa3377f11f2dd0

11 years agoTune tokenization/detokenization flow for speed-up
Jingning Han [Mon, 29 Jul 2013 23:03:02 +0000 (16:03 -0700)]
Tune tokenization/detokenization flow for speed-up

This commit optimizes the tokenization and detokenization operational
flow for speed-up. It makes the coding process about 0.3% faster at
speed 0.

Change-Id: I28008df7482874e4b5f237f2d418ff82a249dd56

11 years agoSkip redundant tokenization in rd loop
Jingning Han [Mon, 29 Jul 2013 21:54:31 +0000 (14:54 -0700)]
Skip redundant tokenization in rd loop

This commit makes the encoder skip the redundant tokenization process
in the rate-distortion optimization search loop, while updating the
entropy contexts accordingly. It makes the speed 0 encoding process
about 0.5% faster at no performance change.

Change-Id: I34a4155a0b5332afeb45c93a51c7f35a294d685c

11 years agoMerge "16x16 inverse 2D-DCT with DC only"
Jingning Han [Mon, 29 Jul 2013 22:29:25 +0000 (15:29 -0700)]
Merge "16x16 inverse 2D-DCT with DC only"

11 years agoMerge "Remove unnecessary 64 byte alignment"
John Koleszar [Mon, 29 Jul 2013 22:09:15 +0000 (15:09 -0700)]
Merge "Remove unnecessary 64 byte alignment"

11 years ago16x16 inverse 2D-DCT with DC only
Jingning Han [Mon, 29 Jul 2013 20:55:22 +0000 (13:55 -0700)]
16x16 inverse 2D-DCT with DC only

This commit provides special handle on 16x16 inverse 2D-DCT, where
only DC coefficient is quantized to be non-zero value.

Change-Id: I7bf71be7fa13384fab453dc8742b5b50e77a277c

11 years agoRenaming txfm to tx for consistency in some places.
Dmitry Kovalev [Mon, 29 Jul 2013 21:35:55 +0000 (14:35 -0700)]
Renaming txfm to tx for consistency in some places.

Change-Id: I2a6a646570e2af66315e7c658d00d99f80c4b127

11 years agoRemove unnecessary 64 byte alignment
John Koleszar [Mon, 29 Jul 2013 21:02:02 +0000 (14:02 -0700)]
Remove unnecessary 64 byte alignment

Fixes a warning on MSVS 2012 where the alignment of vp9_default_iscan_8x8
didn't match between its declaration and definition.

Change-Id: I1466a15635f4b22594d705d570b7e399bfb6cf21

11 years agoRenaming NB_TXFM_MODES constant to TX_MODES.
Dmitry Kovalev [Mon, 29 Jul 2013 20:38:40 +0000 (13:38 -0700)]
Renaming NB_TXFM_MODES constant to TX_MODES.

Change-Id: I10bf06e3a3d5271221ae6a42a36074d01d493039

11 years agoRenaming TX_SIZE_MAX_SB to TX_SIZES.
Dmitry Kovalev [Sat, 27 Jul 2013 00:15:37 +0000 (17:15 -0700)]
Renaming TX_SIZE_MAX_SB to TX_SIZES.

Change-Id: I6aa4191935aa93461a07c41b59fdae1eb5f5f107

11 years agoMerge "Shortcut 8x8/16x16 inverse 2D-DCT"
Jingning Han [Mon, 29 Jul 2013 18:04:07 +0000 (11:04 -0700)]
Merge "Shortcut 8x8/16x16 inverse 2D-DCT"

11 years agoCleanup: replacing xd->mode_info_context with temp variable.
Dmitry Kovalev [Sat, 27 Jul 2013 02:05:37 +0000 (19:05 -0700)]
Cleanup: replacing xd->mode_info_context with temp variable.

Change-Id: I5a3e83102784cabb918a5404405fcab99c5bb9b6

11 years agoInverse dimension order in token_cost array.
Ronald S. Bultje [Wed, 24 Jul 2013 22:13:58 +0000 (15:13 -0700)]
Inverse dimension order in token_cost array.

This allows us to increment the position at the band-level only as
we go from one band to the next; more importantly, that allows us to
use an add instead of multiply instruction, and omit the instruction
altogether if the band doesn't change from one coef to the next, thus
being slightly faster (probably more noticeable on systems where a
multiply is expensive, like arm).

Change-Id: I4343fe35b9f9a47fa00b217bdcbf5f91ff96c381

11 years agoMerge "vp9_decodemv.c cleanup."
Dmitry Kovalev [Sat, 27 Jul 2013 00:24:34 +0000 (17:24 -0700)]
Merge "vp9_decodemv.c cleanup."

11 years agoMerge "d45 intra prediction SSSE3 optimizations."
Ronald S. Bultje [Sat, 27 Jul 2013 00:21:09 +0000 (17:21 -0700)]
Merge "d45 intra prediction SSSE3 optimizations."

11 years agoMerge "Save pixels instead of coefficients in intra4x4 RD loop."
Ronald S. Bultje [Sat, 27 Jul 2013 00:20:58 +0000 (17:20 -0700)]
Merge "Save pixels instead of coefficients in intra4x4 RD loop."

11 years agoMerge "Add best_rd breakout in intra4x4 RD loop."
Ronald S. Bultje [Sat, 27 Jul 2013 00:20:51 +0000 (17:20 -0700)]
Merge "Add best_rd breakout in intra4x4 RD loop."

11 years agoShortcut 8x8/16x16 inverse 2D-DCT
Jingning Han [Sat, 27 Jul 2013 00:01:51 +0000 (17:01 -0700)]
Shortcut 8x8/16x16 inverse 2D-DCT

This commit brought back the shortcut implementation of 8x8/16x16
inverse 2D-DCT. When the eob <= 10, it skips the inverse transform
operations on row 4:7/4:15 in the first round. For bus_cif at 1000
kbps, this provides about 2% speed-up at speed 0.

Change-Id: I453e2d72956467d75be4ad8c04b4482ab889d572

11 years agovp9_decodemv.c cleanup.
Dmitry Kovalev [Fri, 26 Jul 2013 23:49:49 +0000 (16:49 -0700)]
vp9_decodemv.c cleanup.

Renaming:
  read_intra_mode_info  -> read_intra_frame_mode_info
  read_inter_mode_info  -> read_inter_frame_mode_info
  read_intra_block_part -> read_intra_block_mode_info
  read_inter_block_part -> read_inter_block_mode_info
  read_ref_frame        -> read_ref_frames
  read_reference_frame  -> read_is_inter_block

Using num_4x4_blocks_{wide, high}_lookup instead of bit shifts.

Change-Id: I83c81573b4ef6f53f2f8d24683895014bebfba61

11 years agoMerge "Special handle on DC only inverse 8x8 2D-DCT"
Jingning Han [Fri, 26 Jul 2013 23:04:14 +0000 (16:04 -0700)]
Merge "Special handle on DC only inverse 8x8 2D-DCT"

11 years agoMerge "Making read_inter_mode_info function more clear."
Dmitry Kovalev [Fri, 26 Jul 2013 22:47:40 +0000 (15:47 -0700)]
Merge "Making read_inter_mode_info function more clear."

11 years agoMerge "Fix some format error and code error in neon code."
hkuang [Fri, 26 Jul 2013 22:24:28 +0000 (15:24 -0700)]
Merge "Fix some format error and code error in neon code."

11 years agoSpecial handle on DC only inverse 8x8 2D-DCT
Jingning Han [Fri, 26 Jul 2013 21:11:37 +0000 (14:11 -0700)]
Special handle on DC only inverse 8x8 2D-DCT

This commit enables a special handle for the 8x8 inverse 2D-DCT,
where only DC coefficient is quantized to be non-zero. For bus_cif
at 2000 kbps, it provides about 1% speed-up at speed 0.

Change-Id: I2523222359eec26b144cf8fd4c63a4ad63b1b011

11 years agoFix some format error and code error in neon code.
hkuang [Fri, 26 Jul 2013 21:10:39 +0000 (14:10 -0700)]
Fix some format error and code error in neon code.

Change-Id: I748dee8938dfb19f417f24eed005f3d216f83a82

11 years agoMerge "General cleanups."
Dmitry Kovalev [Fri, 26 Jul 2013 20:59:39 +0000 (13:59 -0700)]
Merge "General cleanups."

11 years agod45 intra prediction SSSE3 optimizations.
Ronald S. Bultje [Wed, 24 Jul 2013 22:11:04 +0000 (15:11 -0700)]
d45 intra prediction SSSE3 optimizations.

Change-Id: Ie48035ff4f93c41f8a9b3023e6444fd10432d8fb

11 years agoMerge "Auto min and max partition size experiment."
Yaowu Xu [Fri, 26 Jul 2013 19:10:27 +0000 (12:10 -0700)]
Merge "Auto min and max partition size experiment."

11 years agoAuto min and max partition size experiment.
Paul Wilkins [Wed, 24 Jul 2013 13:07:37 +0000 (14:07 +0100)]
Auto min and max partition size experiment.

Speed feature experiment to set an upper and lower
partition size limit based on what has been seen
in spatial neighbors.

This seems to gives quite reasonable speed gains in local
(10-15%) and when used with speed 0 the losses are small
(0.25% derf, 0.35% stdhd). However, for now I am only
enabling it on speed 1 as there may be clashes with the existing
temporal partition selection in speed 2.

Using a tighter min / max around the range derived from the
neighbors increases speed further but at the cost of a
bigger quality loss. However,  I think this spatial method could
be combined with data from either the last frame or a variance
method (or both) to refine the range of minimum and maximum
partition size. I.e. consider the min and max from spatial and
temporal neighbors and the variance recommendation.

Change-Id: I1b96bf8b84368d6aad0c7aa600fe141b4f07435f

11 years agoModify static threshold calculation
Yunqing Wang [Fri, 26 Jul 2013 02:17:46 +0000 (19:17 -0700)]
Modify static threshold calculation

Used 3 * standard_deviation in internal threshold calculation
instead of fit curve. This actually approached the algorithm
better.
For comparison, similar tests were done:
The overall psnr loss is less than before.
1. derf set:
when static-thresh = 1, psnr loss is 0.329%;
when static-thresh = 500, psnr loss is 0.970%;
2. stdhd set:
when static-thresh = 1, psnr loss is 0.922%;
when static-thresh = 500, psnr loss is 1.307%;

Similar speedup is achieved. For example,
clip            bitrate  static-thresh psnr    time
akiyo(cif)       500        0          48.952  5.077s(50f)
akiyo            500        500        48.866  4.169s(50f)

parkjoy(1080p)   4000       0          30.388  78.20s(30f)
parkjoy          4000       500        30.367  70.85s(30f)

sunflower(1080p) 4000       0          44.402  74.55s(30f)
sunflower        4000       500        44.414  68.69s(30f)

Change-Id: Ic78833642ce1911dbbd1cb6c899a2d7e2dfcc1f3

11 years agoMaking read_inter_mode_info function more clear.
Dmitry Kovalev [Thu, 25 Jul 2013 22:30:18 +0000 (15:30 -0700)]
Making read_inter_mode_info function more clear.

Now read_inter_mode_info calls read_intra_block_part (renamed from
read_intra_block_modes) or read_inter_block_part (just added).

Change-Id: I541badea6b663e0ae692ec158665efb90ed20c03

11 years agoMerge "Add const to vp9_accum_mv_refs parameter"
Johann [Thu, 25 Jul 2013 22:10:52 +0000 (15:10 -0700)]
Merge "Add const to vp9_accum_mv_refs parameter"

11 years agoMerge "Add encoding option --static-thresh"
Yunqing Wang [Thu, 25 Jul 2013 21:58:00 +0000 (14:58 -0700)]
Merge "Add encoding option --static-thresh"

11 years agoAdd encoding option --static-thresh
Yunqing Wang [Thu, 11 Jul 2013 18:15:00 +0000 (11:15 -0700)]
Add encoding option --static-thresh

This option exists in VP8, and it was rewritten in VP9 to support
skipping on different partition levels. After prediction is done,
we can check if the residuals in the partition block will be all
quantized to 0. If this is true, the skip flag is set, and only
prediction data are needed in reconstruction. Based on DCT's energy
conservation property, the skipping check can be estimated in
spatial domain.

The prediction error is calculated and compared to a threshold.
The threshold is determined by the dequant values, and also
adjusted by partition sizes. To be precise, the DC and AC parts
for Y, U, and V planes are checked to decide skipping or not.

Test showed that
1. derf set:
when static-thresh = 1, psnr loss is 0.666%;
when static-thresh = 500, psnr loss is 1.162%;
2. stdhd set:
when static-thresh = 1, psnr loss is 1.249%;
when static-thresh = 500, psnr loss is 1.668%;

For different clips, encoding speedup range is between several
percentage and 20+% when static-thresh <= 500. For example,
clip            bitrate  static-thresh psnr    time
akiyo(cif)       500        0          48.923  5.635s(50f)
akiyo            500        500        48.863  4.402s(50f)

parkjoy(1080p)   4000       0          30.380  77.54s(30f)
parkjoy          4000       500        30.384  69.59s(30f)

sunflower(1080p) 4000       0          44.461  85.2s(30f)
sunflower        4000       500        44.418  78.1s(30f)

Higher static-thresh values give larger speedup with larger
quality loss.

Change-Id: I857031ceb466ff314ab580ac5ec5d18542203c53

11 years agoAdd const to vp9_accum_mv_refs parameter
Johann [Thu, 25 Jul 2013 21:24:49 +0000 (14:24 -0700)]
Add const to vp9_accum_mv_refs parameter

Change-Id: I0625d8ffddf590dfecd1bb8b8d6f57ef64b8bf18

11 years agoGeneral cleanups.
Dmitry Kovalev [Thu, 25 Jul 2013 21:13:44 +0000 (14:13 -0700)]
General cleanups.

Removing unused constants, macros, and function declarations. Using
ROUND_POWER_OF_TWO macro, vp9_zero, vp9_copy where possible. Moving
#include from *.h to *.c. Merging for loops for motion vectors.

Change-Id: Ic3bf841764a2bb177128bb3a6d7aa8f68229cd13

11 years agoMerge "Adding lookup table for size group."
Dmitry Kovalev [Thu, 25 Jul 2013 20:57:28 +0000 (13:57 -0700)]
Merge "Adding lookup table for size group."

11 years agoAdding lookup table for size group.
Dmitry Kovalev [Wed, 24 Jul 2013 00:57:15 +0000 (17:57 -0700)]
Adding lookup table for size group.

Change-Id: Ia6144d77ebed66e0739b62e4d673e26a95aa9550

11 years agoMerge "Simplify handling of sub-partition motion vectors"
Adrian Grange [Thu, 25 Jul 2013 19:58:38 +0000 (12:58 -0700)]
Merge "Simplify handling of sub-partition motion vectors"