]> granicus.if.org Git - libvpx/log
libvpx
8 years agoMerge "mv vp10_fwd_txfm2d_#x# into vp10_rtcd.h" into nextgenv2
Angie Chiang [Tue, 22 Mar 2016 01:07:56 +0000 (01:07 +0000)]
Merge "mv vp10_fwd_txfm2d_#x# into vp10_rtcd.h" into nextgenv2

8 years agoMerge "Passing TXFM_TYPE instead of func pointer" into nextgenv2
Angie Chiang [Tue, 22 Mar 2016 01:07:42 +0000 (01:07 +0000)]
Merge "Passing TXFM_TYPE instead of func pointer" into nextgenv2

8 years agoMerge "Refactor transform type-size search function" into nextgenv2
Yue Chen [Tue, 22 Mar 2016 00:58:44 +0000 (00:58 +0000)]
Merge "Refactor transform type-size search function" into nextgenv2

8 years agoMerge "Refactor sub8x8 reference motion vector search function" into nextgenv2
Jingning Han [Tue, 22 Mar 2016 00:07:45 +0000 (00:07 +0000)]
Merge "Refactor sub8x8 reference motion vector search function" into nextgenv2

8 years agoMerge "Rework the DRL syntax entropy coding system" into nextgenv2
Jingning Han [Tue, 22 Mar 2016 00:07:35 +0000 (00:07 +0000)]
Merge "Rework the DRL syntax entropy coding system" into nextgenv2

8 years agoRefactor transform type-size search function
Yue Chen [Mon, 21 Mar 2016 18:53:57 +0000 (11:53 -0700)]
Refactor transform type-size search function

Decompose choose_tx_size_from_rd into three functions that determine
the transform coding rd at different stages. Besides the original
function, txfm_yrd() calculates the rd for fixed size and type.
choose_tx_size_fix_type() fixes the type and searches for the size.
It can enable other experiments to do restricted tx searches so as to
reduce the impact on speed.
Similar refactoring is done for select_tx_type_yrd() in VAR_TX.

Performance change in baseline is trivial:
0.014/0.001/-0.020 for lowres/midres/hdres.

Change-Id: I2ecbf6066329be088ec1bfb69013b657b14b8afe

8 years agoMerge "Properly set rate_nocoef when pallete mode is used" into nextgenv2
Yaowu Xu [Mon, 21 Mar 2016 20:44:17 +0000 (20:44 +0000)]
Merge "Properly set rate_nocoef when pallete mode is used" into nextgenv2

8 years agoMerge "Adds 1D transforms for ADST/FlipADST to make 16" into nextgenv2
Debargha Mukherjee [Mon, 21 Mar 2016 20:40:20 +0000 (20:40 +0000)]
Merge "Adds 1D transforms for ADST/FlipADST to make 16" into nextgenv2

8 years agoWrite MB tokens using the forward buffered ANS writer.
Alex Converse [Wed, 17 Feb 2016 19:07:20 +0000 (11:07 -0800)]
Write MB tokens using the forward buffered ANS writer.

This allows sharing more code paths with the rest of the code an allows
for easier compatibility with the other experiments.

Change-Id: Id288b533805a4d0657ec2f17542f2e6ad23ebdb4

8 years agoMerge "Add a placeholder forward buffered ANS coder." into nextgenv2
Alex Converse [Mon, 21 Mar 2016 18:41:32 +0000 (18:41 +0000)]
Merge "Add a placeholder forward buffered ANS coder." into nextgenv2

8 years agoAdds 1D transforms for ADST/FlipADST to make 16
Debargha Mukherjee [Tue, 15 Mar 2016 05:30:09 +0000 (22:30 -0700)]
Adds 1D transforms for ADST/FlipADST to make 16

Makes a set of 16 transforms total, adding all 1D
combinations of ADST and FlipADST, and removng all DST
transforms.

lowres, midres both improve by about 0.1% and hdres by
-0.378% in BDRATE but with fewer transforms that are also
simpler.

Further experiments to continue later.

Change-Id: I7348a4c0e12078fdea5ae3a2d36a89a319ffcc6e

8 years agoProperly set rate_nocoef when pallete mode is used
Yaowu Xu [Mon, 21 Mar 2016 17:28:29 +0000 (10:28 -0700)]
Properly set rate_nocoef when pallete mode is used

Change-Id: Iff04c82b3d3b5cf2c7700717c3c3d678bbbb9f9b

8 years agomv vp10_fwd_txfm2d_#x# into vp10_rtcd.h
Angie Chiang [Fri, 18 Mar 2016 21:09:06 +0000 (14:09 -0700)]
mv vp10_fwd_txfm2d_#x# into vp10_rtcd.h

Change-Id: Iad7352698786791b0fd7c005a7edfd1724b71599

8 years agoPassing TXFM_TYPE instead of func pointer
Angie Chiang [Thu, 17 Mar 2016 00:15:27 +0000 (17:15 -0700)]
Passing TXFM_TYPE instead of func pointer

This is to facilitate sse2 implementation

Change-Id: Id2f53e83c5508c4445d9b1bba00a649cb4da6b74

8 years agoRefactor sub8x8 reference motion vector search function
Jingning Han [Mon, 14 Mar 2016 22:04:58 +0000 (15:04 -0700)]
Refactor sub8x8 reference motion vector search function

Rework the interface to allow codec store the reference motion
vector list information for coding process.

Change-Id: I47e26587f6c0808655e4626f316ec7614a7ad8ed

8 years agoRework the DRL syntax entropy coding system
Jingning Han [Thu, 17 Mar 2016 18:26:52 +0000 (11:26 -0700)]
Rework the DRL syntax entropy coding system

This commit re-designs the probability model for the syntax elements
of the dynamic motion vector referencing system.

Change-Id: Icfb8203c7e8f64e10e99f5890e25e6f6b15fe5d1

8 years agoMerge "Enable dynamic motion vector referencing for newmv mode" into nextgenv2
Jingning Han [Sat, 19 Mar 2016 00:40:04 +0000 (00:40 +0000)]
Merge "Enable dynamic motion vector referencing for newmv mode" into nextgenv2

8 years agoMerge "Add ext_partition_types config option" into nextgenv2
Debargha Mukherjee [Fri, 18 Mar 2016 23:51:55 +0000 (23:51 +0000)]
Merge "Add ext_partition_types config option" into nextgenv2

8 years agoMerge "Refactor bsse and skip_txfm in MACROBLOCK." into nextgenv2
Debargha Mukherjee [Fri, 18 Mar 2016 23:51:40 +0000 (23:51 +0000)]
Merge "Refactor bsse and skip_txfm in MACROBLOCK." into nextgenv2

8 years agoMerge "Refactor save_context restore_context in rd_pick_partition." into nextgenv2
Debargha Mukherjee [Fri, 18 Mar 2016 23:51:06 +0000 (23:51 +0000)]
Merge "Refactor save_context restore_context in rd_pick_partition." into nextgenv2

8 years agoMerge "Refactor mbmi->inter_tx_size to 2D array." into nextgenv2
Debargha Mukherjee [Fri, 18 Mar 2016 23:50:25 +0000 (23:50 +0000)]
Merge "Refactor mbmi->inter_tx_size to 2D array." into nextgenv2

8 years agoMerge "Remove prune three from speed features" into nextgenv2
Sarah Parker [Fri, 18 Mar 2016 21:29:24 +0000 (21:29 +0000)]
Merge "Remove prune three from speed features" into nextgenv2

8 years agoRemove prune three from speed features
Sarah Parker [Fri, 18 Mar 2016 20:06:40 +0000 (13:06 -0700)]
Remove prune three from speed features

Not getting good results for this feature, will try again
when transforms are frozen.

Change-Id: Id12396786cb9369ad34d0bd845f7beba3a037726

8 years agoAdd a placeholder forward buffered ANS coder.
Alex Converse [Wed, 17 Feb 2016 19:07:20 +0000 (11:07 -0800)]
Add a placeholder forward buffered ANS coder.

This buffered ANS coder supports coding the symbols in forward (decode)
order. Rather than windowing or growing the buffer, right now this
coder merely asserts that the buffer will never overflow.

This approach should allow ANS to be used as a drop in replacement for
other entropy coders rather than requiring complicated reversal logic
throughout the codebase.

Change-Id: I6689271233d0e22fea94c51950415dad5af96598

8 years agoTwo minor logic fixes
Yaowu Xu [Fri, 18 Mar 2016 18:38:56 +0000 (11:38 -0700)]
Two minor logic fixes

Change-Id: I1d5624fb2f34f87a55613036851034ec7c2d0b76

8 years agoEnable dynamic motion vector referencing for newmv mode
Jingning Han [Tue, 15 Mar 2016 22:58:03 +0000 (15:58 -0700)]
Enable dynamic motion vector referencing for newmv mode

This commit enables the dynamic motion vector predictor for NEWMV
mode. It allows the codec to select the best motion vector predictor
in a rate-distortion optimization framework for motion vector
residual coding. The compression performance is improved:
lowres  0.14%
midres  0.27%
hdres   0.24%

Change-Id: I6a601c74eb6cb0b71a613336d40363359f2edecd

8 years agoRefactor bsse and skip_txfm in MACROBLOCK.
Geza Lore [Wed, 3 Feb 2016 11:42:29 +0000 (11:42 +0000)]
Refactor bsse and skip_txfm in MACROBLOCK.

Simple refactoring to 2 dimensional arrays, in preparation for 128
wide superblocks.

Change-Id: I40d447bd9fbd4f755534ea3cc82fc8f4676cea07

8 years agoRefactor mbmi->inter_tx_size to 2D array.
Geza Lore [Wed, 16 Mar 2016 18:03:57 +0000 (18:03 +0000)]
Refactor mbmi->inter_tx_size to 2D array.

This is in preparation of increasing the superblock size.

Change-Id: I9197e397399fbe8aec1178a45ea0337dd90412d7

8 years agoAdd ext_partition_types config option
Julia Robson [Fri, 18 Mar 2016 15:22:42 +0000 (15:22 +0000)]
Add ext_partition_types config option

Change-Id: I91d280a64d1da77be31004ea91d08d1a70529e42

8 years agoMerge "Minor bug fix in ex-intra experiment" into nextgenv2
Hui Su [Fri, 18 Mar 2016 05:01:56 +0000 (05:01 +0000)]
Merge "Minor bug fix in ex-intra experiment" into nextgenv2

8 years agoMinor bug fix in ex-intra experiment
hui su [Thu, 17 Mar 2016 23:26:22 +0000 (16:26 -0700)]
Minor bug fix in ex-intra experiment

No performance change observed from borg tests.

Change-Id: I20c232c5dde8cfc84452a4c7185389bd8b812ffd

8 years agoMerge "Fix hbd segfault for prune_one and prune_two" into nextgenv2
Sarah Parker [Thu, 17 Mar 2016 22:40:47 +0000 (22:40 +0000)]
Merge "Fix hbd segfault for prune_one and prune_two" into nextgenv2

8 years agoRefactor: call inv_txfm_add
Angie Chiang [Tue, 15 Mar 2016 00:38:15 +0000 (17:38 -0700)]
Refactor: call inv_txfm_add

Change-Id: I52c209a5db1b4a6525c04b70291a08ab5a68c6fe

8 years agoRefactor:Merge inverse_transform_block_inter/intra
Angie Chiang [Tue, 15 Mar 2016 00:00:53 +0000 (17:00 -0700)]
Refactor:Merge inverse_transform_block_inter/intra

Merge inverse_transform_block_inter inverse_transform_block_intra
to inverse_transform_block

Change-Id: I0f561830e639e3bf8d831c28a7c784cb0c4c8b09

8 years agoMerge "add dct 64x64 transform" into nextgenv2
Angie Chiang [Thu, 17 Mar 2016 18:15:11 +0000 (18:15 +0000)]
Merge "add dct 64x64 transform" into nextgenv2

8 years agoFix hbd segfault for prune_one and prune_two
Sarah Parker [Wed, 16 Mar 2016 23:06:19 +0000 (16:06 -0700)]
Fix hbd segfault for prune_one and prune_two

Change-Id: I71da102550aa7d81961e5f10c71058d5fa8dc6c4

8 years agoRefactor save_context restore_context in rd_pick_partition.
Geza Lore [Wed, 27 Jan 2016 15:29:17 +0000 (15:29 +0000)]
Refactor save_context restore_context in rd_pick_partition.

This is a cosmetic patch that removes a great deal of conditional
compilation around CONFIG_VAR_TX from the partition search function.

Change-Id: I9dcef9d4fe6847b793c77bdf565a5cacbdfacd59

8 years agoMerge "Properly save and restore skip related variables" into nextgenv2
Yaowu Xu [Thu, 17 Mar 2016 14:41:03 +0000 (14:41 +0000)]
Merge "Properly save and restore skip related variables" into nextgenv2

8 years agoadd dct 64x64 transform
Angie Chiang [Mon, 14 Mar 2016 19:02:27 +0000 (12:02 -0700)]
add dct 64x64 transform

Change-Id: I131c4d1216cd156e520b8a91c4438c2d3c6602cb

8 years agoMerge "Optimize HBD up-sampled prediction functions" into nextgenv2
Yunqing Wang [Wed, 16 Mar 2016 23:52:08 +0000 (23:52 +0000)]
Merge "Optimize HBD up-sampled prediction functions" into nextgenv2

8 years agoProperly save and restore skip related variables
Yaowu Xu [Wed, 16 Mar 2016 22:05:04 +0000 (15:05 -0700)]
Properly save and restore skip related variables

Change-Id: Id52744e140585c08d047fb395b75ac9318a0b4c3

8 years agoMerge "remove filter_cache" into nextgenv2
Angie Chiang [Wed, 16 Mar 2016 17:21:53 +0000 (17:21 +0000)]
Merge "remove filter_cache" into nextgenv2

8 years agoFactor out zeroing above and left context.
Geza Lore [Tue, 15 Mar 2016 10:29:00 +0000 (10:29 +0000)]
Factor out zeroing above and left context.

Change-Id: I6e5d8cff869c7415a924f845c9e6ccaabe2b7a9b

8 years agoFix build without supertx.
Geza Lore [Wed, 16 Mar 2016 13:08:13 +0000 (13:08 +0000)]
Fix build without supertx.

Change-Id: Ib60821487710f6cf06aaa8ddcbdd5487ba1cbe4f

8 years agoMerge "Properly set the estimate of rate_nocoef" into nextgenv2
Yaowu Xu [Tue, 15 Mar 2016 23:23:11 +0000 (23:23 +0000)]
Merge "Properly set the estimate of rate_nocoef" into nextgenv2

8 years agoProperly set the estimate of rate_nocoef
Yaowu Xu [Tue, 15 Mar 2016 19:36:21 +0000 (12:36 -0700)]
Properly set the estimate of rate_nocoef

This commit fixes the computation of rate_nocoef for situation when
rate_y is uninitialized at INT_MAX for  x->skip is true.

Change-Id: If3dde4e4ee16667f4408067d3bb3084f916272f1

8 years agoremove filter_cache
Angie Chiang [Sat, 12 Mar 2016 00:01:46 +0000 (16:01 -0800)]
remove filter_cache

PSNR test
        lowres  hdres
lowbd   -0.013  0.067
highbd  -0.044  0.039

Change-Id: Iefdb1e966bd004b2027456778185b675e8fb9b81

8 years agoMerge "Add "entropy" experiment flag" into nextgenv2
Hui Su [Tue, 15 Mar 2016 20:46:09 +0000 (20:46 +0000)]
Merge "Add "entropy" experiment flag" into nextgenv2

8 years agoMerge "Refactor 1D transforms" into nextgenv2
Debargha Mukherjee [Tue, 15 Mar 2016 19:08:06 +0000 (19:08 +0000)]
Merge "Refactor 1D transforms" into nextgenv2

8 years agoAdd "entropy" experiment flag
hui su [Tue, 15 Mar 2016 18:44:01 +0000 (11:44 -0700)]
Add "entropy" experiment flag

For experiments to improve compression efficiency of entropy coding.

Change-Id: Idf30dc554bd7eea3a79d21d18515c3e6b8f64b26

8 years agoMerge "Fix copy/zero macros." into nextgenv2
Debargha Mukherjee [Tue, 15 Mar 2016 17:45:31 +0000 (17:45 +0000)]
Merge "Fix copy/zero macros." into nextgenv2

8 years agoMerge "Turn off 32x32 transform type selection" into nextgenv2
Jingning Han [Tue, 15 Mar 2016 16:59:37 +0000 (16:59 +0000)]
Merge "Turn off 32x32 transform type selection" into nextgenv2

8 years agoFix copy/zero macros.
Geza Lore [Fri, 11 Mar 2016 18:33:00 +0000 (18:33 +0000)]
Fix copy/zero macros.

Change-Id: I2df3b6ecd35406ee05c2aa4e49be779e73e1bdc6

8 years agoRefactor 1D transforms
Debargha Mukherjee [Tue, 15 Mar 2016 05:30:09 +0000 (22:30 -0700)]
Refactor 1D transforms

In preparation for adding more 1D variants with ADST/FlipADST/etc.

BDRATE actually improves by 0.21% on lowres.

Change-Id: I2fa4720c69fe001fa666119a284dfc6b17fffab2

8 years agoOptimize HBD up-sampled prediction functions
Yunqing Wang [Tue, 15 Mar 2016 01:59:11 +0000 (18:59 -0700)]
Optimize HBD up-sampled prediction functions

Optimized 2 up-sampled reference prediction functions in high-bit
depth case. This reduced the HBD encoding time by 3%.

Change-Id: I8663ffb5234f5e70168c0fc9ca676309fe8e98f2

8 years agoMerge "Fix typos in unit tests" into nextgenv2
Hui Su [Tue, 15 Mar 2016 01:15:06 +0000 (01:15 +0000)]
Merge "Fix typos in unit tests" into nextgenv2

8 years agoMerge "Speed up rd selection in OBMC experiment" into nextgenv2
Yue Chen [Tue, 15 Mar 2016 00:14:06 +0000 (00:14 +0000)]
Merge "Speed up rd selection in OBMC experiment" into nextgenv2

8 years agoFix typos in unit tests
hui su [Mon, 14 Mar 2016 23:32:41 +0000 (16:32 -0700)]
Fix typos in unit tests

Change-Id: Idff52b337ab2d494c0c26e0d2c71ab3ee8208691

8 years agoSpeed up rd selection in OBMC experiment
Yue Chen [Mon, 14 Mar 2016 19:05:27 +0000 (12:05 -0700)]
Speed up rd selection in OBMC experiment

Instead of testing all interpfilter-BMC/OBMC combinations, we choose
the best interpolation filter based on regular inter prediction.

Reduction in encoding time: ~10%
Drop in performance gain: 0.08% lowres, 0.04% midres

Change-Id: Ifc19097a918ac76b529db9af4c60e2c70e93f7ad

8 years agoTurn off 32x32 transform type selection
Jingning Han [Fri, 11 Mar 2016 20:05:18 +0000 (12:05 -0800)]
Turn off 32x32 transform type selection

Temporarily disable transform type selection for 32x32 transform
block size. This speeds up the encoding process. For bus at CIF
150 frames, the encoding time goes from 896s -> 762s (11% faster).
The compression performance for lowres set is improved by 0.15%,
and -0.029% for hdres.

Change-Id: If239b272970eb302150bec13b8cf192fbe045332

8 years agoMerge "Add high-precision sub-pixel search as a speed feature" into nextgenv2
Yunqing Wang [Sat, 12 Mar 2016 02:26:36 +0000 (02:26 +0000)]
Merge "Add high-precision sub-pixel search as a speed feature" into nextgenv2

8 years agoMerge "Fix sub8x8 interpolation full pixel bug" into nextgenv2
Angie Chiang [Sat, 12 Mar 2016 01:45:27 +0000 (01:45 +0000)]
Merge "Fix sub8x8 interpolation full pixel bug" into nextgenv2

8 years agoAdd high-precision sub-pixel search as a speed feature
Yunqing Wang [Thu, 10 Mar 2016 19:07:50 +0000 (11:07 -0800)]
Add high-precision sub-pixel search as a speed feature

Using the up-sampled reference frames in sub-pixel motion search is
enabled as a speed feature for good-quality mode speed 0 and speed 1.

Change-Id: Ieb454bf8c646ddb99e87bd64c8e74dbd78d84a50

8 years agoMerge "Fix an overflow in highbitdepth loop restoration" into nextgenv2
Debargha Mukherjee [Fri, 11 Mar 2016 21:48:36 +0000 (21:48 +0000)]
Merge "Fix an overflow in highbitdepth loop restoration" into nextgenv2

8 years agoMerge "convolve8 sse2 test" into nextgenv2
Angie Chiang [Fri, 11 Mar 2016 19:57:30 +0000 (19:57 +0000)]
Merge "convolve8 sse2 test" into nextgenv2

8 years agoMerge "Complete (mostly) migration of palette mode" into nextgenv2
Hui Su [Fri, 11 Mar 2016 19:52:41 +0000 (19:52 +0000)]
Merge "Complete (mostly) migration of palette mode" into nextgenv2

8 years agoMerge "Fix a bug in ext-intra experiment" into nextgenv2
Hui Su [Fri, 11 Mar 2016 19:52:34 +0000 (19:52 +0000)]
Merge "Fix a bug in ext-intra experiment" into nextgenv2

8 years agoFix an overflow in highbitdepth loop restoration
Debargha Mukherjee [Fri, 11 Mar 2016 06:56:24 +0000 (22:56 -0800)]
Fix an overflow in highbitdepth loop restoration

Change-Id: Ie20cd35a4c96443c0de234d2cf097187a70ec8dd

8 years agoMerge "Fix compiler warnings" into nextgenv2
Hui Su [Fri, 11 Mar 2016 19:47:38 +0000 (19:47 +0000)]
Merge "Fix compiler warnings" into nextgenv2

8 years agoFix a bug in ext-intra experiment
hui su [Fri, 11 Mar 2016 18:23:51 +0000 (10:23 -0800)]
Fix a bug in ext-intra experiment

Change-Id: I6fab352eb1f7d9c5dc783a4d4d878b6b42838ca2

8 years agoFix compiler warnings
hui su [Fri, 11 Mar 2016 18:13:08 +0000 (10:13 -0800)]
Fix compiler warnings

Change-Id: I00314ec296e8368f1239a556b3a55feac9cec7ae

8 years agoMerge "Enable hybrid 1-D/2-D transform coding for highbd setting" into nextgenv2
Jingning Han [Fri, 11 Mar 2016 18:09:11 +0000 (18:09 +0000)]
Merge "Enable hybrid 1-D/2-D transform coding for highbd setting" into nextgenv2

8 years agoComplete (mostly) migration of palette mode
hui su [Tue, 23 Feb 2016 23:22:25 +0000 (15:22 -0800)]
Complete (mostly) migration of palette mode

Coding gain on screen_content is 12.2% (was 6.6%).

Some features such as frame-level color buffer, adaptive
entropy coding, are coming in future patches.

Change-Id: I2658cf5ec0cbb02cff685475759f3b68c9807697

8 years agoFilling in speed feature functions for ext tx search
Sarah Parker [Mon, 7 Mar 2016 19:00:03 +0000 (11:00 -0800)]
Filling in speed feature functions for ext tx search

Filled in prune one and prune two. Prune three is still
being experimented with.

Change-Id: Ic07f828c448e86cacb0369aa3a9a0feb2edae054

8 years agoMerge "Adds compound wedge prediction modes" into nextgenv2
Debargha Mukherjee [Thu, 10 Mar 2016 17:44:45 +0000 (17:44 +0000)]
Merge "Adds compound wedge prediction modes" into nextgenv2

8 years agoEnable hybrid 1-D/2-D transform coding for highbd setting
Jingning Han [Thu, 10 Mar 2016 00:40:08 +0000 (16:40 -0800)]
Enable hybrid 1-D/2-D transform coding for highbd setting

This commit enables the hybrid 1-D/2-D transform coding scheme for
high bit-depth setting. It improves the compression performance of
ext-tx experiment by 0.98% for lowres_all set.

Change-Id: Ic27f5037f2c36b095a93b9f15dbae34bdcdf00aa

8 years agoAdds compound wedge prediction modes
Debargha Mukherjee [Tue, 1 Mar 2016 00:08:07 +0000 (16:08 -0800)]
Adds compound wedge prediction modes

Incorporates wedge compound prediction modes.

Change-Id: Ie73b54b629105b9dcc5f3763be87f35b09ad2ec7

8 years agoMerge "Fix an assertion condition in transform type search" into nextgenv2
Jingning Han [Thu, 10 Mar 2016 00:20:30 +0000 (00:20 +0000)]
Merge "Fix an assertion condition in transform type search" into nextgenv2

8 years agoMerge "Implemented DST 16x16 SSE2 intrinsics optimization" into nextgenv2
Yi Luo [Wed, 9 Mar 2016 22:27:43 +0000 (22:27 +0000)]
Merge "Implemented DST 16x16 SSE2 intrinsics optimization" into nextgenv2

8 years agoMerge "Add horizontal and vertical scan order for 1-D transform" into nextgenv2
Jingning Han [Wed, 9 Mar 2016 20:47:06 +0000 (20:47 +0000)]
Merge "Add horizontal and vertical scan order for 1-D transform" into nextgenv2

8 years agoFix sub8x8 interpolation full pixel bug
Angie Chiang [Wed, 9 Mar 2016 19:15:19 +0000 (11:15 -0800)]
Fix sub8x8 interpolation full pixel bug

Change-Id: I5df744dc6b21ed9dbbf6ddf38004f2a9e88b7d00

8 years agoFix an assertion condition in transform type search
Jingning Han [Wed, 9 Mar 2016 17:30:17 +0000 (09:30 -0800)]
Fix an assertion condition in transform type search

Change-Id: I442475e559be2acdc1c2a3e5ca021b3de77adda5

8 years agoAdd horizontal and vertical scan order for 1-D transform
Jingning Han [Wed, 9 Mar 2016 16:58:07 +0000 (08:58 -0800)]
Add horizontal and vertical scan order for 1-D transform

This commit enables the 1-D transform to use Manhattan grid vertical
and horizontal scan order for transform coefficient entropy coding.

Enabled in inter prediction mode, the hybrid 1D/2D transform coding
scheme outperforms the 2D-DCT based coding system used in VP9 by
lowres_all  1.7%
hdres_all   1.4%

As one coding option, in addition to the existing 17 other transform
types in ext-tx experiment, the 1D/2D hybrid transform improves
the coding gains:
lowres_all  2.2% -> 3.0%

Change-Id: I9cefa9d9e38224546d0afd67feecd9f8d4a16ab0

8 years agoRefactor entropy coding of transform size
hui su [Mon, 7 Mar 2016 23:25:50 +0000 (15:25 -0800)]
Refactor entropy coding of transform size

No performance change.

Change-Id: If35125fed909d89235b303514f77a33183bb36b3

8 years agoImplemented DST 16x16 SSE2 intrinsics optimization
Yi Luo [Tue, 8 Mar 2016 22:10:24 +0000 (14:10 -0800)]
Implemented DST 16x16 SSE2 intrinsics optimization

- Implemented fdst16_sse2(), fdst16_8col() against C version: fdst16().
- Turned on 7 DST related hybrid txfm types in vp10_fht16x16_sse2().
- Replaced vp10_fht10x10_c() with vp10_fht16x16_sse2() in
  fwd_txfm_16x16().
- Added vp10_fht16x16_sse2() unit test against C version:
  vp10_fht16x16_c() (--gtest_filter=*VP10Trans16x16*).
- Unit test passed.
- Speed improvement: 2.4%, 3.2%, 3.2%, for city_cif.y4m, garden_sif.y4m,
  and mobile_cif.y4m.

Change-Id: Ib30a67ce5d5964bef143d588d0f8fa438be8901f

8 years agoMerge "Adds an ext-tile config flag to host new tiling" into nextgenv2
Debargha Mukherjee [Tue, 8 Mar 2016 17:14:44 +0000 (17:14 +0000)]
Merge "Adds an ext-tile config flag to host new tiling" into nextgenv2

8 years agoMerge "Fix several MSVC compiler warning/errors" into nextgenv2
Yaowu Xu [Tue, 8 Mar 2016 16:44:16 +0000 (16:44 +0000)]
Merge "Fix several MSVC compiler warning/errors" into nextgenv2

8 years agoAdds an ext-tile config flag to host new tiling
Debargha Mukherjee [Tue, 8 Mar 2016 15:17:29 +0000 (07:17 -0800)]
Adds an ext-tile config flag to host new tiling

Change-Id: I6c3bf5545c42030b484a8aaf434b63bd409a0487

8 years agoFix several MSVC compiler warning/errors
Yaowu Xu [Tue, 8 Mar 2016 00:23:26 +0000 (16:23 -0800)]
Fix several MSVC compiler warning/errors

Change-Id: Iccaacee9b7a66b016b5747a3902c236888ad4ba1

8 years agoMerge "Added vp10_fht8x8_sse2() unit test" into nextgenv2
Yi Luo [Tue, 8 Mar 2016 00:30:06 +0000 (00:30 +0000)]
Merge "Added vp10_fht8x8_sse2() unit test" into nextgenv2

8 years agoMerge "Calculate the distortion in pixel domain for sub8x8 rd selection" into nextgenv2
Yue Chen [Tue, 8 Mar 2016 00:13:45 +0000 (00:13 +0000)]
Merge "Calculate the distortion in pixel domain for sub8x8 rd selection" into nextgenv2

8 years agoCalculate the distortion in pixel domain for sub8x8 rd selection
Yue Chen [Fri, 4 Mar 2016 17:48:57 +0000 (09:48 -0800)]
Calculate the distortion in pixel domain for sub8x8 rd selection

Pixel domain distortion calculation is enabled for the rd loop of
inter sub8x8 and intra 4x4 cases.

Coding gain: 0.124% derflr, 0.122% derfhd

Change-Id: I43b47fe81b4f5ccc1c66bc626bd310c413a1ed87

8 years agoAdded vp10_fht8x8_sse2() unit test
Yi Luo [Mon, 7 Mar 2016 22:25:07 +0000 (14:25 -0800)]
Added vp10_fht8x8_sse2() unit test

- Inherited base class TransformTestBase to derived class VP10Trans8x8HT.
- Employed RunCoeffCheck() to test vp10_fht8x8_sse2() against C reference
  function vp10_fht8x8_c().
- fdst8_sse2() related seven hybrid transform cases are covered in this
  test.
- Test passed (4 test cases w/o EXT_TX; 16 test cases with EXT_TX).

Change-Id: Id9a9b308c707164a120d9ceb2c30e572026fb1d0

8 years agoMerge "ANS: Switch from PDFs to CDFs." into nextgenv2
Alex Converse [Mon, 7 Mar 2016 20:51:44 +0000 (20:51 +0000)]
Merge "ANS: Switch from PDFs to CDFs." into nextgenv2

8 years agoMerge "Extend convolution functions to 128x128 for ext-partition." into nextgenv2
Debargha Mukherjee [Mon, 7 Mar 2016 19:54:44 +0000 (19:54 +0000)]
Merge "Extend convolution functions to 128x128 for ext-partition." into nextgenv2

8 years agoMerge "Make sharp filter 10 tap and makes sharp2 sharper" into nextgenv2
Debargha Mukherjee [Mon, 7 Mar 2016 19:51:42 +0000 (19:51 +0000)]
Merge "Make sharp filter 10 tap and makes sharp2 sharper" into nextgenv2

8 years agoMerge "Added vp10_fht4x4_sse2() unit test" into nextgenv2
Yi Luo [Mon, 7 Mar 2016 19:18:33 +0000 (19:18 +0000)]
Merge "Added vp10_fht4x4_sse2() unit test" into nextgenv2

8 years agoMerge "Hybrid 1-D/2-D transform coding" into nextgenv2
Jingning Han [Mon, 7 Mar 2016 19:15:44 +0000 (19:15 +0000)]
Merge "Hybrid 1-D/2-D transform coding" into nextgenv2

8 years agoHybrid 1-D/2-D transform coding
Jingning Han [Sat, 5 Mar 2016 05:23:55 +0000 (21:23 -0800)]
Hybrid 1-D/2-D transform coding

This commit enables a hybrid 1-D/2-D transform coding scheme and
the accompany entropy coding system. It currently uses hybrid
1-D/2-D DCT transform coding. It provides coding performance gains:

lowres_all  0.55%
hdres_all   0.43%

Change-Id: I2b30dcafd21eb2bb3371f6e854cbab440a4dfa78

8 years agoMerge "Adding speed feature interface for ext tx search" into nextgenv2
Sarah Parker [Mon, 7 Mar 2016 16:32:55 +0000 (16:32 +0000)]
Merge "Adding speed feature interface for ext tx search" into nextgenv2