granicus.if.org Git - libvpx/log

Change to extend full border only when needed

This is a short term optimization till we work out a decoder
implementation requiring no frame border extension.

Change-Id: I02d15bfde4d926b50a4e58b393d8c4062d1be70f

commit | commitdiff | tree

Dmitry Kovalev [Mon, 15 Jul 2013 19:26:58 +0000 (12:26 -0700)]

Removing and moving around constant definitions.

Removing unused and duplicated constants, moving them from *.h to *.c
if possible.

Change-Id: Ief4d6b984a3ca2e9b38504f0d855ed072cf7133f

commit | commitdiff | tree

Dmitry Kovalev [Tue, 16 Jul 2013 02:21:32 +0000 (19:21 -0700)]

Merge "Consistent naming for loop-filter filters."

commit | commitdiff | tree

Johann [Tue, 16 Jul 2013 01:43:41 +0000 (18:43 -0700)]

Merge "Remove print_nmvcounts"

commit | commitdiff | tree

Ronald S. Bultje [Fri, 12 Jul 2013 19:59:19 +0000 (12:59 -0700)]

Increase border size from 96 to 160.

This is required because upon downscaling, if a motion vector points
partially into the UMV (e.g. all minus 1 of 64+7 pixels, i.e. 70),
then we can point up to 140 pixels into the larger-resolution (2x)
reference buffer UMV, which means the UMV for reference buffers in
downscaling needs to be 140 rounded up to the nearest multiple of 32,
i.e. 160.

Longer-term, we should probably handle the UMV differently by detecting
edge coverage on-the-fly and using a temporary buffer for edge extensions
instead of adding 160 pixels on all sides of the image (which means a
CIF image uses 3x its own area size for borders).

Change-Id: I5184443e6731cd6721fc6a5d430a53e7d91b4f7e

commit | commitdiff | tree

Ronald S. Bultje [Thu, 11 Jul 2013 20:01:44 +0000 (13:01 -0700)]

Inline vp9_quantize() in xform_quant().

Cycle times:
4x4: 151 to 131 cycles (15% faster)
8x8: 334 to 306 cycles (9% faster)
16x16: 1401 to 1368 cycles (2.5% faster)
32x32: 7403 to 7367 cycles (0.5% faster)

Total encode time of first 50 frames of bus @ 1500kbps (speed 0)
goes from 1min39.2 to 1min38.6, i.e. a 0.67% overall speedup.

Change-Id: I799a49460e5e3fcab01725564dd49c629bfe935f

commit | commitdiff | tree

Ronald S. Bultje [Tue, 16 Jul 2013 00:29:39 +0000 (17:29 -0700)]

Merge "Inline xform_quant() in encode_block_intra()."

commit | commitdiff | tree

Frank Galligan [Tue, 16 Jul 2013 00:11:55 +0000 (17:11 -0700)]

Merge "Neon: Update mbfilter if all vectors follow one branch."

commit | commitdiff | tree

Dmitry Kovalev [Mon, 15 Jul 2013 23:01:31 +0000 (16:01 -0700)]

Consistent naming for loop-filter filters.

Renaming flatmask4 to flat_mask4, flatmask5 to flat_mask5, hevmask to
hev_mask, filter to filter4, mbfilter to filter8, wide_mbfilter to
filter16.

Change-Id: Ic61c73e59c2eee505257584867aafac99833cea1

commit | commitdiff | tree

Ronald S. Bultje [Thu, 11 Jul 2013 18:35:13 +0000 (11:35 -0700)]

Inline xform_quant() in encode_block_intra().

Also inline some of the block calculations to assist the compiler to
not do silly things like calculating the same offset (or converting
between raster/transform block offset or block, mi and pixel unit)
many, many, many times.

Cycle times:
4x4:     584 ->   505 cycles (16% faster)
8x8:    1651 ->  1560 cycles (6% faster)
16x16:  7897 ->  7704 cycles (2.5% faster)
32x32: 16096 -> 15852 cycles (1.5% faster)

Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the
first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall.

Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80

commit | commitdiff | tree

Dmitry Kovalev [Mon, 15 Jul 2013 21:47:25 +0000 (14:47 -0700)]

Code cleanup inside vp9_decodeframe.c.

Removing unused DEC_DEBUG define and dec_debug variable. Changing function
signatures to eliminate code duplication, renaming function
mb_init_dequantizer to init_dequantizer. Also removing redundant curly
braces, and comments.

Change-Id: Ia56ee1b0be5f24abb0e878581845be8a4773c298

commit | commitdiff | tree

Frank Galligan [Fri, 12 Jul 2013 00:13:03 +0000 (17:13 -0700)]

Neon: Update mbfilter if all vectors follow one branch.

Change the mbfilter Neon code from executing both branches if all
vectors follow only one branch.

The code is about 5% faster when executing only one branch and about
1% slower when executing both branches.

-PS5: Remove local stack space from mbfilter.

Change-Id: I6a23f9b318a9f4568a2718b4c9348db988fe2182

commit | commitdiff | tree

Jingning Han [Mon, 15 Jul 2013 18:28:46 +0000 (11:28 -0700)]

Skip inter-coded block reconstruction in rd loop

Skip the inverse transform and reconstruction of inter-mode coded
blocks in the rate-distortion optimization loop, when skip_encode_sb
feature is turned on. This provides about 1% speed-up at speed 0,
and 1.5% speed-up at speed 1. No performance change in both settings.

Change-Id: I2932718bf4d007163702b61b16b6ff100cf9d007

commit | commitdiff | tree

Jingning Han [Mon, 8 Jul 2013 23:48:47 +0000 (16:48 -0700)]

Skip duplicate block encoding in the rd loop

This speed feature allows the encoder to largely remove the spatial
dependency between blocks inside a 64x64 superblock, thereby removing
the need to repeatedly encode superblocks per partition type in the
rate-distortion optimization loop.

A major challenge lies in the intra modes tested in the rate-distortion
optimization loop. The subsequent blocks do not have access to the
reconstructed boundary pixels without the intermediate coding steps.
This was resolved by using the original pixels for intra prediction
in the rd loop, followed by an appropriately designed distortion
modeling on the quantization parameters. Experiments also suggested
that the performance impact is more discernible at lower bit-rate/psnr
settings. Hence a quantizer dependent threshold is applied to deactivate
skip of block coding.

For bus_cif at 2000 kbps,
speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB
         performance loss.

speed 1: runtime 65312ms  -> 61536ms, (7% speed-up) at 0.04dB
         performance loss.

This operation is currently turned on in settings of speed 1.

Change-Id: Ib689741dfff8dd38365d8c1b92860a3e176f56ec

commit | commitdiff | tree

Dmitry Kovalev [Mon, 15 Jul 2013 17:51:42 +0000 (10:51 -0700)]

Merge "Fixing vp9_get_pred_context_comp_ref_p function."

commit | commitdiff | tree

Jingning Han [Sat, 13 Jul 2013 03:54:14 +0000 (20:54 -0700)]

SSE2 8x8 inverse ADST/DCT transform

This commit enables SSE2 implementation of 8x8 inverse ADST/DCT
transform. The runtime goes from 1216 cycles -> 266 cycles.
For bus_cif at 2000 kbps, the overall runtime reduces from
253707ms -> 248430ms, i.e., 2% speed-up at speed 0.

Change-Id: Ib0372e17e9162d7b11a10d653b1c8be547c878fb

commit | commitdiff | tree

Dmitry Kovalev [Wed, 3 Jul 2013 00:19:16 +0000 (17:19 -0700)]

Using vp9_copy and vp9_zero instead of custom code.

Change-Id: Id9b6ceeddca3f9b34bfada5c499b1e7a2f42c30b

commit | commitdiff | tree

Dmitry Kovalev [Sat, 13 Jul 2013 00:46:02 +0000 (17:46 -0700)]

Fixing vp9_get_pred_context_comp_ref_p function.

Adding missed parenthesis around boolean expressions. Bitstream is changed.
Regenerating test vectors.

Change-Id: I4cc00b761e9473f92f180a9fc3a0c607f0aaae56

commit | commitdiff | tree

Dmitry Kovalev [Sat, 13 Jul 2013 00:08:23 +0000 (17:08 -0700)]

Merge "Removing redundant call to set_mi_row_col."

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 23:25:23 +0000 (16:25 -0700)]

Removing redundant call to set_mi_row_col.

This function is actually called from set_offsets which is called right
before vp9_read_mode_info.

Change-Id: Ibb9d5ad606194bc80eab264fad85b31c9dfd8f77

commit | commitdiff | tree

Johann [Fri, 12 Jul 2013 23:12:58 +0000 (16:12 -0700)]

vp9_convolve8_[horiz|vert]_avg

Super basic conversion from the other implementations. Any changes to
one should be trivial to copy over keep in sync.

Change-Id: I1720b4128e0aba4b2779e3761f6494f8a09d3ea8

commit | commitdiff | tree

Yaowu Xu [Fri, 12 Jul 2013 23:17:22 +0000 (16:17 -0700)]

Merge "Fix a build issue"

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 23:02:09 +0000 (16:02 -0700)]

Merge "Adding struct tx_probs and struct tx_counts to cleanup the code."

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 22:50:02 +0000 (15:50 -0700)]

Merge "Making functions read_{inter, intra}_segment_id more similar."

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 22:41:41 +0000 (15:41 -0700)]

Merge "vp9_postproc: remove useless self-assign"

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 00:36:06 +0000 (17:36 -0700)]

Adding struct tx_probs and struct tx_counts to cleanup the code.

Also removing unused declarations from vp9_entropymode.h file.

Change-Id: Ib9c5826db3584a32f6bb3297a76c522b99d83402

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 22:04:07 +0000 (15:04 -0700)]

Merge "Code cleanup in vp9_pred_common.c"

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 21:50:33 +0000 (14:50 -0700)]

Making functions read_{inter, intra}_segment_id more similar.

Change-Id: I51f9ac910834f2d7aba2be4f7ffbce597e61a144

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 21:17:15 +0000 (14:17 -0700)]

vp9_postproc: remove useless self-assign

Change-Id: I0bc5d2d8c9fec8be18263b0dc2528886bb5b7b61

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 21:11:48 +0000 (14:11 -0700)]

Code cleanup in vp9_pred_common.c

No bitstream changes. Using MB_MODE_INFO temp variables instead of
MODE_INFO variables. Removing redundant curly braces.

Change-Id: Ib9d1bedfbd8af97ecc722ccf697ea8177bbe287c

commit | commitdiff | tree

Yaowu Xu [Fri, 12 Jul 2013 18:38:44 +0000 (11:38 -0700)]

Fix a build issue

Change-Id: I23a75c495ed7ea917d7f312bef0990e20a6b53d9

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 18:37:43 +0000 (11:37 -0700)]

vp9: consistent 'log2' variable naming

lg2 -> log2

Change-Id: I0602ddff49e42c9c40c29c084d04b7592b9f8edf

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 18:10:18 +0000 (11:10 -0700)]

Merge changes I33e76c42,I24aeac1e,If4192b40

* changes:
  vp9_dx_iface: s/vp8/vp9/ where possible
  vp[89]_dx_iface: delete unused function
  vp[89]_dx_iface: factorize vp8_mmap_*()

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 06:16:22 +0000 (23:16 -0700)]

vp9_dx_iface: s/vp8/vp9/ where possible

drop 'vp9_' from most static functions unrelated to the codec interface
itself.

Change-Id: I33e76c425bb7373570a57a61662a56d65ab4bdf3

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 17:59:35 +0000 (10:59 -0700)]

Merge "msvs-build: use msbuild for vs >= 2005"

commit | commitdiff | tree

Deb Mukherjee [Wed, 10 Jul 2013 23:51:07 +0000 (16:51 -0700)]

Some minor cleanups for efficiency

Implements some of the helper functions more efficiently with
lookups rathers than branches. Modeling function is consolidated
to reduce some computations.

Also merged the two enums BLOCK_SIZE_TYPES and BlockSize into
one because there is no need to keep them separate (even though
the semantics are a little different).

No bitstream or output change.

About 0.5% speedup

Change-Id: I7d71a66e8031ddb340744dc493f22976052b8f9f

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 17:22:30 +0000 (10:22 -0700)]

Merge "Removing redundant code mostly from vp9_pred_common.{h, c}."

commit | commitdiff | tree

Paul Wilkins [Fri, 12 Jul 2013 09:14:01 +0000 (02:14 -0700)]

Merge "Speed 2 feature adjustment."

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 06:03:24 +0000 (23:03 -0700)]

vp[89]_dx_iface: delete unused function

static mmap_lkup

Change-Id: I24aeac1eca8453e28d58bc06925e58efc228a0a6

commit | commitdiff | tree

James Zern [Fri, 12 Jul 2013 06:01:26 +0000 (23:01 -0700)]

vp[89]_dx_iface: factorize vp8_mmap_*()

s/vp8/vpx/ -> vpx_codec_internal.h / vpx_codec.c

Change-Id: If4192b40206276a761b01d44e334fe15bcb81128

commit | commitdiff | tree

Jingning Han [Fri, 12 Jul 2013 04:52:39 +0000 (21:52 -0700)]

Merge "Cosmetic changes in 16x16 ADST/DCT unit test"

commit | commitdiff | tree

Jingning Han [Fri, 12 Jul 2013 04:52:27 +0000 (21:52 -0700)]

Merge "Remove unnecessary tx_type branch in encode_block"

commit | commitdiff | tree

Dmitry Kovalev [Fri, 12 Jul 2013 01:39:10 +0000 (18:39 -0700)]

Removing redundant code mostly from vp9_pred_common.{h, c}.

Removing redundant function arguments and curly braces.

Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b

commit | commitdiff | tree

Johann [Fri, 12 Jul 2013 00:22:03 +0000 (17:22 -0700)]

Remove print_nmvcounts

For some reason iOS builds take a really long time to sort this
function out.

It's not used anywhere so remove it.

Change-Id: Ia5c8513a0d9c7eb32641cca58ca1c1113e2dd9f4

commit | commitdiff | tree

Ronald S. Bultje [Thu, 11 Jul 2013 19:11:45 +0000 (12:11 -0700)]

Remove unused function block_error().

Change-Id: I78a79fc51c2d7cc3c261f35b569155397f3dc0c4

commit | commitdiff | tree

James Zern [Thu, 11 Jul 2013 22:51:39 +0000 (15:51 -0700)]

Merge "vp9: fix peek_si for version==0"

commit | commitdiff | tree

James Zern [Thu, 11 Jul 2013 22:47:11 +0000 (15:47 -0700)]

Merge "small update to peek_si/get_si documentation"

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 22:20:14 +0000 (15:20 -0700)]

Merge "Calling is_inter_mode() instead of custom code."

commit | commitdiff | tree

Jingning Han [Thu, 11 Jul 2013 21:17:23 +0000 (14:17 -0700)]

Merge "SSE2 4x4 invserse ADST/DCT transform"

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 21:14:47 +0000 (14:14 -0700)]

Calling is_inter_mode() instead of custom code.

Change-Id: Iccd4ab95ea51a6d57ed43947f2fd7ad92e8979cf

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 20:58:34 +0000 (13:58 -0700)]

Merge "Making vp9_default_nmv_context static."

commit | commitdiff | tree

James Zern [Thu, 11 Jul 2013 19:23:28 +0000 (12:23 -0700)]

small update to peek_si/get_si documentation

correct a doxygen and function reference

Change-Id: I525371d64969aa60c464d0f6a133bc29895d7991

commit | commitdiff | tree

James Zern [Thu, 11 Jul 2013 01:45:57 +0000 (18:45 -0700)]

vp9: fix peek_si for version==0

Change-Id: I6bfec4fa50dfc1a953edb1a2aa8e97e6e896bed6

commit | commitdiff | tree

Dmitry Kovalev [Wed, 10 Jul 2013 19:29:43 +0000 (12:29 -0700)]

Moving segmentation related vars into separate struct.

Adding segmentation struct to vp9_seg_common.h. Struct members are from
macroblockd and VP9Common structs. Moving segmentation related constants
and enums to vp9_seg_common.h.

Change-Id: I23fabc33f11a359249f5f80d161daf569d02ec03

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 18:57:17 +0000 (11:57 -0700)]

Merge "Adding write_compressed_header function."

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 18:46:06 +0000 (11:46 -0700)]

Merge "Removing unused TOKENEXTRA arg from pick_sb_modes function."

commit | commitdiff | tree

Jingning Han [Thu, 11 Jul 2013 16:37:25 +0000 (09:37 -0700)]

Cosmetic changes in 16x16 ADST/DCT unit test

Change-Id: Ic649e9e47d14d6f8cae0c443a425ea533a97ad8d

commit | commitdiff | tree

Johann [Thu, 23 May 2013 19:50:41 +0000 (12:50 -0700)]

convolve8 optimizations for neon

Independent horizontal and vertical implementations.

Requires that blocks be built from 4x4 and [xy]_step_q4 == 16

6-10% improvement. CIF improved the least.

Change-Id: I137f5ceae4440adc0960bf88e4453e55a618bcda

commit | commitdiff | tree

hkuang [Tue, 9 Jul 2013 19:06:21 +0000 (12:06 -0700)]

Add neon optimize vp9_dc_only_idct_add.

Change-Id: Iae84ab945cc9662a0ddd839aa2b9ca59f2ae5423

commit | commitdiff | tree

Jingning Han [Thu, 11 Jul 2013 16:09:41 +0000 (09:09 -0700)]

Remove unnecessary tx_type branch in encode_block

The function encode_block is called only by inter-prediction modes,
hence removing the transform type branching there.

Change-Id: I34a3172e28ce2388835efd0f8781922211bff857

commit | commitdiff | tree

Jim Bankoski [Thu, 11 Jul 2013 13:44:02 +0000 (06:44 -0700)]

Merge "Wide loopfilter 16 pix at a time"

commit | commitdiff | tree

Paul Wilkins [Wed, 3 Jul 2013 16:54:06 +0000 (17:54 +0100)]

Speed 2 feature adjustment.

With sf->auto_mv_step_size on it is questionable
whether sf->reduce_first_step_size is worthwhile.
At speed 2 it was not having a big impact.

Even at speed 2 sf->optimize_coefficients = 0 is not
having a big speed imapct so for now I have moved it
down into a higher speed setting.

Change-Id: I8a54de76d486ad37aabce76474889da2768b14c1

commit | commitdiff | tree

Jingning Han [Wed, 10 Jul 2013 19:11:09 +0000 (12:11 -0700)]

SSE2 4x4 invserse ADST/DCT transform

Enable SSE2 4x4 inverse ADST/DCT transform. The runtime goes from
292 cycles down to 89 cycles. Running bus_cif at 2000 kbps, the
overall runtime of speed 0 goes from 301s to 295s (2% speed-up).

Change-Id: I24098136e7fee7ab2fbf1c11755bdf2ca37f3628

commit | commitdiff | tree

Jingning Han [Thu, 11 Jul 2013 03:13:25 +0000 (20:13 -0700)]

Merge "Fix tx_type bug in intra4x4 rd loop"

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 18:17:19 +0000 (11:17 -0700)]

Replace copy_memNxM functions with a generic copy/avg function.

Change-Id: I3ce849452ed4f08527de9565a9914d5ee36170aa

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 17:34:58 +0000 (10:34 -0700)]

Remove unused fwalsh/fdct x86 SIMD implementations.

Change-Id: Ia942e56cf322821d42ba06178672791eeee2847e

commit | commitdiff | tree

Dmitry Kovalev [Thu, 11 Jul 2013 00:44:45 +0000 (17:44 -0700)]

Making vp9_default_nmv_context static.

Change-Id: Ia3d5bd45adf288de11ab59c4728266c93c17e275

commit | commitdiff | tree

Ronald S. Bultje [Thu, 11 Jul 2013 00:08:46 +0000 (17:08 -0700)]

Merge "Remove unused iwalsh4x4 MMX/SSE2 functions."

commit | commitdiff | tree

Ronald S. Bultje [Thu, 11 Jul 2013 00:08:43 +0000 (17:08 -0700)]

Merge "Remove unused 16x3/3x16 sad SSE2 functions."

commit | commitdiff | tree

John Koleszar [Wed, 12 Jun 2013 21:37:01 +0000 (14:37 -0700)]

Wide loopfilter 16 pix at a time

Where possible, do the 16 pixel wide filter while doing the horizontal
filtering pass. The same approach can be taken for the mbloop_filter
when that's implemented. Doing so on the vertical pass is a little more
involved, but possible.

Change-Id: I010cb505e623464247ae8f67fa25a0cdac091320

commit | commitdiff | tree

James Zern [Sat, 6 Apr 2013 02:30:15 +0000 (19:30 -0700)]

msvs-build: use msbuild for vs >= 2005

allows concurrent builds via the /m command line option

Change-Id: I668792ba00276e8626dc175c0a44ddab35fc7114

commit | commitdiff | tree

Dmitry Kovalev [Wed, 10 Jul 2013 22:57:28 +0000 (15:57 -0700)]

Removing unused TOKENEXTRA arg from pick_sb_modes function.

Change-Id: I0543e72fa092eef3976b65e16bb597197c364873

commit | commitdiff | tree

Jingning Han [Wed, 10 Jul 2013 22:45:34 +0000 (15:45 -0700)]

Fix tx_type bug in intra4x4 rd loop

This commit fixed the mis-use of the tx_type for inverse transform
in intra4x4 rate-distortion optimization loop. It improves the
overall coding performance.

Change-Id: I7fe9953175b74890357dbcee33c138573766e980

commit | commitdiff | tree

Deb Mukherjee [Wed, 10 Jul 2013 22:37:11 +0000 (15:37 -0700)]

Merge "Prunes out full-rd computation based on modeled rd"

commit | commitdiff | tree

Dmitry Kovalev [Wed, 10 Jul 2013 22:11:08 +0000 (15:11 -0700)]

Merge "Adding read_compressed_header function."

commit | commitdiff | tree

Dmitry Kovalev [Wed, 10 Jul 2013 22:08:34 +0000 (15:08 -0700)]

Adding write_compressed_header function.

Change-Id: Ic5257fa8278e9b6297de230e4fd26a1e23ad2bb7

commit | commitdiff | tree

Jim Bankoski [Wed, 10 Jul 2013 22:07:53 +0000 (15:07 -0700)]

configure with internal stats not working

Change-Id: I5dea4570cb05df27a522abf6e7b695998654284a

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 17:27:42 +0000 (10:27 -0700)]

Remove unused iwalsh4x4 MMX/SSE2 functions.

Change-Id: I2d22577911a37ed7d8c7e08cac20764842267652

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 17:23:41 +0000 (10:23 -0700)]

Remove unused 16x3/3x16 sad SSE2 functions.

Change-Id: I30a597c0cc366e34c9a3e2afe32d70e044f95ca4

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 21:52:23 +0000 (14:52 -0700)]

Merge "SSSE3 assembly for 4x4/8x8/16x16/32x32 H intra prediction."

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 21:52:19 +0000 (14:52 -0700)]

Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 TM intra prediction."

commit | commitdiff | tree

Jim Bankoski [Wed, 10 Jul 2013 21:39:39 +0000 (14:39 -0700)]

Merge "remove warnings when NDEBUG is set"

commit | commitdiff | tree

Jim Bankoski [Wed, 10 Jul 2013 21:27:20 +0000 (14:27 -0700)]

remove warnings when NDEBUG is set

Change-Id: Ie0cb732fdcb98616a422c4463bff80642248d136

commit | commitdiff | tree

Deb Mukherjee [Mon, 8 Jul 2013 23:01:01 +0000 (16:01 -0700)]

Prunes out full-rd computation based on modeled rd

Adds a speed feature to eliminate full-rd computation if the modeled
rd or rd based on a different parameter in the same mode is already
a lot larger than the best rd yet.

Specifically, only search the sharp and smooth filters if the modeled
rd cost based on the  regular filter is within a certain factor of the
best rd cost so far. Also, skip full-rd computation of non splitmv
inter modes if the modeled rd cost based on pred error is within the
same factor of the best rd cost so far.

Also adds some enhancements in the rd search for splitmv mode to
speed things up by early breakouts. Negligible impact on performance.

Resuts on derfraw300:
psnr:    -0.013% with the splitmv enhancements, -0.24% with the rd
         breakout feature on.
speedup: 6% with splitmv enhancements, 20% with also residual breakout
         (tested on football sequence at 600 Kbps)

Change-Id: I37abc308ea9f110c1679ce649b6a7e73ab1ad5fc

commit | commitdiff | tree

James Zern [Wed, 10 Jul 2013 20:02:22 +0000 (13:02 -0700)]

Merge "msvc: set a more useful debug format"

commit | commitdiff | tree

James Zern [Wed, 10 Jul 2013 20:01:52 +0000 (13:01 -0700)]

Merge "test_libvpx: disable pthreads in gtest for win targets"

commit | commitdiff | tree

Jingning Han [Wed, 3 Jul 2013 16:05:01 +0000 (09:05 -0700)]

SSE2 16x16 ADST/DCT hybrid transform

This commit enables 16x16 ADST/DCT forward hybrid transform using SSE2
operations. It reduces the runtime from 5433 cycles to 1621 cycles, at
no compression performance loss.

Change-Id: I75fd7f1984e9e28846af459f810ff0d6ae125230

commit | commitdiff | tree

Dmitry Kovalev [Wed, 10 Jul 2013 18:43:50 +0000 (11:43 -0700)]

Merge "Adding encode_tiles function to vp9_bitstream.c."

commit | commitdiff | tree

Yaowu Xu [Wed, 10 Jul 2013 18:35:47 +0000 (11:35 -0700)]

Merge "Add a feature to reduce chrome intra mode search"

commit | commitdiff | tree

Jingning Han [Wed, 10 Jul 2013 18:16:39 +0000 (11:16 -0700)]

Merge "Add unit test for 16x16 forward ADST/DCT"

commit | commitdiff | tree

Scott LaVarnway [Wed, 10 Jul 2013 18:09:30 +0000 (11:09 -0700)]

Merge "Bug fix: set frame_parallel_decoding_mode"

commit | commitdiff | tree

John Koleszar [Wed, 10 Jul 2013 18:04:40 +0000 (11:04 -0700)]

Merge "Fix intermediate height in convolve"

commit | commitdiff | tree

Dmitry Kovalev [Mon, 8 Jul 2013 18:54:36 +0000 (11:54 -0700)]

Adding read_compressed_header function.

Splitting setup_txfm_mode into read_tx_mode and read_tx_probs.

Change-Id: I5b4fe48698d56490857d32eafcaeb4291f208479

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 17:24:16 +0000 (10:24 -0700)]

Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 V intra prediction."

commit | commitdiff | tree

Ronald S. Bultje [Wed, 10 Jul 2013 17:13:16 +0000 (10:13 -0700)]

Merge "SSE/SSE2 assembly for 4x4/8x8/16x16/32x32 DC intra prediction."

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom