]> granicus.if.org Git - libvpx/log
libvpx
13 years agoMerge "bug fix possible keyframe context divide by zero"
John Koleszar [Mon, 25 Apr 2011 16:35:12 +0000 (09:35 -0700)]
Merge "bug fix possible keyframe context divide by zero"

13 years agobug fix possible keyframe context divide by zero
James Berry [Fri, 22 Apr 2011 15:54:18 +0000 (11:54 -0400)]
bug fix possible keyframe context divide by zero

vp8_adjust_key_frame_context() divides by
estimate_keyframe_frequency() which can
return 0 in the case where --kf-max-dist=0.

Change-Id: Idfc59653478a0073187cd2aa420e98a321103daa

13 years agoMerge "keep values in registers during quantization"
Johann [Mon, 25 Apr 2011 13:52:38 +0000 (06:52 -0700)]
Merge "keep values in registers during quantization"

13 years agoMerge "Removed unnecessary frame type checks"
Scott LaVarnway [Mon, 25 Apr 2011 13:45:43 +0000 (06:45 -0700)]
Merge "Removed unnecessary frame type checks"

13 years agoMerge "Removed dc_diff from MB_MODE_INFO"
Scott LaVarnway [Mon, 25 Apr 2011 13:45:32 +0000 (06:45 -0700)]
Merge "Removed dc_diff from MB_MODE_INFO"

13 years agoMerge "make two compiler options explicit for Visual Studio projects"
Yaowu Xu [Fri, 22 Apr 2011 21:08:08 +0000 (14:08 -0700)]
Merge "make two compiler options explicit for Visual Studio projects"

13 years agoFix overflow in temporal_filter_apply_sse2().
Ronald S. Bultje [Thu, 21 Apr 2011 20:35:02 +0000 (16:35 -0400)]
Fix overflow in temporal_filter_apply_sse2().

The accumulator array is an integer array, so use paddd instead of paddw
to add values to it. Fixes overflows when using large --arnr-maxframes
(>8) values.

Change-Id: Iad83794caa02400a65f3ab5760f2517e082d66ae

13 years agoMerge "Remove unused kf rate variables"
John Koleszar [Thu, 21 Apr 2011 23:54:14 +0000 (16:54 -0700)]
Merge "Remove unused kf rate variables"

13 years agoCorrected format specifiers in debug print statements
Adrian Grange [Thu, 21 Apr 2011 22:45:57 +0000 (15:45 -0700)]
Corrected format specifiers in debug print statements

The arguments to these fprintfs are int not long int so
the format specifier should be "%d" and not "%ld". This
was writing garbage in the linux build.

Change-Id: I3d2aa8a448d52e6dc08858d825bf394929b47cf3

13 years agomake two compiler options explicit for Visual Studio projects
Yaowu Xu [Thu, 21 Apr 2011 20:27:42 +0000 (13:27 -0700)]
make two compiler options explicit for Visual Studio projects

This patch changes the release configuration of MS VS projects to
explicitly use two compiler options "Maximize Speed (/O2)" and
"Favor fast code(/Ot)".

Change-Id: I0bf8343d9ca195851332b91ec69c69ee4e31ce2a

13 years agokeep values in registers during quantization
Johann [Wed, 13 Apr 2011 20:38:02 +0000 (16:38 -0400)]
keep values in registers during quantization

add an sse4 quantizer so we can use pinsrw/pextrw and keep values in xmm
registers instead of proxying through the stack. and as long as we're
bumping up, use some ssse3 instructions in the EOB detection (see ssse3
fast quantizer)
pick up about a percent on 32bit and about two on 64bit.

Change-Id: If15abba0e8b037a1d231c0edf33501545c9d9363

13 years agoRemoved unnecessary frame type checks
Scott LaVarnway [Thu, 21 Apr 2011 18:59:42 +0000 (14:59 -0400)]
Removed unnecessary frame type checks

ref_frame is set to INTRA_FRAME for keyframes.  The B_PRED
mode is only used in intra frames.

Change-Id: I9bac8bec7c736300d47994f3cb570329edf11ec0

13 years agoRemoved dc_diff from MB_MODE_INFO
Scott LaVarnway [Thu, 21 Apr 2011 18:38:36 +0000 (14:38 -0400)]
Removed dc_diff from MB_MODE_INFO

The dc_diff flag is used to skip loopfiltering.  Instead
of setting this flag in the decoder/encoder, we now check
for this condition in the loopfilter.

Change-Id: Ie2b9cdf9e0f4e8b932bbd36e0878c05bffd28931

13 years agoRemoved force_no_skip
Scott LaVarnway [Wed, 20 Apr 2011 19:45:12 +0000 (15:45 -0400)]
Removed force_no_skip

force_no_skip is always set to zero.

Change-Id: I89b61c5e0bee34627a9c07c05f3517e1db76af77

13 years agoRemoved redundant checks of the mode_info_context flags
Scott LaVarnway [Tue, 19 Apr 2011 15:42:15 +0000 (11:42 -0400)]
Removed redundant checks of the mode_info_context flags

Code cleanup.  The build inter predictor functions are
redundantly checking the mode_info_context for either
INTRA_FRAME or SPLITMV.

Change-Id: I4d58c3a5192a4c2cec5c24ab1caf608bf13aebfb

13 years agoRemove unused kf rate variables
John Koleszar [Tue, 19 Apr 2011 20:08:45 +0000 (16:08 -0400)]
Remove unused kf rate variables

Remove tot_key_frame_bits and prior_key_frame_size[] as they were
tracked but never used. Remove intra_frame_target, as it was only
used to initialize prior_key_frame_size.

Refactor vp8_adjust_key_frame_context() some to remove unnecessary
calculations.

Change-Id: Icbc2c83d2b90e184be03e6f9679e678f3a4bce8f

13 years agomodify SAVE_XMM for potential 64bit use
Johann [Fri, 15 Apr 2011 14:05:20 +0000 (10:05 -0400)]
modify SAVE_XMM for potential 64bit use

the win64 abi requires saving and restoring xmm6:xmm15. currently
SAVE_XMM and RESTORE XMM only allow for saving xmm6:xmm7. allow
specifying the highest register used and if the stack is unaligned.

Change-Id: Ica5699622ffe3346d3a486f48eef0206c51cf867

13 years agoMerge "Add save/restore xmm registers in x86 assembly code"
Johann [Tue, 19 Apr 2011 13:32:10 +0000 (06:32 -0700)]
Merge "Add save/restore xmm registers in x86 assembly code"

13 years agoAdd save/restore xmm registers in x86 assembly code
Johann [Thu, 7 Apr 2011 17:17:22 +0000 (13:17 -0400)]
Add save/restore xmm registers in x86 assembly code

Went through the code and fixed it. Verified on Windows.

Where possible, remove dependencies on xmm[67]

Current code relies on pushing rbp to the stack to get 16 byte
alignment. This broke when rbp wasn't pushed
(vp8/encoder/x86/sad_sse3.asm). Work around this by using unaligned
memory accesses. Revisit this and the offsets in
vp8/encoder/x86/sad_sse3.asm in another change to SAVE_XMM.

Change-Id: I5f940994d3ebfd977c3d68446cef20fd78b07877

13 years agoMerge "Use sub-pixel search's SSE in mode selection"
Yunqing Wang [Mon, 18 Apr 2011 20:20:04 +0000 (13:20 -0700)]
Merge "Use sub-pixel search's SSE in mode selection"

13 years agoUse sub-pixel search's SSE in mode selection
Yunqing Wang [Mon, 18 Apr 2011 19:48:34 +0000 (15:48 -0400)]
Use sub-pixel search's SSE in mode selection

Passed SSE from sub-pixel search back to pick_inter_mode
function, which is compared with the encode_breakout to
see if we could skip evaluating the remaining modes.

Change-Id: I4a86442834f0d1b880a19e21ea52d17d505f941d

13 years agoMerge "Handle long delay between video frames in multi-thread decoder(issue 312)"
Yunqing Wang [Mon, 18 Apr 2011 17:11:41 +0000 (10:11 -0700)]
Merge "Handle long delay between video frames in multi-thread decoder(issue 312)"

13 years agoMerge "store quant_shift as an unsigned char"
Johann [Mon, 18 Apr 2011 17:03:40 +0000 (10:03 -0700)]
Merge "store quant_shift as an unsigned char"

13 years agoMerge "Add spin-wait pause intrinsic for Windows x64 platform."
Yaowu Xu [Mon, 18 Apr 2011 16:53:26 +0000 (09:53 -0700)]
Merge "Add spin-wait pause intrinsic for Windows x64 platform."

13 years agoMerge "fixed an overflow in ssim calculation"
Yaowu Xu [Mon, 18 Apr 2011 14:44:33 +0000 (07:44 -0700)]
Merge "fixed an overflow in ssim calculation"

13 years agoRemoved unused timers
Scott LaVarnway [Fri, 15 Apr 2011 19:09:07 +0000 (15:09 -0400)]
Removed unused timers

Change-Id: I209803b9dbed2b2f6d02258fd7a3963a6645f4ab

13 years agoMerge "added -fomit-frame-pointer flag for gcc builds"
John Koleszar [Mon, 18 Apr 2011 13:07:57 +0000 (06:07 -0700)]
Merge "added -fomit-frame-pointer flag for gcc builds"

13 years agoHandle long delay between video frames in multi-thread decoder(issue 312)
Yunqing Wang [Fri, 15 Apr 2011 16:57:15 +0000 (12:57 -0400)]
Handle long delay between video frames in multi-thread decoder(issue 312)

This is reported by m...@hesotech.de (see issue 312):
"The decoder causes an access violation
when you decode the first frame, then make a pause of about
60 seconds and then decode further frames. But only if
vpx_codec_dec_cfg_t.threads> 1.

This is caused by a timeout of WaitForSingleObject.
When I change the definition of VPXINFINITE to INFINITE(0xFFFFFFFF),
the problem is solved."

Reproduced the crash and verified the changes on Windows platform.
This brings the behavior inline with the other platforms using sem_wait().

Change-Id: I27b32f90bce05846ef2684b50f7a88f292299da1

13 years agoMerge "remove dead code, add missing RESTORE_XMM"
Johann [Fri, 15 Apr 2011 20:32:54 +0000 (13:32 -0700)]
Merge "remove dead code, add missing RESTORE_XMM"

13 years agoadded -fomit-frame-pointer flag for gcc builds
Scott LaVarnway [Fri, 15 Apr 2011 19:59:21 +0000 (15:59 -0400)]
added -fomit-frame-pointer flag for gcc builds

According to the docs, this should have been enabled, but
the disassembled output shows otherwise.  This improved
the encode/decode performance.

Change-Id: I45ad7e6d299b89ac3166d7ef7da75b74994344c6

13 years agoremove executable bit
Johann [Fri, 15 Apr 2011 17:43:24 +0000 (13:43 -0400)]
remove executable bit

source files are not executable

Change-Id: Id2c7294695a22217468426423979f68f02d82340

13 years agoMerge "Fix usage of value returned by vp8_pick_intra4x4mby_modes"
Adrian Grange [Fri, 15 Apr 2011 15:37:19 +0000 (08:37 -0700)]
Merge "Fix usage of value returned by vp8_pick_intra4x4mby_modes"

13 years agoMerge "Reduce unnecessary distortion computation"
Yunqing Wang [Fri, 15 Apr 2011 15:17:03 +0000 (08:17 -0700)]
Merge "Reduce unnecessary distortion computation"

13 years agoremove dead code, add missing RESTORE_XMM
Johann [Fri, 15 Apr 2011 14:11:53 +0000 (10:11 -0400)]
remove dead code, add missing RESTORE_XMM

vp8_filter_block1d16_h4_ssse3 was never called

because UNSHADOW_ARGS moves the stack by 'mov rsp, rbp', the issue was
masked. however, if/when win64 used those registers for persistant data,
issues could/will arise.

Change-Id: I56d6effca0aeba1f86082689771cb10145d39651

13 years agoFix off-by-one in copy_and_extend_plane
John Koleszar [Fri, 15 Apr 2011 12:44:39 +0000 (08:44 -0400)]
Fix off-by-one in copy_and_extend_plane

Should only copy h lines, not h+1.

Change-Id: I802a85686635900459c6dc79596189033e5298d8

13 years agoReduce unnecessary distortion computation
Yunqing Wang [Thu, 14 Apr 2011 19:53:33 +0000 (15:53 -0400)]
Reduce unnecessary distortion computation

In vp8_pick_inter_mode(), for NEWMV mode, use the error result got
from motion search as distortion. This helps performance in real-
time mode.

Change-Id: I398c4e46cc5381f7d874e748cf78827ef0e0860c

13 years agoMerge "Refactor lookahead ring buffer"
John Koleszar [Thu, 14 Apr 2011 19:35:01 +0000 (12:35 -0700)]
Merge "Refactor lookahead ring buffer"

13 years agoMerge "Use consistent delimiters."
Fritz Koenig [Thu, 14 Apr 2011 18:56:18 +0000 (11:56 -0700)]
Merge "Use consistent delimiters."

13 years agoFix usage of value returned by vp8_pick_intra4x4mby_modes
Adrian Grange [Thu, 14 Apr 2011 17:31:56 +0000 (10:31 -0700)]
Fix usage of value returned by vp8_pick_intra4x4mby_modes

The value of distortion2 returned by vp8_pick_intra4x4mby_modes
was being overwritten by the value returned by get16x16prederror
before it was tested.

Change-Id: If00e80332b272c5545c3a7e381c8041e8319b41a

13 years agoMerge "update configure for ios sdk 4.3"
Johann [Thu, 14 Apr 2011 15:55:22 +0000 (08:55 -0700)]
Merge "update configure for ios sdk 4.3"

13 years agoupdate configure for ios sdk 4.3
Joshua Bleecher Snyder [Thu, 14 Apr 2011 13:28:49 +0000 (09:28 -0400)]
update configure for ios sdk 4.3

update for the latest version of the ios sdk. adding
usr/lib/system fixes a missing libcache.dylib issue

make isysroot path more DRY

Change-Id: Ib748ef3dac3cac2e4848fbffa1e9a0112eac826b

13 years agoUse consistent delimiters.
Fritz Koenig [Wed, 13 Apr 2011 22:06:17 +0000 (15:06 -0700)]
Use consistent delimiters.

opsnr.stt file was using \t for delimiters on everything
except between VPXSSIM and Time.

Change-Id: I6284c4e40c05ff642bf4b0170dca062c279a42df

13 years agoFixed use of early breakout in vp8_pick_intra4x4mby_modes
Adrian Grange [Wed, 13 Apr 2011 19:56:46 +0000 (12:56 -0700)]
Fixed use of early breakout in vp8_pick_intra4x4mby_modes

Index i is used to detect early breakout from the first loop, but
its value is lost due to reuse in the second for loop. I moved
the position of the second loop and did some format cleanup.

Change-Id: I02780eae1bd89df4b6c000fb8a018b0837aac2e5

13 years agoRefactor lookahead ring buffer
John Koleszar [Wed, 13 Apr 2011 18:00:18 +0000 (14:00 -0400)]
Refactor lookahead ring buffer

This patch cleans up the source buffer storage and copy mechanism to
allow access through a standard push/pop/peek interface. This approach
also avoids an extra copy in the case where the source is not a
multiple of 16, fixing issue #102.

Change-Id: I05808c39f5743625cb4c7af54cc841b9b10fdbd9

13 years agostore quant_shift as an unsigned char
Johann [Wed, 13 Apr 2011 17:45:58 +0000 (13:45 -0400)]
store quant_shift as an unsigned char

in encodframe.c, quant_shift is set to 0 or 1 in vp8cx_invert_quant

only use 8 bits to store this, instead of 16. will allow saving an
xmm register in an updated version of the regular quantize

Change-Id: Ie88c47fe2aff5af0283dab1147fb2791e4b12f90

13 years agoMerge "Bugfix for error accumulator stats"
John Koleszar [Tue, 12 Apr 2011 13:59:00 +0000 (06:59 -0700)]
Merge "Bugfix for error accumulator stats"

13 years agoBugfix for error accumulator stats
John Koleszar [Mon, 11 Apr 2011 17:05:08 +0000 (13:05 -0400)]
Bugfix for error accumulator stats

Previous to commit de4e9e3, there was an early return in the alt-ref
case that was inadvertantly removed when the function was refactored
to return void. This patch restores the prior behavior.

Change-Id: I783ffd594a4690297e2742f99526fd7ad67698b2

13 years agoMerge "Fix encoder range check for frame width and height"
John Koleszar [Tue, 12 Apr 2011 12:34:12 +0000 (05:34 -0700)]
Merge "Fix encoder range check for frame width and height"

13 years agoFix encoder range check for frame width and height
Attila Nagy [Tue, 12 Apr 2011 12:01:22 +0000 (15:01 +0300)]
Fix encoder range check for frame width and height

14 bits available in the bistream => valid range [1..16383]
Removed unused local vars.

Change-Id: Icf3385e47a9fa13af70053129c2248671f285583

13 years agoSet cpu_used range to [-16, 16] in real-time mode
Yunqing Wang [Mon, 11 Apr 2011 19:55:04 +0000 (15:55 -0400)]
Set cpu_used range to [-16, 16] in real-time mode

Remove encoding speed limitation in real-time mode.

Change-Id: Ib5e35d8bb522b2a25f3e4ad5cfe2788ebebb3617

13 years agoDefine RDCOST only once
Yunqing Wang [Thu, 7 Apr 2011 20:57:25 +0000 (16:57 -0400)]
Define RDCOST only once

Clean up the code.

Change-Id: I7db048efa4d972b528d553a7921bc45979621129

13 years agoRemove unused files
John Koleszar [Mon, 11 Apr 2011 14:34:40 +0000 (10:34 -0400)]
Remove unused files

Change-Id: I36ca3f2f4620358033da34daf764f0b388dacd08

13 years agoFix input MV for full search
Yunqing Wang [Fri, 8 Apr 2011 20:15:26 +0000 (16:15 -0400)]
Fix input MV for full search

Input MV needs to be modified to full-pixel precision.

Change-Id: Ic5d78e41bf27077e325024332b9fe89f76c44f0c

13 years agoMerge "use asm_offsets with vp8_fast_quantize_b_sse3"
Johann Koenig [Fri, 8 Apr 2011 17:05:47 +0000 (10:05 -0700)]
Merge "use asm_offsets with vp8_fast_quantize_b_sse3"

13 years agoMerge "Error accumulator stats bug."
John Koleszar [Fri, 8 Apr 2011 15:20:32 +0000 (08:20 -0700)]
Merge "Error accumulator stats bug."

13 years agoError accumulator stats bug.
Paul Wilkins [Fri, 8 Apr 2011 13:21:36 +0000 (14:21 +0100)]
Error accumulator stats bug.

The error accumulator stats values cpi->prediction_error and
cpi->intra_error were being populated with rd values not
distortion values.

These are only "currently" used in a limited way for RT compress
key frame detection.

Change-Id: I2702ba1cab6e49ab8dc096ba75b6b34ab3573021

13 years agofixed an overflow in ssim calculation
Jim Bankoski [Mon, 28 Mar 2011 23:39:05 +0000 (16:39 -0700)]
fixed an overflow in ssim calculation

This commit fixed an overflow in ssim calculation, added register
save and restore to make sure assembly code working for x64 platform.
It also changed the sampling points to every 4x4 instead of 8x8 and
adjusted the constants in SSIM calculation to match the scale of
previous VPXSSIM.

Change-Id: Ia4dbb8c69eac55812f4662c88ab4653b6720537b

13 years agouse asm_offsets with vp8_fast_quantize_b_sse3
Johann Koenig [Thu, 7 Apr 2011 20:40:05 +0000 (16:40 -0400)]
use asm_offsets with vp8_fast_quantize_b_sse3

on the same order as the sse2 fast quantize change: ~2%
except for 32bit. only a slight improvment there.

Change-Id: Iff80e5f1ce7e646eebfdc8871405458ff911986b

13 years agoUse correct 32 bit comparisons for SAD breakout.
James Berry [Thu, 7 Apr 2011 19:03:49 +0000 (15:03 -0400)]
Use correct 32 bit comparisons for SAD breakout.

Rax updated to eax to avoid uninitialized memory
usage.

Change-Id: Iedb953f104329ede2a786fc648a47f1be2f3798a

13 years agoMerge "use asm_offsets with vp8_fast_quantize_b_sse2"
Johann [Wed, 6 Apr 2011 17:53:55 +0000 (10:53 -0700)]
Merge "use asm_offsets with vp8_fast_quantize_b_sse2"

13 years agoMerge "Minor modification"
Yunqing Wang [Wed, 6 Apr 2011 13:12:13 +0000 (06:12 -0700)]
Merge "Minor modification"

13 years agoMinor modification
Yunqing Wang [Wed, 6 Apr 2011 13:08:47 +0000 (09:08 -0400)]
Minor modification

A small change.

Change-Id: I2e7726e58370a95d0319361f4f6ad231138d1328

13 years agouse asm_offsets with vp8_fast_quantize_b_sse2
Johann [Thu, 24 Mar 2011 17:31:10 +0000 (13:31 -0400)]
use asm_offsets with vp8_fast_quantize_b_sse2

on the same order as the regular quantize change: ~2%

Change-Id: I5c9eec18e89ae7345dd96945cb740e6f349cee86

13 years agoFixed unused variable warnings for firstpass.c
Scott LaVarnway [Mon, 4 Apr 2011 18:18:31 +0000 (14:18 -0400)]
Fixed unused variable warnings for firstpass.c

Change-Id: I8378a9a541ade2f098359a7b20fa08e6c1596d80

13 years agoMerge "Slightly simplify vp8_decode_mb_tokens."
John Koleszar [Mon, 4 Apr 2011 15:58:25 +0000 (08:58 -0700)]
Merge "Slightly simplify vp8_decode_mb_tokens."

13 years agoMerge "tweak vp8_regular_quantize_b_sse2"
Johann [Mon, 4 Apr 2011 15:56:25 +0000 (08:56 -0700)]
Merge "tweak vp8_regular_quantize_b_sse2"

13 years agoSlightly simplify vp8_decode_mb_tokens.
Gaute Strokkenes [Mon, 4 Apr 2011 15:47:22 +0000 (16:47 +0100)]
Slightly simplify vp8_decode_mb_tokens.

Change-Id: I0058ba7dcfc50a3374b712197639ac337f8726be

13 years agoMerge "Use full-pixel MV in mvsadcost calculation"
Yunqing Wang [Mon, 4 Apr 2011 15:40:51 +0000 (08:40 -0700)]
Merge "Use full-pixel MV in mvsadcost calculation"

13 years agoMerge "support obj_int_extract on cygwin"
John Koleszar [Mon, 4 Apr 2011 15:29:50 +0000 (08:29 -0700)]
Merge "support obj_int_extract on cygwin"

13 years agoUse full-pixel MV in mvsadcost calculation
Yunqing Wang [Fri, 1 Apr 2011 20:41:58 +0000 (16:41 -0400)]
Use full-pixel MV in mvsadcost calculation

MV sad cost error is only used in full-pixel motion search,
which only need full-pixel resolution instead of quarter-pixel
resolution. This change reduced mvsadcost table size, and
removed unneccessary pamameter passing since this table is
constant once it is generated.

Change-Id: I9f931e55f6abc3c99011321f1dfb2f3562e6f6b0

13 years agosupport obj_int_extract on cygwin
Johann [Thu, 31 Mar 2011 20:35:22 +0000 (16:35 -0400)]
support obj_int_extract on cygwin

cygwin doesn't support _sopen. drop down to the lowest common
denominator and merge main for all platforms. this also opens the door
for supporting multiple object formats with a single binary.

Change-Id: I7cd45091639d447434e6d5db2e19cfc9988f8630

13 years agoMerge "vpxenc: die on realloc failures"
John Koleszar [Fri, 1 Apr 2011 14:55:55 +0000 (07:55 -0700)]
Merge "vpxenc: die on realloc failures"

13 years agotweak vp8_regular_quantize_b_sse2
Johann [Fri, 1 Apr 2011 13:58:23 +0000 (09:58 -0400)]
tweak vp8_regular_quantize_b_sse2

rather than look up rc in the zig zag table, embed it in the macro. this
also allows us to shuffle some values in the macro and keep *d in rsi

gains of about the same order as the obj_int_extract implementation: ~2%

Change-Id: Ib7252dd10eee66e0af8b0e567426122781dc053d

13 years agoMerge "Wrapper function removed from vp8_subtract_b_neon function call"
Johann [Fri, 1 Apr 2011 12:47:21 +0000 (05:47 -0700)]
Merge "Wrapper function removed from vp8_subtract_b_neon function call"

13 years agoWrapper function removed from vp8_subtract_b_neon function call
Tero Rintaluoma [Wed, 30 Mar 2011 10:45:59 +0000 (13:45 +0300)]
Wrapper function removed from vp8_subtract_b_neon function call

Address calculations moved from encodemb_arm.c file to neon
optimized assembly function to save cycles in function calls.
 - vp8_subtract_b_neon_func replaced with vp8_subtract_b_neon
   that contains all needed address calculations
 - unnecessary file encodemb_arm.c removed
 - consistent with ARMv6 optimized version

Change-Id: I6cbc1a2670b56c2077f59995fcf8f70786b4990b

13 years agoMerge "ARMv6 optimized subtract functions"
Johann [Thu, 31 Mar 2011 15:40:10 +0000 (08:40 -0700)]
Merge "ARMv6 optimized subtract functions"

13 years agoMerge changes I4e32a8fb,Ic6a9d4c5
John Koleszar [Thu, 31 Mar 2011 13:21:11 +0000 (06:21 -0700)]
Merge changes I4e32a8fb,Ic6a9d4c5

* changes:
  Generate a vpx.pc file for pkg-config.
  Export the version string as a makefile variable.

13 years agoGenerate a vpx.pc file for pkg-config.
Ralph Giles [Mon, 28 Mar 2011 19:04:51 +0000 (12:04 -0700)]
Generate a vpx.pc file for pkg-config.

Rules are added to libs.mk to generate a vpx.pc, which is
installed as pkgconfig/vpx.pc under the target library directory.
This also requires the install path prefix be exported directly
in config.mk.

Some systems use a tool called pkg-config to query information
about intalled libraries or other resources, based on database
files provided by the packages themselves at install time.

Providing such a file for libvpx simplifies integration with
other build systems, and provides an easy avenue for developers
to test against their own builds of the library.

Change-Id: I4e32a8fbb53fc331aa95eb207c63dd70a76d18ed

13 years agoExport the version string as a makefile variable.
Ralph Giles [Mon, 28 Mar 2011 18:36:53 +0000 (11:36 -0700)]
Export the version string as a makefile variable.

The configure script exports the major/minor/patch version
numbers, but didn't make the full version string available
to Makefile recipes and rules, the way it is available to
C code from vpx_version.h.

Change-Id: Ic6a9d4c574a6ea66a50c928f4eedeb91d7668eb5

13 years agoFix: lpf semaphore was signaled in single threaded run
Attila Nagy [Wed, 30 Mar 2011 12:50:17 +0000 (15:50 +0300)]
Fix: lpf semaphore was signaled in single threaded run

After picking filter level, post the loopfilter semaphore
just when multiple threads are in use.

Change-Id: If7bfb64601d906adef703f454dafc25e978b93c6

13 years agovpxenc: die on realloc failures
John Koleszar [Wed, 30 Mar 2011 01:44:19 +0000 (21:44 -0400)]
vpxenc: die on realloc failures

Identified as a possible cause of issue #308, the code was silently
ignoring realloc failures, which would lead to corruption, memory
leaks, and likely a crash. The best we can do in this case is die
gracefully.

Change-Id: Ie5f6a853d367015be5b9712bd742778f3baeefd9

13 years agoMerge "Half pixel variance further optimized for ARMv6"
Johann [Tue, 29 Mar 2011 19:14:54 +0000 (12:14 -0700)]
Merge "Half pixel variance further optimized for ARMv6"

13 years agoMerge "Fix a crash while enabling shared (--enable-shared)"
Yunqing Wang [Tue, 29 Mar 2011 16:04:22 +0000 (09:04 -0700)]
Merge "Fix a crash while enabling shared (--enable-shared)"

13 years agoFix a crash while enabling shared (--enable-shared)
Yunqing Wang [Tue, 29 Mar 2011 15:31:06 +0000 (11:31 -0400)]
Fix a crash while enabling shared (--enable-shared)

Fixed a bug in SSSE3 sub-pixel filter functions.

Change-Id: I2e2126652970eb78307ffcefcace1efd5966fb0a

13 years agouse GLOBAL correctly on 32bit shared libraries
Johann [Tue, 29 Mar 2011 14:28:30 +0000 (10:28 -0400)]
use GLOBAL correctly on 32bit shared libraries

http://code.google.com/p/webm/issues/detail?id=309

Change-Id: I6fce9e2f74bc09a9f258df7f91ab599812324e8c

13 years agoMerge "configure: enable unused variable warnings"
John Koleszar [Tue, 29 Mar 2011 14:38:04 +0000 (07:38 -0700)]
Merge "configure: enable unused variable warnings"

13 years agoARMv6 optimized subtract functions
Tero Rintaluoma [Tue, 29 Mar 2011 13:52:00 +0000 (16:52 +0300)]
ARMv6 optimized subtract functions

Adds following ARMv6 optimized functions to encoder:
  - vp8_subtract_b_armv6
  - vp8_subtract_mby_armv6
  - vp8_subtract_mbuv_armv6

Gives 1-5% speed-up depending on input sequence and encoding
parameters. Functions have one stall cycle inside the loop body
on Cortex pipeline.

Change-Id: I19cca5408b9861b96f378e818eefeb3855238639

13 years agoadd asm_enc_offsets.c for all targets
Johann [Thu, 24 Mar 2011 22:12:21 +0000 (18:12 -0400)]
add asm_enc_offsets.c for all targets

now that we need asm_enc_offsets.c for x86 and arm and it is
harmless to build it for other targets, add it unconditionally

Change-Id: I320c5220afd94fee2b98bda9ff4e5e34c67062f3

13 years agoHalf pixel variance further optimized for ARMv6
Tero Rintaluoma [Mon, 28 Mar 2011 06:51:51 +0000 (09:51 +0300)]
Half pixel variance further optimized for ARMv6

Half pixel interpolations optimized in variance calculations. Separate
function calls to vp8_filter_block2d_bil_x_pass_armv6 are avoided.On
average, performance improvement is 6-7% for VGA@30fps sequences.

Change-Id: Idb5f118a9d51548e824719d2cfe5be0fa6996628

13 years agoMerge "use asm_offsets with vp8_regular_quantize_b_sse2"
Johann [Thu, 24 Mar 2011 18:06:36 +0000 (11:06 -0700)]
Merge "use asm_offsets with vp8_regular_quantize_b_sse2"

13 years agouse asm_offsets with vp8_regular_quantize_b_sse2
Johann [Thu, 10 Feb 2011 19:57:43 +0000 (14:57 -0500)]
use asm_offsets with vp8_regular_quantize_b_sse2

remove helper function and avoid shadowing all the arguments to the
stack on 64bit systems

when running with --good --cpu-used=0:
~2% on linux x86 and x86_64
~2% on win32 x86 msys and visual studio
more on darwin10 x86_64
significantly more on
x86_64-win64-vs9

Change-Id: Ib7be12edf511fbf2922f191afd5b33b19a0c4ae6

13 years agoMerge "ARMv6 optimized fdct4x4"
Johann [Wed, 23 Mar 2011 14:52:51 +0000 (07:52 -0700)]
Merge "ARMv6 optimized fdct4x4"

13 years agoMerge "Allow specifying --end-usage by enum name"
John Koleszar [Mon, 21 Mar 2011 19:29:11 +0000 (12:29 -0700)]
Merge "Allow specifying --end-usage by enum name"

13 years agoMerge "vpx_codec_dec_init: check that the iface is a decoder"
John Koleszar [Mon, 21 Mar 2011 16:12:58 +0000 (09:12 -0700)]
Merge "vpx_codec_dec_init: check that the iface is a decoder"

13 years agovpx_codec_dec_init: check that the iface is a decoder
John Koleszar [Mon, 21 Mar 2011 15:46:15 +0000 (11:46 -0400)]
vpx_codec_dec_init: check that the iface is a decoder

Make sure the given interface is actually a decoder interface before
initializing it.

Change-Id: Ie48d737f2956cc2f0891666de5ea87251e96bc49

13 years agoMerge "Fix multithreaded encoding for 1 MB wide frame"
Yunqing Wang [Mon, 21 Mar 2011 14:41:31 +0000 (07:41 -0700)]
Merge "Fix multithreaded encoding for 1 MB wide frame"

13 years agoRemove unused vp8_get4x4sse_cs_mmx declaration
John Koleszar [Fri, 11 Mar 2011 18:01:02 +0000 (13:01 -0500)]
Remove unused vp8_get4x4sse_cs_mmx declaration

This declaration did not match the prototype_sad() prototype, but was
unused in this translation unit, so it is removed instead. Fixes
issue 290.

Change-Id: I168854f88a85f73ca9aaf61d1e5dc0f43fc3fdb3

13 years agoMerge "Increase static linkage, remove unused functions"
John Koleszar [Mon, 21 Mar 2011 11:51:51 +0000 (04:51 -0700)]
Merge "Increase static linkage, remove unused functions"

13 years agoAllow specifying --end-usage by enum name
John Koleszar [Mon, 21 Mar 2011 11:50:42 +0000 (07:50 -0400)]
Allow specifying --end-usage by enum name

Map an enum to the --end-usage values, so you can specify
--end-usage=cq instead of --end-usage=2. The numerical values still
work for historical scripts, etc, but this is more user friendly.

Change-Id: I445ecd9638f801f5924a71eabf449bee293cdd34

13 years agoARMv6 optimized fdct4x4
Tero Rintaluoma [Mon, 21 Mar 2011 11:33:45 +0000 (13:33 +0200)]
ARMv6 optimized fdct4x4

Optimized fdct4x4 (8x4) for ARMv6 instruction set.
  - No interlocks in Cortex-A8 pipeline
  - One interlock cycle in ARM11 pipeline
  - About 2.16 times faster than current C-code compiled with -O3

Change-Id: I60484ecd144365da45bb68a960d30196b59952b8