granicus.if.org Git - libx264/log

]> granicus.if.org Git - libx264/log

Rafaël Carré [Mon, 16 Apr 2012 01:20:14 +0000 (21:20 -0400)]

Update config.guess and config.sub
Adds support for a bunch of targets, including:
aarch64 (armv8)
arm-linux-androideabi

commit | commitdiff | tree

Alexander Prikhodko [Sat, 31 Mar 2012 08:33:41 +0000 (11:33 +0300)]

configure: correct use of RC variable and add --extra-rcflags

commit | commitdiff | tree

Steven Walters [Thu, 29 Mar 2012 01:15:04 +0000 (21:15 -0400)]

ICL/MSVS: Fix shared library generation and usage
MSVS requires exported variables to be declared with the DATA keyword, and requires that imported variables be declared with dllimport.
This does not fix x264 cli being unable to use a shared library built by ICL however.

commit | commitdiff | tree

Kieran Kunhya [Tue, 27 Mar 2012 16:38:56 +0000 (17:38 +0100)]

Fix intra-refresh + hrd

commit | commitdiff | tree

Anton Mitrofanov [Sun, 25 Mar 2012 13:34:24 +0000 (17:34 +0400)]

Fix frame input colorspace check

commit | commitdiff | tree

Fiona Glaser [Thu, 22 Mar 2012 20:56:50 +0000 (13:56 -0700)]

Fix comment in deblock.c
The code does, in fact, handle CAVLC+8x8dct correctly already.

commit | commitdiff | tree

Fiona Glaser [Tue, 13 Mar 2012 21:37:26 +0000 (14:37 -0700)]

Fix sliced-threads ratecontrol bug
Was using qp instead of qscale; could cause NANs (not to mention less accurate results).

commit | commitdiff | tree

Anton Mitrofanov [Mon, 12 Mar 2012 06:08:18 +0000 (23:08 -0700)]

Fix clobbering of mutex/cvs
Regression in r2183.
Bizarrely seemed to work on many platforms, but crashed on win64 and may have been slower.
Only affected sliced threads during encoding, but could cause crashes on x264 encoder close even without sliced threads.

commit | commitdiff | tree

Fiona Glaser [Fri, 24 Feb 2012 21:34:39 +0000 (13:34 -0800)]

Sliced-threads: do hpel and deblock after returning
Lowers encoding latency around 14% in sliced threads mode with preset superfast.
Additionally, even if there is no waiting time between frames, this improves parallelism, because hpel+deblock are done during the (singlethreaded) lookahead.
For ease of debugging, dump-yuv forces all of the threads to wait and finish instead of setting b_full_recon.

commit | commitdiff | tree

Fiona Glaser [Fri, 24 Feb 2012 21:16:52 +0000 (13:16 -0800)]

Add full-recon API option
Fully reconstruct frames even without dump-yuv.

commit | commitdiff | tree

Fiona Glaser [Wed, 22 Feb 2012 21:33:36 +0000 (13:33 -0800)]

x86inc: switch to amdnops
Recent AMD CPUs' instruction decoders choke horribly on extremely long nops (i.e. with 4 prefixes).
Won't affect much, since we don't use ALIGN much.

commit | commitdiff | tree

Fiona Glaser [Wed, 15 Feb 2012 00:54:03 +0000 (16:54 -0800)]

BMI1 decimate functions
Intel was nice enough to make tzcnt equal to "rep bsf", which is backwards-compatible.
This means we don't actually have to add new functions to make it work.

commit | commitdiff | tree

Fiona Glaser [Tue, 14 Feb 2012 23:07:10 +0000 (15:07 -0800)]

Minor asm changes

commit | commitdiff | tree

Fiona Glaser [Thu, 9 Feb 2012 22:23:52 +0000 (14:23 -0800)]

Add row-reencoding support to VBV for improved accuracy
Extremely accurate, possibly 100% so (I can't get it to fail even with difficult VBVs).
Does not yet support rows split on slice boundaries (occurs often with slice-max-size/mbs).
Still inaccurate with sliced threads, but better than before.

commit | commitdiff | tree

Fiona Glaser [Thu, 9 Feb 2012 20:38:44 +0000 (12:38 -0800)]

Abstract bitstream backup/restore functions
Required for row re-encoding.

commit | commitdiff | tree

Anton Mitrofanov [Thu, 9 Feb 2012 23:27:53 +0000 (15:27 -0800)]

Add an small per-MB cost penalty for lowres
Helps avoid VBV predictors going nuts with very low-cost MBs.
One particular case this fixes is zero-cost MBs: adaptive quantization decreases the QP a lot, but (before this patch), no cost penalty gets factored in for this, because anything times zero is zero.

commit | commitdiff | tree

Fiona Glaser [Tue, 14 Feb 2012 02:31:51 +0000 (18:31 -0800)]

Remove explicit run calculation from coeff_level_run
Not necessary with the CAVLC lookup table for zero run codes.

commit | commitdiff | tree

Fiona Glaser [Mon, 13 Feb 2012 21:20:06 +0000 (13:20 -0800)]

Export PSNR/SSIM in x264 API

commit | commitdiff | tree

Ronald S. Bultje [Wed, 8 Feb 2012 21:10:31 +0000 (13:10 -0800)]

x86inc: support yasm -f win64
Not necessary for x264, as -m amd64 already does the right thing, but used by external users of x86inc.

commit | commitdiff | tree

Henrik Gramner [Wed, 1 Feb 2012 22:52:48 +0000 (23:52 +0100)]

Fix incorrect zero-extension assumptions in x86_64 asm
Some x264 asm assumed that the high 32 bits of registers containing "int" values would be zero.
This is almost always the case, and it seems to work with gcc, but it is *not* guaranteed by the ABI.
As a result, it breaks with some other compilers, like Clang, that take advantage of this in optimizations.
Accordingly, fix all x86 code by using intptr_t instead of int or using movsxd where neccessary.
Also add checkasm hack to detect when assembly functions incorrectly assumes that 32-bit integers are zero-extended to 64-bit.

commit | commitdiff | tree

Fiona Glaser [Thu, 23 Feb 2012 17:11:23 +0000 (09:11 -0800)]

Fix possible alignment crash when linking from MSVC
x264_cavlc_init needs to be stack-aligned now.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 21 Feb 2012 20:58:22 +0000 (12:58 -0800)]

Fix rare overflow in 10-bit intra_satd_x3_16x16 asm

commit | commitdiff | tree

Steven Walters [Sun, 12 Feb 2012 03:56:43 +0000 (22:56 -0500)]

ICL: fix out of tree building and resource file usage on Windows

commit | commitdiff | tree

Oka Motofumi [Sun, 5 Feb 2012 21:07:34 +0000 (06:07 +0900)]

Add error handling for out-of-tree build

commit | commitdiff | tree

Anton Mitrofanov [Tue, 6 Mar 2012 13:34:02 +0000 (17:34 +0400)]

Fix RGB colorspace input
BGR/BGRA input was correct.

commit | commitdiff | tree

Fiona Glaser [Tue, 14 Feb 2012 00:40:32 +0000 (16:40 -0800)]

Fix interlaced + extremal slice-max-size
Broke if the first macroblock in the slice exceeded the set slice-max-size.

commit | commitdiff | tree

Henrik Gramner [Sun, 5 Feb 2012 19:43:09 +0000 (20:43 +0100)]

Fix regression in r2141
Broke register preservation in x264_cpu_cpuid and x264_cpu_xgetbv.
Did not cause any problems.

commit | commitdiff | tree

Fiona Glaser [Thu, 19 Jan 2012 22:56:54 +0000 (14:56 -0800)]

TBM, AVX2, FMA3, BMI1, and BMI2 CPU detection support
TBM and BMI1 are supported by Trinity/Piledriver.
The others (and BMI1) will probably appear in Intel's upcoming Haswell.
Also update x86inc with AVX2 stuff.

commit | commitdiff | tree

Loren Merritt [Fri, 3 Feb 2012 06:27:18 +0000 (06:27 +0000)]

x86inc: add TAIL_CALL macro to abstract a common asm idiom

commit | commitdiff | tree

Fiona Glaser [Thu, 26 Jan 2012 00:44:38 +0000 (16:44 -0800)]

Minor asm optimizations/cleanup

commit | commitdiff | tree

Fiona Glaser [Wed, 25 Jan 2012 03:03:58 +0000 (19:03 -0800)]

Clean up and optimize weightp, plus enable SSSE3 weight on SB/BDZ
Also remove unused AVX cruft.

commit | commitdiff | tree

Fiona Glaser [Tue, 24 Jan 2012 02:57:58 +0000 (18:57 -0800)]

XOP frame_init_lowres
Covers both 8-bit and 16-bit, ~5-10% faster on Bulldozer.

commit | commitdiff | tree

Fiona Glaser [Tue, 17 Jan 2012 23:25:10 +0000 (15:25 -0800)]

XOP 8x8 zigzags
Field: 35(mmx) ->16(xop) cycles
Frame: 32(ssse3)->20(xop) cycles

commit | commitdiff | tree

Fiona Glaser [Mon, 23 Jan 2012 23:09:38 +0000 (15:09 -0800)]

AVX 32-bit hpel_filter_h
Faster on Sandy Bridge.
Also add details on unsuccessful optimizations in these functions.

commit | commitdiff | tree

Fiona Glaser [Sat, 28 Jan 2012 00:29:30 +0000 (16:29 -0800)]

x86inc: add high halfword register support
Might be useful in a few cases.

commit | commitdiff | tree

Ronald S. Bultje [Wed, 25 Jan 2012 05:53:59 +0000 (13:53 +0800)]

Change %ifdef directives to %if directives in *.asm files
This allows combining multiple conditionals in a single statement.

commit | commitdiff | tree

Anton Mitrofanov [Sun, 22 Jan 2012 18:13:52 +0000 (22:13 +0400)]

Use TV range algorithm for bit-depth conversions
Such sources are more common, so better to be correct for the common case.
This also produces less error for the case of full range than the previous algorithm produced for the case of TV range.

commit | commitdiff | tree

Hii [Wed, 25 Jan 2012 08:29:22 +0000 (16:29 +0800)]

Bump dates to 2012

commit | commitdiff | tree

Henrik Gramner [Sat, 28 Jan 2012 20:38:27 +0000 (21:38 +0100)]

Add Windows resource file
Displays version info in Windows Explorer.

commit | commitdiff | tree

Sergey Radionov [Mon, 16 Jan 2012 21:22:44 +0000 (13:22 -0800)]

Fix win32 pthread_cond_signal
Isn't used by x264 currently, so didn't cause a problem.
Fix backported from libav.

commit | commitdiff | tree

Mans Rullgard [Wed, 1 Feb 2012 23:55:25 +0000 (15:55 -0800)]

ARM: align asm functions to 4 bytes.
Some linkers apparently fail to correctly align ARM functions when mixing with Thumb code.

commit | commitdiff | tree

Anton Mitrofanov [Sun, 22 Jan 2012 09:00:23 +0000 (13:00 +0400)]

Fix normalization of colorspace when input is packed YUV 4:2:2

commit | commitdiff | tree

Fiona Glaser [Sat, 21 Jan 2012 20:54:40 +0000 (12:54 -0800)]

Force keyint-min 1 with Blu-ray
Fixes an issue with referencing across I-frames that's prohibited in Blu-ray for some godforsaken reason.

commit | commitdiff | tree

Oka Motofumi [Sun, 29 Jan 2012 11:34:41 +0000 (20:34 +0900)]

Fix crash in --demuxer y4m with unsupported colorspace

commit | commitdiff | tree

Anton Mitrofanov [Mon, 16 Jan 2012 22:02:53 +0000 (14:02 -0800)]

Fix overread/possible crash with intra refresh + VBV

commit | commitdiff | tree

Loren Merritt [Wed, 18 Jan 2012 23:47:07 +0000 (15:47 -0800)]

Fix trellis 2 + subme >= 8
Trellis didn't return a boolean value as it was supposed to.
Regression in r2143-5.

commit | commitdiff | tree

Loren Merritt [Fri, 6 Jan 2012 15:53:29 +0000 (15:53 +0000)]

CABAC trellis opts part 4: x86_64 asm
Another 20% faster.
18k->12k codesize.

This patch series may have a large impact on encoding speed.
For example, 24% faster at --preset slower --crf 23 with 720p parkjoy.
Overall speed increase is proportional to the cost of trellis (which is proportional to bitrate, and much more with --trellis 2).

commit | commitdiff | tree

Loren Merritt [Fri, 6 Jan 2012 15:53:04 +0000 (15:53 +0000)]

CABAC trellis opts part 3: make some arrays non-static

commit | commitdiff | tree

Loren Merritt [Thu, 22 Dec 2011 17:56:06 +0000 (17:56 +0000)]

CABAC trellis opts part 2: C optimizations

Hoist the branch on coef value out of the loop over node contexts.
Special cases for each possible coef value (0,1,n).
Special case for dc-only blocks.
Template the main loop for two common subsets of nodes, to avoid a bunch of branches about which nodes are live.
Use the nonupdating version of cabac_size_decision in more cases, and omit those bins from the node struct.
CABAC offsets are now compile-time constants.
Change TRELLIS_SCORE_MAX from a specific constant to anything negative, which is cheaper to test.
Remove dct_weight2_zigzag[], since trellis has to lookup zigzag[] anyway.

60% faster on x86_64.
25k->18k codesize.

commit | commitdiff | tree

Loren Merritt [Thu, 22 Dec 2011 17:55:06 +0000 (17:55 +0000)]

CABAC trellis opts part 1: minor change in output
Due to different tie-break order.

commit | commitdiff | tree

Henrik Gramner [Sun, 8 Jan 2012 03:14:10 +0000 (04:14 +0100)]

x86inc improvements for 64-bit

Add support for all x86-64 registers
Prefer caller-saved register over callee-saved on WIN64
Support up to 15 function arguments

commit | commitdiff | tree

Ilia Valiakhmetov [Sun, 15 Jan 2012 10:47:58 +0000 (04:47 -0600)]

High bit depth SSE2/AVX add8x8_idct8 and add16x16_idct8
From Google Code-In.

commit | commitdiff | tree

Edward Wang [Wed, 4 Jan 2012 23:35:54 +0000 (15:35 -0800)]

MMX/SSE2/AVX predict_8x16_p, high bit depth fdct8
From Google Code-In.

commit | commitdiff | tree

Fiona Glaser [Thu, 22 Dec 2011 22:03:15 +0000 (14:03 -0800)]

XOP 8-bit fDCT
Use integer MAC for one of the SUMSUB passes. About a dozen cycles faster for 16x16.

commit | commitdiff | tree

Cristian Militaru [Wed, 4 Jan 2012 20:38:08 +0000 (12:38 -0800)]

High bit depth intra_sad_x3_4x4
From Google Code-In.

commit | commitdiff | tree

Fiona Glaser [Thu, 8 Dec 2011 21:45:41 +0000 (13:45 -0800)]

Use a large LUT for CAVLC zero-run bit codes
Helps the most with trellis and RD, but also helps with bitstream writing.
Seems at worst neutral even in the extreme case of a CPU with small L2 cache (e.g. ARM Cortex A8).

commit | commitdiff | tree

Matt Habel [Sat, 17 Dec 2011 07:16:09 +0000 (23:16 -0800)]

High bit depth intra_sad_x3_8x8, intra_satd_x3_4x4/8x8c/16x16
Also add an ACCUM macro to handle accumulator-induced add-or-swap more concisely.

commit | commitdiff | tree

Shitiz Garg [Sat, 3 Dec 2011 23:34:57 +0000 (15:34 -0800)]

MMX 10-bit predict_8x8c_h and predict_8x16c_h
From Google Code-In.

commit | commitdiff | tree

Aaron Schmitz [Wed, 30 Nov 2011 06:15:45 +0000 (00:15 -0600)]

Some MBAFF x86 assembly functions.
deblock_chroma_420_mbaff, plus 422/422_intra_mbaff implemented using existing functions.
From Google Code-In.

commit | commitdiff | tree

George Stephanos [Fri, 2 Dec 2011 00:53:45 +0000 (16:53 -0800)]

More ARM NEON assembly functions
predict_8x8_v, predict_4x4_dc_top, predict_8x8_ddl, predict_8x8_ddr, predict_8x8_vl, predict_8x8_vr, predict_8x8_hd, predict_8x8_hu.
From Google Code-In.

commit | commitdiff | tree

Ilia [Mon, 28 Nov 2011 13:20:09 +0000 (05:20 -0800)]

More 4:2:2 asm functions
High bit depth version of deblock_h_chroma_422.
Regular and high bit depth versions of deblock_h_chroma_intra_422.
High bit depth pixel_vsad.
SSE2 high bit depth and MMX 8-bit predict_8x8_vl.
Our first GCI patch this year!

commit | commitdiff | tree

Henrik Gramner [Thu, 8 Dec 2011 15:14:35 +0000 (16:14 +0100)]

SSE2 and SSSE3 versions of sub8x16_dct_dc
Also slightly faster sub8x8_dct_dc

commit | commitdiff | tree

Steven Walters [Mon, 5 Dec 2011 13:46:34 +0000 (08:46 -0500)]

Resize filter updates
Use AVPixFmtDescriptors to pick the most compatible x264 csp for any pixel format.
Fix deprecated use of av_set_int.
Now requires libavutil >= 51.19.0

commit | commitdiff | tree

Oka Motofumi [Thu, 5 Jan 2012 22:23:50 +0000 (14:23 -0800)]

Add out-of-tree build support

commit | commitdiff | tree

Anton Mitrofanov [Fri, 16 Dec 2011 14:17:00 +0000 (18:17 +0400)]

Limit SSIM to 100db
Avoids floating point error for infinite SSIM (lossless).

commit | commitdiff | tree

Reynaldo H. Verdejo Pinochet [Wed, 4 Jan 2012 16:16:12 +0000 (13:16 -0300)]

Fix wrong conditional inclusion of inttypes.h
inttypes.h is required by encoder/ratecontrol.c for SCNxxx macros, and HAVE_STDINT_H does not imply having inttypes.h.
stdint.h is a subset of inttypes.h, but this isn't enough for x264.
This change fixes building x264 with Android's toolchain.

commit | commitdiff | tree

Anton Mitrofanov [Wed, 21 Dec 2011 07:08:56 +0000 (11:08 +0400)]

Fix crash with sliced threads and input height <= 112

commit | commitdiff | tree

Phillip Blucas [Mon, 19 Dec 2011 23:43:41 +0000 (17:43 -0600)]

Fix loading custom 8x8 chroma quant matrices in 4:4:4

commit | commitdiff | tree

Anton Mitrofanov [Thu, 15 Dec 2011 21:48:07 +0000 (01:48 +0400)]

Fix PCM cost overflow

commit | commitdiff | tree

Anton Mitrofanov [Thu, 8 Dec 2011 21:54:22 +0000 (01:54 +0400)]

Fix overflow in 8-bit x86 vsad asm function

commit | commitdiff | tree

Anton Mitrofanov [Wed, 7 Dec 2011 15:14:52 +0000 (19:14 +0400)]

Fix crash in --fullhelp when compiled against recent ffmpeg
Don't assume all pixel formats have a description.

commit | commitdiff | tree

Fiona Glaser [Tue, 6 Dec 2011 22:39:21 +0000 (14:39 -0800)]

Fix regression in r2118
Broke trellis with i16x16 macroblocks.

commit | commitdiff | tree

Fiona Glaser [Wed, 30 Nov 2011 21:02:12 +0000 (13:02 -0800)]

Modify MBAFF chroma deblock functions to handle U/V at the same time
Allows for more convenient asm implementations.

commit | commitdiff | tree

Fiona Glaser [Fri, 11 Nov 2011 00:16:13 +0000 (16:16 -0800)]

CABAC trellis optimizations: use SIMD quant
Significant speed increase, minor change in output due to rounding.

commit | commitdiff | tree

Steven Walters [Sun, 6 Nov 2011 17:48:30 +0000 (09:48 -0800)]

YUV range detection and support for x264CLI
Two new options: --input-range and --range.
--input-range forces the range of the input in case of misdetection; auto by default.
-- range sets the range of the output; x264cli will convert if necessary, TV by default.
--fullrange is now removed as a CLI option (but the libx264 API is unchanged).

commit | commitdiff | tree

Kieran Kunhya [Fri, 4 Nov 2011 20:09:13 +0000 (20:09 +0000)]

Pass through user data

commit | commitdiff | tree

Fiona Glaser [Thu, 27 Oct 2011 21:05:56 +0000 (14:05 -0700)]

Remove unpredictable branch in CABAC dqp

commit | commitdiff | tree

Loren Merritt [Sun, 23 Oct 2011 23:15:11 +0000 (23:15 +0000)]

x86inc: AVX symmetry optimization
3-arg AVX ops with a memory arg can only have it in src2,
whereas SSE emulation of 3-arg prefers to have it in src1 (i.e. the move).
So, if the op is symmetric and the wrong one is memory, swap them.
Eliminates redundant moves in some cases when using 3-operand without AVX with memory arguments.
Also fix movss and movsd in some cases, and flag shufps correctly as float.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 29 Nov 2011 21:45:13 +0000 (13:45 -0800)]

checkasm: shut up gcc warnings, fix some naming of functions in results

commit | commitdiff | tree

Mans Rullgard [Tue, 29 Nov 2011 00:29:12 +0000 (16:29 -0800)]

checkasm: fix build on ARM
Because of how ALIGNED_ARRAY_16 is defined on ARM, array initialisers cannot be used here. Use memset() instead.

commit | commitdiff | tree

Anton Mitrofanov [Fri, 11 Nov 2011 21:31:49 +0000 (01:31 +0400)]

Improve makefile rules
Remove the need for "make clean" after most reconfigures.

commit | commitdiff | tree

Anton Mitrofanov [Fri, 11 Nov 2011 20:47:48 +0000 (00:47 +0400)]

Mark some local functions as static, cosmetics

commit | commitdiff | tree

Anton Mitrofanov [Fri, 11 Nov 2011 19:19:02 +0000 (23:19 +0400)]

Fix crash if timecode file opening fails

commit | commitdiff | tree

Fabian Greffrath [Fri, 11 Nov 2011 21:25:43 +0000 (13:25 -0800)]

Configure: force PIC for shared build on PARISC and MIPS

commit | commitdiff | tree

Anton Mitrofanov [Sat, 22 Oct 2011 15:41:07 +0000 (19:41 +0400)]

Improve yasm version check
Previous check allowed certain earlier versions that weren't fully compatible.

commit | commitdiff | tree

Fiona Glaser [Tue, 18 Oct 2011 21:30:26 +0000 (14:30 -0700)]

Add fenc prefetching to adaptive quant
Many fewer cache misses, faster adaptive quant.

commit | commitdiff | tree

Fiona Glaser [Tue, 18 Oct 2011 21:14:03 +0000 (14:14 -0700)]

Split prefetch_fenc between colorspaces
Add 4:2:2 version.

commit | commitdiff | tree

Fiona Glaser [Wed, 12 Oct 2011 00:04:32 +0000 (17:04 -0700)]

Some more 4:2:2 x86 asm
coeff_last8, coeff_level_run8, var2_8x16, predict_8x16c_dc, satd_4x16, intra_mbcmp_8x16c_x3, deblock_h_chroma_422

commit | commitdiff | tree

Loren Merritt [Tue, 11 Oct 2011 18:12:43 +0000 (18:12 +0000)]

Remove obsolete versions of intra_mbcmp_x3
intra_mbcmp_x3 is unnecessary if x9 exists (SSSE3 and onwards).

commit | commitdiff | tree

Loren Merritt [Mon, 10 Oct 2011 05:42:36 +0000 (05:42 +0000)]

SSSE3/SSE4/AVX 9-way fully merged i8x8 analysis (sa8d_x9)
x86_64 only for now, due to register requirements (like sa8d_x3).

i8x8 analysis cycles (per partition):
penryn sandybridge bulldozer
616->600  482->374  418->356  preset=faster
892->632  725->387  598->373  preset=medium
948->650  789->409  673->383  preset=slower

commit | commitdiff | tree

Fiona Glaser [Sat, 1 Oct 2011 02:09:19 +0000 (19:09 -0700)]

SSSE3/SSE4/AVX 9-way fully merged i8x8 analysis (sad_x9)
~3 times faster than current analysis, plus (like intra_sad_x9_4x4) analyzes all modes without shortcuts.

commit | commitdiff | tree

Loren Merritt [Wed, 5 Oct 2011 20:29:21 +0000 (13:29 -0700)]

Merge i4x4 prediction with intra_mbcmp_x9_4x4
Avoids a redundant prediction after analysis.

commit | commitdiff | tree

Fiona Glaser [Wed, 5 Oct 2011 20:17:31 +0000 (13:17 -0700)]

Inline i4x4/i8x8 encode into intra analysis
Larger code size, but faster.

commit | commitdiff | tree

Fiona Glaser [Thu, 22 Sep 2011 00:12:10 +0000 (17:12 -0700)]

Initial XOP and FMA4 support on AMD Bulldozer
~10% faster Hadamard functions (SATD/SA8D/hadamard_ac) plus other improvements.

commit | commitdiff | tree

Mans Rullgard [Tue, 27 Sep 2011 17:14:14 +0000 (21:14 +0400)]

ARM: update NEON chroma deblock functions to NV12 pixel format

commit | commitdiff | tree

Sean McGovern [Mon, 17 Oct 2011 19:45:15 +0000 (12:45 -0700)]

Add /usr/lib/{64/}values-xpg6.o to $LDFLAGS on Solaris
This is required for POSIX.1-2001 compliance.

commit | commitdiff | tree

Sean McGovern [Mon, 17 Oct 2011 19:44:03 +0000 (12:44 -0700)]

Fix linker test for -Bsymbolic
The Solaris linker only accepts -Bsymbolic for objects compiled in dynamic mode (i.e. shared objects), so pass -shared to gcc.
Additionally, for x86_32 unresolved textrels cause a linker error so mark the .text section as 'impure'.

commit | commitdiff | tree

Sean McGovern [Mon, 17 Oct 2011 19:43:28 +0000 (12:43 -0700)]

Add $SOFLAGS to exported SOFLAGS make variable

commit | commitdiff | tree

Henrik Gramner [Sat, 24 Sep 2011 13:56:08 +0000 (15:56 +0200)]

Allow setting a chroma format at compile time
Gives a slight speed increase and significant binary size reduction when only one chroma format is needed.

commit | commitdiff | tree

Harfe Leier [Fri, 30 Sep 2011 19:49:33 +0000 (12:49 -0700)]

Improve profile help
List high422/high444 profiles, and don't show non-high-bit-depth profiles in high bit depth builds.

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom