granicus.if.org Git - libx264/log

]> granicus.if.org Git - libx264/log

projects / libx264 / log

commit | commitdiff | tree

Fiona Glaser [Thu, 21 Jan 2010 18:00:07 +0000 (10:00 -0800)]

Merge nnz_backup with scratch buffer
Slightly less memory usage.

commit | commitdiff | tree

Steven Walters [Wed, 20 Jan 2010 17:00:54 +0000 (09:00 -0800)]

Use cross-prefix properly with pkg-config for cross-compiling

commit | commitdiff | tree

Fiona Glaser [Tue, 19 Jan 2010 04:29:33 +0000 (20:29 -0800)]

Various performance optimizations
Simplify and compact storage of direct motion vectors, faster --direct auto.
Shrink various arrays to save a bit of cache.
Simplify and reorganize B macroblock type writing in CABAC.
Add some missing ALIGNED macros.

commit | commitdiff | tree

Fiona Glaser [Mon, 18 Jan 2010 23:50:06 +0000 (15:50 -0800)]

Fix crash on new AMD M300 and similar CPUs
Apparently these CPUs have SSE4a, but not misaligned SSE.

commit | commitdiff | tree

Fiona Glaser [Mon, 18 Jan 2010 00:11:05 +0000 (19:11 -0500)]

Fix intra refresh with subme < 6
Also improve the quality of intra masking.

commit | commitdiff | tree

Fiona Glaser [Sun, 17 Jan 2010 01:11:29 +0000 (20:11 -0500)]

Add support for multiple --tune options
Tunes apply in the order they are listed in the case of conflicts.
Psy tunings, i.e. film/animation/grain/psnr/ssim, cannot be combined.
Also clarify --profile, which forces the limits of a profile, not the profile itself.

commit | commitdiff | tree

Fiona Glaser [Sat, 16 Jan 2010 07:50:15 +0000 (02:50 -0500)]

Various bugfixes and tweaks in analysis
Fix the oldest-ever bug in x264: b16x8 analysis used the wrong width for predict_mv.
Fix cache_ref calls for slightly better MV prediction in bsub16x16 analysis.
Make B-partition analysis consider reference frame costs.
Various other minor changes.
Overall very slightly improved mode decision and motion search in B-frames.

commit | commitdiff | tree

Loren Merritt [Thu, 14 Jan 2010 19:52:12 +0000 (14:52 -0500)]

More --me tesa optimizations

commit | commitdiff | tree

Fiona Glaser [Thu, 14 Jan 2010 15:39:10 +0000 (10:39 -0500)]

Fix typo in configure

commit | commitdiff | tree

Fiona Glaser [Thu, 14 Jan 2010 05:07:30 +0000 (00:07 -0500)]

Make --fps force CFR mode

commit | commitdiff | tree

Fiona Glaser [Thu, 14 Jan 2010 01:21:31 +0000 (20:21 -0500)]

Eliminate intentional array overflow in quant matrix handling
While it probably never caused problems, it was incredibly ugly and evil.

commit | commitdiff | tree

Fiona Glaser [Thu, 14 Jan 2010 01:16:13 +0000 (20:16 -0500)]

Faster --me tesa

commit | commitdiff | tree

Anton Mitrofanov [Wed, 13 Jan 2010 20:44:00 +0000 (15:44 -0500)]

Fix static pthreads + dynamically linked x264 on win32
Add the necessary static pthread initialization code to a new DLLmain function.

commit | commitdiff | tree

Steven Walters [Wed, 13 Jan 2010 03:55:10 +0000 (22:55 -0500)]

Add getopt_long to the included getopt.c
Fixes option handling on OSs that have a nonworking/missing getopt (e.g. Solaris).

commit | commitdiff | tree

Fiona Glaser [Wed, 13 Jan 2010 01:14:35 +0000 (20:14 -0500)]

Faster psy-trellis init
Remove some unncessary zigzags.

commit | commitdiff | tree

Fiona Glaser [Wed, 13 Jan 2010 00:19:07 +0000 (19:19 -0500)]

Simplfy intra mode availability handling
Slightly faster, 1.5kb smaller binary size, less code.

commit | commitdiff | tree

Fiona Glaser [Sun, 10 Jan 2010 20:14:02 +0000 (15:14 -0500)]

Fix free callback, add x264_encoder_parameters function
x264 would try to use the passed param struct after freeing if the param_free callback was set.
Probably didn't cause any issues, as probably no programs used the callback in this location yet.

A new x264_encoder_parameters function is now available in the API.
This function lets the calling application grab the current state of the encoder's parameters.
Use this in x264cli to ensure that the param struct used for set_param is updated with whatever changes x264_encoder_open has made to it.

Patch partially by Anton Mitrofanov <BugMaster@narod.ru>.

commit | commitdiff | tree

David Conrad [Sat, 9 Jan 2010 06:52:33 +0000 (01:52 -0500)]

Fix x264 compilation on Apple GCC
Apple's GCC stupidly ignores the ARM ABI and doesn't give any stack alignment beyond 4.

commit | commitdiff | tree

Fiona Glaser [Sat, 2 Jan 2010 08:27:46 +0000 (03:27 -0500)]

Faster weightp motion search
For blind-weight dupes, copy the motion vector from the main search and qpel-refine instead of doing a full search.
Fix the p8x8 early termination, which had unexpected results when combined with blind weighting.
Overall, marginally reduces compression but should potentially improve speed by over 5%.

commit | commitdiff | tree

Fiona Glaser [Thu, 31 Dec 2009 18:45:27 +0000 (13:45 -0500)]

More correct padding constants for lowres planes
Since lowres analysis isn't interlace-aware, we don't need to double the vertical padding for interlaced video.

commit | commitdiff | tree

Fiona Glaser [Thu, 31 Dec 2009 07:57:45 +0000 (02:57 -0500)]

Fix some invalid reads caught by valgrind
Temporal predictor calculation was misled by invalid reference counts for I-frames.

commit | commitdiff | tree

Fiona Glaser [Tue, 22 Dec 2009 23:59:29 +0000 (18:59 -0500)]

Periodic intra refresh
Uses SEI recovery points, a moving vertical "bar" of intra blocks, and motion vector restrictions to eliminate keyframes.
Attempt to hide the visual appearance of the intra bar when --no-psy isn't set.
Enabled with --intra-refresh.
The refresh interval is controlled using keyint, but won't exceed the number of macroblock columns in the frame.
Greatly benefits low-latency streaming by making it possible to achieve constant framesize without intra-only encoding.
Combined with slice-max size for one slice per packet, tests suggest effective resiliance against packet loss as high as 25%.
x264 is now the best free software low-latency video encoder in the world.

Accordingly, change the API to add b_keyframe to the parameters present in output pictures.
Calling applications should check this to see if a frame is seekable, not the frame type.

Also make x264's motion estimation strictly abide by horizontal MV range limits in order for PIR to work.
Also fix a major bug in sliced-threads VBV handling.
Also change "auto" threads for sliced threads to "cores" instead of "1.5*cores" after performance testing.
Also simplify ratecontrol's checking of first pass options.
Also some minor tweaks to row-based VBV that should improve VBV accuracy on small frames.

commit | commitdiff | tree

Kieran Kunhya [Mon, 28 Dec 2009 15:42:17 +0000 (10:42 -0500)]

LAVF/FFMS input support, native VFR timestamp handling
libx264 now takes three new API parameters.
b_vfr_input tells x264 whether or not the input is VFR, and is 1 by default.
i_timebase_num and i_timebase_den pass the timebase to x264.

x264_picture_t now returns the DTS of each frame: the calling app need not calculate it anymore.

Add libavformat and FFMS2 input support: requires libav* and ffms2 libraries respectively.
FFMS2 is _STRONGLY_ preferred over libavformat: we encourage all distributions to compile with FFMS2 support if at all possible.
FFMS2 can be found at http://code.google.com/p/ffmpegsource/.
--index, a new x264cli option, allows the user to store (or load) an FFMS2 index file for future use, to avoid re-indexing in the future.

Overhaul the muxers to pass through timestamps instead of assuming CFR.
Also overhaul muxers to correctly use b_annexb and b_repeat_headers to simplify the code.
Remove VFW input support, since it's now pretty much redundant with native AVS support and LAVF support.
Finally, overhaul a large part of the x264cli internals.

--force-cfr, a new x264cli option, allows the user to force the old method of timestamp handling. May be useful in case of a source with broken timestamps.
Avisynth, YUV, and Y4M input are all still CFR. LAVF or FFMS2 must be used for VFR support.

Do note that this patch does *not* add VFR ratecontrol yet.
Support for telecined input is also somewhat dubious at the moment.

Large parts of this patch by Mike Gurlitz <mike.gurlitz@gmail.com>, Steven Walters <kemuri9@gmail.com>, and Yusuke Nakamura <muken.the.vfrmaniac@gmail.com>.

commit | commitdiff | tree

Fiona Glaser [Wed, 16 Dec 2009 00:59:00 +0000 (16:59 -0800)]

More help typo fixes

commit | commitdiff | tree

Loren Merritt [Thu, 14 Jan 2010 03:07:30 +0000 (03:07 +0000)]

Fix x264_clz on inputs > 1<<31
(though x264 never generates such inputs)

commit | commitdiff | tree

Fiona Glaser [Sun, 13 Dec 2009 11:16:04 +0000 (03:16 -0800)]

Don't do sum/ssd analysis if weightp == 1
Typo fixes in comments and help.

commit | commitdiff | tree

Fiona Glaser [Sat, 12 Dec 2009 01:22:18 +0000 (17:22 -0800)]

Fix two bugs in 2-pass ratecontrol
last_qscale_for wasn't set during the 2pass init code.
abr_buffer was way too small in the case of multiple threads, so accordingly increase its buffer size based on the number of threads.
May significantly increase quality with many threads in 2-pass mode, especially in cases with extremely large I-frames, such as anime.

commit | commitdiff | tree

Steven Walters [Fri, 11 Dec 2009 03:48:51 +0000 (19:48 -0800)]

Avisynth-MT and 2.6 compatibility fixes
Explain to the user why YV12 conversion is forced with Avisynth 2.6.
Fix encoding with Avisynth-MT scripts by inserting the necessary Distributor() call; speeds such scripts back up to expected levels.

commit | commitdiff | tree

Steven Walters [Thu, 10 Dec 2009 00:03:19 +0000 (16:03 -0800)]

Fix zone parsing on mingw
Due to MinGW evidently being in the hands of a pack of phenomenal idiots, MinGW does not have strtok_r, a basic string function.
As such, remove the dependency on strtok_r in zone parsing.
Previously, using zones for anything other than ratecontrol failed.

commit | commitdiff | tree

Fiona Glaser [Wed, 9 Dec 2009 23:03:44 +0000 (15:03 -0800)]

More lookahead optimizations
Under subme 1, don't do any qpel search at all and round temporal MVs accordingly.
Drop internal subme with subme 1 to do fullpel predictor checks only.
Other minor optimizations.

commit | commitdiff | tree

Fiona Glaser [Wed, 9 Dec 2009 13:56:35 +0000 (05:56 -0800)]

Various minor missing changes from previous commits
Boolify sliced threads too
Remove unused constants from dct-a.asm
Fix a few typos/minor errors in preset documentation

commit | commitdiff | tree

Fiona Glaser [Fri, 11 Dec 2009 00:52:39 +0000 (16:52 -0800)]

Fix regression in direct=auto/temporal in r1364
Bug caused rare race condition in frame reference handling.
This resulted in invalid bitstreams in some B-frames and, very rarely, crashes.

commit | commitdiff | tree

Fiona Glaser [Wed, 9 Dec 2009 01:46:55 +0000 (17:46 -0800)]

Add fast pskip to x264 SEI info header

commit | commitdiff | tree

Steven Walters [Tue, 8 Dec 2009 19:36:25 +0000 (11:36 -0800)]

Minor seeking fix with Avisynth input
Seeking past the end of the input with --seek would result in the same frame being repeated over and over.

commit | commitdiff | tree

Fiona Glaser [Tue, 8 Dec 2009 11:08:17 +0000 (03:08 -0800)]

Add support for MB-tree + B-pyramid
Modify B-adapt 2 to consider pyramid in its calculations.
Generally results in many more B-frames being used when pyramid is on.
Modify MB-tree statsfile reading to handle the reordering necessary.
Make differing keyint or pyramid between passes into a fatal error.

commit | commitdiff | tree

Fiona Glaser [Tue, 8 Dec 2009 02:34:05 +0000 (18:34 -0800)]

Use aliasing-avoidance macros in array_non_zero

commit | commitdiff | tree

Cleo Saulnier [Mon, 7 Dec 2009 20:40:14 +0000 (12:40 -0800)]

MMX version of 8x8 interlaced zigzag
Just as fast as SSSE3 on Nehalem (and faster on Conroe/Penryn), so remove the SSSE3 version.

commit | commitdiff | tree

Fiona Glaser [Mon, 7 Dec 2009 08:49:41 +0000 (00:49 -0800)]

Bring back slice-based threading support
Enabled with --sliced-threads
Unlike normal threading, adds no encoding latency.
Less efficient than normal threading, both performance and compression-wise.
Useful for low-latency encoding environments where performance is still important, such as HD videoconferencing.
Add --tune zerolatency, which eliminates all x264 encoder-side latency (no delayed frames at all).
Some tweaks to VBV ratecontrol and lookahead (in addition to those required by sliced threading).
Commit sponsored by a media streaming company that wishes to remain anonymous.

commit | commitdiff | tree

Alex Jurkiewicz [Tue, 8 Dec 2009 02:17:29 +0000 (18:17 -0800)]

Add more detailed help for presets/tunes/profiles
Shows what options they represent.

commit | commitdiff | tree

Fiona Glaser [Sat, 5 Dec 2009 11:19:44 +0000 (03:19 -0800)]

qpel RD no longer needs mbcmp_unaligned

commit | commitdiff | tree

Loren Merritt [Wed, 9 Dec 2009 00:37:09 +0000 (00:37 +0000)]

ensure that all boolean options are {0,1} so they print consistently in the options SEI

commit | commitdiff | tree

Fiona Glaser [Sat, 5 Dec 2009 10:27:30 +0000 (02:27 -0800)]

Actually do r1356
Somehow commit r1356 got lost in the ether. I'm not sure how, but now it's fixed.

commit | commitdiff | tree

Steven Walters [Fri, 4 Dec 2009 20:17:56 +0000 (12:17 -0800)]

Remove some unused code from x264.c

commit | commitdiff | tree

Fiona Glaser [Thu, 3 Dec 2009 23:36:52 +0000 (15:36 -0800)]

SSSE3 version of zigzag_8x8_field
Slightly faster interlaced encoding with 8x8dct.
Helps most on Nehalem, somewhat disappointing on Conroe/Penryn.

commit | commitdiff | tree

Fiona Glaser [Thu, 3 Dec 2009 03:55:45 +0000 (19:55 -0800)]

Fix crash in interlaced with >8 refs
Crash introduced in weightp.

commit | commitdiff | tree

Fiona Glaser [Wed, 2 Dec 2009 00:15:15 +0000 (16:15 -0800)]

Significantly faster qpel-RD
Cache the results of MC, like in bidir-RD.
Slightly changes output due to the necessary reordering of satd/RD calls.
5-10% faster qpel-RD.

commit | commitdiff | tree

David Conrad [Tue, 1 Dec 2009 20:23:09 +0000 (12:23 -0800)]

Add x264 prefix to functions with ffmpeg equivalents
Not important now, but will be when we add libav* input support.

commit | commitdiff | tree

Fiona Glaser [Mon, 30 Nov 2009 09:41:24 +0000 (01:41 -0800)]

10L in r1353
Broke mp4 output.

commit | commitdiff | tree

Steven Walters [Fri, 27 Nov 2009 06:37:18 +0000 (22:37 -0800)]

Enhanced Avisynth input support
Requires avisynth_c.h from the Avisynth API headers.
Reports errors properly from Avisynth script input.
Automatically construct input scripts for almost any input file.
Tries ffmpegsource2, DSS2, directshowsource, and many other sourcing methods, based on the input file extension.
Automatically converts to YV12.

commit | commitdiff | tree

Fiona Glaser [Wed, 25 Nov 2009 18:40:08 +0000 (10:40 -0800)]

Much faster weightp
Move sum/ssd calculation out of lookahead and do it only once per frame.
Also various minor optimizations, cosmetics, and cleanups.

commit | commitdiff | tree

Kieran Kunhya [Wed, 25 Nov 2009 09:26:02 +0000 (01:26 -0800)]

Fix bugs in fps/timestamp handling in FLV muxer

commit | commitdiff | tree

Fiona Glaser [Wed, 25 Nov 2009 06:37:02 +0000 (22:37 -0800)]

Fix bug in weightp analysis
Weights weren't reset upon early terminations, so old (wrong) weights could stick around.
Small compression improvement.

commit | commitdiff | tree

Fiona Glaser [Wed, 25 Nov 2009 04:24:14 +0000 (20:24 -0800)]

Minor deblocking optimization, update comments

commit | commitdiff | tree

Fiona Glaser [Wed, 25 Nov 2009 00:21:07 +0000 (16:21 -0800)]

Fix weightb with delta_poc_bottom
Has no effect yet, but will be required once we add TFF/BFF signalling support in interlaced mode.
Gives 0.5-0.7% better compression with proper TFF/BFF signalling.

commit | commitdiff | tree

Fiona Glaser [Sat, 21 Nov 2009 07:27:51 +0000 (23:27 -0800)]

Give more meaningful error if 1st/2nd pass resolution differ

commit | commitdiff | tree

Steven Walters [Fri, 20 Nov 2009 20:04:13 +0000 (12:04 -0800)]

Fix extremely rare deadlock with sync-lookahead
Patch partially by Anton Mitrofanov.

commit | commitdiff | tree

Fiona Glaser [Fri, 20 Nov 2009 16:04:28 +0000 (08:04 -0800)]

Only print weightp stats if there were P-frames

commit | commitdiff | tree

Fiona Glaser [Wed, 18 Nov 2009 21:47:04 +0000 (13:47 -0800)]

Faster lookahead with subme=1
If it hasn't been clear already, don't use subme=1 as a "fast first pass" option.
Use subme=2 instead; 1 and below now enable a fast (lower quality) lookahead mode.

commit | commitdiff | tree

Fiona Glaser [Mon, 16 Nov 2009 23:23:58 +0000 (15:23 -0800)]

Faster weightp analysis
Modify pixel_var slightly to return the necessary information and use it for weight analysis instead of sad/ssd.
Various minor cosmetics.

commit | commitdiff | tree

Dylan Yudaken [Mon, 16 Nov 2009 00:14:50 +0000 (16:14 -0800)]

Fix two issues in weightp
If analysis decided on an offset of -128, x264 would create non-compliant streams.
Fix some cases with nearly all intra blocks where analysis could pick very weird weights.
Also add some asserts to check compliancy.

commit | commitdiff | tree

Alexander Strange [Sun, 15 Nov 2009 06:16:18 +0000 (22:16 -0800)]

Allow compilation with non-Apple GCC on OS X

commit | commitdiff | tree

Alexander Strange [Sun, 15 Nov 2009 06:13:28 +0000 (22:13 -0800)]

Use __attribute__((may_alias)) for type-punning
GCC thinks pointer casts to unions aren't valid with strict aliasing.
See http://gcc.gnu.org/onlinedocs/gcc-4.4.2/gcc/Optimize-Options.html#Type_002dpunning.
Also use M32() in y4m.c.
Enable -Wstrict-aliasing again since all such warnings are fixed.

commit | commitdiff | tree

Fiona Glaser [Sun, 15 Nov 2009 03:58:46 +0000 (19:58 -0800)]

100l in deadlock fix

commit | commitdiff | tree

Kieran Kunhya [Sun, 15 Nov 2009 03:01:09 +0000 (19:01 -0800)]

FLV muxing support

commit | commitdiff | tree

Fiona Glaser [Sun, 15 Nov 2009 02:40:22 +0000 (18:40 -0800)]

Fix rare deadlock introduced in weightp

commit | commitdiff | tree

Fiona Glaser [Thu, 12 Nov 2009 20:40:40 +0000 (12:40 -0800)]

Actually add -Wno-strict-aliasing to configure

commit | commitdiff | tree

Dylan Yudaken [Thu, 12 Nov 2009 15:03:46 +0000 (07:03 -0800)]

Various weightp fixes
Make weightp results match in threaded vs non-threaded mode.
Fix two-pass with slow-firstpass.

commit | commitdiff | tree

Fiona Glaser [Thu, 12 Nov 2009 13:25:32 +0000 (05:25 -0800)]

Fix all aliasing violations
New type-punning macros perform write/read-combining without aliasing violations per the second-to-last part of 6.5.7 in the C99 specification.
GCC 4.4, however, doesn't seem to have read this part of the spec and still warns about the violations.
Regardless, it seems to fix all known aliasing miscompilations, so perhaps the GCC warning generator is just broken.
As such, add -Wno-strict-aliasing to CFLAGS.

commit | commitdiff | tree

David Conrad [Thu, 12 Nov 2009 04:53:49 +0000 (20:53 -0800)]

Fix 10l in weightp on ARM

commit | commitdiff | tree

Fiona Glaser [Tue, 10 Nov 2009 05:22:41 +0000 (21:22 -0800)]

Fix one (of possibly many) miscompilations in weightp
Use NOINLINE and some emms calls to fix emms reordering issues.
This issue occurred with some GCC versions if threads > 1 and the phase of the moon was right.
Also a cosmetic in x264.c.

commit | commitdiff | tree

Fiona Glaser [Mon, 9 Nov 2009 17:18:03 +0000 (09:18 -0800)]

Fix pixel_ssd on win64
Didn't preserve XMM registers, may or may not have caused problems.

commit | commitdiff | tree

Steven Walters [Mon, 9 Nov 2009 06:18:35 +0000 (22:18 -0800)]

Fix weightp logfile parsing on MinGW

commit | commitdiff | tree

Loren Merritt [Mon, 9 Nov 2009 05:27:29 +0000 (05:27 +0000)]

cosmetics

commit | commitdiff | tree

David Conrad [Mon, 9 Nov 2009 04:12:54 +0000 (20:12 -0800)]

Fix weightp on ARM + PPC
No ARM or PPC assembly yet though.

commit | commitdiff | tree

Dylan Yudaken [Mon, 9 Nov 2009 01:59:08 +0000 (17:59 -0800)]

Weighted P-frame prediction
Merge Dylan's Google Summer of Code 2009 tree.
Detect fades and use weighted prediction to improve compression and quality.
"Blind" mode provides a small overall quality increase by using a -1 offset without doing any analysis, as described in JVT-AB033.
"Smart", the default mode, also performs fade detection and decides weights accordingly.
MB-tree takes into account the effects of "smart" analysis in lookahead, even further improving quality in fades.
If psy is on, mbtree is on, interlaced is off, and weightp is off, fade detection will still be performed.
However, it will be used to adjust quality instead of create actual weights.
This will improve quality in fades when encoding in Baseline profile.

Doesn't add support for interlaced encoding with weightp yet.
Only adds support for luma weights, not chroma weights.
Internal code for chroma weights is in, but there's no analysis yet.
Baseline profile requires that weightp be off.
All weightp modes may cause minor breakage in non-compliant decoders that take shortcuts in deblocking reference frame checks.
"Smart" may cause serious breakage in non-compliant decoders that take shortcuts in handling of duplicate reference frames.

Thanks to Google for sponsoring our most successful Summer of Code yet!

commit | commitdiff | tree

Steven Walters [Sun, 8 Nov 2009 19:53:48 +0000 (11:53 -0800)]

Fix assert failure in the case of forced i-frames
Note that this applies to non-IDR i-frames, not IDR-frames.
This fix is also required for future open-gop.

commit | commitdiff | tree

Steven Walters [Sun, 8 Nov 2009 01:07:28 +0000 (17:07 -0800)]

Fix issues relating to input/output files being pipes/FIFOs

commit | commitdiff | tree

David Conrad [Sat, 7 Nov 2009 17:25:18 +0000 (09:25 -0800)]

Various ARM-related fixes
Fix comment for mc_copy_neon.
Fix memzero_aligned_neon prototype.
Update NEON (i)dct_dc prototypes.
Duplicate x86 behavior for global+hidden functions.

commit | commitdiff | tree

Fiona Glaser [Wed, 4 Nov 2009 08:03:14 +0000 (00:03 -0800)]

Fix miscompilation with gcc 4.3 on ARM
Aliasing violation in spatial prediction caused nasty artifacts.
Shut up two other GCC warnings while we're at it.

commit | commitdiff | tree

Fiona Glaser [Wed, 4 Nov 2009 07:15:35 +0000 (23:15 -0800)]

Fix extremely rare infinite loop in 2-pass VBV
Implicit conversion from double->float lost enough precision to cause the loop termination condition to never trigger.
Bug report by Tal Aloni.

commit | commitdiff | tree

Anton Mitrofanov [Sun, 1 Nov 2009 02:51:14 +0000 (19:51 -0700)]

Fix large file support, broken in r1302

commit | commitdiff | tree

Fiona Glaser [Sat, 31 Oct 2009 01:58:03 +0000 (18:58 -0700)]

Dramatically reduce size of pixel_ssd_* asm functions
~10k of code size eliminated.

commit | commitdiff | tree

Loren Merritt [Sat, 7 Nov 2009 06:09:47 +0000 (06:09 +0000)]

fix bottom-right pixel of lowres planes, which was uninitialized.
weirdly, valgrind reported this only with --no-asm.

commit | commitdiff | tree

Fiona Glaser [Thu, 29 Oct 2009 19:28:37 +0000 (12:28 -0700)]

Further reduce code size in bime
~7-8 kilobytes saved, ~0.6% faster subme 9.

commit | commitdiff | tree

Anton Mitrofanov [Wed, 28 Oct 2009 19:57:11 +0000 (12:57 -0700)]

Fix case in which MB-tree didn't propagate all data correctly
Should improve quality in all cases.
Also some minor cosmetic improvements.

commit | commitdiff | tree

Fiona Glaser [Tue, 27 Oct 2009 23:01:46 +0000 (16:01 -0700)]

Take into account chroma MV offset during interlaced motion search
Small improvement in interlaced compression.

commit | commitdiff | tree

Fiona Glaser [Tue, 27 Oct 2009 22:08:37 +0000 (15:08 -0700)]

Slightly faster ssse3 width4 chroma MC
Cacheline-aware in the same fashion as width8, but not conditional.

commit | commitdiff | tree

Fiona Glaser [Tue, 27 Oct 2009 21:01:46 +0000 (14:01 -0700)]

Eliminate some rare cases where MB-tree gave incorrect results in B-frames
Also get rid of some unnecessary memcpies.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 27 Oct 2009 19:28:07 +0000 (12:28 -0700)]

Fix cases in which b-adapt 1 could result in AUTO-type frames.
This didn't actually cause any issues, but it removes the need for the fixing-up code that prevented said issues.

commit | commitdiff | tree

Fiona Glaser [Mon, 26 Oct 2009 19:53:07 +0000 (12:53 -0700)]

Motion compensation optimizations
Turning off inlining saves a whole boatload of code size for near-zero speed cost.
Simplify offset calculation.
Various other optimizations.

commit | commitdiff | tree

Fiona Glaser [Mon, 26 Oct 2009 02:41:10 +0000 (19:41 -0700)]

Minor CAVLC optimizations

commit | commitdiff | tree

Loren Merritt [Sun, 25 Oct 2009 19:34:12 +0000 (19:34 +0000)]

cosmetics

commit | commitdiff | tree

Fiona Glaser [Sun, 25 Oct 2009 16:14:27 +0000 (09:14 -0700)]

ISC-license x86inc.asm
As the assembly abstraction layer is very useful in non-x264 projects, it is now ISC (simplified BSD) so that others, even in commercial projects, can use it as well.

commit | commitdiff | tree

Fiona Glaser [Fri, 23 Oct 2009 23:20:39 +0000 (16:20 -0700)]

Various minor CABAC optimizations

commit | commitdiff | tree

Lamont Alston [Fri, 23 Oct 2009 18:01:13 +0000 (11:01 -0700)]

Fix bug in b-pyramid strict
Bug caused invalid streams in some situations.

commit | commitdiff | tree

Fiona Glaser [Fri, 23 Oct 2009 09:34:49 +0000 (02:34 -0700)]

Remove non-mod16 warning
Compression only "suffers" by an extremely marginal amount and too many people misinterpret the warning.

commit | commitdiff | tree

Fiona Glaser [Fri, 23 Oct 2009 05:38:32 +0000 (22:38 -0700)]

Fix two warnings + some minor optimizations