granicus.if.org Git - libx264/log

]> granicus.if.org Git - libx264/log

projects / libx264 / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Steven Walters [Fri, 4 Jun 2010 20:44:55 +0000 (13:44 -0700)]

Preprocessing cosmetics
Unify input/output defines to HAVE_* format.
Define values as 1 to simplify conditionals.

commit | commitdiff | tree

Fiona Glaser [Fri, 4 Jun 2010 04:31:10 +0000 (21:31 -0700)]

Take more shortcuts in i4x4/i8x8 analysis
Based on the scores of the H and V modes, rule out modes which are unlikely.
Small compression loss (0.1-0.5%) and large speed gain (10-30% faster intra analysis).
Not enabled in slower encoding modes.

Also make C versions of the merged SATD functions in order to eliminate branches based on their availability.

commit | commitdiff | tree

Fiona Glaser [Wed, 2 Jun 2010 22:47:26 +0000 (15:47 -0700)]

Display SSIM measurement in db as well

commit | commitdiff | tree

Anton Mitrofanov [Mon, 7 Jun 2010 21:03:03 +0000 (01:03 +0400)]

Make version.sh indicate "M" for local commits too

commit | commitdiff | tree

Alex Jurkiewicz [Sun, 6 Jun 2010 07:21:12 +0000 (15:21 +0800)]

Add error message for invalid [de]muxer selection

commit | commitdiff | tree

Nathan Caldwell [Sun, 6 Jun 2010 20:19:41 +0000 (14:19 -0600)]

Deduplicate the ALIGN macro, move it to common.h

commit | commitdiff | tree

David Conrad [Thu, 3 Jun 2010 23:02:24 +0000 (19:02 -0400)]

Fix a use of ALIGNED_ARRAY_16 on ARM

commit | commitdiff | tree

Fiona Glaser [Tue, 8 Jun 2010 22:41:17 +0000 (15:41 -0700)]

Add missing emms after nal_encode
Caused random, bizarre failures with some calling applications.

commit | commitdiff | tree

Fiona Glaser [Tue, 8 Jun 2010 22:38:32 +0000 (15:38 -0700)]

Fix crash in fake-interlaced at some resolutions

commit | commitdiff | tree

Yusuke Nakamura [Wed, 2 Jun 2010 13:27:57 +0000 (22:27 +0900)]

Fix no-mbtree + aq-mode=0

Regression in r1618.

commit | commitdiff | tree

Fiona Glaser [Wed, 2 Jun 2010 08:07:44 +0000 (01:07 -0700)]

Add API function to fix x264_picture_t initialization
Calling applications that do not use x264_picture_alloc need to use x264_picture_init to initialize x264_picture_t structures.
Previously, if the calling application didn't zero x264_picture_t, Bad Things could happen.

commit | commitdiff | tree

Yusuke Nakamura [Wed, 2 Jun 2010 08:02:31 +0000 (17:02 +0900)]

Fix Avisynth input
Regression in r1624. A more permanent solution to the problem will be committed later.

commit | commitdiff | tree

Oskar Arvidsson [Wed, 2 Jun 2010 00:08:45 +0000 (02:08 +0200)]

Convert to a unified "dctcoeff" type for DCT data
Necessary for future high bit-depth support.

commit | commitdiff | tree

Oskar Arvidsson [Tue, 1 Jun 2010 23:35:38 +0000 (01:35 +0200)]

Convert to a unified "pixel" type for pixel data
Necessary for future high bit-depth support.
Various macros and extra types have been introduced to make operations on variable-size pixels more convenient.

commit | commitdiff | tree

Fiona Glaser [Fri, 28 May 2010 21:27:22 +0000 (14:27 -0700)]

Add API tool to apply arbitrary quantizer offsets
The calling application can now pass a "map" of quantizer offsets to apply to each frame.
An optional callback to free the map can also be included.
This allows all kinds of flexible region-of-interest coding and similar.

commit | commitdiff | tree

Fiona Glaser [Thu, 27 May 2010 21:27:32 +0000 (14:27 -0700)]

x86 assembly code for NAL escaping
Up to ~10x faster than C depending on CPU.
Helps the most at very high bitrates (e.g. lossless).
Also make the C code faster and simpler.

commit | commitdiff | tree

Fiona Glaser [Fri, 28 May 2010 21:30:07 +0000 (14:30 -0700)]

Re-enable i8x8 merged SATD
Accidentally got disabled when intra_sad_x3 was added.

commit | commitdiff | tree

Henrik Gramner [Sun, 30 May 2010 20:45:14 +0000 (22:45 +0200)]

Some deblocking-related optimizations

commit | commitdiff | tree

Henrik Gramner [Thu, 27 May 2010 20:18:38 +0000 (22:18 +0200)]

Optimize out some x264_scan8 reads

commit | commitdiff | tree

Fiona Glaser [Thu, 27 May 2010 17:42:15 +0000 (10:42 -0700)]

Add fast skip in lookahead motion search
Helps speed very significantly on motionless blocks.

commit | commitdiff | tree

Fiona Glaser [Wed, 26 May 2010 19:55:35 +0000 (12:55 -0700)]

Merge some of adaptive quant and weightp
Eliminate redundant work; both of them were calculating variance of the frame.

commit | commitdiff | tree

Fiona Glaser [Thu, 27 May 2010 19:31:41 +0000 (12:31 -0700)]

Fix omission in libx264 tuning documentation

commit | commitdiff | tree

Fiona Glaser [Sun, 30 May 2010 16:42:53 +0000 (09:42 -0700)]

Fix ultrafast to actually turn off weightb

commit | commitdiff | tree

Anton Mitrofanov [Mon, 31 May 2010 18:36:50 +0000 (22:36 +0400)]

Fix crash with MP4-muxing if zero frames were encoded

commit | commitdiff | tree

Fiona Glaser [Mon, 31 May 2010 18:14:22 +0000 (11:14 -0700)]

Fix cavlc+deblock+8x8dct (regression in r1612)
Add cavlc+8x8dct munging to new deblock system.
May have caused minor visual artifacts.

commit | commitdiff | tree

Fiona Glaser [Wed, 26 May 2010 19:40:31 +0000 (12:40 -0700)]

Fix 10L in r1612
Stats need to be calculated before deblock strength, not after.
Broke ref stats in x264cli (no affect on actual output).

commit | commitdiff | tree

Fiona Glaser [Tue, 25 May 2010 19:42:44 +0000 (12:42 -0700)]

Overhaul deblocking again
Move deblock strength calculation to immediately after encoding to take advantage of the data that's already in cache.
Keep the deblocking itself as per-row.

commit | commitdiff | tree

Fiona Glaser [Tue, 25 May 2010 23:13:59 +0000 (16:13 -0700)]

Detect Atom CPU, enable appropriate asm functions
I'm not going to actually optimize for this pile of garbage unless someone pays me.
But it can't hurt to at least enable the correct functions based on benchmarks.

Also save some cache on Intel CPUs that don't need the decimate LUT due to having fast bsr/bsf.

commit | commitdiff | tree

Fiona Glaser [Mon, 24 May 2010 18:13:22 +0000 (11:13 -0700)]

Slightly faster mbtree asm

commit | commitdiff | tree

Fiona Glaser [Fri, 21 May 2010 22:39:38 +0000 (15:39 -0700)]

Faster deblock strength asm on conroe/penryn

commit | commitdiff | tree

Fiona Glaser [Fri, 21 May 2010 21:32:13 +0000 (14:32 -0700)]

Avoid an extra var2 in chroma encoding if possible
Also remove a redundant if.

commit | commitdiff | tree

Fiona Glaser [Fri, 21 May 2010 20:07:12 +0000 (13:07 -0700)]

Avoid a redundant qpel check in lookahead with subme <= 1.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 25 May 2010 15:11:42 +0000 (19:11 +0400)]

Fix ABR rate control calculations
Incorrect frame numbers were used, resulting in slightly inaccurate ratecontrol.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 25 May 2010 14:45:16 +0000 (18:45 +0400)]

Fix calculation of total bitrate printed after stop by CTRL+C

commit | commitdiff | tree

Kieran Kunhya [Sat, 22 May 2010 13:32:53 +0000 (14:32 +0100)]

Fix typo in fake-interlaced documentation

commit | commitdiff | tree

Fiona Glaser [Wed, 26 May 2010 00:49:07 +0000 (17:49 -0700)]

Fix CABAC+PCM, regression in r1592
Changes to queue in CABAC didn't get propagated to PCM code.

commit | commitdiff | tree

Henrik Gramner [Fri, 21 May 2010 13:30:26 +0000 (15:30 +0200)]

Fix performance regression in r1582
Set the correct compiler flags.

commit | commitdiff | tree

Fiona Glaser [Tue, 18 May 2010 23:48:00 +0000 (16:48 -0700)]

Rewrite deblock strength calculation, add asm
Rewrite is significantly slower, but is necessary to make asm possible.
Similar concept to ffmpeg's deblock strength asm.
Roughly one order of magnitude faster than C.
Overall, with the asm, saves ~100-300 clocks in deblocking per MB.

commit | commitdiff | tree

Anton Mitrofanov [Fri, 21 May 2010 06:33:45 +0000 (10:33 +0400)]

Fix different output with differing sync-lookahead
Also reduce memory consumption.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 18 May 2010 18:26:59 +0000 (22:26 +0400)]

Mark Win32 executable as large address aware

commit | commitdiff | tree

Kieran Kunhya [Thu, 20 May 2010 16:45:16 +0000 (17:45 +0100)]

Add "Fake interlaced" option
This encodes all frames progressively yet flags the stream as interlaced.
This makes it possible to encode valid 25p and 30p Blu-Ray streams.
Also put the pulldown help section in a more appropriate place.

commit | commitdiff | tree

Alex Jurkiewicz [Thu, 20 May 2010 07:01:37 +0000 (15:01 +0800)]

Modify version.sh to output to stdout.
Update configure to match.

commit | commitdiff | tree

Henrik Gramner [Wed, 19 May 2010 21:09:58 +0000 (23:09 +0200)]

Set correct filesystem permissions for various files

commit | commitdiff | tree

Anton Mitrofanov [Wed, 19 May 2010 17:07:03 +0000 (21:07 +0400)]

Fix regression in r1566
Intra stats need to be kept track of for fast intra decision.

commit | commitdiff | tree

Fiona Glaser [Tue, 18 May 2010 18:53:32 +0000 (11:53 -0700)]

Fix rc-lookahead in encoding options SEI in 2-pass with VBV

commit | commitdiff | tree

Loren Merritt [Mon, 17 May 2010 21:08:37 +0000 (14:08 -0700)]

Reduce memory usage in 2-pass with b-adapt 2

commit | commitdiff | tree

Fiona Glaser [Sat, 15 May 2010 21:48:58 +0000 (14:48 -0700)]

Overhaul CABAC: faster, less cache usage
Horribly munge up the CABAC tables to allow deduplication of some data.
Saves 256 bytes of L1d cache in non-RD, 512 bytes in RD.
Add asm versions of bypass and terminal; save L1i cache by re-using putbyte code.
Further optimize encode_decision.
All 3 primary CABAC functions fit in under 256 bytes of code total on x86_64.

commit | commitdiff | tree

Kieran Kunhya [Thu, 13 May 2010 18:13:35 +0000 (19:13 +0100)]

Fix typo in pulldown

commit | commitdiff | tree

Anton Mitrofanov [Wed, 12 May 2010 18:05:34 +0000 (22:05 +0400)]

Fix bitrate calculation in progress status
Was slightly incorrect due to using pts, which is out of order.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 11 May 2010 21:57:38 +0000 (01:57 +0400)]

Fix crash with sliced-threads on Phenom

commit | commitdiff | tree

Fiona Glaser [Tue, 11 May 2010 05:59:12 +0000 (22:59 -0700)]

Fix condition for printing rc=cbr in options SEI
Also fix crf-max formatting.

commit | commitdiff | tree

Henrik Gramner [Mon, 10 May 2010 21:27:36 +0000 (23:27 +0200)]

Shrink even more constant arrays

commit | commitdiff | tree

Fiona Glaser [Sat, 8 May 2010 19:07:13 +0000 (12:07 -0700)]

Add API function to trigger intra refresh
Useful for interactive applications where the encoder knows that packet loss has occurred on the client.
Full documentation is in x264.h.

commit | commitdiff | tree

Fiona Glaser [Sat, 8 May 2010 18:58:22 +0000 (11:58 -0700)]

Fix intra refresh behavior with I-frames
Intra refresh still allows I-frames (for scenecuts/etc).
Now I-frames count as a full refresh, as opposed to instantly triggering a refresh.

commit | commitdiff | tree

Anton Mitrofanov [Thu, 6 May 2010 17:03:31 +0000 (10:03 -0700)]

More cosmetics

commit | commitdiff | tree

Fiona Glaser [Thu, 6 May 2010 07:53:20 +0000 (00:53 -0700)]

Fix unresolved symbol in r1573
gnu ld didn't complain, but some other linkers did.

commit | commitdiff | tree

Steven Walters [Wed, 5 May 2010 23:54:04 +0000 (19:54 -0400)]

Remove unnecessary --enable options
Change --enable-visualize to actually check for X11 support.

commit | commitdiff | tree

Fiona Glaser [Tue, 4 May 2010 04:27:16 +0000 (21:27 -0700)]

Don't force row QPs to integer values with VBV
VBV should no longer raise the bitrate of the video. That is, at a given quality level or average bitrate, turning on VBV should only lower the bitrate.
This isn't quite true if adaptive quant is off, but nobody should be doing that anyways.
Also may result in slightly more accurate per-row VBV ratecontrol.

commit | commitdiff | tree

James Darnley [Sun, 2 May 2010 23:30:50 +0000 (16:30 -0700)]

Add field-order detection to y4m demuxer

commit | commitdiff | tree

Fiona Glaser [Sun, 2 May 2010 18:45:15 +0000 (11:45 -0700)]

Fix sliced-threads + interlaced
Broken in r1546.

commit | commitdiff | tree

Fiona Glaser [Sun, 2 May 2010 18:41:36 +0000 (11:41 -0700)]

Improve temporal MV prediction
Predict based on the results of p16x16 search, not final MVs.
This lets us get predictions even if mode decision chose intra.
Also improves cache coherency.

commit | commitdiff | tree

Fiona Glaser [Sun, 2 May 2010 02:34:14 +0000 (19:34 -0700)]

More accurate MV prediction on edges in lookahead

commit | commitdiff | tree

Fiona Glaser [Sun, 2 May 2010 02:32:01 +0000 (19:32 -0700)]

Error out on invalid input stride
Might catch some crashes due to buggy calling applications.

commit | commitdiff | tree

Fiona Glaser [Sat, 1 May 2010 07:18:01 +0000 (00:18 -0700)]

Remove unnecessary debugging assert
Shouldn't have been in r1568 to begin with.

commit | commitdiff | tree

Fiona Glaser [Fri, 30 Apr 2010 20:45:50 +0000 (13:45 -0700)]

Shrink some more constant arrays

commit | commitdiff | tree

Fiona Glaser [Fri, 30 Apr 2010 18:36:19 +0000 (11:36 -0700)]

Deduplicate asm constants, automate name prefixing
Auto-prefix global constants with x264_ in cextern.
Eliminate x264_ prefix from asm files; automate it in cglobal.
Deduplicate asm constants wherever possible to save data cache (move them to a new const-a.asm).
Remove x264_emms() entirely on non-x86 (don't even call an empty function).
Add cextern_naked for a non-prefixed cextern (used in checkasm).

commit | commitdiff | tree

Fiona Glaser [Fri, 30 Apr 2010 16:57:55 +0000 (09:57 -0700)]

Shrink a few x86 asm functions
Add a few more instructions to cut down on the use of the 4-byte addressing mode.

commit | commitdiff | tree

Fiona Glaser [Fri, 30 Apr 2010 02:53:59 +0000 (19:53 -0700)]

Make options SEI use weight* instead of wpred*
More intuitive and maps more reasonably to the CLI options.
Breaks statsfile backwards-compatibility.

commit | commitdiff | tree

Loren Merritt [Thu, 29 Apr 2010 17:35:25 +0000 (17:35 +0000)]

r1548 broke subme < 3 + p8x8/b8x8
Caused significantly worse compression. Preset-wise, only affected veryfast.
Fixed by not modifying mvc in-place.

commit | commitdiff | tree

Henrik Gramner [Mon, 26 Apr 2010 23:44:33 +0000 (01:44 +0200)]

More write-combining

commit | commitdiff | tree

Fiona Glaser [Mon, 26 Apr 2010 22:10:11 +0000 (15:10 -0700)]

Reduce lookahead memory usage, cache misses
Merge lowres_types with lowres_costs.

commit | commitdiff | tree

Fiona Glaser [Sun, 25 Apr 2010 21:54:29 +0000 (14:54 -0700)]

Fix build on x86 with asm on but SSE off

commit | commitdiff | tree

Fiona Glaser [Sat, 24 Apr 2010 20:55:51 +0000 (13:55 -0700)]

Don't calculate ref/partition stats if not necessary

commit | commitdiff | tree

Fiona Glaser [Sat, 24 Apr 2010 20:07:18 +0000 (13:07 -0700)]

Split out MV prediction into mvpred.c
Make common/macroblock.c a bit less gigantic.

commit | commitdiff | tree

Loren Merritt [Sat, 24 Apr 2010 16:22:14 +0000 (16:22 +0000)]

Fix mv predictor clipping on non-x86 (regression in r1548)

commit | commitdiff | tree

Anton Mitrofanov [Fri, 23 Apr 2010 20:26:13 +0000 (00:26 +0400)]

Move getopt.c to x264cli sources from libx264
Only affects builds on systems without getopt.c.

commit | commitdiff | tree

Fiona Glaser [Thu, 22 Apr 2010 19:53:07 +0000 (12:53 -0700)]

Move deblocking code to a separate file
Should clean up frame.c a bit.

commit | commitdiff | tree

Steven Walters [Tue, 20 Apr 2010 23:48:02 +0000 (19:48 -0400)]

fix ffms demuxer to support input timebase values > 2^31

commit | commitdiff | tree

Fiona Glaser [Tue, 20 Apr 2010 23:53:06 +0000 (16:53 -0700)]

Fix 10l in cache_load changes
Broke constrained intra pred, probably not anything else.

commit | commitdiff | tree

Fiona Glaser [Tue, 20 Apr 2010 23:50:13 +0000 (16:50 -0700)]

Faster fullpel predictor checking
Also shave a few instructions off dia/hex motion estimation loops.

commit | commitdiff | tree

Loren Merritt [Tue, 20 Apr 2010 09:40:49 +0000 (09:40 +0000)]

Fix checkasm's generation of deblock inputs (regression in r1517)

commit | commitdiff | tree

Loren Merritt [Tue, 20 Apr 2010 09:17:18 +0000 (09:17 +0000)]

Fix printing of bitrate when timestamps aren't available
Doesn't affect x264cli, but was broken in some other apps in CFR mode.

commit | commitdiff | tree

Fiona Glaser [Tue, 20 Apr 2010 07:46:29 +0000 (00:46 -0700)]

Don't check mv0 twice
One less SAD in motion estimation.
Also rename bmv -> pmv; more accurate naming.

commit | commitdiff | tree

Fiona Glaser [Mon, 19 Apr 2010 18:02:27 +0000 (11:02 -0700)]

Remove reordering restrictions from weightp
Apparently the spec does allow two consecutive copies of the same frame in the reference list.
This involves an incredibly ugly hack to wrap around the frame number.
Very slight compression improvement.

commit | commitdiff | tree

Fiona Glaser [Tue, 20 Apr 2010 06:34:03 +0000 (23:34 -0700)]

Print intra chroma pred modes in stats

commit | commitdiff | tree

Fiona Glaser [Mon, 19 Apr 2010 05:54:48 +0000 (22:54 -0700)]

Add mv0 special case in pskip chroma MC
Significantly faster pskip MC.

commit | commitdiff | tree

Francois Cartegnie [Sun, 18 Apr 2010 20:04:59 +0000 (13:04 -0700)]

Fix build scripts to work with non-GNU tools

commit | commitdiff | tree

Fiona Glaser [Sat, 17 Apr 2010 03:04:13 +0000 (20:04 -0700)]

Faster deblock reference frame checks
Use a lookup table to simplify logic

commit | commitdiff | tree

Henrik Gramner [Fri, 16 Apr 2010 20:39:45 +0000 (22:39 +0200)]

Faster chroma CBP handling

commit | commitdiff | tree

Fiona Glaser [Fri, 16 Apr 2010 18:36:43 +0000 (11:36 -0700)]

Fix issues with extremely large timebases
With timebase denominators >= 2^30 , x264 would silently overflow and cause odd issues.
Now x264 will explicitly fail with timebase denominators >= 2^31 and work with timebase denominators 2^31 > x >= 2^30.

commit | commitdiff | tree

Fiona Glaser [Fri, 16 Apr 2010 19:06:07 +0000 (12:06 -0700)]

MMX code for predictor rounding/clipping
Faster predictor checking at subme < 3.

commit | commitdiff | tree

Fiona Glaser [Fri, 16 Apr 2010 10:06:46 +0000 (03:06 -0700)]

Fix four minor bugs found by Clang

commit | commitdiff | tree

Fiona Glaser [Thu, 15 Apr 2010 23:32:31 +0000 (16:32 -0700)]

Move deblocking/hpel into sliced threads
Instead of doing both as a separate pass, do them during the main encode.
This requires disabling deblocking between slices (disable_deblock_idc == 2).
Overall performance gain is about 11% on --preset superfast with sliced threads.
Doesn't reduce the amount of actual computation done: only better parallelizes it.

commit | commitdiff | tree

Fiona Glaser [Wed, 14 Apr 2010 21:43:25 +0000 (14:43 -0700)]

Prefetch MB data in cache_load
Dramatically reduces L1 cache misses.
~10% faster cache_load.

commit | commitdiff | tree

Fiona Glaser [Fri, 23 Apr 2010 19:09:37 +0000 (19:09 +0000)]

Fix a ton of pessimization caused by aliasing in cache_save and cache_load

commit | commitdiff | tree

Fiona Glaser [Fri, 23 Apr 2010 19:09:18 +0000 (19:09 +0000)]

Add CP128/M128 macros using SSE

commit | commitdiff | tree

Fiona Glaser [Sun, 11 Apr 2010 20:36:50 +0000 (13:36 -0700)]

Fix various early terminations with slices
Neighbouring type values (type_top, etc) are now loaded even if the MB isn't available for prediction.
Significant overall performance increase (as high as 5-10%+) with lots of slices (e.g. with slice-max-size).

commit | commitdiff | tree

Anton Mitrofanov [Tue, 13 Apr 2010 17:25:42 +0000 (21:25 +0400)]

Enable --fast-pskip on fast firstpass

commit | commitdiff | tree

Steven Walters [Tue, 13 Apr 2010 12:44:37 +0000 (08:44 -0400)]

Make interlaced detection in avisynth only apply to field-based input
Fixes improper flagging of progressive sources.

commit | commitdiff | tree

Anton Mitrofanov [Tue, 13 Apr 2010 15:55:12 +0000 (19:55 +0400)]

Set psy=0 in lossless mode
Doesn't actually affect output, just what's written in the SEI.

Unnamed repository; edit this file 'description' to name the repository.