]>
granicus.if.org Git - libx264/log
Fiona Glaser [Sun, 27 Apr 2008 09:10:28 +0000 (03:10 -0600)]
faster probe_skip
Loren Merritt [Tue, 22 Apr 2008 23:16:25 +0000 (17:16 -0600)]
drop support for pre-SSE3 assemblers
Loren Merritt [Fri, 25 Apr 2008 06:33:12 +0000 (00:33 -0600)]
s/x264_cpu_restore/x264_emms/
no point in giving it a generic name when it's not generic
Fiona Glaser [Sun, 27 Apr 2008 08:37:37 +0000 (02:37 -0600)]
faster cabac_mb_cbp_luma
ported from ffmpeg
Fiona Glaser [Sat, 26 Apr 2008 03:41:40 +0000 (21:41 -0600)]
remove some redundant nnz counts
move some nnz counts from macroblock_encode to cavlc if cabac doesn't need them
Fiona Glaser [Sat, 26 Apr 2008 02:43:57 +0000 (20:43 -0600)]
compute missing nnz count in subme7 cavlc
Fiona Glaser [Fri, 25 Apr 2008 07:47:47 +0000 (01:47 -0600)]
remove a division in macroblock-level bookkeeping
Fiona Glaser [Fri, 25 Apr 2008 00:55:30 +0000 (18:55 -0600)]
omit P/B-skip mc from macroblock_encode if the pixels haven't been overwritten since probe_skip
Loren Merritt [Thu, 24 Apr 2008 11:17:04 +0000 (05:17 -0600)]
earlier termination in SEA if mvcost exceeds residual
Loren Merritt [Tue, 22 Apr 2008 10:00:24 +0000 (04:00 -0600)]
remove void* arithmetic from r821
Guillaume Poirier [Fri, 25 Apr 2008 09:29:09 +0000 (11:29 +0200)]
Fix define of illegal function identifiers (as defined in section "7.1.3 Reserved identiers" of C99 spec)
Guillaume Poirier [Fri, 25 Apr 2008 08:50:48 +0000 (10:50 +0200)]
Fix define of illegal identifier (as defined in section "7.1.3 Reserved identiers" of C99 spec) "__UNUSED__", and use the one defined in common/osdep.h, i.e. "UNUSED"
based on a patch by Diego Biurrun
Guillaume Poirier [Thu, 24 Apr 2008 12:46:11 +0000 (14:46 +0200)]
more consistent include name (in line with other PPC includes)
Guillaume Poirier [Thu, 24 Apr 2008 12:44:24 +0000 (14:44 +0200)]
fix illegal identifiers in multiple inclusion guards
patch by Diego Biurrun % diego A biurrun P de %
Fiona Glaser [Tue, 22 Apr 2008 06:38:37 +0000 (00:38 -0600)]
AQ now treats perfectly flat blocks as low energy, rather than retaining previous block's QP.
fixes occasional blocking in fades.
Loren Merritt [Sun, 20 Apr 2008 18:19:46 +0000 (12:19 -0600)]
checkasm cabac
Loren Merritt [Sun, 20 Apr 2008 08:39:31 +0000 (02:39 -0600)]
s/movdqa/movaps/g
Loren Merritt [Mon, 21 Apr 2008 00:25:53 +0000 (18:25 -0600)]
--asm to allow testing of different versions of asm without recompile
Loren Merritt [Sat, 12 Apr 2008 07:40:28 +0000 (01:40 -0600)]
copy left neighbor pixels directly from previous mb instead of main plane
Fiona Glaser [Wed, 9 Apr 2008 22:30:34 +0000 (16:30 -0600)]
cacheline split workaround for mc_luma
Guillaume Poirier [Wed, 16 Apr 2008 08:46:15 +0000 (10:46 +0200)]
add "SECTION_RODATA" before "SECTION .text" to setup the fakegot label used in macho binaries.
This fixes compilation with --enable-pic
Requires Yasm 0.7.0 or newer
Patch by Dave Lee % davelee P com A gmail P com %
Loren Merritt [Sun, 13 Apr 2008 16:29:15 +0000 (10:29 -0600)]
more hpel fixes
Gabriel Bouvigne [Thu, 10 Apr 2008 14:59:19 +0000 (08:59 -0600)]
update msvc projectfile
Loren Merritt [Sat, 12 Apr 2008 00:48:30 +0000 (18:48 -0600)]
r810 borked hpel_filter_sse2 on unaligned buffers
Loren Merritt [Thu, 10 Apr 2008 09:17:53 +0000 (03:17 -0600)]
threads=auto on multicore now implies thread input, just like explicit thread numbers already did
Loren Merritt [Wed, 9 Apr 2008 02:16:50 +0000 (20:16 -0600)]
dct4 sse2
Loren Merritt [Tue, 8 Apr 2008 18:19:23 +0000 (12:19 -0600)]
faster x86_32 dct8
Loren Merritt [Mon, 7 Apr 2008 16:22:03 +0000 (10:22 -0600)]
macros to deal with macros that permute their arguments
Loren Merritt [Mon, 7 Apr 2008 14:24:40 +0000 (08:24 -0600)]
mmx cachesplit sad of non-square sizes checked height instead of width
Loren Merritt [Fri, 4 Apr 2008 07:07:40 +0000 (01:07 -0600)]
sfence after nontemporal stores
Loren Merritt [Wed, 2 Apr 2008 17:22:43 +0000 (11:22 -0600)]
simplify hpel filter asm (move control flow to C) and add sse2, ssse3 versions
Loren Merritt [Fri, 4 Apr 2008 02:46:36 +0000 (20:46 -0600)]
more mmx/xmm macros (mova, movu, movh)
Fiona Glaser [Wed, 2 Apr 2008 11:06:02 +0000 (05:06 -0600)]
improve handling of cavlc dct coef overflows
support large coefs in high profile, and clip to allowed range in baseline/main
Guillaume Poirier [Mon, 31 Mar 2008 08:50:45 +0000 (10:50 +0200)]
fix shared libs on MacOSX
based on a patch by İsmail Dönmez
Loren Merritt [Mon, 31 Mar 2008 08:27:53 +0000 (02:27 -0600)]
typo in r803
Loren Merritt [Mon, 31 Mar 2008 00:08:28 +0000 (18:08 -0600)]
fix a crash on mp4 muxing with invalid params
Fiona Glaser [Sun, 30 Mar 2008 23:58:41 +0000 (17:58 -0600)]
variance-based psy adaptive quantization
new options: --aq-mode --aq-strength
AQ is enabled by default
Zuxy Meng [Sun, 30 Mar 2008 00:04:23 +0000 (18:04 -0600)]
fix naming of .dll on mingw
Loren Merritt [Sat, 29 Mar 2008 23:53:36 +0000 (17:53 -0600)]
don't distinguish between mingw and cygwin
Fiona Glaser [Sat, 29 Mar 2008 22:27:54 +0000 (16:27 -0600)]
remove a memset
Loren Merritt [Sat, 29 Mar 2008 22:27:08 +0000 (16:27 -0600)]
typo. don't evaluate rd pskip when p16x16 found ref>0.
Loren Merritt [Sun, 30 Mar 2008 02:42:51 +0000 (20:42 -0600)]
r784 borked lossless dc zigzag
Loren Merritt [Tue, 25 Mar 2008 13:31:51 +0000 (07:31 -0600)]
fix an arithmetic overflow that disabled SEA threshold after finding a mv with SAD < mvcost.
Guillaume Poirier [Tue, 25 Mar 2008 16:30:04 +0000 (16:30 +0000)]
fix hpel_filter_altivec picked up by checkasm
Patch by Manuel %maaanuuu A gmx.net % and Noboru Asai % noboru P asai A gmail P com %
Loren Merritt [Tue, 25 Mar 2008 06:59:50 +0000 (00:59 -0600)]
faster residual
Loren Merritt [Tue, 25 Mar 2008 03:31:46 +0000 (21:31 -0600)]
nasm doesn't like align(nop) in structs
Loren Merritt [Tue, 25 Mar 2008 01:25:19 +0000 (19:25 -0600)]
reduce the size of some cabac arrays
Fiona Glaser [Tue, 25 Mar 2008 01:21:24 +0000 (19:21 -0600)]
use cabac context transition table from trellis in normal residual coding too
Fiona Glaser [Tue, 25 Mar 2008 01:12:07 +0000 (19:12 -0600)]
rearrange cabac struct to reduce code size
Fiona Glaser [Mon, 24 Mar 2008 09:25:25 +0000 (03:25 -0600)]
higher precision RD lambda
improves quality at QP<=12.
Loren Merritt [Mon, 24 Mar 2008 07:56:31 +0000 (01:56 -0600)]
faster cabac_encode_ue_bypass
Loren Merritt [Mon, 24 Mar 2008 04:14:18 +0000 (22:14 -0600)]
cabac asm.
mostly because gcc refuses to use cmov.
28% faster than c on core2, 11% on k8, 6% on p4.
Loren Merritt [Mon, 24 Mar 2008 04:08:07 +0000 (22:08 -0600)]
cosmetics in cabac
Loren Merritt [Sun, 23 Mar 2008 02:25:06 +0000 (20:25 -0600)]
inline cabac_size_decision
Loren Merritt [Sat, 22 Mar 2008 09:25:03 +0000 (03:25 -0600)]
cosmetics in DECLARE_ALIGNED
Loren Merritt [Sat, 22 Mar 2008 09:06:18 +0000 (03:06 -0600)]
don't distinguish between luma4x4 and luma4x4ac
Loren Merritt [Sat, 22 Mar 2008 08:46:31 +0000 (02:46 -0600)]
faster lossless zigzag
Loren Merritt [Sat, 22 Mar 2008 09:14:33 +0000 (03:14 -0600)]
more alignment
Loren Merritt [Sat, 22 Mar 2008 07:49:52 +0000 (01:49 -0600)]
add tesa and lossless to fprofile
Loren Merritt [Sat, 22 Mar 2008 07:46:43 +0000 (01:46 -0600)]
cosmetics in residual_write
Loren Merritt [Sat, 22 Mar 2008 05:24:33 +0000 (23:24 -0600)]
remove unused bitstream reader
Loren Merritt [Sat, 22 Mar 2008 00:58:46 +0000 (18:58 -0600)]
cosmetics in quant asm
Loren Merritt [Sat, 22 Mar 2008 00:46:29 +0000 (18:46 -0600)]
special case dequant for flat matrix
Loren Merritt [Fri, 21 Mar 2008 06:04:46 +0000 (00:04 -0600)]
faster dequant
Loren Merritt [Fri, 21 Mar 2008 04:08:07 +0000 (22:08 -0600)]
simplify hpel_filter_c
Loren Merritt [Fri, 21 Mar 2008 01:35:54 +0000 (19:35 -0600)]
use x264_mc_copy_w16_sse2 in mc.copy, it was previously only in mc_luma
Loren Merritt [Thu, 20 Mar 2008 20:00:08 +0000 (14:00 -0600)]
new ssd_8x*_sse2
align ssd_16x*_sse2
unroll ssd_4x*_mmx
Manuel Rommel [Thu, 20 Mar 2008 19:21:16 +0000 (13:21 -0600)]
update altivec zigzags
Loren Merritt [Thu, 20 Mar 2008 16:41:50 +0000 (10:41 -0600)]
r768 borked cavlc
Loren Merritt [Thu, 20 Mar 2008 06:52:11 +0000 (00:52 -0600)]
cosmetics in intra predict
Fiona Glaser [Thu, 20 Mar 2008 06:31:42 +0000 (00:31 -0600)]
faster intra predict 8x8 hu/hd
Loren Merritt [Thu, 20 Mar 2008 05:43:19 +0000 (23:43 -0600)]
reduce zigzag arrays from int to int16_t
Loren Merritt [Thu, 20 Mar 2008 05:42:20 +0000 (23:42 -0600)]
reduce the size of some arrays
Fiona Glaser [Wed, 19 Mar 2008 21:01:05 +0000 (15:01 -0600)]
skip intra pred+dct+quant in cases where it's redundant (analyse vs encode)
large speedup with trellis=2, small speedup with trellis=0 and/or subme>=6
Loren Merritt [Wed, 19 Mar 2008 20:03:34 +0000 (14:03 -0600)]
cosmetics in asm
Fiona Glaser [Wed, 19 Mar 2008 20:00:34 +0000 (14:00 -0600)]
satd_4x4_ssse3
Fiona Glaser [Wed, 19 Mar 2008 19:40:41 +0000 (13:40 -0600)]
get_ref_sse2
Fiona Glaser [Wed, 19 Mar 2008 01:17:22 +0000 (19:17 -0600)]
continue instead of crash when the threading mv constraint is violated.
doesn't fix the underlying bug, but hopefully less annoying until we find it.
Loren Merritt [Wed, 19 Mar 2008 00:24:01 +0000 (18:24 -0600)]
remove remaining reference to clip1.h
Loren Merritt [Tue, 18 Mar 2008 18:34:10 +0000 (12:34 -0600)]
fix name mangling again.
apparently it's not just a convention, dll build fails if you try to export a non-prefixed name.
Gabriel Bouvigne [Mon, 17 Mar 2008 21:44:40 +0000 (15:44 -0600)]
update msvc projectfile
Loren Merritt [Mon, 17 Mar 2008 21:41:59 +0000 (15:41 -0600)]
missing #ifdef HAVE_SSE3
Loren Merritt [Mon, 17 Mar 2008 21:41:30 +0000 (15:41 -0600)]
don't define offsetof since it's standard
Loren Merritt [Mon, 17 Mar 2008 07:23:35 +0000 (01:23 -0600)]
shut up gcc warning in offsetof
Håkan Hjort [Mon, 17 Mar 2008 07:20:02 +0000 (01:20 -0600)]
increase alignment of mv arrays
Fiona Glaser [Mon, 17 Mar 2008 05:58:04 +0000 (23:58 -0600)]
memcpy_aligned_sse2
Loren Merritt [Mon, 17 Mar 2008 04:40:43 +0000 (22:40 -0600)]
checkasm check whether callee-saved regs are correctly saved
x86_32 only for now since x86_64 varargs are annoying
Loren Merritt [Mon, 17 Mar 2008 04:28:20 +0000 (22:28 -0600)]
fix x86_32 ads which failed to preserve a register
Loren Merritt [Sun, 16 Mar 2008 22:34:41 +0000 (16:34 -0600)]
fix some name mangling issues introduced by the merge
Loren Merritt [Sun, 16 Mar 2008 21:30:40 +0000 (15:30 -0600)]
remove x264_mc_clip1.
it's wrong for sufficiently perverse inputs, and clip_uint8 is faster anyway.
Loren Merritt [Sun, 16 Mar 2008 19:54:58 +0000 (13:54 -0600)]
merge x86_32 and x86_64 asm, with macros to abstract calling convention and register names
Loren Merritt [Sun, 9 Mar 2008 11:58:55 +0000 (05:58 -0600)]
git compatible version script
Loren Merritt [Mon, 3 Mar 2008 00:53:01 +0000 (17:53 -0700)]
check for broken versions of yasm
Loren Merritt [Mon, 3 Mar 2008 00:27:38 +0000 (17:27 -0700)]
increase the alignment of the i8x8 edge cache, needed for sse2 intra prediction.
patch by Alexander Strange.
Loren Merritt [Sun, 2 Mar 2008 23:12:57 +0000 (16:12 -0700)]
.gitignore
Loren Merritt [Sun, 2 Mar 2008 03:04:07 +0000 (03:04 +0000)]
pic macros now keep track of which register holds the GOT, so variable access doesn't have to care
git-svn-id: svn://svn.videolan.org/x264/trunk@745
df754926 -b1dd-0310-bc7b-
ec298dee348c
Loren Merritt [Sun, 2 Mar 2008 02:27:45 +0000 (02:27 +0000)]
remove x86_64 predict_8x8_ddl_mmxext because sse2 is faster even on amd
git-svn-id: svn://svn.videolan.org/x264/trunk@744
df754926 -b1dd-0310-bc7b-
ec298dee348c
Loren Merritt [Sun, 2 Mar 2008 02:26:00 +0000 (02:26 +0000)]
cosmetics in dsp init
git-svn-id: svn://svn.videolan.org/x264/trunk@743
df754926 -b1dd-0310-bc7b-
ec298dee348c
Loren Merritt [Sun, 2 Mar 2008 02:11:12 +0000 (02:11 +0000)]
sse2 16x16 intra pred.
port the remaining intra pred functions from x86_64 to x86_32.
patch by Fiona Glaser.
git-svn-id: svn://svn.videolan.org/x264/trunk@742
df754926 -b1dd-0310-bc7b-
ec298dee348c
Loren Merritt [Sat, 1 Mar 2008 13:47:05 +0000 (13:47 +0000)]
some simplifications to mmx intra pred that should have been done way back when we switched to constant fdec_stride.
and remove pic spills in functions that have a free caller-saved reg.
patch partly by Fiona Glaser.
git-svn-id: svn://svn.videolan.org/x264/trunk@741
df754926 -b1dd-0310-bc7b-
ec298dee348c