granicus.if.org Git - libvpx/log

]> granicus.if.org Git - libvpx/log

projects / libvpx / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Marco Paniconi [Sat, 14 Jan 2023 03:46:10 +0000 (19:46 -0800)]

Fix to segfault for external resize test in vp9

Failure occurs for 1 pass non-realtime mode at speed 0.
Due to speed feautre rd_ml_partition.var_pruning, which
doesn't check for scaled reference in simple_motion_search().

Bug: webm:1768

Change-Id: Iddcb56033bac042faebb5196eed788317590b23f

commit | commitdiff | tree

James Zern [Tue, 10 Jan 2023 21:49:15 +0000 (13:49 -0800)]

build: replace egrep with grep -E

avoids a warning on some platforms:
egrep: warning: egrep is obsolescent; using grep -E

Bug: webm:1786
Change-Id: Ia434297731303aacb0b02cf3dcbfd8e03936485d
Fixed: webm:1786

commit | commitdiff | tree

Jonathan Wright [Thu, 5 Jan 2023 15:04:53 +0000 (15:04 +0000)]

Use Neon load/store helper functions consistently

Define all Neon load/store helper functions in mem_neon.h and use
them consistently in Neon convolution functions.

Change-Id: I57905bc0a3574c77999cf4f4a73442c3420fa2be

commit | commitdiff | tree

Jonathan Wright [Thu, 5 Jan 2023 12:20:03 +0000 (12:20 +0000)]

Use lane-referencing intrinsics in Neon convolution kernels

The Neon convolution helper functions take a pointer to a filter and
load the 8 values into a single Neon register. For some reason,
filter values 3 and 4 are then duplicated into their own separate
registers.

This patch modifies these helper functions so that they access filter
values 3 and 4 via the lane-referencing versions of the various Neon
multiply instructions. This reduces register pressure and tidies up
the source code quite a bit.

Change-Id: Ia4aeee8b46fe218658fb8577dc07ff04a9324b3e

commit | commitdiff | tree

Jerome Jiang [Wed, 21 Dec 2022 16:13:40 +0000 (11:13 -0500)]

Remove references to deprecated NumPy type aliases

This change replaces references to a number of deprecated NumPy type
aliases (np.bool, np.int, np.float, np.complex, np.object, np.str)
with their recommended replacement
(bool, int, float, complex, object, str).

NumPy 1.24 drops the deprecated aliases
so we must remove uses before updating NumPy.

Change-Id: I9f5dfcbb11fe6534fce358054f210c7653f278c3

commit | commitdiff | tree

Scott LaVarnway [Tue, 20 Dec 2022 23:43:44 +0000 (15:43 -0800)]

[x86]: Add vpx_highbd_comp_avg_pred_sse2().

C vs SSE2

4x4: 3.38x
8x8: 3.45x
16x16: 2.06x
32x32: 2.19x
64x64: 1.39x

Change-Id: I46638fe187b49a78fee554114fac51c485d74474

commit | commitdiff | tree

Scott LaVarnway [Fri, 16 Dec 2022 18:21:00 +0000 (10:21 -0800)]

Add vpx_highbd_comp_avg_pred_c() test.

Change-Id: I6b2c3379c49a62e56e5ac56fd4782a50b3c4e12a

commit | commitdiff | tree

Marco Paniconi [Wed, 14 Dec 2022 17:08:21 +0000 (17:08 +0000)]

Merge "rc-svc: Add tests for dynamic svc in external RC" into main

commit | commitdiff | tree

Marco Paniconi [Wed, 7 Dec 2022 08:17:22 +0000 (00:17 -0800)]

rc-svc: Add tests for dynamic svc in external RC

Test to verify RC for going down and back up in
spatial layers. Going back up has an issue so added
a TODO.

Make the test more flexible to handle dynamic layers.
Test for dyanmic change in temporal layers to follow.

Change-Id: Ic5542f7b274135277429e116f56ba54e682e96a0

commit | commitdiff | tree

Anton Venema [Tue, 13 Dec 2022 18:27:37 +0000 (10:27 -0800)]

Add additional ARM targets for Visual Studio.

configure: Add an armv7-win32-vs16 target
configure: Add an armv7-win32-vs17 target
configure: Add an arm64-win64-vs16 target
configure: Add an arm64-win64-vs17 target

Change-Id: I11d6cd6e51f7703939d6fd3fc6a7469591e3b09d

commit | commitdiff | tree

Cheng Chen [Tue, 13 Dec 2022 01:24:00 +0000 (01:24 +0000)]

Merge "L2E: Add a new interface to control rdmult" into main

commit | commitdiff | tree

Scott LaVarnway [Tue, 6 Dec 2022 21:13:30 +0000 (13:13 -0800)]

[x86]: Add vpx_highbd_subtract_block_avx2().

Up to 4x faster than "sse2 vectorized C".

Change-Id: Ie9b3c12a437c5cddf92c4d5349c4f659ca6b82ea

commit | commitdiff | tree

Scott LaVarnway [Tue, 6 Dec 2022 22:18:03 +0000 (14:18 -0800)]

Add vpx highbd subtract test.

Change-Id: I069ae0fe22bfc82ad5083df85a7fdf9058a285eb

commit | commitdiff | tree

Cheng Chen [Sat, 3 Dec 2022 02:04:32 +0000 (18:04 -0800)]

L2E: Add a new interface to control rdmult

Allow external model to control frame rdmult.

A function is called per frame to get the value of rdmult from
the external model.

The external rdmult will overwrite libvpx's default rdmult unless
a reserved value is selected.

A unit test is added to test when the default rdmult value is set.

Change-Id: I2f17a036c188de66dc00709beef4bf2ed86a919a

commit | commitdiff | tree

Marco Paniconi [Mon, 5 Dec 2022 22:30:40 +0000 (14:30 -0800)]

rc-rtc: Test for periodic key in SVC external RC

This test catches the fix merged in here:
https://chromium-review.googlesource.com/c/webm/libvpx/+/4022904

Change-Id: Ib68fbcba694b5d465a9faf3ca7d6880bfe8eabb3

commit | commitdiff | tree

Marco Paniconi [Mon, 5 Dec 2022 19:54:33 +0000 (11:54 -0800)]

rc-rtc: Remove frame_flags_ change in svc ratectril rtc test

SVC test is only in CBR and the frame_flags are
set by the SVC pattern, so we shouldn't undo them
for svc mode.

Change-Id: I5ffa65dd58a7b47f287d124d9e71ba1dc7c5a549

commit | commitdiff | tree

Marco Paniconi [Fri, 18 Nov 2022 04:16:26 +0000 (04:16 +0000)]

Merge "vp9/rate_ctrl_rtc: Improve get cyclic refresh data" into main

commit | commitdiff | tree

Hirokazu Honda [Thu, 17 Nov 2022 07:05:28 +0000 (16:05 +0900)]

vp9/rate_ctrl_rtc: Improve get cyclic refresh data

A client of the vp9 rate controller needs to know whether the
segmentation is enabled and the size of delta_q. It is also nicer to
know the size of map. This CL changes the interface to achieve these.

Bug: b:259487065
Test: Build

Change-Id: If05854530f97e1430a7b97788910f277ab673a87

commit | commitdiff | tree

Marco Paniconi [Tue, 15 Nov 2022 21:45:07 +0000 (21:45 +0000)]

Merge "vp9-svc: Fixes to make SVC work with VBR" into main

commit | commitdiff | tree

Marco Paniconi [Tue, 15 Nov 2022 06:11:19 +0000 (22:11 -0800)]

vp9-svc: Fixes to make SVC work with VBR

Prior to this CL SVC with VBR mode was broken.
Fixes made here to make VBR rate control work for SVC.
Rename is_one_pass_cbr_svc() --> is_one_pass_svc(),
as it can be used now for both CBR and VBR.

Added rate targetting unittest for (2SL, 3TL).

Bug: chromium:1375111
Change-Id: I5a62ffe7fbea29dc5949c88a284768386b1907a9

commit | commitdiff | tree

James Zern [Tue, 15 Nov 2022 19:19:43 +0000 (19:19 +0000)]

Merge "[NEON] Optimize FHT functions, add highbd FHT 4x4" into main

commit | commitdiff | tree

Johann [Mon, 14 Nov 2022 08:59:45 +0000 (17:59 +0900)]

quantize: remove vp9_regular_quantize_b_4x4

This was just a helper function which called vpx_quantize_b or
vpx_highbd_quantize_b. It also checked for skip_block, which was
necessary when webm:1439 was filed but does not appear to be
necessary now.

Removes a quantize variant and makes subsequent cleanups easier.

Change-Id: Ibe545eccd19370f07ff26c8e151f290c642efd2a

commit | commitdiff | tree

Konstantinos Margaritis [Wed, 9 Nov 2022 09:30:58 +0000 (09:30 +0000)]

[NEON] Optimize FHT functions, add highbd FHT 4x4

Refactor & optimize FHT functions further, use new butterfly functions
4x4 5% faster, 8x8 & 16x16 10% faster than previous versions.
Highbd 4x4 FHT version 2.27x faster than C version for --rt.

Change-Id: I3ebcd26010f6c5c067026aa9353cde46669c5d94

commit | commitdiff | tree

Marco Paniconi [Fri, 11 Nov 2022 02:50:19 +0000 (18:50 -0800)]

vp9-rc: Fix key frame setting in external RC

Bug: b/257368998

Change-Id: I03e35915ac99b50cb6bdf7bce8b8f9ec5aef75b7

commit | commitdiff | tree

James Zern [Mon, 7 Nov 2022 21:48:50 +0000 (21:48 +0000)]

Merge "Add Neon implementation of vpx_hadamard_32x32" into main

commit | commitdiff | tree

Sam James [Sun, 6 Nov 2022 04:11:59 +0000 (04:11 +0000)]

build: fix -Wimplicit-int (Clang 16)

Clang 16 will make -Wimplicit-int error by default which can, in addition to
other things, lead to some configure tests silently failing/returning the wrong result.

Fixes this error:
```
+/var/tmp/portage/media-libs/libvpx-1.12.0/temp/vpx-conf-1802-30624.c:1:15: error: type specifier missing, defaults to 'int'; ISO C99 and later do not support implicit int [-Wimplicit-int]
```

For more information, see LWN.net [0] or LLVM's Discourse [1], gentoo-dev@ [2],
or the (new) c-std-porting mailing list [3].

[0] https://lwn.net/Articles/913505/
[1] https://discourse.llvm.org/t/configure-script-breakage-with-the-new-werror-implicit-function-declaration/65213
[2] https://archives.gentoo.org/gentoo-dev/message/dd9f2d3082b8b6f8dfbccb0639e6e240
[3] hosted at lists.linux.dev.

Bug: https://bugs.gentoo.org/879705
Change-Id: Id73a98944ab3c99a368b9da7a5e902ddff9d937f
Signed-off-by: Sam James <sam@gentoo.org>

commit | commitdiff | tree

Andrew Salkeld [Thu, 13 Oct 2022 15:28:41 +0000 (16:28 +0100)]

Add Neon implementation of vpx_hadamard_32x32

Add an Arm Neon implementation of vpx_hadamard_32x32 and use it
instead of the scalar C implementation.

Also add test coverage for the new Neon implementation.

Change-Id: Iccc018eec4dbbe629fb0c6f8ad6ea8554e7a0b13

commit | commitdiff | tree

Konstantinos Margaritis [Wed, 26 Oct 2022 22:09:32 +0000 (22:09 +0000)]

[NEON] Optimize highbd 32x32 DCT

For --best quality, resulting function
vpx_highbd_fdct32x32_rd_neon takes 0.27% of cpu time in
profiling, vs 6.27% for the sum of scalar functions:
vpx_fdct32, vpx_fdct32.constprop.0, vpx_fdct32x32_rd_c for rd.
For --rt quality, the function takes 0.19% vs 4.57% for the scalar
version.
Overall, this improves encoding time by ~6% compared for highbd
for --best and ~9% for --rt.

Change-Id: I1ce4bbef6e364bbadc76264056aa3f86b1a8edc5

commit | commitdiff | tree

James Zern [Wed, 2 Nov 2022 02:21:18 +0000 (02:21 +0000)]

Merge "[NEON] Optimize and homogenize Butterfly DCT functions" into main

commit | commitdiff | tree

Konstantinos Margaritis [Wed, 26 Oct 2022 21:37:31 +0000 (21:37 +0000)]

[NEON] Optimize and homogenize Butterfly DCT functions

Provide a set of commonly used Butterfly DCT functions for use in
DCT 4x4, 8x8, 16x16, 32x32 functions. These are provided in various
forms, using vqrdmulh_s16/vqrdmulh_s32 for _fast variants, which
unfortunately are only usable in pass1 of most DCTs, as they do not
provide the necessary precision in pass2.
This gave a performance gain ranging from 5% to 15% in 16x16 case.
Also, for 32x32, the loads were rearranged, along with the butterfly
optimizations, this gave 10% gain in 32x32_rd function.
This refactoring was necessary to allow easier porting of highbd
32x32 functions -follows this patchset.

Change-Id: I6282e640b95a95938faff76c3b2bace3dc298bc3

commit | commitdiff | tree

Johann Koenig [Thu, 27 Oct 2022 08:38:48 +0000 (08:38 +0000)]

Merge "MacOS 13 is darwin22" into main

commit | commitdiff | tree

Johann Koenig [Thu, 27 Oct 2022 08:38:18 +0000 (08:38 +0000)]

Merge "rtcd: allow disabling neon on armv8" into main

commit | commitdiff | tree

Johann [Thu, 27 Oct 2022 02:40:19 +0000 (11:40 +0900)]

MacOS 13 is darwin22

Bug: webm:1783
Change-Id: I97d94ab8c8aebe13aedb58e280dc37474814ad5d

commit | commitdiff | tree

Johann [Wed, 26 Oct 2022 23:49:37 +0000 (08:49 +0900)]

rtcd: allow disabling neon on armv8

Change-Id: Idef943775456eb95b46be5c92c114c1d215f38d7

commit | commitdiff | tree

Johann [Wed, 26 Oct 2022 08:14:21 +0000 (17:14 +0900)]

mailmap: add johann@duck.com

Change-Id: I3b48951e69ba1f4a9fafdbb81fac48f79587a342

commit | commitdiff | tree

James Zern [Tue, 25 Oct 2022 19:16:46 +0000 (19:16 +0000)]

Merge changes I36545ff4,Id1aa29da into main

* changes:
vp9_highbd_quantize_fp*_neon: normalize fn param name
highbd_sad_avx2: normalize function param names

commit | commitdiff | tree

James Zern [Tue, 25 Oct 2022 19:16:08 +0000 (19:16 +0000)]

Merge "SAD*Test: mark virtual Run() as overridden" into main

commit | commitdiff | tree

Johann Koenig [Tue, 25 Oct 2022 13:26:37 +0000 (13:26 +0000)]

Merge "quantize: consolidate sse2 conditionals" into main

commit | commitdiff | tree

Johann Koenig [Tue, 25 Oct 2022 13:26:22 +0000 (13:26 +0000)]

Merge "vp9 quantize: rewrite ssse3 in intrinsics" into main

commit | commitdiff | tree

James Zern [Mon, 24 Oct 2022 22:37:26 +0000 (15:37 -0700)]

SAD*Test: mark virtual Run() as overridden

this comes from AbstractBench

Change-Id: Ie0b5a26a68bfbffd80f132125d15a1bdfc990c22

commit | commitdiff | tree

James Zern [Mon, 24 Oct 2022 22:28:47 +0000 (15:28 -0700)]

vp9_highbd_quantize_fp*_neon: normalize fn param name

count -> n_coeffs. aligns the name with the rtcd header; clears a
clang-tidy warning

Change-Id: I36545ff479df92b117c95e494f16002e6990f433

commit | commitdiff | tree

James Zern [Mon, 24 Oct 2022 22:24:51 +0000 (15:24 -0700)]

highbd_sad_avx2: normalize function param names

(src|ref)8_ptr -> (src|ref)_ptr. aligns the names with the rtcd header;
clears some clang-tidy warnings

Change-Id: Id1aa29da8c0fa5860b46ac902f5b2620c0d3ff54

commit | commitdiff | tree

Marco Paniconi [Tue, 18 Oct 2022 05:36:25 +0000 (22:36 -0700)]

Fix to VP8 external RC for buffer levels

On a dynamic change of temporal layers:
starting/maimum/optimal were being set twice,
causing incorrect large values.

Bug: b/253927937
Change-Id: I204e885cff92530336a9ed9a4363c486c5bf80ae

commit | commitdiff | tree

Johann [Mon, 17 Oct 2022 07:22:23 +0000 (16:22 +0900)]

quantize: consolidate sse2 conditionals

Change-Id: I43de579e30f2967b97064063e29676e0af1a498f

commit | commitdiff | tree

Johann [Sat, 1 Oct 2022 02:47:05 +0000 (11:47 +0900)]

vp9 quantize: rewrite ssse3 in intrinsics

Change-Id: I3177251a5935453a23a23c39ea5f6fd41254775e

commit | commitdiff | tree

Marco Paniconi [Sat, 15 Oct 2022 01:56:46 +0000 (01:56 +0000)]

Merge "Fix to VP8 external RC for dynamic update of layers" into main

commit | commitdiff | tree

Marco Paniconi [Wed, 12 Oct 2022 07:10:47 +0000 (00:10 -0700)]

Fix to VP8 external RC for dynamic update of layers

On change/update of rc_cfg: when number of temporal
layers change call vp8_reset_temporal_layer_change(),
which in turn will call vp8_init_temporal_layer_context()
only for the new layers.

Bug:b/249644737

Change-Id: Ib20d746c7eacd10b78806ca6a5362c750d9ca0b3

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 13 Oct 2022 15:19:46 +0000 (15:19 +0000)]

[NEON] fix clang compile warnings

Change-Id: Ib7ce7a774ec89ba51169ea64d24c878109ef07d1

commit | commitdiff | tree

Scott LaVarnway [Thu, 13 Oct 2022 11:31:51 +0000 (11:31 +0000)]

Merge "Add vpx_highbd_sad64x{64,32}_avg_avx2." into main

commit | commitdiff | tree

Konstantinos Margaritis [Fri, 7 Oct 2022 15:13:29 +0000 (15:13 +0000)]

[NEON] Add highbd FDCT 16x16 function

90-95% faster than C version in best/rt profiles

Change-Id: I41d5e9acdc348b57153637ec736498a25ed84c25

commit | commitdiff | tree

James Zern [Wed, 12 Oct 2022 20:07:51 +0000 (20:07 +0000)]

Merge "[NEON] Add highbd FDCT 8x8 function" into main

commit | commitdiff | tree

Scott LaVarnway [Wed, 12 Oct 2022 19:50:55 +0000 (19:50 +0000)]

Merge "Add vpx_highbd_sad32x{64,32,16}_avg_avx2." into main

commit | commitdiff | tree

Scott LaVarnway [Wed, 12 Oct 2022 19:44:44 +0000 (19:44 +0000)]

Merge "Add vpx_highbd_sad16x{32,16,8}_avg_avx2." into main

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 6 Oct 2022 16:00:43 +0000 (16:00 +0000)]

[NEON] Add highbd FDCT 8x8 function

50% faster than C version in best/rt profiles

Change-Id: I0f9504ed52b5d5f7722407e91108ed4056d66bc2

commit | commitdiff | tree

Scott LaVarnway [Wed, 12 Oct 2022 17:26:43 +0000 (10:26 -0700)]

Add vpx_highbd_sad64x{64,32}_avg_avx2.

~2.8x faster than the sse2 version.

Bug: b/245917257

Change-Id: Ib727ba8a8c8fa4df450bafdde30ed99fd283f06d

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 6 Oct 2022 14:53:56 +0000 (14:53 +0000)]

[NEON] Add highbd FDCT 4x4 function

~80% faster than C version for both best/rt profiles.

Change-Id: Ibb3c8e1862131d2a020922420d53c66b31d5c2c3

commit | commitdiff | tree

Scott LaVarnway [Wed, 12 Oct 2022 13:05:46 +0000 (06:05 -0700)]

Add vpx_highbd_sad32x{64,32,16}_avg_avx2.

2.1x to 2.8x faster than the sse2 version.

Bug: b/245917257

Change-Id: I1aaffa4a1debbe5559784e854b8fc6fba07e5000

commit | commitdiff | tree

Scott LaVarnway [Mon, 10 Oct 2022 15:38:44 +0000 (08:38 -0700)]

Add vpx_highbd_sad16x{32,16,8}_avg_avx2.

1.6x to 2.1x faster than the sse2 version.

Bug: b/245917257

Change-Id: I56c467a850297ae3abcca4b4843302bb8d5d0ac1

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 6 Oct 2022 13:05:01 +0000 (13:05 +0000)]

[NEON] Move helper functions for reuse

Move all butterfly functions to fdct_neon.h
Slightly optimize load/scale/cross functions
in fdct 16x16.
These will be reused in highbd variants.

Change-Id: I28b6e0cc240304bab6b94d9c3f33cca77b8cb073

commit | commitdiff | tree

Scott LaVarnway [Mon, 10 Oct 2022 20:34:02 +0000 (20:34 +0000)]

Merge "SADavgTest: Add speed test." into main

commit | commitdiff | tree

Scott LaVarnway [Mon, 10 Oct 2022 19:20:37 +0000 (12:20 -0700)]

SADavgTest: Add speed test.

Change-Id: Ie14c0f6d15f410adf749f7ab74cf9f2bf35f3d5f

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 6 Oct 2022 10:58:27 +0000 (10:58 +0000)]

[NEON] move transpose_8x8 to reuse

Change-Id: I3915b6c9971aedaac9c23f21fdb88bc271216208

commit | commitdiff | tree

James Zern [Mon, 10 Oct 2022 18:37:05 +0000 (18:37 +0000)]

Merge "[NEON] highbd partial DCT functions" into main

commit | commitdiff | tree

Konstantinos Margaritis [Thu, 6 Oct 2022 10:26:05 +0000 (10:26 +0000)]

[NEON] highbd partial DCT functions

Change-Id: I7dd4e698469562f5b1f948cc36f8403b490dcb6a

commit | commitdiff | tree

Scott LaVarnway [Fri, 7 Oct 2022 12:53:50 +0000 (05:53 -0700)]

Add vpx_highbd_sad64x{64,32}_avx2.

~2.8x faster than the sse2 version.

Bug: b/245917257

Change-Id: Ibc8e5d030ec145c9a9b742fff98fbd9131c9ede4

commit | commitdiff | tree

Johann Koenig [Fri, 7 Oct 2022 08:17:03 +0000 (08:17 +0000)]

Merge "vp9 quantize: change index" into main

commit | commitdiff | tree

Scott LaVarnway [Wed, 5 Oct 2022 21:03:55 +0000 (14:03 -0700)]

Add vpx_highbd_sad32x{64,32,16}_avx2.

2.7x to 3.1x faster than the sse2 version.

Bug: b/245917257

Change-Id: Idff3284932f7ee89d036f38893205bf622a159a3

commit | commitdiff | tree

Scott LaVarnway [Wed, 5 Oct 2022 14:04:27 +0000 (07:04 -0700)]

Add vpx_highbd_sad16x{32,16,8}_avx2.

1.9x to 2.4x faster than the sse2 version.

Bug: b/245917257

Change-Id: I686452772f9b72233930de2207af36a0cd72e0bb

commit | commitdiff | tree

Cheng Chen [Tue, 4 Oct 2022 16:15:49 +0000 (16:15 +0000)]

Merge "L2E: Rework recode decisions for external max frame size and q" into main

commit | commitdiff | tree

Johann [Sat, 1 Oct 2022 02:18:09 +0000 (11:18 +0900)]

vp9 quantize: change index

In assembly it made sense to iterate using n_coeffs.
In intrinsics it's just as fast to use index and
easier to read.

Change-Id: I403c959709309dad68123d0a3d0efe183874543d

commit | commitdiff | tree

Scott LaVarnway [Mon, 19 Sep 2022 12:09:23 +0000 (05:09 -0700)]

vpx_subpixel_8t_intrin_avx2.c: quiet -Wuninitialized

warning: ‘s2[3]’ may be used uninitialized
and
warning: ‘s1[3]’ may be used uninitialized

The warnings exposed unused code.

Change-Id: I75cf1f9db75e811cb42e2f143be1ad76f3e4dee9

commit | commitdiff | tree

Scott LaVarnway [Mon, 26 Sep 2022 23:18:04 +0000 (23:18 +0000)]

Merge "vp9_rd.c quiet -Wstringop-overflow" into main

commit | commitdiff | tree

Johann [Sat, 24 Sep 2022 01:53:05 +0000 (10:53 +0900)]

quantize: standardize vp9_quantize_fp_sse2

Match style for vpx_quantize_b_sse2 and prepare to rewrite
ssse3 version in intrinsics.

Need to evaluate the value of threshold breakout before
going further.

Change-Id: I9cfceb1bb0dc237cd6b73fc8d41d78bba444a15b

commit | commitdiff | tree

Scott LaVarnway [Fri, 23 Sep 2022 16:17:18 +0000 (09:17 -0700)]

vp9_rd.c quiet -Wstringop-overflow

../libvpx/vp9/encoder/vp9_rd.c:594:20: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=]
  594 |         t_above[i] = !!*(const uint32_t *)&above[i];
      |         ~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../libvpx/vp9/encoder/vp9_rd.c:572:47: note: at offset [64, 254] into destination object ‘t_above’ of size [0, 16]
  572 |                               ENTROPY_CONTEXT t_above[16],
      |                               ~~~~~~~~~~~~~~~~^~~~~~~~~~~

Change-Id: Ie9ef24e685af417cdd35f6aa7284805e422b6ae2

commit | commitdiff | tree

Johann [Sat, 24 Sep 2022 01:55:52 +0000 (10:55 +0900)]

quantize: add untested function

vp9_quantize_fp_sse2 was only tested in non-hbd
configuration. Missed when fixing this for
vpx_quantize_b_sse2.

Change-Id: Ide346e5727d74281c774f605c90d280050e0bf62

commit | commitdiff | tree

Johann [Fri, 16 Sep 2022 23:47:28 +0000 (08:47 +0900)]

quantize: increase iscan by 1

All of the assembly adds 1 to iscan to convert from
a 0 based array to the EOB value.

Add 1 to all iscan values and remove the extra
instructions from the assembly.

Change-Id: I219dd7f2bd10533ab24b206289565703176dc5e9

commit | commitdiff | tree

Scott LaVarnway [Wed, 21 Sep 2022 23:41:42 +0000 (23:41 +0000)]

Merge "resize_test.cc: quiet -Wmaybe-uninitialized" into main

commit | commitdiff | tree

Scott LaVarnway [Wed, 21 Sep 2022 19:15:16 +0000 (12:15 -0700)]

resize_test.cc: quiet -Wmaybe-uninitialized

warning: ‘expected_w’ may be used uninitialized
Change-Id: I915efd82d3263250cea90391345f7683c1330fc8

commit | commitdiff | tree

Scott LaVarnway [Wed, 21 Sep 2022 20:53:07 +0000 (20:53 +0000)]

Merge "post_proc_sse2.c: quiet -Wuninitialized" into main

commit | commitdiff | tree

Scott LaVarnway [Wed, 21 Sep 2022 18:37:04 +0000 (11:37 -0700)]

post_proc_sse2.c: quiet -Wuninitialized

In file included from ../libvpx/vpx_dsp/x86/post_proc_sse2.c:12:
In function ‘_mm_add_epi16’,
    inlined from ‘vpx_mbpost_proc_down_sse2’ at ../libvpx/vpx_dsp/x86/post_proc_sse2.c:88:13:
/usr/lib/gcc/x86_64-linux-gnu/12/include/emmintrin.h:1060:35: warning: ‘below_context’ may be used uninitialized [-Wmaybe-uninitialized]
1060 |   return (__m128i) ((__v8hu)__A + (__v8hu)__B);
      |                                   ^~~~~~~~~~~
../libvpx/vpx_dsp/x86/post_proc_sse2.c: In function ‘vpx_mbpost_proc_down_sse2’:
../libvpx/vpx_dsp/x86/post_proc_sse2.c:39:13: note: ‘below_context’ was declared here
   39 |     __m128i below_context;

Change-Id: I2fc592f121c4e85d0aff1640014c3444f5eb09fd

commit | commitdiff | tree

James Zern [Tue, 20 Sep 2022 23:24:44 +0000 (23:24 +0000)]

Merge "CHECK_MEM_ERROR: add an assert for a valid jmp target" into main

commit | commitdiff | tree

Johann Koenig [Tue, 20 Sep 2022 00:12:13 +0000 (00:12 +0000)]

Merge "quantize: test lowbd in highbd builds" into main

commit | commitdiff | tree

Johann [Sun, 18 Sep 2022 01:26:00 +0000 (10:26 +0900)]

quantize: test lowbd in highbd builds

Change-Id: I7af273e979415a8b8cafb7494728d2736862f4a5

commit | commitdiff | tree

Johann [Fri, 16 Sep 2022 22:54:40 +0000 (07:54 +0900)]

fwd_txfm: remove avx2 file from non-hbd

Resolves warning on OS X:
file: libvpx_g.a(fwd_txfm_avx2.c.o) has no symbols

Change-Id: Ie8b290bb3ed329656beb883d552c98353f1ed5e5

commit | commitdiff | tree

Cheng Chen [Wed, 14 Sep 2022 18:40:50 +0000 (11:40 -0700)]

L2E: Rework recode decisions for external max frame size and q

Allow to handle external q and external max frame size separately.
Rely on libvpx's decision to catch overshoot/undershoot and recode frames.

Previously, when external max frame size is set, we didn't handle
undershoot cases, and now we fall back to libvpx's decision to
recode a frame if overshoot/undershoot is seen.

Change-Id: Ic3eee042cfe104b528c5f2c6c82b98dd5d8fa8ca

commit | commitdiff | tree

Scott LaVarnway [Wed, 14 Sep 2022 10:36:46 +0000 (03:36 -0700)]

Add vpx_highbd_sad64x{64,32}x4d_avx2.

~2x faster than the sse2 version.

Bug: b/245917257

Change-Id: I4742950ab7b90d7f09e8d4687e1e967138acee39

commit | commitdiff | tree

Scott LaVarnway [Mon, 12 Sep 2022 14:40:39 +0000 (07:40 -0700)]

Add vpx_highbd_sad32x{64,32,16}x4d_avx2.

~2.4x faster than the sse2 version.

Bug: b/245917257

Change-Id: I6df2bd62b46e5e175c8ad80daa6de3a1c313db0f

commit | commitdiff | tree

James Zern [Sat, 28 May 2022 04:53:49 +0000 (21:53 -0700)]

CHECK_MEM_ERROR: add an assert for a valid jmp target

callers of CHECK_MEM_ERROR() expect failures to not return

tested with:
configure --enable-debug --enable-vp9-postproc --enable-postproc \
--enable-multi-res-encoding --enable-vp9-temporal-denoising \
--enable-error-concealment

--enable-internal-stats has unrelated assertion failures currently

Change-Id: Ic12073b1ae80a6f434f14d24f652e64d30f63eea

commit | commitdiff | tree

Scott LaVarnway [Mon, 12 Sep 2022 12:18:19 +0000 (12:18 +0000)]

Merge "Add vpx_highbd_sad16x{32,16,8}x4d_avx2." into main

commit | commitdiff | tree

Wan-Teh Chang [Thu, 8 Sep 2022 22:35:13 +0000 (15:35 -0700)]

Update third_party/googletest to v1.12.1

See https://github.com/google/googletest/releases/tag/release-1.12.1.

Modeled after https://aomedia-review.googlesource.com/c/aom/+/162601.

Change-Id: If0ced3097b4c8490985e3381aaac9b3266d52ae7

commit | commitdiff | tree

Scott LaVarnway [Thu, 8 Sep 2022 20:05:55 +0000 (13:05 -0700)]

Add vpx_highbd_sad16x{32,16,8}x4d_avx2.

1.98x to 2.3x faster than the sse2 version.

Bug: b/245917257

Change-Id: Ie4f9bb942ffaf4af7d395fb5a5978b41aabfc93c

commit | commitdiff | tree

James Zern [Thu, 8 Sep 2022 01:41:13 +0000 (18:41 -0700)]

vp8_decode: declare 2 variables volatile

fixes -Wclobbered warnings with gcc 12.1.0:
vp8/vp8_dx_iface.c|278 col 16| warning: variable 'w' might be clobbered
by 'longjmp' or 'vfork' [-Wclobbered]
vp8/vp8_dx_iface.c|278 col 19| warning: variable 'h' might be clobbered
by 'longjmp' or 'vfork' [-Wclobbered]

Change-Id: Ib2c606a3450188d7869c066cacaf5615d9746181

commit | commitdiff | tree

James Zern [Tue, 6 Sep 2022 22:23:30 +0000 (22:23 +0000)]

Merge "x86,cosmetics: prefer _mm_setzero_si128/_mm256_setzero_si256" into main

commit | commitdiff | tree

James Zern [Fri, 2 Sep 2022 23:55:43 +0000 (16:55 -0700)]

sad_neon: enable UDOT implementation w/aarch32

Change-Id: Ia28305ec5c61518b732cbacbd102acd2cb7f9d82

commit | commitdiff | tree

James Zern [Fri, 2 Sep 2022 23:44:14 +0000 (16:44 -0700)]

variance_neon.cc: simplify __ARM_FEATURE_DOTPROD check

missed in
447e27588 vpx_dsp,neon: simplify __ARM_FEATURE_DOTPROD check

+ fix #if comments

only check that the macro is defined, the value doesn't have any effect.

from https://arm-software.github.io/acle/main/acle.html:

5.5.7.7.  Dot Product extension
  __ARM_FEATURE_DOTPROD is defined if the dot product data manipulation
  instructions are supported and the vector intrinsics are available.
  Note that this implies:
    - __ARM_NEON == 1

Change-Id: I098b96421b7de5928bb3b11612ca1f32e7b6cbc4

commit | commitdiff | tree

James Zern [Fri, 2 Sep 2022 23:17:52 +0000 (16:17 -0700)]

x86,cosmetics: prefer _mm_setzero_si128/_mm256_setzero_si256

over *_set1_*(0)

Change-Id: I136e1798a2ce286480ebb9418db67a2f1e92b9a2

commit | commitdiff | tree

James Zern [Fri, 2 Sep 2022 19:17:20 +0000 (12:17 -0700)]

vpx_dsp,neon: simplify __ARM_FEATURE_DOTPROD check

only check that the macro is defined, the value doesn't have any effect.

from https://arm-software.github.io/acle/main/acle.html:

5.5.7.7.  Dot Product extension
  __ARM_FEATURE_DOTPROD is defined if the dot product data manipulation
  instructions are supported and the vector intrinsics are available.
  Note that this implies:
    - __ARM_NEON == 1

Change-Id: I164fe121ccefda99050a9b6a99738a2b518520f3

commit | commitdiff | tree

James Zern [Fri, 2 Sep 2022 01:47:50 +0000 (18:47 -0700)]

neon,load_unaligned_*: use dup for lane 0

this produces better assembly with gcc (11.3.0-3); no change in assembly
using clang from the r24 android sdk (Android (8075178, based on
r437112b) clang version 14.0.1
(https://android.googlesource.com/toolchain/llvm-project
8671348b81b95fc603505dfc881b45103bee1731)

Change-Id: Ifec252d4f499f23be1cd94aa8516caf6b3fbbc11

commit | commitdiff | tree

James Zern [Wed, 31 Aug 2022 23:35:08 +0000 (16:35 -0700)]

test/*,cosmetics: normalize void parameter lists

replace (void) with (); use of this synonym is more common in C++ code.

Change-Id: I9813e82234dc9caa7115918a0491b0040f6afaf4

commit | commitdiff | tree

Yaowu Xu [Tue, 30 Aug 2022 16:04:58 +0000 (09:04 -0700)]

Remove const for pass-by-value parameters

This also fixes MSVC compiler warnings.

Change-Id: I20dc9ac821275ba95598f3016fc6b23e884e13b7

Unnamed repository; edit this file 'description' to name the repository.