]> granicus.if.org Git - llvm/log
llvm
5 years ago[GlobalISel] Import patterns containing SUBREG_TO_REG
Jessica Paquette [Wed, 28 Aug 2019 20:12:31 +0000 (20:12 +0000)]
[GlobalISel] Import patterns containing SUBREG_TO_REG

Reuse the logic for INSERT_SUBREG to also import SUBREG_TO_REG patterns.

- Split `inferSuperRegisterClass` into two functions, one which tries to use
  an existing TreePatternNode (`inferSuperRegisterClassForNode`), and one that
  doesn't. SUBREG_TO_REG doesn't have a node to leverage, which is the cause
  for the split.

- Rename GlobalISelEmitterInsertSubreg.td to GlobalISelEmitterSubreg.td and
  update it.

- Update impacted tests in the AArch64 and X86 backends.

This is kind of a hit/miss for code size improvements/regressions. E.g. in
add-ext.ll, we now get some identity copies. This isn't really anything the
importer can handle, since it's caused by a later pass introducing the copy for
the sake of correctness.

Differential Revision: https://reviews.llvm.org/D66769

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370254 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agogn build: Merge r370249
Nico Weber [Wed, 28 Aug 2019 19:38:59 +0000 (19:38 +0000)]
gn build: Merge r370249

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370251 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[AMDGPU] Fix bug when calculating user_spgr_count for Code Object V3 assembler
Scott Linder [Wed, 28 Aug 2019 19:38:15 +0000 (19:38 +0000)]
[AMDGPU] Fix bug when calculating user_spgr_count for Code Object V3 assembler

Stop counting explicitly disabled user_spgr's in the user_sgpr_count field of the kernel descriptor.

Differential Revision: https://reviews.llvm.org/D66900

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370250 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[InstCombine] clean up wrap propagation for reassociated ops; NFCI
Sanjay Patel [Wed, 28 Aug 2019 18:58:06 +0000 (18:58 +0000)]
[InstCombine] clean up wrap propagation for reassociated ops; NFCI

Always true/false checks were flagged by static analysis;
https://bugs.llvm.org/show_bug.cgi?id=43143

I have not confirmed the logic difference in propagating nsw vs. nuw,
but presumably we would have noticed a bug by now if that was wrong.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370248 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ValueMapper] NFC: Remove dead code to pause metadata mapping
Pirama Arumuga Nainar [Wed, 28 Aug 2019 17:43:14 +0000 (17:43 +0000)]
[ValueMapper] NFC: Remove dead code to pause metadata mapping

Summary:
This functionality was added when Mapper::mapMetadata was recursive.  It
is no longer needed after r265456, which switched it to be iterative.

Reviewers: dexonsmith, srhines

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66860

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370236 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[X86][ReleaseNotes] Add a note about the switch to widening legalization for narrow...
Craig Topper [Wed, 28 Aug 2019 17:18:56 +0000 (17:18 +0000)]
[X86][ReleaseNotes] Add a note about the switch to widening legalization for narrow vectors.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370233 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Attributor] Regularly clear dependences to remove spurious ones
Johannes Doerfert [Wed, 28 Aug 2019 16:58:52 +0000 (16:58 +0000)]
[Attributor] Regularly clear dependences to remove spurious ones

As dependences between abstract attributes can become stale, e.g., if
one was sufficient to imply another one at some point but it has since
been wakened to the point it is not usable for the formerly implied one.
To weed out spurious dependences, and thereby eliminate unneeded
updates, we introduce an option to determine how often the dependence
cache is cleared and recomputed during the fixpoint iteration.

Note that the initial value was determined such that we see a positive
result on our tests.

Differential Revision: https://reviews.llvm.org/D63315

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370230 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[FPEnv] Add fptosi and fptoui constrained intrinsics.
Kevin P. Neal [Wed, 28 Aug 2019 16:33:36 +0000 (16:33 +0000)]
[FPEnv] Add fptosi and fptoui constrained intrinsics.

This implements constrained floating point intrinsics for FP to signed and
unsigned integers.

Quoting from D32319:
The purpose of the constrained intrinsics is to force the optimizer to
respect the restrictions that will be necessary to support things like the
STDC FENV_ACCESS ON pragma without interfering with optimizations when
these restrictions are not needed.

Reviewed by: Andrew Kaylor, Craig Topper, Hal Finkel, Cameron McInally, Roman Lebedev, Kit Barton
Approved by: Craig Topper
Differential Revision: http://reviews.llvm.org/D63782

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370228 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[AArch64][GlobalISel] Fall back when translating musttail calls
Jessica Paquette [Wed, 28 Aug 2019 16:19:01 +0000 (16:19 +0000)]
[AArch64][GlobalISel] Fall back when translating musttail calls

These are currently translated as normal functions calls in AArch64.

Until we have proper tail call lowering, we shouldn't translate these.

Differential Revision: https://reviews.llvm.org/D66842

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370225 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoReduce scope of variable only used in a local pattern match. NFCI.
Simon Pilgrim [Wed, 28 Aug 2019 16:06:33 +0000 (16:06 +0000)]
Reduce scope of variable only used in a local pattern match. NFCI.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370224 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Added more tests for D66651
David Bolvansky [Wed, 28 Aug 2019 16:00:15 +0000 (16:00 +0000)]
[NFC] Added more tests for D66651

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370222 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[InstCombine] Disable recursion in foldGEPICmp for vector pointer GEPs
Craig Topper [Wed, 28 Aug 2019 15:40:34 +0000 (15:40 +0000)]
[InstCombine] Disable recursion in foldGEPICmp for vector pointer GEPs

Due to missing vector support in this function, recursion can
generate worse code in some cases.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370221 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoFix uninitialized variable warning in cppcheck. NFCI.
Simon Pilgrim [Wed, 28 Aug 2019 15:19:49 +0000 (15:19 +0000)]
Fix uninitialized variable warning in cppcheck. NFCI.

InstCombiner::MaxArraySizeForCombine is set outside the constructor so we need to ensure it has a default initialization value.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370220 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Added a comment to avoid possible confusion
David Bolvansky [Wed, 28 Aug 2019 15:04:48 +0000 (15:04 +0000)]
[NFC] Added a comment to avoid possible confusion

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370217 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[AMDGPU] Adjust number of SGPRs available in Calling Convention
Ryan Taylor [Wed, 28 Aug 2019 15:00:45 +0000 (15:00 +0000)]
[AMDGPU] Adjust number of SGPRs available in Calling Convention

This reduces the number of SGPRs due to some concerns about running
out of SGPRs if you make all the SGPRs that aren't reserved available
for the calling convention.

Change-Id: Idb4ca4dc72f5b6808cb524ff7270915a8de5b4c1

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370215 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRemove duplicate 'BitWidth' variable. NFCI.
Simon Pilgrim [Wed, 28 Aug 2019 14:37:44 +0000 (14:37 +0000)]
Remove duplicate 'BitWidth' variable. NFCI.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370212 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Attributor] Restrict liveness and return information to functions
Johannes Doerfert [Wed, 28 Aug 2019 14:09:14 +0000 (14:09 +0000)]
[Attributor] Restrict liveness and return information to functions

Summary:
Until we have proper call-site information we should not recompute
liveness and return information for each call site. This patch directly
uses the function versions and introduces TODOs at the usage sites.

The required iterations to get to the fixpoint are most of the time
reduced by this change and we always avoid work duplication.

Reviewers: sstefan1, uenoku

Subscribers: hiraditya, bollu, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66562

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370208 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoInstCombiner::visitSelectInst - rename Pred to MinMaxPred to stop shadow variable...
Simon Pilgrim [Wed, 28 Aug 2019 14:05:38 +0000 (14:05 +0000)]
InstCombiner::visitSelectInst - rename Pred to MinMaxPred to stop shadow variable warning. NFCI.

We have a lot of Predicate variables, all similarly named....

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370207 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoReland "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same...
Vlad Tsyrklevich [Wed, 28 Aug 2019 14:04:09 +0000 (14:04 +0000)]
Reland "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time."

This relands this commit, I mistakenly reverted the original change
thinking it was the cause of the observed MSan failures but it was not.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370206 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[SelectionDAG] Don't generate libcalls for wide shifts on Windows (PR42711)
Hans Wennborg [Wed, 28 Aug 2019 13:55:10 +0000 (13:55 +0000)]
[SelectionDAG] Don't generate libcalls for wide shifts on Windows (PR42711)

Neither libgcc or compiler-rt are usually used on Windows, so these
functions can't be called.

Differential revision: https://reviews.llvm.org/D66880

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370204 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[X86] Add test for rotate combining when add X, X is used instead of shl X, 1. NFC
Amaury Sechet [Wed, 28 Aug 2019 13:52:32 +0000 (13:52 +0000)]
[X86] Add test for rotate combining when add X, X is used instead of shl X, 1. NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370203 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same...
Vlad Tsyrklevich [Wed, 28 Aug 2019 13:15:08 +0000 (13:15 +0000)]
Revert "[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time."

This reverts commit r370032, it was causing check-llvm failures on
sanitizer-x86_64-linux-bootstrap-msan

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370198 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[DAGCombine] Fix cppcheck shadow variable warning. NFCI.
Simon Pilgrim [Wed, 28 Aug 2019 12:48:41 +0000 (12:48 +0000)]
[DAGCombine] Fix cppcheck shadow variable warning. NFCI.

We already have an outer Ops variable.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370197 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[mips] Use less registers to load address of TargetExternalSymbol
Simon Atanasyan [Wed, 28 Aug 2019 12:35:53 +0000 (12:35 +0000)]
[mips] Use less registers to load address of TargetExternalSymbol

There is no pattern matched `add hi, (MipsLo texternalsym)`. As a result,
loading an address of 32-bit symbol requires two registers and one more
additional instruction:
```
addiu $1, $zero, %lo(foo)
lui   $2, %hi(foo)
addu  $25, $2, $1
```

This patch adds the missed pattern and enables generation more effective
set of instructions:
```
lui   $1, %hi(foo)
addiu $25, $1, %lo(foo)
```

Differential Revision: https://reviews.llvm.org/D66771

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370196 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[TargetLowering] Add buildLegalVectorShuffle facility to help build legal shuffles
Amaury Sechet [Wed, 28 Aug 2019 12:00:06 +0000 (12:00 +0000)]
[TargetLowering] Add buildLegalVectorShuffle facility to help build legal shuffles

Summary: There are at least 2 ways to express the same shuffle. Various pieces of code explicit check for both option, but other places do not when they would benefit from doing it. This patches refactor the codebase to use buildLegalVectorShuffle in order to make that behavior more consistent.

Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri

Subscribers: javed.absar, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66804

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370190 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[DAGCombine] Remove LoadedSlice::Cost default 'ForCodeSize' constructor arguments...
Simon Pilgrim [Wed, 28 Aug 2019 11:50:36 +0000 (11:50 +0000)]
[DAGCombine] Remove LoadedSlice::Cost default 'ForCodeSize' constructor arguments. NFCI.

These were always being passed in and it allowed me to add the explicit tag to stop a cppcheck warning about 1 argument constructors.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370189 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agogn build: Merge r370187
Nico Weber [Wed, 28 Aug 2019 11:42:20 +0000 (11:42 +0000)]
gn build: Merge r370187

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370188 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ARM] Move MVEVPTBlockPass to a separate file. NFC
David Green [Wed, 28 Aug 2019 11:37:31 +0000 (11:37 +0000)]
[ARM] Move MVEVPTBlockPass to a separate file. NFC

This just pulls the MVEVPTBlockPass into a separate file, as opposed to being
wrapped up in Thumb2ITBlockPass.

Differential revision: https://reviews.llvm.org/D66579

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370187 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[MVE] VMOVX patterns
David Green [Wed, 28 Aug 2019 10:13:23 +0000 (10:13 +0000)]
[MVE] VMOVX patterns

This adds fp16 VMOVX patterns, using the same patterns as rL362482 with some
adjustments for MVE. It allows us to move fp16 registers without going into and
out of gprs.

VMOVX is able to move the top bits from a fp16 in a fp reg into the bottom bits
of another register, zeroing the rest. This can be used for odd MVE register
lanes. The top bits are not read by fp16 instructions, so no move is required
there if we are dealing with even lanes.

Differential revision: https://reviews.llvm.org/D66793

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370184 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[LLVM-C] Fix ByVal Attribute crashing
Hans Wennborg [Wed, 28 Aug 2019 09:21:56 +0000 (09:21 +0000)]
[LLVM-C] Fix ByVal Attribute crashing

With the introduction of the typed byval attribute change there was no
way that the LLVM-C API could create the correct class Attribute. If a
program that uses the C API creates a ByVal attribute and annotates a
function with that attribute LLVM will crash when it assembles or write
that module containing the function out as bitcode.

This change is a minimal fix to at least allow code to work, this is
because the byval change is on the 9.0 and I don't want to introduce new
LLVM-C API this late in the release cycle.

By Jakob Bornecrantz!

Differential revision: https://reviews.llvm.org/D66144

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370176 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[LV] Fold tail by masking - handle reductions
Ayal Zaks [Wed, 28 Aug 2019 09:02:23 +0000 (09:02 +0000)]
[LV] Fold tail by masking - handle reductions

Allow vectorizing loops that have reductions when tail is folded by masking.
A select is introduced in VPlan, choosing between the last value carried by the
loop-exit/live-out instruction of the reduction, and the penultimate value
carried by the reduction phi, according to the "i < n" mask of fold-tail.
This select replaces the last value as the live-out value of the loop.

Differential Revision: https://reviews.llvm.org/D66720

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370173 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ARM][ParallelDSP] Change search for muls
Sam Parker [Wed, 28 Aug 2019 08:51:13 +0000 (08:51 +0000)]
[ARM][ParallelDSP] Change search for muls

rL369567 reverted a couple of recent changes made to ARMParallelDSP
because of a miscompilation error: PR43073.

The issue stemmed from an underlying bug that was caused by adding
muls into a reduction before it was proved that they could be executed
in parallel with another mul.

Most of the changes here are from the previously reverted commits.
The additional changes have been made area:
1) The Search function now doesn't insert any muls into the Reduction
   object. That now happens once the search has successfully finished.
2) For any muls added into the reduction but that weren't paired, we
   accumulate their values as an input into the smlad.

Differential Revision: https://reviews.llvm.org/D66660

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370171 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Unbreak tests
David Bolvansky [Wed, 28 Aug 2019 08:42:40 +0000 (08:42 +0000)]
[NFC] Unbreak tests

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370170 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Updated test
David Bolvansky [Wed, 28 Aug 2019 08:40:45 +0000 (08:40 +0000)]
[NFC] Updated test

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370169 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAnnotate return values of allocation functions with dereferenceable_or_null
David Bolvansky [Wed, 28 Aug 2019 08:28:20 +0000 (08:28 +0000)]
Annotate return values of allocation functions with dereferenceable_or_null

Summary:
Example
define dso_local noalias i8* @_Z6maixxnv() local_unnamed_addr #0 {
entry:
  %call = tail call noalias dereferenceable_or_null(64) i8* @malloc(i64 64) #6
  ret i8* %call
}

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: aaron.ballman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66651

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370168 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[llvm-objdump] Add the missing ARMv8 subarch detection
Yi Kong [Wed, 28 Aug 2019 06:37:22 +0000 (06:37 +0000)]
[llvm-objdump] Add the missing ARMv8 subarch detection

Differential Revision: https://reviews.llvm.org/D66849

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370163 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[LoopFusion] Fix another -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off build
Fangrui Song [Wed, 28 Aug 2019 03:12:40 +0000 (03:12 +0000)]
[LoopFusion] Fix another -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off build

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370156 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU/GlobalISel: Fix constraining scalar and/or/xor
Matt Arsenault [Wed, 28 Aug 2019 02:11:03 +0000 (02:11 +0000)]
AMDGPU/GlobalISel: Fix constraining scalar and/or/xor

If the result register already had a register class assigned, the
sources may not have been properly constrained.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370150 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert r370105 - Update two x86 datalayouts for r370083, looks like racing commits
Richard Trieu [Wed, 28 Aug 2019 01:55:06 +0000 (01:55 +0000)]
Revert r370105 - Update two x86 datalayouts for r370083, looks like racing commits

r370083 has been reverted, which this change depends on.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370147 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[test] Speculative fix for r369966 on llvm-clang-x86_64-win
Vedant Kumar [Wed, 28 Aug 2019 01:39:36 +0000 (01:39 +0000)]
[test] Speculative fix for r369966 on llvm-clang-x86_64-win

Run the MIR pipeline in this test to completion to try and avoid a "Bad
machine code" error.

Build failure:
http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190826/688338.html

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370145 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert "Change the X86 datalayout to add three address spaces for 32 bit signed,"
Vlad Tsyrklevich [Wed, 28 Aug 2019 01:08:54 +0000 (01:08 +0000)]
Revert "Change the X86 datalayout to add three address spaces for 32 bit signed,"

This reverts commit r370083 because it caused check-lld failures on
sanitizer-x86_64-linux-fast.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370142 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU/GlobalISel: Implement addrspacecast for 32-bit constant addrspace
Matt Arsenault [Wed, 28 Aug 2019 00:58:24 +0000 (00:58 +0000)]
AMDGPU/GlobalISel: Implement addrspacecast for 32-bit constant addrspace

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370140 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Assert preconditions and merge all users into one codepath in Loads.cpp
Philip Reames [Tue, 27 Aug 2019 23:36:31 +0000 (23:36 +0000)]
[NFC] Assert preconditions and merge all users into one codepath in Loads.cpp

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370128 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[InstCombine] Disable some portions of foldGEPICmp for GEPs that return a vector...
Craig Topper [Tue, 27 Aug 2019 21:38:56 +0000 (21:38 +0000)]
[InstCombine] Disable some portions of foldGEPICmp for GEPs that return a vector of pointers. Fix other portions.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370114 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[RISCV] Implement RISCVRegisterInfo::getPointerRegClass
Luis Marques [Tue, 27 Aug 2019 21:37:57 +0000 (21:37 +0000)]
[RISCV] Implement RISCVRegisterInfo::getPointerRegClass

Fixes bug 43041

Differential Revision: https://reviews.llvm.org/D66752

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370113 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Analysis] Improve EmitGEPOffset handling of vector GEPs with scalar indices.
Craig Topper [Tue, 27 Aug 2019 21:31:44 +0000 (21:31 +0000)]
[Analysis] Improve EmitGEPOffset handling of vector GEPs with scalar indices.

This patch splats the scalar index if necessary before using it
in any integer casts or other arithmetic.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370112 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoUpdate two x86 datalayouts for r370083, looks like racing commits
Reid Kleckner [Tue, 27 Aug 2019 19:55:10 +0000 (19:55 +0000)]
Update two x86 datalayouts for r370083, looks like racing commits

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370105 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[GlobalISel] Replace hard coded dynamic alloca handling with G_DYN_STACKALLOC.
Amara Emerson [Tue, 27 Aug 2019 19:54:27 +0000 (19:54 +0000)]
[GlobalISel] Replace hard coded dynamic alloca handling with G_DYN_STACKALLOC.

This change moves the actual stack pointer manipulation into the legalizer,
available to targets via lower(). The codegen is slightly different because
we're using explicit masks instead of G_PTRMASK, and using G_SUB rather than
adding a negative amount via G_GEP.

Differential Revision: https://reviews.llvm.org/D66678

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370104 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Loads/SROA] Remove blatantly incorrect code and fix a bug revealed in the process
Philip Reames [Tue, 27 Aug 2019 19:34:43 +0000 (19:34 +0000)]
[Loads/SROA] Remove blatantly incorrect code and fix a bug revealed in the process

The code we had isSafeToLoadUnconditionally was blatantly wrong. This function takes a "Size" argument which is supposed to describe the span loaded from. Instead, the code use the size of the pointer passed (which may be unrelated!) and only checks that span. For any Size > LoadSize, this can and does lead to miscompiles.

Worse, the generic code just a few lines above correctly handles the cases which *are* valid. So, let's delete said code.

Removing this code revealed two issues:
1) As noted by jdoerfert the removed code incorrectly handled external globals.  The test update in SROA is to stop testing incorrect behavior.
2) SROA was confusing bytes and bits, but this wasn't obvious as the Size parameter was being essentially ignored anyway.  Fixed.

Differential Revision: https://reviews.llvm.org/D66778

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370102 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoDAG: computeNumSignBits for MUL
Matt Arsenault [Tue, 27 Aug 2019 19:05:33 +0000 (19:05 +0000)]
DAG: computeNumSignBits for MUL

Copied directly from the IR version.

Most of the testcases I've added for this are somewhat problematic
because they really end up testing the yet to be implemented version
for MUL_I24/MUL_U24.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370099 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU: Add baseline test for num sign bits of mul
Matt Arsenault [Tue, 27 Aug 2019 19:01:02 +0000 (19:01 +0000)]
AMDGPU: Add baseline test for num sign bits of mul

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370098 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[XCOFF][AIX] Generate symbol table entries with llvm-readobj
Jason Liu [Tue, 27 Aug 2019 18:54:46 +0000 (18:54 +0000)]
[XCOFF][AIX] Generate symbol table entries with llvm-readobj

Summary:

This patch implements main entry and auxiliary entries of symbol table generation for llvm-readobj on AIX.
The source code of aix_xcoff_xlc_test8.o (compile with xlc) is:

-bash-4.2$ cat test8.c
extern int i;
extern int TestforXcoff;
extern int fun(int i);
static int static_i;
char* p="abcd";
int fun1(int j) {
  static_i++;
  j++;
  j=j+*p;
  return j;
}
int main() {
  i++;
  fun(i);
  return fun1(i);
}

Patch provided by DiggerLin

Differential Revision: https://reviews.llvm.org/D65240

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370097 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert Autogenerate the shebang lines for tools/opt-viewer
Reid Kleckner [Tue, 27 Aug 2019 18:31:29 +0000 (18:31 +0000)]
Revert Autogenerate the shebang lines for tools/opt-viewer

This reverts r369486 (git commit 8d18384809957cc923752e10a86adab129e3df48)

The opt-viewer tests don't pass after this change, and fixing them isn't
trivial. opt-viewer.py imports optmap, which requires adjusting
pythonpath, which is more work than I'm willing to do to fix forward.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370095 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ORCv2] - New Speculate Query Implementation
Praveen Velliengiri [Tue, 27 Aug 2019 18:23:36 +0000 (18:23 +0000)]
[ORCv2] - New Speculate Query Implementation

Summary:
This patch introduces, SequenceBBQuery - new heuristic to find likely next callable functions it tries to find the blocks with calls in order of execution sequence of Blocks.

It still uses BlockFrequencyAnalysis to find high frequency blocks. For a handful of hottest blocks (plan to customize), the algorithm traverse and discovered the caller blocks along the way to Entry Basic Block and Exit Basic Block. It uses Block Hint, to stop traversing the already visited blocks in both direction. It implicitly assumes that once the block is visited during discovering entry or exit nodes, revisiting them again does not add much. It also branch probability info (cached result) to traverse only hot edges (planned to customize) from hot blocks. Without BPI, the algorithm mostly return's all the blocks in the CFG with calls.

It also changes the heuristic queries, so they don't maintain states. Hence it is safe to call from multiple threads.

It also implements, new instrumentation to avoid jumping into JIT on every call to the function with the help _orc_speculate.decision.block and _orc_speculate.block.

"Speculator Registration Mechanism is also changed" - kudos to @lhames

Open to review, mostly looking to change implementation of SequeceBBQuery heuristics with good data structure choices.

Reviewers: lhames, dblaikie

Reviewed By: lhames

Subscribers: mgorny, hiraditya, mgrang, llvm-commits, lhames

Tags: #speculative_compilation_in_orc, #llvm

Differential Revision: https://reviews.llvm.org/D66399

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370092 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Tblgen][MCA] Add the ability to mark groups as LoadQueue and StoreQueue. NFCI
Andrea Di Biagio [Tue, 27 Aug 2019 18:20:34 +0000 (18:20 +0000)]
[Tblgen][MCA] Add the ability to mark groups as LoadQueue and StoreQueue. NFCI

Before this patch, users were not allowed to optionally mark processor resource
groups as load/store queues. That is because tablegen class MemoryQueue was
originally declared as expecting a ProcResource template argument (instead of a
more generic ProcResourceKind).

That was an oversight, since the original intention from D54957 was to let user
mark any processor resource as either load/store queue.  This patch adds the
ability to use processor resource groups in MemoryQueue definitions. This is not
a user visible change.

Differential Revision: https://reviews.llvm.org/D66810

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370091 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU: Add amdgpu-32bit-address-high-bits to MIR serialization
Matt Arsenault [Tue, 27 Aug 2019 18:18:38 +0000 (18:18 +0000)]
AMDGPU: Add amdgpu-32bit-address-high-bits to MIR serialization

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370089 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[JITLink] Fix bogus TimerGroup constructor call.
Lang Hames [Tue, 27 Aug 2019 18:05:05 +0000 (18:05 +0000)]
[JITLink] Fix bogus TimerGroup constructor call.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370088 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU: Fix crash from inconsistent register types for v3i16/v3f16
Matt Arsenault [Tue, 27 Aug 2019 17:51:56 +0000 (17:51 +0000)]
AMDGPU: Fix crash from inconsistent register types for v3i16/v3f16

This is something of a workaround since computeRegisterProperties
seems to be doing the wrong thing.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370086 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ORC] NFC remove unimplemented query
Praveen Velliengiri [Tue, 27 Aug 2019 17:51:50 +0000 (17:51 +0000)]
[ORC] NFC remove unimplemented query

Summary: CFGWalk Query is unimplemented for valid reasons. But the declaration got included in commit file.

Reviewers: lhames, dblaikie

Reviewed By: dblaikie

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66289

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370085 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRecommit "[GlobalISel] Import patterns containing INSERT_SUBREG"
Jessica Paquette [Tue, 27 Aug 2019 17:47:06 +0000 (17:47 +0000)]
Recommit  "[GlobalISel] Import patterns containing INSERT_SUBREG"

I thought `llvm::sort` was stable for some reason but it's not.

Use `llvm::stable_sort` in `CodeGenTarget::getSuperRegForSubReg`.

Original patch: https://reviews.llvm.org/D66498

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370084 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoChange the X86 datalayout to add three address spaces for 32 bit signed,
Amy Huang [Tue, 27 Aug 2019 17:46:53 +0000 (17:46 +0000)]
Change the X86 datalayout to add three address spaces for 32 bit signed,
32 bit unsigned, and 64 bit pointers.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370083 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert "[GlobalISel] Import patterns containing INSERT_SUBREG"
Jessica Paquette [Tue, 27 Aug 2019 17:26:44 +0000 (17:26 +0000)]
Revert "[GlobalISel] Import patterns containing INSERT_SUBREG"

When EXPENSIVE_CHECKS are enabled, GlobalISelEmitterSubreg.td doesn't get
stable output.

Reverting while I debug it.

See: https://reviews.llvm.org/D66498

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370080 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[X86] Remove encoding information from the TAILJMP instructions that are lowered...
Craig Topper [Tue, 27 Aug 2019 17:24:23 +0000 (17:24 +0000)]
[X86] Remove encoding information from the TAILJMP instructions that are lowered by MCInstLowering. Fix LowerPATCHABLE_TAIL_CALL to also convert them to regular JMP/JCC instructions

There are 5 instructions here that are converted from TAILJMP opcodes to regular JMP/JCC opcodes during MCInstLowering. So normally there encoding information isn't used. The exception being when XRay wraps them in PATCHABLE_TAIL_CALL.

For the ones that weren't already handled in MCInstLowering, add handling for those and remove their encoding information.

This patch fixes PATCHABLE_TAIL_CALL to do the same opcode conversion as the regular lowering patch. Then removes the encoding information.

Differential Revision: https://reviews.llvm.org/D66561

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370079 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[JITLink] Add timers and -show-times option to llvm-jitlink.
Lang Hames [Tue, 27 Aug 2019 15:51:19 +0000 (15:51 +0000)]
[JITLink] Add timers and -show-times option to llvm-jitlink.

The timers track time spent loading objects, linking, and (if applicable)
running JIT-link'd code.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370075 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[JITLink][ORC] Track eh-frame section size for registration/deregistration.
Lang Hames [Tue, 27 Aug 2019 15:50:32 +0000 (15:50 +0000)]
[JITLink][ORC] Track eh-frame section size for registration/deregistration.

On MachO, processing of the eh-frame section should stop if the end of the
__eh_frame section is reached, regardless of whether or not there is a null CFI
length field at the end of the section. This patch tracks the eh-frame section
size and threads it through the appropriate APIs so that processing can be
terminated correctly.

No testcase yet: This patch is all API plumbing (rather than modification of
linked memory) which the existing infrastructure does not provide a way of
testing. Committing without a testcase until I have an idea of how to write
one.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370074 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[JITLink] Don't under-align zero-fill sections.
Lang Hames [Tue, 27 Aug 2019 15:22:23 +0000 (15:22 +0000)]
[JITLink] Don't under-align zero-fill sections.

If content sections have lower alignment than zero-fill sections then bump the
overall segment alignment to avoid under-aligning the zero-fill sections.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370072 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[DAGCombiner] cancel fnegs from multiplied operands of FMA
Sanjay Patel [Tue, 27 Aug 2019 15:17:46 +0000 (15:17 +0000)]
[DAGCombiner] cancel fnegs from multiplied operands of FMA

(-X) * (-Y) + Z --> X * Y + Z

This is a missing optimization that shows up as a potential regression in D66050,
so we should solve it first. We appear to be partly missing this fold in IR as well.

We do handle the simpler case already:
(-X) * (-Y) --> X * Y

And it might be beneficial to make the constraint less conservative (eg, if both
operands are cheap, but not necessarily cheaper), but that causes infinite looping
for the existing fmul transform.

Differential Revision: https://reviews.llvm.org/D66755

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370071 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoHandle local commons for XCOFF object file writing
Jason Liu [Tue, 27 Aug 2019 15:14:45 +0000 (15:14 +0000)]
Handle local commons for XCOFF object file writing

Summary:
Adds support for emitting common local global symbols to an XCOFF object file.
Local commons are emitted into the .bss section with a storage class of
C_HIDEXT.

Patch by: daltenty

Reviewers: sfertile, hubert.reinterpretcast

Differential Revision: https://reviews.llvm.org/D66097

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370070 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert "[CodeGen] Do the Simple Early Return in block-placement pass to optimize...
Jinsong Ji [Tue, 27 Aug 2019 14:59:08 +0000 (14:59 +0000)]
Revert "[CodeGen] Do the Simple Early Return in block-placement pass to optimize the blocks"

This reverts commit b3d258fc44b588f06eb35f8e4b9a6d1fc859acec.

@skatkov is reporting crash in D63972#1646303
Contacted @ZhangKang, and revert the commit on behalf of him.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370069 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[MIPS GlobalISel] ClampScalar G_SHL, G_ASHR and G_LSHR
Petar Avramovic [Tue, 27 Aug 2019 14:41:44 +0000 (14:41 +0000)]
[MIPS GlobalISel] ClampScalar G_SHL, G_ASHR and G_LSHR

ClampScalar G_SHL, G_ASHR and G_LSHR to s32 for MIPS32.

Differential Revision: https://reviews.llvm.org/D66533

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370067 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[GlobalISel] Factor narrowScalar for G_ASHR and G_LSHR. NFC
Petar Avramovic [Tue, 27 Aug 2019 14:33:05 +0000 (14:33 +0000)]
[GlobalISel] Factor narrowScalar for G_ASHR and G_LSHR. NFC

Main difference is in the way Hi for Long shift (HiL) is made.
G_LSHR fills HiL with zeros, while G_ASHR fills HiL with sign bit value.

Differential Revision: https://reviews.llvm.org/D66589

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370064 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[GlobalISel] Fix narrowScalar for shifts to match algorithm from SDAG
Petar Avramovic [Tue, 27 Aug 2019 14:22:32 +0000 (14:22 +0000)]
[GlobalISel] Fix narrowScalar for shifts to match algorithm from SDAG

Fix typos. Use Hi and Lo prefixes for Or instead of LHS and RHS
to match names of surrounding variables.

Differential Revision: https://reviews.llvm.org/D66587

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370062 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[DAGCombiner] Add node to the worklist in topological order in parallelizeChainedStores
Amaury Sechet [Tue, 27 Aug 2019 13:27:57 +0000 (13:27 +0000)]
[DAGCombiner] Add node to the worklist in topological order in parallelizeChainedStores

Summary: As per title.

Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66659

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370056 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[X86][AVX] Add SimplifyDemandedVectorElts support for KSHIFTL/KSHIFTR
Simon Pilgrim [Tue, 27 Aug 2019 13:13:17 +0000 (13:13 +0000)]
[X86][AVX] Add SimplifyDemandedVectorElts support for KSHIFTL/KSHIFTR

Differential Revision: https://reviews.llvm.org/D66527

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370055 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[IntrinsicEmitter] Support scalable vectors in intrinsics
Cullen Rhodes [Tue, 27 Aug 2019 12:57:09 +0000 (12:57 +0000)]
[IntrinsicEmitter] Support scalable vectors in intrinsics

Summary:
This patch adds support for scalable vectors in intrinsics, enabling
intrinsics such as the following to be defined:

    declare <vscale x 4 x i32> @llvm.something.nxv4i32(<vscale x 4 x i32>)

Support for this is implemented by defining a new type descriptor for
scalable vectors and adding mangling support for scalable vector types
in the name mangling scheme used by 'any' types in intrinsic signatures.

Tests have been added for IRBuilder to test scalable vectors work as
expected when using intrinsics through this interface. This required
implementing an intrinsic that is explicitly defined with scalable
vectors, e.g.  LLVMType<nxv4i32>, an SVE floating-point convert
intrinsic was used for this.  The behaviour of the overloaded type
LLVMScalarOrSameVectorWidth with scalable vectors is tested using the
existing masked load intrinsic. Also added an .ll test to test the
Verifier catches a bad intrinsic argument when passing a fixed-width
predicate (mask) to the masked.load intrinsic where a scalable is
expected.

Patch by Paul Walker

Reviewed By: sdesmalen

Differential Revision: https://reviews.llvm.org/D65930

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370053 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Added tests for D66651
David Bolvansky [Tue, 27 Aug 2019 11:41:03 +0000 (11:41 +0000)]
[NFC] Added tests for D66651

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370046 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAdd error handling to the DataExtractor class
Pavel Labath [Tue, 27 Aug 2019 11:24:08 +0000 (11:24 +0000)]
Add error handling to the DataExtractor class

Summary:
This is motivated by D63591, where we realized that there isn't a really
good way of telling whether a DataExtractor is reading actual data, or
is it just returning default values because it reached the end of the
buffer.

This patch resolves that by providing a new "Cursor" class. A Cursor
object encapsulates two things:
- the current position/offset in the DataExtractor
- an error object

Storing the error object inside the Cursor enables one to use the same
pattern as the std::{io}stream API, where one can blindly perform a
sequence of reads and only check for errors once at the end of the
operation. Similarly to the stream API, as soon as we encounter one
error, all of the subsequent operations are skipped (return default
values) too, even if the would suceed with clear error state. Unlike the
std::stream API (but in line with other llvm APIs), we force the error
state to be checked through usage of llvm::Error.

Reviewers: probinson, dblaikie, JDevlieghere, aprantl, echristo

Subscribers: kristina, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D63713

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370042 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[DAGCombiner] Add node to the worklist in topological order after relegalization.
Amaury Sechet [Tue, 27 Aug 2019 11:06:09 +0000 (11:06 +0000)]
[DAGCombiner] Add node to the worklist in topological order after relegalization.

Summary: As per title.

Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66702

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370040 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[InstCombine] Fold select with ctlz to cttz
David Bolvansky [Tue, 27 Aug 2019 10:22:40 +0000 (10:22 +0000)]
[InstCombine] Fold select with ctlz to cttz

Summary:
Handle pattern [0]:

int ctz(unsigned int a)
{
  int c = __clz(a & -a);
  return a ? 31 - c : c;
}

In reality, the compiler can generate much better code for cttz, so fold away this pattern.

https://godbolt.org/z/c5kPtV

 [0] https://community.arm.com/community-help/f/discussions/2114/count-trailing-zeros

Reviewers: spatel, nikic, lebedev.ri, dmgreen, hfinkel

Reviewed By: hfinkel

Subscribers: hfinkel, javed.absar, kristof.beyls, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66308

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370037 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAArch64: avoid creating cycle in DAG for post-increment NEON ops.
Tim Northover [Tue, 27 Aug 2019 10:21:11 +0000 (10:21 +0000)]
AArch64: avoid creating cycle in DAG for post-increment NEON ops.

Inserting a value into Visited has the effect of terminating a search for
predecessors if that node is seen. This is legitimate for the base address, and
acts as a slight performance optimization, but the vector-building node can be
paert of a legitimate cycle so we shouldn't stop searching there.

PR43056.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370036 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[llvm-objdump] - Remove one overload of reportError. NFCI.
George Rimar [Tue, 27 Aug 2019 10:03:45 +0000 (10:03 +0000)]
[llvm-objdump] - Remove one overload of reportError. NFCI.

There is a problem with reportError we have.
Declaration says we have ArchiveName
that follows the FileName:

reportError(Error E, StringRef FileName, StringRef ArchiveName,...

Though implementation have them reversed. I cleaned it up and
removed an excessive reportError(Error E, StringRef File) version.

Rebased on top of D66418.

Differential revision: https://reviews.llvm.org/D66517

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370034 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time.
George Rimar [Tue, 27 Aug 2019 09:58:39 +0000 (09:58 +0000)]
[yaml2obj] - Don't allow setting StOther and Other/Visibility at the same time.

This is a follow up discussed in the comments of D66583.

Currently, if for example, we have both StOther and Other set in YAML document for a symbol,
then yaml2obj reports an "unknown key 'Other'" error.
It happens because 'mapOptional()' is never called for 'Other/Visibility' in this case,
leaving those unhandled.

This message does not describe the reason of the error well. This patch fixes it.

Differential revision: https://reviews.llvm.org/D66642

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370032 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[SelectionDAGBuilder] Hide existence of ConstantDataVector vector from visitGetElemen...
Craig Topper [Tue, 27 Aug 2019 06:39:50 +0000 (06:39 +0000)]
[SelectionDAGBuilder] Hide existence of ConstantDataVector vector from visitGetElementPtr.

ConstantDataVector is a specialized verison of ConstantVector
that stores data in a packed array of bits instead of as
individual pointers to other Constants. But we really shouldn't
expose that if we can void it. And we should handle regular
ConstantVector equally well.

This removes a dyn_cast to ConstantDataVector and just calls
getSplatValue directly on a Constant* if the type is a vector.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370018 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[SelectionDAGBuilder] Fix typo in comment. NFC
Craig Topper [Tue, 27 Aug 2019 06:38:51 +0000 (06:38 +0000)]
[SelectionDAGBuilder] Fix typo in comment. NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370017 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[ValueTracking] Add AllowNonInbounds parameter to GetPointerBaseWithConstantOffset...
Hideto Ueno [Tue, 27 Aug 2019 06:30:33 +0000 (06:30 +0000)]
[ValueTracking] Add AllowNonInbounds parameter to GetPointerBaseWithConstantOffset function

This commit was part of D65402.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370016 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Attributor] Clamp operator to extend known state
Hideto Ueno [Tue, 27 Aug 2019 06:02:56 +0000 (06:02 +0000)]
[Attributor] Clamp operator to extend known state

Summary:
Similar to `^=` operator for IntegerState, this patch introduces a `+=` operator to "clamp" known information.

Reviewers: jdoerfert, sstefan1

Reviewed By: jdoerfert

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66635

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370015 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Attributor] Introduce an API to delete stuff
Johannes Doerfert [Tue, 27 Aug 2019 04:57:54 +0000 (04:57 +0000)]
[Attributor] Introduce an API to delete stuff

Summary:
During the fixpoint iteration, including the manifest stage, we should
not delete stuff as other abstract attributes might have a reference to
the value. Through the API this can now be done safely at the very end.

Reviewers: uenoku, sstefan1

Subscribers: hiraditya, bollu, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66779

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370014 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[NFC] Replace the FIXME I added in rL369989 with a comment clarifying the current...
Philip Reames [Tue, 27 Aug 2019 04:52:35 +0000 (04:52 +0000)]
[NFC] Replace the FIXME I added in rL369989 with a comment clarifying the current code

The current approach is restrictive (as all of geps must be multiples of the alignment), but correct.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370013 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoRevert r369927 - [DAGCombiner] Remove a bunch of redundant AddToWorklist calls.
Richard Trieu [Tue, 27 Aug 2019 02:04:11 +0000 (02:04 +0000)]
Revert r369927 - [DAGCombiner] Remove a bunch of redundant AddToWorklist calls.

This change causes instrumented builds of Clang to have a fatal error in the
backend.  https://reviews.llvm.org/D66537 has the details.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370006 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[WinEH] Allocate space in funclets stack to save XMM CSRs
Pengfei Wang [Tue, 27 Aug 2019 01:53:24 +0000 (01:53 +0000)]
[WinEH] Allocate space in funclets stack to save XMM CSRs

Summary:
This is an alternate approach to D63396

Currently funclets reuse the same stack slots that are used in the
parent function for saving callee-saved xmm registers. If the parent
function modifies a callee-saved xmm register before an excpetion is
thrown, the catch handler will overwrite the original saved value.

This patch allocates space in funclets stack for saving callee-saved xmm
registers and uses RSP instead RBP to access memory.

Signed-off-by: Pengfei Wang <pengfei.wang@intel.com>
Reviewers: rnk, RKSimon, craig.topper, annita.zhang, LuoYuanke, andrew.w.kaylor

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66596

Signed-off-by: Pengfei Wang <pengfei.wang@intel.com>
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370005 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[Analysis] In EmitGEPOffset, use Constant::getUniqueInteger to handle struct indices...
Craig Topper [Tue, 27 Aug 2019 01:07:37 +0000 (01:07 +0000)]
[Analysis] In EmitGEPOffset, use Constant::getUniqueInteger to handle struct indices in vector GEPs.

We previously called getSplatValue if the index had a vector type,
but getSplatValue returns null for non-splats. This would cause
a nullptr dereference if it wasn't a splat.

Using getUniqueInteger gives us an assert if its a vector type,
but the value isn't a splat. This is what is used in
SelectionDAGBuilder's code that expands GEPs as well.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@370001 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[MemorySSA] Fix insertUse.
Alina Sbirlea [Tue, 27 Aug 2019 00:34:47 +0000 (00:34 +0000)]
[MemorySSA] Fix insertUse.

Actually call the renamePass on inserted Phis.
Fixes PR42940.

Subscribers: llvm-commits

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369997 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAdd PhaseOrdering/lifetime-sanitizer.ll tests
Vitaly Buka [Tue, 27 Aug 2019 00:18:28 +0000 (00:18 +0000)]
Add PhaseOrdering/lifetime-sanitizer.ll tests

Reviewers: lebedev.ri

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66761

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369996 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU: Combine directly on mul24 intrinsics
Matt Arsenault [Tue, 27 Aug 2019 00:18:09 +0000 (00:18 +0000)]
AMDGPU: Combine directly on mul24 intrinsics

The problem these are supposed to work around can occur before the
intrinsics are lowered into the nodes. Try to directly simplify them
so they are matched before the bit assert operations can be optimized
out.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369994 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoAMDGPU: Run AMDGPUCodeGenPrepare after scalar opts
Matt Arsenault [Tue, 27 Aug 2019 00:08:31 +0000 (00:08 +0000)]
AMDGPU: Run AMDGPUCodeGenPrepare after scalar opts

The mul24 matching could interfere with SLSR and the other addressing
mode related passes. This probably is not the optimal placement, but
is an intermediate step. This should probably be moved after all the
generic IR passes, particularly LSR. Moving this after LSR seems to
help in some cases, and hurts others.

As-is in this patch, in idiv-licm, it saves 1-2 instructions inside
some of the loop bodies, but increases the number in others. Moving
this later helps these loops. In the new lsr tests in
mul24-pass-ordering, the intrinsic prevents introducing more
instructions in the loop preheader, so moving this later ends up
hurting them. This shouldn't be any worse than before the intrinsics
were introduced in r366094, and LSR should probably be smarter. I
think it's because it doesn't know the and inside the loop will be
folded away.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369991 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agoReorganize code and add a fixme to point out a bug in existing code [NFC]
Philip Reames [Mon, 26 Aug 2019 23:57:27 +0000 (23:57 +0000)]
Reorganize code and add a fixme to point out a bug in existing code [NFC]

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369989 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[mips] Fix indentation. NFC
Simon Atanasyan [Mon, 26 Aug 2019 22:40:34 +0000 (22:40 +0000)]
[mips] Fix indentation. NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369983 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[mips] clang-format the code. NFC
Simon Atanasyan [Mon, 26 Aug 2019 22:40:28 +0000 (22:40 +0000)]
[mips] clang-format the code. NFC

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369982 91177308-0d34-0410-b5e6-96231b3b80d8

5 years ago[X86] Delay combineIncDecVector until after op legalization.
Craig Topper [Mon, 26 Aug 2019 22:17:54 +0000 (22:17 +0000)]
[X86] Delay combineIncDecVector until after op legalization.

Probably better to keep add over sub in early DAG combines.

It might make sense to push this to lowering or delay it all
the way to isel. But this was the simplest change.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369981 91177308-0d34-0410-b5e6-96231b3b80d8

5 years agomsan, codegen, instcombine: Keep more lifetime markers used for msan
Vitaly Buka [Mon, 26 Aug 2019 22:15:50 +0000 (22:15 +0000)]
msan, codegen, instcombine: Keep more lifetime markers used for msan

Reviewers: eugenis

Subscribers: hiraditya, cfe-commits, #sanitizers, llvm-commits

Tags: #clang, #sanitizers, #llvm

Differential Revision: https://reviews.llvm.org/D66695

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@369979 91177308-0d34-0410-b5e6-96231b3b80d8