[IR] redefine 'UnsafeAlgebra' / 'reassoc' fast-math-flags and add 'trans' fast-math...

author Sanjay Patel <spatel@rotateright.com>

Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)

committer Sanjay Patel <spatel@rotateright.com>

Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)
author Sanjay Patel <spatel@rotateright.com>
Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)
committer Sanjay Patel <spatel@rotateright.com>
Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)
diff --git a/docs/LangRef.rst b/docs/LangRef.rst

index f6256e5143aa2ddd0d6b6ed8761ee379e3792328..8cbed7d87d16945be6643d7037a78f5e27c9f469 100644 (file)
--- a/docs/LangRef.rst
+++ b/docs/LangRef.rst
@@ -2272,11 +2272,11 @@ seq\_cst total orderings of other operations that are not marked
  Fast-Math Flags
  ---------------
  
-LLVM IR floating-point binary ops (:ref:`fadd <i_fadd>`,
+LLVM IR floating-point operations (:ref:`fadd <i_fadd>`,
  :ref:`fsub <i_fsub>`, :ref:`fmul <i_fmul>`, :ref:`fdiv <i_fdiv>`,
  :ref:`frem <i_frem>`, :ref:`fcmp <i_fcmp>`) and :ref:`call <i_call>`
-instructions have the following flags that can be set to enable
-otherwise unsafe floating point transformations.
+may use the following flags to enable otherwise unsafe 
+floating-point transformations.
  
  ``nnan``
     No NaNs - Allow optimizations to assume the arguments and result are not
@@ -2300,10 +2300,17 @@ otherwise unsafe floating point transformations.
     Allow floating-point contraction (e.g. fusing a multiply followed by an
     addition into a fused multiply-and-add).
  
+``afn``
+   Approximate functions - Allow substitution of approximate calculations for
+   functions (sin, log, sqrt, etc). See floating-point intrinsic definitions 
+   for places where this can apply to LLVM's intrinsic math functions. 
+
+``reassoc``
+   Allow reassociation transformations for floating-point instructions. 
+   This may dramatically change results in floating point.
+
  ``fast``
-   Fast - Allow algebraically equivalent transformations that may
-   dramatically change results in floating point (e.g. reassociate). This
-   flag implies all the others.
+   This flag implies all of the others.
  
  .. _uselistorder:
  
@@ -10483,7 +10490,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.sqrt`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10497,20 +10504,22 @@ all types however.
  Overview:
  """""""""
  
-The '``llvm.sqrt``' intrinsics return the square root of the specified value,
-returning the same value as the libm '``sqrt``' functions would, but without
-trapping or setting ``errno``.
+The '``llvm.sqrt``' intrinsics return the square root of the specified value.
  
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the square root of the operand if it is a nonnegative
-floating point number.
+Return the same value as a corresponding libm '``sqrt``' function but without
+trapping or setting ``errno``. For types specified by IEEE-754, the result 
+matches a conforming libm implementation.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.powi.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10557,7 +10566,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.sin`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10576,14 +10585,16 @@ The '``llvm.sin.*``' intrinsics return the sine of the operand.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the sine of the specified operand, returning the
-same values as the libm ``sin`` functions would, and handles error
-conditions in the same way.
+Return the same value as a corresponding libm '``sin``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.cos.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10592,7 +10603,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.cos`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10611,14 +10622,16 @@ The '``llvm.cos.*``' intrinsics return the cosine of the operand.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the cosine of the specified operand, returning the
-same values as the libm ``cos`` functions would, and handles error
-conditions in the same way.
+Return the same value as a corresponding libm '``cos``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.pow.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10627,7 +10640,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.pow`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10647,15 +10660,16 @@ specified (positive or negative) power.
  Arguments:
  """"""""""
  
-The second argument is a floating point power, and the first is a value
-to raise to that power.
+The arguments and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the first value raised to the second power,
-returning the same values as the libm ``pow`` functions would, and
-handles error conditions in the same way.
+Return the same value as a corresponding libm '``pow``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.exp.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10664,7 +10678,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.exp`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10684,13 +10698,16 @@ value.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``exp`` functions
-would, and handles error conditions in the same way.
+Return the same value as a corresponding libm '``exp``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.exp2.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10699,7 +10716,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.exp2`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10719,13 +10736,16 @@ specified value.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``exp2`` functions
-would, and handles error conditions in the same way.
+Return the same value as a corresponding libm '``exp2``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.log.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10734,7 +10754,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.log`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10754,13 +10774,16 @@ value.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``log`` functions
-would, and handles error conditions in the same way.
+Return the same value as a corresponding libm '``log``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.log10.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10769,7 +10792,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.log10`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10789,13 +10812,16 @@ specified value.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``log10`` functions
-would, and handles error conditions in the same way.
+Return the same value as a corresponding libm '``log10``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.log2.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10804,7 +10830,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.log2`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10824,13 +10850,16 @@ value.
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same type.
+The argument and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``log2`` functions
-would, and handles error conditions in the same way.
+Return the same value as a corresponding libm '``log2``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.fma.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -10839,7 +10868,7 @@ Syntax:
  """""""
  
  This is an overloaded intrinsic. You can use ``llvm.fma`` on any
-floating point or vector of floating point type. Not all targets support
+floating-point or vector of floating-point type. Not all targets support
  all types however.
  
  ::
@@ -10853,20 +10882,21 @@ all types however.
  Overview:
  """""""""
  
-The '``llvm.fma.*``' intrinsics perform the fused multiply-add
-operation.
+The '``llvm.fma.*``' intrinsics perform the fused multiply-add operation.
  
  Arguments:
  """"""""""
  
-The argument and return value are floating point numbers of the same
-type.
+The arguments and return value are floating-point numbers of the same type.
  
  Semantics:
  """"""""""
  
-This function returns the same values as the libm ``fma`` functions
-would, and does not set errno.
+Return the same value as a corresponding libm '``fma``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated 
+using a less accurate calculation.
  
  '``llvm.fabs.*``' Intrinsic
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
diff --git a/include/llvm/IR/Instruction.h b/include/llvm/IR/Instruction.h

index 66b1e7e01fe423f5ced3f4c6d8ee4800d2193415..41f379b87c232e78d1fbd5bbbf517069c5c19f44 100644 (file)
--- a/include/llvm/IR/Instruction.h
+++ b/include/llvm/IR/Instruction.h
@@ -308,10 +308,15 @@ public:
    /// Determine whether the exact flag is set.
    bool isExact() const;
  
-  /// Set or clear the unsafe-algebra flag on this instruction, which must be an
+  /// Set or clear all fast-math-flags on this instruction, which must be an
    /// operator which supports this flag. See LangRef.html for the meaning of
    /// this flag.
-  void setHasUnsafeAlgebra(bool B);
+  void setFast(bool B);
+
+  /// Set or clear the reassociation flag on this instruction, which must be
+  /// an operator which supports this flag. See LangRef.html for the meaning of
+  /// this flag.
+  void setHasAllowReassoc(bool B);
  
    /// Set or clear the no-nans flag on this instruction, which must be an
    /// operator which supports this flag. See LangRef.html for the meaning of
@@ -333,6 +338,11 @@ public:
    /// this flag.
    void setHasAllowReciprocal(bool B);
  
+  /// Set or clear the approximate-math-functions flag on this instruction,
+  /// which must be an operator which supports this flag. See LangRef.html for
+  /// the meaning of this flag.
+  void setHasApproxFunc(bool B);
+
    /// Convenience function for setting multiple fast-math flags on this
    /// instruction, which must be an operator which supports these flags. See
    /// LangRef.html for the meaning of these flags.
@@ -343,8 +353,11 @@ public:
    /// LangRef.html for the meaning of these flags.
    void copyFastMathFlags(FastMathFlags FMF);
  
-  /// Determine whether the unsafe-algebra flag is set.
-  bool hasUnsafeAlgebra() const;
+  /// Determine whether all fast-math-flags are set.
+  bool isFast() const;
+
+  /// Determine whether the allow-reassociation flag is set.
+  bool hasAllowReassoc() const;
  
    /// Determine whether the no-NaNs flag is set.
    bool hasNoNaNs() const;
@@ -361,6 +374,9 @@ public:
    /// Determine whether the allow-contract flag is set.
    bool hasAllowContract() const;
  
+  /// Determine whether the approximate-math-functions flag is set.
+  bool hasApproxFunc() const;
+
    /// Convenience function for getting all the fast-math flags, which must be an
    /// operator which supports these flags. See LangRef.html for the meaning of
    /// these flags.
diff --git a/include/llvm/IR/Operator.h b/include/llvm/IR/Operator.h

index ae9255174a3151d956a1266600c419e57a1ed50d..01746e4b6a2919e6fa7831f52a85c4ac1f73a399 100644 (file)
--- a/include/llvm/IR/Operator.h
+++ b/include/llvm/IR/Operator.h
@@ -163,52 +163,61 @@ private:
  
    unsigned Flags = 0;
  
-  FastMathFlags(unsigned F) : Flags(F) { }
+  FastMathFlags(unsigned F) {
+    // If all 7 bits are set, turn this into -1. If the number of bits grows,
+    // this must be updated. This is intended to provide some forward binary
+    // compatibility insurance for the meaning of 'fast' in case bits are added.
+    if (F == 0x7F) Flags = ~0U;
+    else Flags = F;
+  }
  
  public:
-  /// This is how the bits are used in Value::SubclassOptionalData so they
-  /// should fit there too.
+  // This is how the bits are used in Value::SubclassOptionalData so they
+  // should fit there too.
+  // WARNING: We're out of space. SubclassOptionalData only has 7 bits. New
+  // functionality will require a change in how this information is stored.
    enum {
-    UnsafeAlgebra   = (1 << 0),
+    AllowReassoc    = (1 << 0),
      NoNaNs          = (1 << 1),
      NoInfs          = (1 << 2),
      NoSignedZeros   = (1 << 3),
      AllowReciprocal = (1 << 4),
-    AllowContract   = (1 << 5)
+    AllowContract   = (1 << 5),
+    ApproxFunc      = (1 << 6)
    };
  
    FastMathFlags() = default;
  
-  /// Whether any flag is set
    bool any() const { return Flags != 0; }
+  bool none() const { return Flags == 0; }
+  bool all() const { return Flags == ~0U; }
  
-  /// Set all the flags to false
    void clear() { Flags = 0; }
+  void set()   { Flags = ~0U; }
  
    /// Flag queries
+  bool allowReassoc() const    { return 0 != (Flags & AllowReassoc); }
    bool noNaNs() const          { return 0 != (Flags & NoNaNs); }
    bool noInfs() const          { return 0 != (Flags & NoInfs); }
    bool noSignedZeros() const   { return 0 != (Flags & NoSignedZeros); }
    bool allowReciprocal() const { return 0 != (Flags & AllowReciprocal); }
-  bool allowContract() const { return 0 != (Flags & AllowContract); }
-  bool unsafeAlgebra() const   { return 0 != (Flags & UnsafeAlgebra); }
+  bool allowContract() const   { return 0 != (Flags & AllowContract); }
+  bool approxFunc() const      { return 0 != (Flags & ApproxFunc); }
+  /// 'Fast' means all bits are set.
+  bool isFast() const          { return all(); }
  
    /// Flag setters
+  void setAllowReassoc()    { Flags |= AllowReassoc; }
    void setNoNaNs()          { Flags |= NoNaNs; }
    void setNoInfs()          { Flags |= NoInfs; }
    void setNoSignedZeros()   { Flags |= NoSignedZeros; }
    void setAllowReciprocal() { Flags |= AllowReciprocal; }
+  // TODO: Change the other set* functions to take a parameter?
    void setAllowContract(bool B) {
      Flags = (Flags & ~AllowContract) | B * AllowContract;
    }
-  void setUnsafeAlgebra() {
-    Flags |= UnsafeAlgebra;
-    setNoNaNs();
-    setNoInfs();
-    setNoSignedZeros();
-    setAllowReciprocal();
-    setAllowContract(true);
-  }
+  void setApproxFunc()      { Flags |= ApproxFunc; }
+  void setFast()            { set(); }
  
    void operator&=(const FastMathFlags &OtherFlags) {
      Flags &= OtherFlags.Flags;
@@ -221,18 +230,21 @@ class FPMathOperator : public Operator {
  private:
    friend class Instruction;
  
-  void setHasUnsafeAlgebra(bool B) {
+  /// 'Fast' means all bits are set.
+  void setFast(bool B) {
+    setHasAllowReassoc(B);
+    setHasNoNaNs(B);
+    setHasNoInfs(B);
+    setHasNoSignedZeros(B);
+    setHasAllowReciprocal(B);
+    setHasAllowContract(B);
+    setHasApproxFunc(B);
+  }
+
+  void setHasAllowReassoc(bool B) {
      SubclassOptionalData =
-      (SubclassOptionalData & ~FastMathFlags::UnsafeAlgebra) |
-      (B * FastMathFlags::UnsafeAlgebra);
-
-    // Unsafe algebra implies all the others
-    if (B) {
-      setHasNoNaNs(true);
-      setHasNoInfs(true);
-      setHasNoSignedZeros(true);
-      setHasAllowReciprocal(true);
-    }
+    (SubclassOptionalData & ~FastMathFlags::AllowReassoc) |
+    (B * FastMathFlags::AllowReassoc);
    }
  
    void setHasNoNaNs(bool B) {
@@ -265,6 +277,12 @@ private:
          (B * FastMathFlags::AllowContract);
    }
  
+  void setHasApproxFunc(bool B) {
+    SubclassOptionalData =
+        (SubclassOptionalData & ~FastMathFlags::ApproxFunc) |
+        (B * FastMathFlags::ApproxFunc);
+  }
+
    /// Convenience function for setting multiple fast-math flags.
    /// FMF is a mask of the bits to set.
    void setFastMathFlags(FastMathFlags FMF) {
@@ -278,42 +296,53 @@ private:
    }
  
  public:
-  /// Test whether this operation is permitted to be
-  /// algebraically transformed, aka the 'A' fast-math property.
-  bool hasUnsafeAlgebra() const {
-    return (SubclassOptionalData & FastMathFlags::UnsafeAlgebra) != 0;
+  /// Test if this operation allows all non-strict floating-point transforms.
+  bool isFast() const {
+    return ((SubclassOptionalData & FastMathFlags::AllowReassoc) != 0 &&
+            (SubclassOptionalData & FastMathFlags::NoNaNs) != 0 &&
+            (SubclassOptionalData & FastMathFlags::NoInfs) != 0 &&
+            (SubclassOptionalData & FastMathFlags::NoSignedZeros) != 0 &&
+            (SubclassOptionalData & FastMathFlags::AllowReciprocal) != 0 &&
+            (SubclassOptionalData & FastMathFlags::AllowContract) != 0 &&
+            (SubclassOptionalData & FastMathFlags::ApproxFunc) != 0);
+  }
+
+  /// Test if this operation may be simplified with reassociative transforms.
+  bool hasAllowReassoc() const {
+    return (SubclassOptionalData & FastMathFlags::AllowReassoc) != 0;
    }
  
-  /// Test whether this operation's arguments and results are to be
-  /// treated as non-NaN, aka the 'N' fast-math property.
+  /// Test if this operation's arguments and results are assumed not-NaN.
    bool hasNoNaNs() const {
      return (SubclassOptionalData & FastMathFlags::NoNaNs) != 0;
    }
  
-  /// Test whether this operation's arguments and results are to be
-  /// treated as NoN-Inf, aka the 'I' fast-math property.
+  /// Test if this operation's arguments and results are assumed not-infinite.
    bool hasNoInfs() const {
      return (SubclassOptionalData & FastMathFlags::NoInfs) != 0;
    }
  
-  /// Test whether this operation can treat the sign of zero
-  /// as insignificant, aka the 'S' fast-math property.
+  /// Test if this operation can ignore the sign of zero.
    bool hasNoSignedZeros() const {
      return (SubclassOptionalData & FastMathFlags::NoSignedZeros) != 0;
    }
  
-  /// Test whether this operation is permitted to use
-  /// reciprocal instead of division, aka the 'R' fast-math property.
+  /// Test if this operation can use reciprocal multiply instead of division.
    bool hasAllowReciprocal() const {
      return (SubclassOptionalData & FastMathFlags::AllowReciprocal) != 0;
    }
  
-  /// Test whether this operation is permitted to
-  /// be floating-point contracted.
+  /// Test if this operation can be floating-point contracted (FMA).
    bool hasAllowContract() const {
      return (SubclassOptionalData & FastMathFlags::AllowContract) != 0;
    }
  
+  /// Test if this operation allows approximations of math library functions or
+  /// intrinsics.
+  bool hasApproxFunc() const {
+    return (SubclassOptionalData & FastMathFlags::ApproxFunc) != 0;
+  }
+
    /// Convenience function for getting all the fast-math flags
    FastMathFlags getFastMathFlags() const {
      return FastMathFlags(SubclassOptionalData);
diff --git a/include/llvm/Transforms/Utils/LoopUtils.h b/include/llvm/Transforms/Utils/LoopUtils.h

index 650224610ad269e42851c4d3a85ffb7f52d06066..a59b188f8d6c99b9ed81f8816b60455e2047191f 100644 (file)
--- a/include/llvm/Transforms/Utils/LoopUtils.h
+++ b/include/llvm/Transforms/Utils/LoopUtils.h
@@ -331,15 +331,13 @@ public:
    /// not have the "fast-math" property. Such operation requires a relaxed FP
    /// mode.
    bool hasUnsafeAlgebra() {
-    return InductionBinOp &&
-      !cast<FPMathOperator>(InductionBinOp)->hasUnsafeAlgebra();
+    return InductionBinOp && !cast<FPMathOperator>(InductionBinOp)->isFast();
    }
  
    /// Returns induction operator that does not have "fast-math" property
    /// and requires FP unsafe mode.
    Instruction *getUnsafeAlgebraInst() {
-    if (!InductionBinOp ||
-        cast<FPMathOperator>(InductionBinOp)->hasUnsafeAlgebra())
+    if (!InductionBinOp || cast<FPMathOperator>(InductionBinOp)->isFast())
        return nullptr;
      return InductionBinOp;
    }
diff --git a/lib/AsmParser/LLLexer.cpp b/lib/AsmParser/LLLexer.cpp

index 50b391fdf73f2ca1616e370dc6af4138bdb44764..b8b56d79c827193ef4e8e737c146575c1d832b7e 100644 (file)
--- a/lib/AsmParser/LLLexer.cpp
+++ b/lib/AsmParser/LLLexer.cpp
@@ -552,6 +552,8 @@ lltok::Kind LLLexer::LexIdentifier() {
    KEYWORD(nsz);
    KEYWORD(arcp);
    KEYWORD(contract);
+  KEYWORD(reassoc);
+  KEYWORD(afn);
    KEYWORD(fast);
    KEYWORD(nuw);
    KEYWORD(nsw);
diff --git a/lib/AsmParser/LLParser.h b/lib/AsmParser/LLParser.h

index dcc3f22e03b9197eac30df72aa695861bc16b653..94e4c1ae96d59be6a34879c5458d7dced7b03952 100644 (file)
--- a/lib/AsmParser/LLParser.h
+++ b/lib/AsmParser/LLParser.h
@@ -193,7 +193,7 @@ namespace llvm {
        FastMathFlags FMF;
        while (true)
          switch (Lex.getKind()) {
-        case lltok::kw_fast: FMF.setUnsafeAlgebra();   Lex.Lex(); continue;
+        case lltok::kw_fast: FMF.setFast();            Lex.Lex(); continue;
          case lltok::kw_nnan: FMF.setNoNaNs();          Lex.Lex(); continue;
          case lltok::kw_ninf: FMF.setNoInfs();          Lex.Lex(); continue;
          case lltok::kw_nsz:  FMF.setNoSignedZeros();   Lex.Lex(); continue;
@@ -202,6 +202,8 @@ namespace llvm {
            FMF.setAllowContract(true);
            Lex.Lex();
            continue;
+        case lltok::kw_reassoc: FMF.setAllowReassoc(); Lex.Lex(); continue;
+        case lltok::kw_afn:     FMF.setApproxFunc();   Lex.Lex(); continue;
          default: return FMF;
          }
        return FMF;
diff --git a/lib/AsmParser/LLToken.h b/lib/AsmParser/LLToken.h

index db0de6c0d5a98378174ec0749c1a3277070405c9..0c5cf6b5d45531237371dbfe2f72e4965c992d83 100644 (file)
--- a/lib/AsmParser/LLToken.h
+++ b/lib/AsmParser/LLToken.h
@@ -102,6 +102,8 @@ enum Kind {
    kw_nsz,
    kw_arcp,
    kw_contract,
+  kw_reassoc,
+  kw_afn,
    kw_fast,
    kw_nuw,
    kw_nsw,
diff --git a/lib/Bitcode/Reader/BitcodeReader.cpp b/lib/Bitcode/Reader/BitcodeReader.cpp

index d0f11db8f61f344c8acaa29ce4a003de35242f68..3e0a39c099b230aa5ec715870bf3ef64c68ceecf 100644 (file)
--- a/lib/Bitcode/Reader/BitcodeReader.cpp
+++ b/lib/Bitcode/Reader/BitcodeReader.cpp
@@ -1046,8 +1046,8 @@ static Comdat::SelectionKind getDecodedComdatSelectionKind(unsigned Val) {
  
  static FastMathFlags getDecodedFastMathFlags(unsigned Val) {
    FastMathFlags FMF;
-  if (0 != (Val & FastMathFlags::UnsafeAlgebra))
-    FMF.setUnsafeAlgebra();
+  if (0 != (Val & FastMathFlags::AllowReassoc))
+    FMF.setAllowReassoc();
    if (0 != (Val & FastMathFlags::NoNaNs))
      FMF.setNoNaNs();
    if (0 != (Val & FastMathFlags::NoInfs))
@@ -1058,6 +1058,8 @@ static FastMathFlags getDecodedFastMathFlags(unsigned Val) {
      FMF.setAllowReciprocal();
    if (0 != (Val & FastMathFlags::AllowContract))
      FMF.setAllowContract(true);
+  if (0 != (Val & FastMathFlags::ApproxFunc))
+    FMF.setApproxFunc();
    return FMF;
  }
  
diff --git a/lib/Bitcode/Writer/BitcodeWriter.cpp b/lib/Bitcode/Writer/BitcodeWriter.cpp

index c5d376c9426b8012abe7798dc91449b5cb1b49d4..03a77c9734e4a9456a928e29ac37148b0a57336b 100644 (file)
--- a/lib/Bitcode/Writer/BitcodeWriter.cpp
+++ b/lib/Bitcode/Writer/BitcodeWriter.cpp
@@ -1321,8 +1321,8 @@ static uint64_t getOptimizationFlags(const Value *V) {
      if (PEO->isExact())
        Flags |= 1 << bitc::PEO_EXACT;
    } else if (const auto *FPMO = dyn_cast<FPMathOperator>(V)) {
-    if (FPMO->hasUnsafeAlgebra())
-      Flags |= FastMathFlags::UnsafeAlgebra;
+    if (FPMO->hasAllowReassoc())
+      Flags |= FastMathFlags::AllowReassoc;
      if (FPMO->hasNoNaNs())
        Flags |= FastMathFlags::NoNaNs;
      if (FPMO->hasNoInfs())
@@ -1333,6 +1333,8 @@ static uint64_t getOptimizationFlags(const Value *V) {
        Flags |= FastMathFlags::AllowReciprocal;
      if (FPMO->hasAllowContract())
        Flags |= FastMathFlags::AllowContract;
+    if (FPMO->hasApproxFunc())
+      Flags |= FastMathFlags::ApproxFunc;
    }
  
    return Flags;
diff --git a/lib/CodeGen/ExpandReductions.cpp b/lib/CodeGen/ExpandReductions.cpp

index 70dca3b74b2f30a13485ffd14d43743477526252..abf487a4f198c5a6cc1b140e3b8227694ca31c45 100644 (file)
--- a/lib/CodeGen/ExpandReductions.cpp
+++ b/lib/CodeGen/ExpandReductions.cpp
@@ -95,7 +95,7 @@ bool expandReductions(Function &F, const TargetTransformInfo *TTI) {
        // and it can't be handled by generating this shuffle sequence.
        // TODO: Implement scalarization of ordered reductions here for targets
        // without native support.
-      if (!II->getFastMathFlags().unsafeAlgebra())
+      if (!II->getFastMathFlags().isFast())
          continue;
        Vec = II->getArgOperand(1);
        break;
diff --git a/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp b/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

index c8abe25b7c6b128bc1b5c1c3c347161f2068132e..f45e264f6e245d6e455d8d9959d39564a4180ac0 100644 (file)
--- a/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
+++ b/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
@@ -2585,7 +2585,7 @@ static bool isVectorReductionOp(const User *I) {
    case Instruction::FAdd:
    case Instruction::FMul:
      if (const FPMathOperator *FPOp = dyn_cast<const FPMathOperator>(Inst))
-      if (FPOp->getFastMathFlags().unsafeAlgebra())
+      if (FPOp->getFastMathFlags().isFast())
          break;
      LLVM_FALLTHROUGH;
    default:
@@ -2631,7 +2631,7 @@ static bool isVectorReductionOp(const User *I) {
  
        if (Inst->getOpcode() == OpCode || isa<PHINode>(U)) {
          if (const FPMathOperator *FPOp = dyn_cast<const FPMathOperator>(Inst))
-          if (!isa<PHINode>(FPOp) && !FPOp->getFastMathFlags().unsafeAlgebra())
+          if (!isa<PHINode>(FPOp) && !FPOp->getFastMathFlags().isFast())
              return false;
          UsersToVisit.push_back(U);
        } else if (const ShuffleVectorInst *ShufInst =
@@ -2725,7 +2725,7 @@ void SelectionDAGBuilder::visitBinary(const User &I, unsigned OpCode) {
    Flags.setNoInfs(FMF.noInfs());
    Flags.setNoNaNs(FMF.noNaNs());
    Flags.setNoSignedZeros(FMF.noSignedZeros());
-  Flags.setUnsafeAlgebra(FMF.unsafeAlgebra());
+  Flags.setUnsafeAlgebra(FMF.isFast());
  
    SDValue BinNodeValue = DAG.getNode(OpCode, getCurSDLoc(), Op1.getValueType(),
                                       Op1, Op2, Flags);
@@ -7959,13 +7959,13 @@ void SelectionDAGBuilder::visitVectorReduce(const CallInst &I,
  
    switch (Intrinsic) {
    case Intrinsic::experimental_vector_reduce_fadd:
-    if (FMF.unsafeAlgebra())
+    if (FMF.isFast())
        Res = DAG.getNode(ISD::VECREDUCE_FADD, dl, VT, Op2);
      else
        Res = DAG.getNode(ISD::VECREDUCE_STRICT_FADD, dl, VT, Op1, Op2);
      break;
    case Intrinsic::experimental_vector_reduce_fmul:
-    if (FMF.unsafeAlgebra())
+    if (FMF.isFast())
        Res = DAG.getNode(ISD::VECREDUCE_FMUL, dl, VT, Op2);
      else
        Res = DAG.getNode(ISD::VECREDUCE_STRICT_FMUL, dl, VT, Op1, Op2);
diff --git a/lib/IR/AsmWriter.cpp b/lib/IR/AsmWriter.cpp

index 3f41a1dc0663042a110e86da4b3a83a6cf1f5a07..0fafe82404e4db8f2036a55bcafb157a27a2cada 100644 (file)
--- a/lib/IR/AsmWriter.cpp
+++ b/lib/IR/AsmWriter.cpp
@@ -1108,10 +1108,12 @@ static void writeAtomicRMWOperation(raw_ostream &Out,
  
  static void WriteOptimizationInfo(raw_ostream &Out, const User *U) {
    if (const FPMathOperator *FPO = dyn_cast<const FPMathOperator>(U)) {
-    // Unsafe algebra implies all the others, no need to write them all out
-    if (FPO->hasUnsafeAlgebra())
+    // 'Fast' is an abbreviation for all fast-math-flags.
+    if (FPO->isFast())
        Out << " fast";
      else {
+      if (FPO->hasAllowReassoc())
+        Out << " reassoc";
        if (FPO->hasNoNaNs())
          Out << " nnan";
        if (FPO->hasNoInfs())
@@ -1122,6 +1124,8 @@ static void WriteOptimizationInfo(raw_ostream &Out, const User *U) {
          Out << " arcp";
        if (FPO->hasAllowContract())
          Out << " contract";
+      if (FPO->hasApproxFunc())
+        Out << " afn";
      }
    }
  
diff --git a/lib/IR/Instruction.cpp b/lib/IR/Instruction.cpp

index ceb521c4c48b5acf60124113ab7263c70f869e09..ffc3a30e6a1039fae375e1e0cbda2e512b7fbae1 100644 (file)
--- a/lib/IR/Instruction.cpp
+++ b/lib/IR/Instruction.cpp
@@ -146,9 +146,14 @@ bool Instruction::isExact() const {
    return cast<PossiblyExactOperator>(this)->isExact();
  }
  
-void Instruction::setHasUnsafeAlgebra(bool B) {
+void Instruction::setFast(bool B) {
    assert(isa<FPMathOperator>(this) && "setting fast-math flag on invalid op");
-  cast<FPMathOperator>(this)->setHasUnsafeAlgebra(B);
+  cast<FPMathOperator>(this)->setFast(B);
+}
+
+void Instruction::setHasAllowReassoc(bool B) {
+  assert(isa<FPMathOperator>(this) && "setting fast-math flag on invalid op");
+  cast<FPMathOperator>(this)->setHasAllowReassoc(B);
  }
  
  void Instruction::setHasNoNaNs(bool B) {
@@ -171,6 +176,11 @@ void Instruction::setHasAllowReciprocal(bool B) {
    cast<FPMathOperator>(this)->setHasAllowReciprocal(B);
  }
  
+void Instruction::setHasApproxFunc(bool B) {
+  assert(isa<FPMathOperator>(this) && "setting fast-math flag on invalid op");
+  cast<FPMathOperator>(this)->setHasApproxFunc(B);
+}
+
  void Instruction::setFastMathFlags(FastMathFlags FMF) {
    assert(isa<FPMathOperator>(this) && "setting fast-math flag on invalid op");
    cast<FPMathOperator>(this)->setFastMathFlags(FMF);
@@ -181,9 +191,14 @@ void Instruction::copyFastMathFlags(FastMathFlags FMF) {
    cast<FPMathOperator>(this)->copyFastMathFlags(FMF);
  }
  
-bool Instruction::hasUnsafeAlgebra() const {
+bool Instruction::isFast() const {
    assert(isa<FPMathOperator>(this) && "getting fast-math flag on invalid op");
-  return cast<FPMathOperator>(this)->hasUnsafeAlgebra();
+  return cast<FPMathOperator>(this)->isFast();
+}
+
+bool Instruction::hasAllowReassoc() const {
+  assert(isa<FPMathOperator>(this) && "getting fast-math flag on invalid op");
+  return cast<FPMathOperator>(this)->hasAllowReassoc();
  }
  
  bool Instruction::hasNoNaNs() const {
@@ -211,6 +226,11 @@ bool Instruction::hasAllowContract() const {
    return cast<FPMathOperator>(this)->hasAllowContract();
  }
  
+bool Instruction::hasApproxFunc() const {
+  assert(isa<FPMathOperator>(this) && "getting fast-math flag on invalid op");
+  return cast<FPMathOperator>(this)->hasApproxFunc();
+}
+
  FastMathFlags Instruction::getFastMathFlags() const {
    assert(isa<FPMathOperator>(this) && "getting fast-math flag on invalid op");
    return cast<FPMathOperator>(this)->getFastMathFlags();
@@ -579,7 +599,7 @@ bool Instruction::isAssociative() const {
    switch (Opcode) {
    case FMul:
    case FAdd:
-    return cast<FPMathOperator>(this)->hasUnsafeAlgebra();
+    return cast<FPMathOperator>(this)->isFast();
    default:
      return false;
    }
diff --git a/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp b/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp

index 156f7bc6512bd2db2a8e919077583218b16b90d9..b17b67167666f07bf4fb76ea70ba74be028e6050 100644 (file)
--- a/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
+++ b/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
@@ -400,7 +400,7 @@ bool AMDGPUCodeGenPrepare::visitFDiv(BinaryOperator &FDiv) {
      return false;
  
    FastMathFlags FMF = FPOp->getFastMathFlags();
-  bool UnsafeDiv = HasUnsafeFPMath || FMF.unsafeAlgebra() ||
+  bool UnsafeDiv = HasUnsafeFPMath || FMF.isFast() ||
                                        FMF.allowReciprocal();
  
    // With UnsafeDiv node will be optimized to just rcp and mul.
diff --git a/lib/Target/AMDGPU/AMDGPULibCalls.cpp b/lib/Target/AMDGPU/AMDGPULibCalls.cpp

index 97cce266e8e80a0cdbc17a6b16313aa0681c9c09..714c60a7446bcb51a0f47ea3ae48129e908012f6 100644 (file)
--- a/lib/Target/AMDGPU/AMDGPULibCalls.cpp
+++ b/lib/Target/AMDGPU/AMDGPULibCalls.cpp
@@ -487,7 +487,7 @@ bool AMDGPULibCalls::parseFunctionName(const StringRef& FMangledName,
  
  bool AMDGPULibCalls::isUnsafeMath(const CallInst *CI) const {
    if (auto Op = dyn_cast<FPMathOperator>(CI))
-    if (Op->hasUnsafeAlgebra())
+    if (Op->isFast())
        return true;
    const Function *F = CI->getParent()->getParent();
    Attribute Attr = F->getFnAttribute("unsafe-fp-math");
diff --git a/lib/Transforms/InstCombine/InstCombineAddSub.cpp b/lib/Transforms/InstCombine/InstCombineAddSub.cpp

index 18b246b5d99f39a983d6ad79a49104dfc8518a3f..d28d615f47ea6f23fcfb3650fbe826fc84eedcc2 100644 (file)
--- a/lib/Transforms/InstCombine/InstCombineAddSub.cpp
+++ b/lib/Transforms/InstCombine/InstCombineAddSub.cpp
@@ -482,7 +482,7 @@ Value *FAddCombine::performFactorization(Instruction *I) {
      return nullptr;
  
    FastMathFlags Flags;
-  Flags.setUnsafeAlgebra();
+  Flags.setFast();
    if (I0) Flags &= I->getFastMathFlags();
    if (I1) Flags &= I->getFastMathFlags();
  
@@ -511,7 +511,7 @@ Value *FAddCombine::performFactorization(Instruction *I) {
  }
  
  Value *FAddCombine::simplify(Instruction *I) {
-  assert(I->hasUnsafeAlgebra() && "Should be in unsafe mode");
+  assert(I->isFast() && "Expected 'fast' instruction");
  
    // Currently we are not able to handle vector type.
    if (I->getType()->isVectorTy())
@@ -1386,7 +1386,7 @@ Instruction *InstCombiner::visitFAdd(BinaryOperator &I) {
    if (Value *V = SimplifySelectsFeedingBinaryOp(I, LHS, RHS))
      return replaceInstUsesWith(I, V);
  
-  if (I.hasUnsafeAlgebra()) {
+  if (I.isFast()) {
      if (Value *V = FAddCombine(Builder).simplify(&I))
        return replaceInstUsesWith(I, V);
    }
@@ -1736,7 +1736,7 @@ Instruction *InstCombiner::visitFSub(BinaryOperator &I) {
    if (Value *V = SimplifySelectsFeedingBinaryOp(I, Op0, Op1))
      return replaceInstUsesWith(I, V);
  
-  if (I.hasUnsafeAlgebra()) {
+  if (I.isFast()) {
      if (Value *V = FAddCombine(Builder).simplify(&I))
        return replaceInstUsesWith(I, V);
    }
diff --git a/lib/Transforms/InstCombine/InstCombineCalls.cpp b/lib/Transforms/InstCombine/InstCombineCalls.cpp

index 7a4abc9aca0cd5cd6900c81048d3abd849ebbb48..a00e6f73ab8ce941fb2a736e05dc5c6d21bebe06 100644 (file)
--- a/lib/Transforms/InstCombine/InstCombineCalls.cpp
+++ b/lib/Transforms/InstCombine/InstCombineCalls.cpp
@@ -2017,7 +2017,7 @@ Instruction *InstCombiner::visitCallInst(CallInst &CI) {
    }
    case Intrinsic::fmuladd: {
      // Canonicalize fast fmuladd to the separate fmul + fadd.
-    if (II->hasUnsafeAlgebra()) {
+    if (II->isFast()) {
        BuilderTy::FastMathFlagGuard Guard(Builder);
        Builder.setFastMathFlags(II->getFastMathFlags());
        Value *Mul = Builder.CreateFMul(II->getArgOperand(0),
diff --git a/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp b/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp

index e6b9753826715b245aa16595bdb6957266c5805e..87666360c1a08dfe2ff192d23a98fb60f336d211 100644 (file)
--- a/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
+++ b/lib/Transforms/InstCombine/InstCombineMulDivRem.cpp
@@ -487,7 +487,7 @@ static void detectLog2OfHalf(Value *&Op, Value *&Y, IntrinsicInst *&Log2) {
    IntrinsicInst *II = dyn_cast<IntrinsicInst>(Op);
    if (!II)
      return;
-  if (II->getIntrinsicID() != Intrinsic::log2 || !II->hasUnsafeAlgebra())
+  if (II->getIntrinsicID() != Intrinsic::log2 || !II->isFast())
      return;
    Log2 = II;
  
@@ -498,7 +498,8 @@ static void detectLog2OfHalf(Value *&Op, Value *&Y, IntrinsicInst *&Log2) {
    Instruction *I = dyn_cast<Instruction>(OpLog2Of);
    if (!I)
      return;
-  if (I->getOpcode() != Instruction::FMul || !I->hasUnsafeAlgebra())
+
+  if (I->getOpcode() != Instruction::FMul || !I->isFast())
      return;
  
    if (match(I->getOperand(0), m_SpecificFP(0.5)))
@@ -601,7 +602,7 @@ Value *InstCombiner::foldFMulConst(Instruction *FMulOrDiv, Constant *C,
    }
  
    if (R) {
-    R->setHasUnsafeAlgebra(true);
+    R->setFast(true);
      InsertNewInstWith(R, *InsertBefore);
    }
  
@@ -622,7 +623,7 @@ Instruction *InstCombiner::visitFMul(BinaryOperator &I) {
                                    SQ.getWithInstruction(&I)))
      return replaceInstUsesWith(I, V);
  
-  bool AllowReassociate = I.hasUnsafeAlgebra();
+  bool AllowReassociate = I.isFast();
  
    // Simplify mul instructions with a constant RHS.
    if (isa<Constant>(Op1)) {
@@ -1341,7 +1342,7 @@ Instruction *InstCombiner::visitFDiv(BinaryOperator &I) {
        if (Instruction *R = FoldOpIntoSelect(I, SI))
          return R;
  
-  bool AllowReassociate = I.hasUnsafeAlgebra();
+  bool AllowReassociate = I.isFast();
    bool AllowReciprocal = I.hasAllowReciprocal();
  
    if (Constant *Op1C = dyn_cast<Constant>(Op1)) {
diff --git a/lib/Transforms/Scalar/Reassociate.cpp b/lib/Transforms/Scalar/Reassociate.cpp

index a44ca333fee69162f218bf149232ab138e53c941..1f32f9f24aac442a42de22855b42b28ef33d0883 100644 (file)
--- a/lib/Transforms/Scalar/Reassociate.cpp
+++ b/lib/Transforms/Scalar/Reassociate.cpp
@@ -145,8 +145,7 @@ XorOpnd::XorOpnd(Value *V) {
  static BinaryOperator *isReassociableOp(Value *V, unsigned Opcode) {
    if (V->hasOneUse() && isa<Instruction>(V) &&
        cast<Instruction>(V)->getOpcode() == Opcode &&
-      (!isa<FPMathOperator>(V) ||
-       cast<Instruction>(V)->hasUnsafeAlgebra()))
+      (!isa<FPMathOperator>(V) || cast<Instruction>(V)->isFast()))
      return cast<BinaryOperator>(V);
    return nullptr;
  }
@@ -156,8 +155,7 @@ static BinaryOperator *isReassociableOp(Value *V, unsigned Opcode1,
    if (V->hasOneUse() && isa<Instruction>(V) &&
        (cast<Instruction>(V)->getOpcode() == Opcode1 ||
         cast<Instruction>(V)->getOpcode() == Opcode2) &&
-      (!isa<FPMathOperator>(V) ||
-       cast<Instruction>(V)->hasUnsafeAlgebra()))
+      (!isa<FPMathOperator>(V) || cast<Instruction>(V)->isFast()))
      return cast<BinaryOperator>(V);
    return nullptr;
  }
@@ -565,7 +563,7 @@ static bool LinearizeExprTree(BinaryOperator *I,
        assert((!isa<Instruction>(Op) ||
                cast<Instruction>(Op)->getOpcode() != Opcode
                || (isa<FPMathOperator>(Op) &&
-                  !cast<Instruction>(Op)->hasUnsafeAlgebra())) &&
+                  !cast<Instruction>(Op)->isFast())) &&
               "Should have been handled above!");
        assert(Op->hasOneUse() && "Has uses outside the expression tree!");
  
@@ -2017,8 +2015,8 @@ void ReassociatePass::OptimizeInst(Instruction *I) {
    if (I->isCommutative())
      canonicalizeOperands(I);
  
-  // Don't optimize floating point instructions that don't have unsafe algebra.
-  if (I->getType()->isFPOrFPVectorTy() && !I->hasUnsafeAlgebra())
+  // Don't optimize floating-point instructions unless they are 'fast'.
+  if (I->getType()->isFPOrFPVectorTy() && !I->isFast())
      return;
  
    // Do not reassociate boolean (i1) expressions.  We want to preserve the
diff --git a/lib/Transforms/Utils/LoopUtils.cpp b/lib/Transforms/Utils/LoopUtils.cpp

index 13c0bfbcb2e95ee134408f1582edc8da8ecaac8f..0de6924e635480ace2c365dda1298a27303edaee 100644 (file)
--- a/lib/Transforms/Utils/LoopUtils.cpp
+++ b/lib/Transforms/Utils/LoopUtils.cpp
@@ -432,7 +432,7 @@ RecurrenceDescriptor::isRecurrenceInstr(Instruction *I, RecurrenceKind Kind,
                                          InstDesc &Prev, bool HasFunNoNaNAttr) {
    bool FP = I->getType()->isFloatingPointTy();
    Instruction *UAI = Prev.getUnsafeAlgebraInst();
-  if (!UAI && FP && !I->hasUnsafeAlgebra())
+  if (!UAI && FP && !I->isFast())
      UAI = I; // Found an unsafe (unvectorizable) algebra instruction.
  
    switch (I->getOpcode()) {
@@ -660,11 +660,11 @@ Value *RecurrenceDescriptor::createMinMaxOp(IRBuilder<> &Builder,
      break;
    }
  
-  // We only match FP sequences with unsafe algebra, so we can unconditionally
+  // We only match FP sequences that are 'fast', so we can unconditionally
    // set it on any generated instructions.
    IRBuilder<>::FastMathFlagGuard FMFG(Builder);
    FastMathFlags FMF;
-  FMF.setUnsafeAlgebra();
+  FMF.setFast();
    Builder.setFastMathFlags(FMF);
  
    Value *Cmp;
@@ -768,7 +768,7 @@ Value *InductionDescriptor::transform(IRBuilder<> &B, Value *Index,
  
      // Floating point operations had to be 'fast' to enable the induction.
      FastMathFlags Flags;
-    Flags.setUnsafeAlgebra();
+    Flags.setFast();
  
      Value *MulExp = B.CreateFMul(StepValue, Index);
      if (isa<Instruction>(MulExp))
@@ -1338,7 +1338,7 @@ Optional<unsigned> llvm::getLoopEstimatedTripCount(Loop *L) {
  static Value *addFastMathFlag(Value *V) {
    if (isa<FPMathOperator>(V)) {
      FastMathFlags Flags;
-    Flags.setUnsafeAlgebra();
+    Flags.setFast();
      cast<Instruction>(V)->setFastMathFlags(Flags);
    }
    return V;
@@ -1401,7 +1401,7 @@ Value *llvm::createSimpleTargetReduction(
    RD::MinMaxRecurrenceKind MinMaxKind = RD::MRK_Invalid;
    // TODO: Support creating ordered reductions.
    FastMathFlags FMFUnsafe;
-  FMFUnsafe.setUnsafeAlgebra();
+  FMFUnsafe.setFast();
  
    switch (Opcode) {
    case Instruction::Add:
diff --git a/lib/Transforms/Utils/SimplifyLibCalls.cpp b/lib/Transforms/Utils/SimplifyLibCalls.cpp

index 33117659489cf40bb05be44928673e431e6dd67a..a29b83717f35c86dddea436cf17c278bd792caa6 100644 (file)
--- a/lib/Transforms/Utils/SimplifyLibCalls.cpp
+++ b/lib/Transforms/Utils/SimplifyLibCalls.cpp
@@ -1111,7 +1111,7 @@ Value *LibCallSimplifier::optimizePow(CallInst *CI, IRBuilder<> &B) {
    // Example: x = 1000, y = 0.001.
    // pow(exp(x), y) = pow(inf, 0.001) = inf, whereas exp(x*y) = exp(1).
    auto *OpC = dyn_cast<CallInst>(Op1);
-  if (OpC && OpC->hasUnsafeAlgebra() && CI->hasUnsafeAlgebra()) {
+  if (OpC && OpC->isFast() && CI->isFast()) {
      LibFunc Func;
      Function *OpCCallee = OpC->getCalledFunction();
      if (OpCCallee && TLI->getLibFunc(OpCCallee->getName(), Func) &&
@@ -1136,7 +1136,7 @@ Value *LibCallSimplifier::optimizePow(CallInst *CI, IRBuilder<> &B) {
                        LibFunc_sqrtl)) {
      // If -ffast-math:
      // pow(x, -0.5) -> 1.0 / sqrt(x)
-    if (CI->hasUnsafeAlgebra()) {
+    if (CI->isFast()) {
        IRBuilder<>::FastMathFlagGuard Guard(B);
        B.setFastMathFlags(CI->getFastMathFlags());
  
@@ -1157,7 +1157,7 @@ Value *LibCallSimplifier::optimizePow(CallInst *CI, IRBuilder<> &B) {
                        LibFunc_sqrtl)) {
  
      // In -ffast-math, pow(x, 0.5) -> sqrt(x).
-    if (CI->hasUnsafeAlgebra()) {
+    if (CI->isFast()) {
        IRBuilder<>::FastMathFlagGuard Guard(B);
        B.setFastMathFlags(CI->getFastMathFlags());
  
@@ -1196,7 +1196,7 @@ Value *LibCallSimplifier::optimizePow(CallInst *CI, IRBuilder<> &B) {
      return B.CreateFDiv(ConstantFP::get(CI->getType(), 1.0), Op1, "powrecip");
  
    // In -ffast-math, generate repeated fmul instead of generating pow(x, n).
-  if (CI->hasUnsafeAlgebra()) {
+  if (CI->isFast()) {
      APFloat V = abs(Op2C->getValueAPF());
      // We limit to a max of 7 fmul(s). Thus max exponent is 32.
      // This transformation applies to integer exponents only.
@@ -1284,9 +1284,9 @@ Value *LibCallSimplifier::optimizeFMinFMax(CallInst *CI, IRBuilder<> &B) {
  
    IRBuilder<>::FastMathFlagGuard Guard(B);
    FastMathFlags FMF;
-  if (CI->hasUnsafeAlgebra()) {
-    // Unsafe algebra sets all fast-math-flags to true.
-    FMF.setUnsafeAlgebra();
+  if (CI->isFast()) {
+    // If the call is 'fast', then anything we create here will also be 'fast'.
+    FMF.setFast();
    } else {
      // At a minimum, no-nans-fp-math must be true.
      if (!CI->hasNoNaNs())
@@ -1317,13 +1317,13 @@ Value *LibCallSimplifier::optimizeLog(CallInst *CI, IRBuilder<> &B) {
    if (UnsafeFPShrink && hasFloatVersion(Name))
      Ret = optimizeUnaryDoubleFP(CI, B, true);
  
-  if (!CI->hasUnsafeAlgebra())
+  if (!CI->isFast())
      return Ret;
    Value *Op1 = CI->getArgOperand(0);
    auto *OpC = dyn_cast<CallInst>(Op1);
  
-  // The earlier call must also be unsafe in order to do these transforms.
-  if (!OpC || !OpC->hasUnsafeAlgebra())
+  // The earlier call must also be 'fast' in order to do these transforms.
+  if (!OpC || !OpC->isFast())
      return Ret;
  
    // log(pow(x,y)) -> y*log(x)
@@ -1333,7 +1333,7 @@ Value *LibCallSimplifier::optimizeLog(CallInst *CI, IRBuilder<> &B) {
  
    IRBuilder<>::FastMathFlagGuard Guard(B);
    FastMathFlags FMF;
-  FMF.setUnsafeAlgebra();
+  FMF.setFast();
    B.setFastMathFlags(FMF);
  
    LibFunc Func;
@@ -1365,11 +1365,11 @@ Value *LibCallSimplifier::optimizeSqrt(CallInst *CI, IRBuilder<> &B) {
                                    Callee->getIntrinsicID() == Intrinsic::sqrt))
      Ret = optimizeUnaryDoubleFP(CI, B, true);
  
-  if (!CI->hasUnsafeAlgebra())
+  if (!CI->isFast())
      return Ret;
  
    Instruction *I = dyn_cast<Instruction>(CI->getArgOperand(0));
-  if (!I || I->getOpcode() != Instruction::FMul || !I->hasUnsafeAlgebra())
+  if (!I || I->getOpcode() != Instruction::FMul || !I->isFast())
      return Ret;
  
    // We're looking for a repeated factor in a multiplication tree,
@@ -1391,8 +1391,7 @@ Value *LibCallSimplifier::optimizeSqrt(CallInst *CI, IRBuilder<> &B) {
      Value *OtherMul0, *OtherMul1;
      if (match(Op0, m_FMul(m_Value(OtherMul0), m_Value(OtherMul1)))) {
        // Pattern: sqrt((x * y) * z)
-      if (OtherMul0 == OtherMul1 &&
-          cast<Instruction>(Op0)->hasUnsafeAlgebra()) {
+      if (OtherMul0 == OtherMul1 && cast<Instruction>(Op0)->isFast()) {
          // Matched: sqrt((x * x) * z)
          RepeatOp = OtherMul0;
          OtherOp = Op1;
@@ -1437,8 +1436,8 @@ Value *LibCallSimplifier::optimizeTan(CallInst *CI, IRBuilder<> &B) {
    if (!OpC)
      return Ret;
  
-  // Both calls must allow unsafe optimizations in order to remove them.
-  if (!CI->hasUnsafeAlgebra() || !OpC->hasUnsafeAlgebra())
+  // Both calls must be 'fast' in order to remove them.
+  if (!CI->isFast() || !OpC->isFast())
      return Ret;
  
    // tan(atan(x)) -> x
@@ -2167,10 +2166,10 @@ Value *LibCallSimplifier::optimizeCall(CallInst *CI) {
  
    // Command-line parameter overrides instruction attribute.
    // This can't be moved to optimizeFloatingPointLibCall() because it may be
-  // used by the intrinsic optimizations. 
+  // used by the intrinsic optimizations.
    if (EnableUnsafeFPShrink.getNumOccurrences() > 0)
      UnsafeFPShrink = EnableUnsafeFPShrink;
-  else if (isa<FPMathOperator>(CI) && CI->hasUnsafeAlgebra())
+  else if (isa<FPMathOperator>(CI) && CI->isFast())
      UnsafeFPShrink = true;
  
    // First, check for intrinsics.
diff --git a/lib/Transforms/Vectorize/LoopVectorize.cpp b/lib/Transforms/Vectorize/LoopVectorize.cpp

index ca2f5a178e06b690c7bb02a998ae31a68d53b3b0..ed29ca0b5731eb48f52a1578f8b1b0a70fb97992 100644 (file)
--- a/lib/Transforms/Vectorize/LoopVectorize.cpp
+++ b/lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -385,7 +385,7 @@ static unsigned getReciprocalPredBlockProb() { return 2; }
  static Value *addFastMathFlag(Value *V) {
    if (isa<FPMathOperator>(V)) {
      FastMathFlags Flags;
-    Flags.setUnsafeAlgebra();
+    Flags.setFast();
      cast<Instruction>(V)->setFastMathFlags(Flags);
    }
    return V;
@@ -2720,7 +2720,7 @@ Value *InnerLoopVectorizer::getStepVector(Value *Val, int StartIdx, Value *Step,
  
    // Floating point operations had to be 'fast' to enable the induction.
    FastMathFlags Flags;
-  Flags.setUnsafeAlgebra();
+  Flags.setFast();
  
    Value *MulOp = Builder.CreateFMul(Cv, Step);
    if (isa<Instruction>(MulOp))
@@ -5396,7 +5396,7 @@ bool LoopVectorizationLegality::canVectorizeInstrs() {
          // operations, shuffles, or casts, as they don't change precision or
          // semantics.
        } else if (I.getType()->isFloatingPointTy() && (CI || I.isBinaryOp()) &&
-                 !I.hasUnsafeAlgebra()) {
+                 !I.isFast()) {
          DEBUG(dbgs() << "LV: Found FP op with unsafe algebra.\n");
          Hints->setPotentiallyUnsafe();
        }
diff --git a/lib/Transforms/Vectorize/SLPVectorizer.cpp b/lib/Transforms/Vectorize/SLPVectorizer.cpp

index 5dcf5528ac92c5267a67a376b10a8c0a93984d29..4232252af36dcfb88a5a62a3d5f8fc5247b41a21 100644 (file)
--- a/lib/Transforms/Vectorize/SLPVectorizer.cpp
+++ b/lib/Transforms/Vectorize/SLPVectorizer.cpp
@@ -4880,7 +4880,7 @@ class HorizontalReduction {
        case RK_Min:
        case RK_Max:
          return Opcode == Instruction::ICmp ||
-               cast<Instruction>(I->getOperand(0))->hasUnsafeAlgebra();
+               cast<Instruction>(I->getOperand(0))->isFast();
        case RK_UMin:
        case RK_UMax:
          assert(Opcode == Instruction::ICmp &&
@@ -5232,7 +5232,7 @@ public:
      Value *VectorizedTree = nullptr;
      IRBuilder<> Builder(ReductionRoot);
      FastMathFlags Unsafe;
-    Unsafe.setUnsafeAlgebra();
+    Unsafe.setFast();
      Builder.setFastMathFlags(Unsafe);
      unsigned i = 0;
  
diff --git a/test/Assembler/fast-math-flags.ll b/test/Assembler/fast-math-flags.ll

index 4ef3607e1d0067e8a102630ce49edc97845ba773..664b1bd271eb585339a4bbcaa0e4a622e13ec4e7 100644 (file)
--- a/test/Assembler/fast-math-flags.ll
+++ b/test/Assembler/fast-math-flags.ll
@@ -7,6 +7,8 @@
  @vec    = external global <3 x float>
  @arr    = external global [3 x float]
  
+declare float @foo(float)
+
  define float @none(float %x, float %y) {
  entry:
  ; CHECK:  %vec = load  <3 x float>,  <3 x float>* @vec
@@ -86,6 +88,28 @@ entry:
    ret float %c
  }
  
+; CHECK: @reassoc(
+define float @reassoc(float %x, float %y) {
+; CHECK: %a = fsub reassoc float %x, %y
+  %a = fsub reassoc float %x, %y
+; CHECK: %b = fmul reassoc float %x, %y
+  %b = fmul reassoc float %x, %y
+; CHECK: %c = call reassoc float @foo(float %b)
+  %c = call reassoc float @foo(float %b)
+  ret float %c
+}
+
+; CHECK: @afn(
+define float @afn(float %x, float %y) {
+; CHECK: %a = fdiv afn float %x, %y
+  %a = fdiv afn float %x, %y
+; CHECK: %b = frem afn float %x, %y
+  %b = frem afn float %x, %y
+; CHECK: %c = call afn float @foo(float %b)
+  %c = call afn float @foo(float %b)
+  ret float %c
+}
+
  ; CHECK: no_nan_inf
  define float @no_nan_inf(float %x, float %y) {
  entry:
@@ -130,10 +154,10 @@ entry:
  ; CHECK:  %arr    = load [3 x float], [3 x float]* @arr
    %arr    = load [3 x float], [3 x float]* @arr
  
-; CHECK:  %a = fadd nnan ninf float %x, %y
-  %a = fadd ninf nnan float %x, %y
-; CHECK:  %a_vec = fadd nnan <3 x float> %vec, %vec
-  %a_vec = fadd nnan <3 x float> %vec, %vec
+; CHECK:  %a = fadd nnan ninf afn float %x, %y
+  %a = fadd ninf nnan afn float %x, %y
+; CHECK:  %a_vec = fadd reassoc nnan <3 x float> %vec, %vec
+  %a_vec = fadd reassoc nnan <3 x float> %vec, %vec
  ; CHECK:  %b = fsub fast float %x, %y
    %b = fsub nnan nsz fast float %x, %y
  ; CHECK:  %b_vec = fsub nnan <3 x float> %vec, %vec
diff --git a/test/Bitcode/compatibility-3.6.ll b/test/Bitcode/compatibility-3.6.ll

index e9313dfba870e2fd616511767b845c6047ec6c02..6c47a853e24a8b815bcbcbb6a766648cdf89fed4 100644 (file)
--- a/test/Bitcode/compatibility-3.6.ll
+++ b/test/Bitcode/compatibility-3.6.ll
@@ -612,7 +612,9 @@ define void @fastmathflags(float %op1, float %op2) {
    %f.arcp = fadd arcp float %op1, %op2
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp float %op1, %op2
    ret void
  }
  
diff --git a/test/Bitcode/compatibility-3.7.ll b/test/Bitcode/compatibility-3.7.ll

index 82fc99055357a38725cdcd8bc7ad8f95f8efc226..55844e5c4986ed96d2ba60fa6ee208b170956694 100644 (file)
--- a/test/Bitcode/compatibility-3.7.ll
+++ b/test/Bitcode/compatibility-3.7.ll
@@ -656,7 +656,9 @@ define void @fastmathflags(float %op1, float %op2) {
    %f.arcp = fadd arcp float %op1, %op2
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp float %op1, %op2
    ret void
  }
  
diff --git a/test/Bitcode/compatibility-3.8.ll b/test/Bitcode/compatibility-3.8.ll

index 2e70a380d10ed104568866835be6fd039e444d3a..a7fa20f2bc087084615739e0b55c9244bc047888 100644 (file)
--- a/test/Bitcode/compatibility-3.8.ll
+++ b/test/Bitcode/compatibility-3.8.ll
@@ -687,7 +687,9 @@ define void @fastmathflags(float %op1, float %op2) {
    %f.arcp = fadd arcp float %op1, %op2
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp float %op1, %op2
    ret void
  }
  
@@ -700,7 +702,9 @@ declare <4 x double> @fmf3()
  ; CHECK-LABEL: fastMathFlagsForCalls(
  define void @fastMathFlagsForCalls(float %f, double %d1, <4 x double> %d2) {
    %call.fast = call fast float @fmf1()
-  ; CHECK: %call.fast = call fast float @fmf1()
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'aml' bits set, so this is not fully 'fast'.
+  ; CHECK: %call.fast = call reassoc nnan ninf nsz arcp float @fmf1()
  
    ; Throw in some other attributes to make sure those stay in the right places.
  
diff --git a/test/Bitcode/compatibility-3.9.ll b/test/Bitcode/compatibility-3.9.ll

index 7c84daa7d3c449474297fbe4644ad162e0b5113f..c456fefe9d4086adb95f14f06fc4d683e540a40a 100644 (file)
--- a/test/Bitcode/compatibility-3.9.ll
+++ b/test/Bitcode/compatibility-3.9.ll
@@ -758,7 +758,9 @@ define void @fastmathflags(float %op1, float %op2) {
    %f.arcp = fadd arcp float %op1, %op2
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp float %op1, %op2
    ret void
  }
  
@@ -771,7 +773,9 @@ declare <4 x double> @fmf3()
  ; CHECK-LABEL: fastMathFlagsForCalls(
  define void @fastMathFlagsForCalls(float %f, double %d1, <4 x double> %d2) {
    %call.fast = call fast float @fmf1()
-  ; CHECK: %call.fast = call fast float @fmf1()
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %call.fast = call reassoc nnan ninf nsz arcp float @fmf1()
  
    ; Throw in some other attributes to make sure those stay in the right places.
  
diff --git a/test/Bitcode/compatibility-4.0.ll b/test/Bitcode/compatibility-4.0.ll

index 9e34d48c95f763458fc814232d1a6c844a321ba7..68446a7d5b0a3e1d48fc064ddc58fa01e63ebb8d 100644 (file)
--- a/test/Bitcode/compatibility-4.0.ll
+++ b/test/Bitcode/compatibility-4.0.ll
@@ -757,8 +757,10 @@ define void @fastmathflags(float %op1, float %op2) {
    ; CHECK: %f.nsz = fadd nsz float %op1, %op2
    %f.arcp = fadd arcp float %op1, %op2
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp float %op1, %op2
    ret void
  }
  
@@ -771,7 +773,9 @@ declare <4 x double> @fmf3()
  ; CHECK-LABEL: fastMathFlagsForCalls(
  define void @fastMathFlagsForCalls(float %f, double %d1, <4 x double> %d2) {
    %call.fast = call fast float @fmf1()
-  ; CHECK: %call.fast = call fast float @fmf1()
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'contract' and 'afn' bits set, so this is not fully 'fast'.
+  ; CHECK: %call.fast = call reassoc nnan ninf nsz arcp float @fmf1()
  
    ; Throw in some other attributes to make sure those stay in the right places.
  
diff --git a/test/Bitcode/compatibility-5.0.ll b/test/Bitcode/compatibility-5.0.ll

index a4b3fca82b7bf14b9a0ed3e36373e9b5095ab695..cdadc032d87b16d4dcad866c43de8b059115fe21 100644 (file)
--- a/test/Bitcode/compatibility-5.0.ll
+++ b/test/Bitcode/compatibility-5.0.ll
@@ -765,7 +765,9 @@ define void @fastmathflags(float %op1, float %op2) {
    %f.contract = fadd contract float %op1, %op2
    ; CHECK: %f.contract = fadd contract float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
-  ; CHECK: %f.fast = fadd fast float %op1, %op2
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'afn' bit set, so this is not fully 'fast'.
+  ; CHECK: %f.fast = fadd reassoc nnan ninf nsz arcp contract float %op1, %op2
    ret void
  }
  
@@ -778,7 +780,9 @@ declare <4 x double> @fmf3()
  ; CHECK-LABEL: fastMathFlagsForCalls(
  define void @fastMathFlagsForCalls(float %f, double %d1, <4 x double> %d2) {
    %call.fast = call fast float @fmf1()
-  ; CHECK: %call.fast = call fast float @fmf1()
+  ; 'fast' used to be its own bit, but this changed in Oct 2017.
+  ; The binary test file does not have the newer 'afn' bit set, so this is not fully 'fast'.
+  ; CHECK: %call.fast = call reassoc nnan ninf nsz arcp contract float @fmf1()
  
    ; Throw in some other attributes to make sure those stay in the right places.
  
diff --git a/test/Bitcode/compatibility.ll b/test/Bitcode/compatibility.ll

index 7d4167f4cb0b71a5a22443d2a6641774d2e37e75..0157fd438a70f243d9ad695619cb119357eb4fc1 100644 (file)
--- a/test/Bitcode/compatibility.ll
+++ b/test/Bitcode/compatibility.ll
@@ -775,6 +775,10 @@ define void @fastmathflags(float %op1, float %op2) {
    ; CHECK: %f.arcp = fadd arcp float %op1, %op2
    %f.contract = fadd contract float %op1, %op2
    ; CHECK: %f.contract = fadd contract float %op1, %op2
+  %f.afn = fadd afn float %op1, %op2
+  ; CHECK: %f.afn = fadd afn float %op1, %op2
+  %f.reassoc = fadd reassoc float %op1, %op2
+  ; CHECK: %f.reassoc = fadd reassoc float %op1, %op2
    %f.fast = fadd fast float %op1, %op2
    ; CHECK: %f.fast = fadd fast float %op1, %op2
    ret void
diff --git a/unittests/IR/IRBuilderTest.cpp b/unittests/IR/IRBuilderTest.cpp

index d361107cc0d2062a73dc9f8bd75f14384f155625..bb74756d81a9fb193150bbd7fd555f2db4da9eca 100644 (file)
--- a/unittests/IR/IRBuilderTest.cpp
+++ b/unittests/IR/IRBuilderTest.cpp
@@ -144,17 +144,40 @@ TEST_F(IRBuilderTest, FastMathFlags) {
    FastMathFlags FMF;
    Builder.setFastMathFlags(FMF);
  
+  // By default, no flags are set.
    F = Builder.CreateFAdd(F, F);
    EXPECT_FALSE(Builder.getFastMathFlags().any());
+  ASSERT_TRUE(isa<Instruction>(F));
+  FAdd = cast<Instruction>(F);
+  EXPECT_FALSE(FAdd->hasNoNaNs());
+  EXPECT_FALSE(FAdd->hasNoInfs());
+  EXPECT_FALSE(FAdd->hasNoSignedZeros());
+  EXPECT_FALSE(FAdd->hasAllowReciprocal());
+  EXPECT_FALSE(FAdd->hasAllowContract());
+  EXPECT_FALSE(FAdd->hasAllowReassoc());
+  EXPECT_FALSE(FAdd->hasApproxFunc());
  
-  FMF.setUnsafeAlgebra();
+  // Set all flags in the instruction.
+  FAdd->setFast(true);
+  EXPECT_TRUE(FAdd->hasNoNaNs());
+  EXPECT_TRUE(FAdd->hasNoInfs());
+  EXPECT_TRUE(FAdd->hasNoSignedZeros());
+  EXPECT_TRUE(FAdd->hasAllowReciprocal());
+  EXPECT_TRUE(FAdd->hasAllowContract());
+  EXPECT_TRUE(FAdd->hasAllowReassoc());
+  EXPECT_TRUE(FAdd->hasApproxFunc());
+
+  // All flags are set in the builder.
+  FMF.setFast();
    Builder.setFastMathFlags(FMF);
  
    F = Builder.CreateFAdd(F, F);
    EXPECT_TRUE(Builder.getFastMathFlags().any());
+  EXPECT_TRUE(Builder.getFastMathFlags().all());
    ASSERT_TRUE(isa<Instruction>(F));
    FAdd = cast<Instruction>(F);
    EXPECT_TRUE(FAdd->hasNoNaNs());
+  EXPECT_TRUE(FAdd->isFast());
  
    // Now, try it with CreateBinOp
    F = Builder.CreateBinOp(Instruction::FAdd, F, F);
@@ -162,21 +185,23 @@ TEST_F(IRBuilderTest, FastMathFlags) {
    ASSERT_TRUE(isa<Instruction>(F));
    FAdd = cast<Instruction>(F);
    EXPECT_TRUE(FAdd->hasNoNaNs());
+  EXPECT_TRUE(FAdd->isFast());
  
    F = Builder.CreateFDiv(F, F);
-  EXPECT_TRUE(Builder.getFastMathFlags().any());
-  EXPECT_TRUE(Builder.getFastMathFlags().UnsafeAlgebra);
+  EXPECT_TRUE(Builder.getFastMathFlags().all());
    ASSERT_TRUE(isa<Instruction>(F));
    FDiv = cast<Instruction>(F);
    EXPECT_TRUE(FDiv->hasAllowReciprocal());
  
+  // Clear all FMF in the builder.
    Builder.clearFastMathFlags();
  
    F = Builder.CreateFDiv(F, F);
    ASSERT_TRUE(isa<Instruction>(F));
    FDiv = cast<Instruction>(F);
    EXPECT_FALSE(FDiv->hasAllowReciprocal());
-
+ 
+  // Try individual flags.
    FMF.clear();
    FMF.setAllowReciprocal();
    Builder.setFastMathFlags(FMF);
@@ -225,7 +250,25 @@ TEST_F(IRBuilderTest, FastMathFlags) {
    FAdd = cast<Instruction>(FC);
    EXPECT_TRUE(FAdd->hasAllowContract());
  
+  FMF.setApproxFunc();
+  Builder.clearFastMathFlags();
+  Builder.setFastMathFlags(FMF);
+  // Now 'aml' and 'contract' are set.
+  F = Builder.CreateFMul(F, F);
+  FAdd = cast<Instruction>(F);
+  EXPECT_TRUE(FAdd->hasApproxFunc());
+  EXPECT_TRUE(FAdd->hasAllowContract());
+  EXPECT_FALSE(FAdd->hasAllowReassoc());
+  
+  FMF.setAllowReassoc();
    Builder.clearFastMathFlags();
+  Builder.setFastMathFlags(FMF);
+  // Now 'aml' and 'contract' and 'reassoc' are set.
+  F = Builder.CreateFMul(F, F);
+  FAdd = cast<Instruction>(F);
+  EXPECT_TRUE(FAdd->hasApproxFunc());
+  EXPECT_TRUE(FAdd->hasAllowContract());
+  EXPECT_TRUE(FAdd->hasAllowReassoc());
  
    // Test a call with FMF.
    auto CalleeTy = FunctionType::get(Type::getFloatTy(Ctx),
author	Sanjay Patel <spatel@rotateright.com>
	Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)
committer	Sanjay Patel <spatel@rotateright.com>
	Mon, 6 Nov 2017 16:27:15 +0000 (16:27 +0000)
docs/LangRef.rst		patch \| blob \| history
include/llvm/IR/Instruction.h		patch \| blob \| history
include/llvm/IR/Operator.h		patch \| blob \| history
include/llvm/Transforms/Utils/LoopUtils.h		patch \| blob \| history
lib/AsmParser/LLLexer.cpp		patch \| blob \| history
lib/AsmParser/LLParser.h		patch \| blob \| history
lib/AsmParser/LLToken.h		patch \| blob \| history
lib/Bitcode/Reader/BitcodeReader.cpp		patch \| blob \| history
lib/Bitcode/Writer/BitcodeWriter.cpp		patch \| blob \| history
lib/CodeGen/ExpandReductions.cpp		patch \| blob \| history
lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp		patch \| blob \| history
lib/IR/AsmWriter.cpp		patch \| blob \| history
lib/IR/Instruction.cpp		patch \| blob \| history
lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp		patch \| blob \| history
lib/Target/AMDGPU/AMDGPULibCalls.cpp		patch \| blob \| history
lib/Transforms/InstCombine/InstCombineAddSub.cpp		patch \| blob \| history
lib/Transforms/InstCombine/InstCombineCalls.cpp		patch \| blob \| history
lib/Transforms/InstCombine/InstCombineMulDivRem.cpp		patch \| blob \| history
lib/Transforms/Scalar/Reassociate.cpp		patch \| blob \| history
lib/Transforms/Utils/LoopUtils.cpp		patch \| blob \| history
lib/Transforms/Utils/SimplifyLibCalls.cpp		patch \| blob \| history
lib/Transforms/Vectorize/LoopVectorize.cpp		patch \| blob \| history
lib/Transforms/Vectorize/SLPVectorizer.cpp		patch \| blob \| history
test/Assembler/fast-math-flags.ll		patch \| blob \| history
test/Bitcode/compatibility-3.6.ll		patch \| blob \| history
test/Bitcode/compatibility-3.7.ll		patch \| blob \| history
test/Bitcode/compatibility-3.8.ll		patch \| blob \| history
test/Bitcode/compatibility-3.9.ll		patch \| blob \| history
test/Bitcode/compatibility-4.0.ll		patch \| blob \| history
test/Bitcode/compatibility-5.0.ll		patch \| blob \| history
test/Bitcode/compatibility.ll		patch \| blob \| history
unittests/IR/IRBuilderTest.cpp		patch \| blob \| history