Skip to content

Conversation

@umangyadav
Copy link
Member

@umangyadav umangyadav commented Dec 18, 2025

Motivation

Pulls changes from upstream llvm upto this commit
https://github.com/ROCm/llvm-project/tree/95f194e3c67b77d6326e2ce71718842414b92594

Technical Details

External LIT Tests

  • check-llvm
  • check-mlir

Jenkins Internal CI

  • Weekly (parameterSweeps + Tuning)
  • Nightly CI
  • PR CI

MIGraphX CI

Performance

  • Compare tuning runtime on CI with one of the weekly runs to see if tuning time has regressed or not.

Navi2X

  • parameterSweeps (manual running)
  • Nightly E2E tests (w/ fixed data)
  • Nightly E2E tests (w/ random data)
  • PR E2E tests

List of files that are different

Diff files for review

Open these files in notepad++ or any DiffViewer with "diff" language
diff -rup upstream/llvm-project/llvm rocMLIR/external/llvm-project/llvm &> llvm_diffs.txt
llvm_diffs.txt

diff -rup upstream/llvm-project/mlir rocMLIR/external/llvm-project/mlir &> mlir_diffs.txt
mlir_diffs.txt

@pabloantoniom @justinrosner can you please check if all your "EXTERNAL" commit changes are in here or not ?

…c67b

95f194e3c67b merge main into amd-staging (#884)
100ba3c7978c update revert_patches.txt
8723e1f1c101 AMDGPU/PromoteAlloca: Fix handling of users of multiple allocas (#172771)
3b2ef9209499 AMDGPU/PromoteAlloca: Refactor into analysis / commit phases (#170512)
d4a1bfb8fb9b merge main into amd-staging (#882)
29c3943574f2 merge main into amd-staging
0206f18a9cc3 [clang-tidy][NFC] Remove redundant braces with clang-format 'RemoveBracesLLVM' (3/N) (#172752)
233a88579fd5 [clang-tidy][NFC] Remove redundant braces with clang-format 'RemoveBracesLLVM' (1/N) (#172748)
cba7bb9d2f8a [LLVM][CodeGen][X86] Make printConstant's output for vector ConstantFP match that of ConstantVector. (#172679)
5f84dfff532b [X86] Add tests showing failure to concat matching CVTPS2DQ/CVTTPS2DQ vector ops (#172836)
6f748698f74a Revert "[bolt][aarch64] simplify rodata/literal load for X86 & AArch6… (#172822)
5c05824d2bd3 [CodeGen] Rename expand-fp to expand-ir-insts (#172681)
80e354837202 [llvm][AMDGPU] Fix signed/unsigned comparison warning in 32-bit builds (#172623)
11e8237545da [LowerAllowCheck] Move tests to Transforms/LowerAllowCheck (#172028)
35c2dbd4814a [AMDGPU] Remove trivially true predicates from GCNSubtarget. NFC. (#172830)
f4e941b20988 [lldb] Use AST nodes as Subscript and BitExtraction arguments in DIL (#169363)
d6f159dd05a2 AMDGPU: Add pattern for copysign of 0 (#172699)
7fe5953a44bb [lldb][windows] add Windows Virtual Console support (#168729)
0af137c0b018 [CIR] Introduce syntax for scalable vectors (#172683)
4e859c5a95ec [clang] Fix an invalidate iterator in PCH with -ftime-trace enabled. (#172658)
8cfda791054b [HLSL][SPIR-V] Implement vk::push_constant (#166793)
e957c8175001 [InstCombine] Use getSigned() for negative number in shift transform
1c06165c9b20 [libc++] Make std::align an inline function (#167472)
b036d70eeef7 Revert "[OMPT][Offload][OpenMP] Fixes for OMPT data used by libomptarget" (#172827)
64e3de71b0f2 [gn build] Port 2093d2eea1d2
c8bf96328297 [AArch64][SVE] Rework VECTOR_COMPRESS lowering (#171162)
c501a71e8a66 [CIR] Add flattened version of ResumeOp (#172534)
ad761003103f [clang][bytecode] Add missing cstdint include
726452720ccf [OMPT][Offload][OpenMP] Fixes for OMPT data used by libomptarget (#156020)
2093d2eea1d2 [clang][bytecode] Refactor `InitMapPtr` (#172665)
98b82f90dfb7 [PowerPC]: Add check for cast when shufflevector (#172443)
e655317cf134 [LV][EVL] Add test case for checking debug info when tail folding by EVL. nfc (#172429)
f8ab74f25e6c [RISC-V] Add schedule information for Qualcomm extensions (#172601)
71760f324ff9 [CodeGen] Merge ExpandLargeDivRem into ExpandFp (#172680)
f125c8db5c73 [OFFLOAD] Add plugin with support for Intel oneAPI Level Zero (#158900)
24d9550b27c6 [VectorCombine] foldShuffleOfBinops - if both operands are the same don't duplicate the total new cost (#172719)
999c9382571d [bolt][aarch64] simplify rodata/literal load for X86 & AArch64 (#165723)
055afb7db3b2 [llvm][docs] Forbid AI use for "good first issue" issues (#172515)
50ea2d855155 [RISCV] Extract vector from passthru when combining tuple_extract+vlseg. (#172743)
b9bd1dad5ae9 merge main into amd-staging (#878)
8e648380a173 [LoongArch][NFC] Add tests for issue #172154
6d405d6b5eb4 [RISCV] Replace enablePExtCodeGen with hasStdExtP for scalar code in RISCVISelDAGToDAG.cpp (#172785)
5623d1f4e67f [mlir][Linalg] Allow more control in drop unit dims (#171796)
55ff003344b7 [LoongArch][NFC] Partial revert "Custom lowering for vector logical right shifts of integers"
1e90a273fe60 [RISCV][llvm] Support fminimum, fmaximum, fminnum, fmaxnum, fminimumnum, fmaximumnum codegen for zvfbfa (#171794)
4d7382f1e96a [RISCV] Make an assert in RISCVDAGToDAGISel::Select stricter. NFC
48c8c4547584 [ORC] Fix typo in method name. NFCI. (#172787)
4a8e6a36b668 [BOLT][AArch64] Speed up ICF pass (#172783)
188bf2769a59 Fix variable only used by assert in #172716. (#172788)
b2ce1a981efd [clang-tidy][NFC] Replace `llvm::StringLiteral` with `StringRef` (#172765)
686c2a14766a [clang-tidy][NFC] Prefer `isa<T>` over `T::classof` (#172772)
8a0cdb88f911 [RISCV] Add short forward branch support for `qc.e.lb(u)`, `qc.e.lh(u)` and `qc.e.lw` (#172629)
f34900b7dd0f [lldb-dap] Bump the version to 0.4.1
cd081f8a5a4a [RISCV] Enable the zext.h alias for PACK with P. (#172756)
6cd651ae21f2 Revert "Make result variables obey their dynamic values in subsequent expressions (#168611)" (#172780)
cd75676928ad [RISCV] Prefer li over pli in RISCVMatInt. (#172778)
52a2d53eb50a merge main into amd-staging
5c5046397cf6 [OpenMP][Offload] Fix upstream merge issue (#873)
f15c56637efa merge main into amd-staging (#877)
35b2b24e624b Implement reserveAllocationSpace for SectionMemoryManager (#71968)
e88f3d8d8022 [bazel] Port 2b9e47749ca1eb337ea26b8084dad52732dc7186 (#172776)
fa9b3059546e [bazel] Port 7f1a30ebd242b2a55e8393717f1e594f9cd61569 (#172712)
457f93d4483f [LoongArch] Fix OptimizeW crash when MI operand is not a virtual register (#172604)
88461e8f585f [RISCV] Prevent unnecessary calls to hasAllBUsers/AllHUsers. NFC (#172768)
94e03a78942a [RISCV] Enable use of PACK in RISCVMatInt with P extension. (#172760)
54eee1e947f7 Reapply "[mlir][math] Add FP software implementation lowering pass: math-to-apfloat" (#172714) (#172716)
796fafeff92f [IR] Update `PHINode::removeIncomingValueIf()` to use the swap strategy like `PHINode::removeIncomingValue()` (#172639)
5766b8dfed58 [gn build] Port f58d2f32c0eb
10d4e73fed27 [gn build] Port e4c4498798b0
5f4b9ae8514a [clang][deps] Extract `CompilerInvocation` creation (#172744)
5d1618b20110 [BOLT][AArch64] Use minimal code alignment for cold functions (#172598)
0036c6744595 [RISCV]: Implemented softening of `FCANONICALIZE` (#169234)
84ae97d2016a merge main into amd-staging
b1d0e5f43b2d [gn] port a1d3cdc88aa2
f8377f59fbae [lldb/test] Fix libcxx configuration handling for remote platforms (#172761)
af38a1e83f69 merge main into amd-staging (#875)
76a17d99a748 [MLIR] Fix AsmPrinter alias uniqueness check (#172734)
9c7708a20fd8 [CAS] For `tryLockFileThreadSafe()` return immediately if timeout is 0 (#172729)
79ed37ca9247 [MLIR][Transform] Fix transform.smt.constrain_params's verifier (#172753)
5acdf790d3f3 [clang] NFC: Qualify argument with `const` (#172745)
bbf48d77ffe0 [RISCV] Fix incorrect check-prefix in rv32zbkb-only-valid.s. NFC
9cc1585b137c [VPlan] Add VPBlockUtils::transferSuccessors (NFCI).
4014d83cf7c3 clang: Remove unnecessary host-supports-cuda from test (#171174)
eecce1689a56 ValueTracking: Add baseline test for amdgcn_rcp handling (#172489)
53005fd43542 Use the Propeller CFG profile in the PGO analysis map if it is available. (#163252)
7b9168caa9c2 ValueTracking: Handle amdgcn_exp2 in computeKnownFPClass (#172495)
82cdc300fa88 ValueTracking: Handle amdgcn_log in computeKnownFPClass (#172492)
e7892d702fe7 [DAGCombiner] Fix assertion failure in vector division lowering (#172321)
e7ef7229d376 [MLIR][XeGPU] Fix the test issue brought by PR172125 (#172742)
316116e0cdde [scudo] Add an option to zero memory on deallocation. (#142394)
60b6c53f2575 [HLSL][Matrix] Add support for single subscript accessor (#170779)
26c9598c103a [lldb] Fix TestIvarProtocols to use +new instead of +alloc (NFC) (#172740)
534f334b84ad Revert "Reland [clang][modules-driver] Add scanner to detect C++20 module presence (#153497)" (2nd Attempt) (#172701)
2824225d27cc [LLD] [COFF] Fix implicit DLL entry point for MinGW (#171680)
21fd8cc2a558 Make STLExtras's (all|any|none)_of() Utility Functions Constexpr-Friendly (#172536)
33d7cd67a829 ValueTracking: Add baseline test for amdgcn_exp2 handling (#172494)
3734ace841fe ValueTracking: Add baseline test for nofpclass handling of amdgcn_log (#172491)
ad5a02a603cc [ORC] Rewrite loop to use structured bindings. NFCI. (#172631)
a587ccd87d6f fix `llvm.fma.f16` double rounding issue when there is no native support (#171904)
558760009cf4 [ORC] Rename WrapperFunctionResult to WrapperFunctionBuffer. NFCI. (#172633)
b6d06de34d0d Revert "[flang][cuda] Add support for derived-type initialization on device" (#172737)
c6400cd3a904 device-libs: Use copysign in daz_opt path of conversion functions (#872)
b6bfa856860b [aarch64] Mix the frame pointer with the stack cookie when protecting the stack (#161114)
1d374fdb0625 merge main into amd-staging
bab0dc4d4841 Reapply "[LV] Mark checks as never succeeding for high cost cutoff."
db1fd3fd2a0c [clang] Remove pointless `hasDiagnostics()` checks (#172705)
f09f578c0d89 [NVPTX][DagCombiner] Eliminate guards on shift amount because PTX shifts automatically clamp (#172431)
521080386e13 [libc] Cast the error value to int (#172724)
c8cca1eb9f18 [sanitizer_common] [Darwin] Add inline frame support for AtosSymbolizer (#170815)
a450c3801241 [libc][fenv] Fix performance regression for x86 mxcsr utilities. (#172717)
8d3fb12b5b2f [clang-format] Fix comparison warning in 32-bit builds (#172627)
2b9e47749ca1 [MLIR][XeGPU] Refactor Layout access interface  (#172125)
51ef9302a6f7 [MLIR] Apply clang-tidy fixes for llvm-qualified-auto in TransformInterfaces.cpp (NFC)
7794502b33e4 [MLIR] Add non-regression bytecode test for #170993
a4baf2c7ffc7 Revert "[mlir][bytecode] Add support for deferred attribute/type parsing. (#170993)"
399b33086f1c AMDGPU: Add baseline tests for fcopysign with 0 magnitude (#172698)
7ead626bf1c8 [MLIR] Apply clang-tidy fixes for modernize-use-equals-default in TestXeGPUTransforms.cpp (NFC)
bf161806dc6e [MLIR] Apply clang-tidy fixes for readability-identifier-naming in MemRefOps.cpp (NFC)
841d20f93f93 [llc][NPM] Return error code instead of calling exit() in NPM driver (#171250)
c7706d9472fe [flang] Don't evaluate initializers for arrays with invalid rank (#171163)
dd63127b5678 [libc] Support opaque FILE* on baremetal (#168931)
2b89b6f01363 merge main into amd-staging (#870)
d176c8d20f22 [VectorCombine] foldShuffleOfBinops - add test showing failure to recognise that the new shuffle is repeated (so only a single cost) (#172708)
621fe03eaad3 Revert "[mlir][math] Add FP software implementation lowering pass: math-to-apfloat" (#172714)
208553460a70 [lldb] Prefer exact address match when looking up symbol by address (#172055)
a34118044443 [bazel] fix #170267 (#172697)
7f1a30ebd242 [mlir][math] Add FP software implementation lowering pass: math-to-apfloat (#171221)
6ac581975940 [flang][OpenMP] Convert more clauses in pre-lowering (#172334)
658df648d993 [LifetimeSafety] Cross-TU Vs Intra-TU Annotation Suggestions (#171972)
ca2275b25fea [flang][cuda] Add support for derived-type initialization on device (#172568)
12cb3763216d [flang][OpenMP] Diagnose whole assumed-size arrays on two clauses (#172510)
966b5217e60f [libc++] Fix the formatting for percents in compare-benchmarks
e2d21b2eb870 [SPIR-V] Legalize vector arithmetic and intrinsics for large vectors (#170668)
8618bb0e4cb0 [libc++] Add the --ignore-under option to compare-benchmarks
b5dd6cc74642 [clang][BufferUsage] Fix crash when parsing invalid format string (#170496)
68e4a7530fd3 [LifetimeSafety] Guard debug-only function with NDEBUG (#172696)
3c55e9a5e801 [NFC][clang][DependecnyScanning] Fix else after return in bb1bfb1c7558931c59533fed312db6b7d0454f4b.
1e9af7ffb1a4 [CIR] Support PredefinedExpr for ConstantLValue (#172564)
5f15fee8acc7 [mlir][amdgpu] Add tensor load store operations (#172686)
e5fe825c1338 [Support] On Windows, fix rpmalloc TLS destructor when running inside a DLL (#171465)
ea2f797db262 device-libs: Use copysign in daz_opt path of conversion functions
05218825ee46 [libc++] Fix `ranges::for_each` taking whole associative containers (#172605)
e5f93db6c054 [flang][NFC] Strip trailing whitespace from tests (14 of 14)
e128c4612fc2 [flang][docs] Fix text and links in doxygen landing page
2f715e96c027 [CIR][X86] Add support for alignd builtins (#171621)
32feb01dc3c9 [lldb][NFCI] Remove unused field in BreakpointResolverName (#172574)
fbe5e8c9f482 Revert "Revert "Reland [clang][modules-driver] Add scanner to detect C++20 module presence (#153497)"" (#172693)
1f2818cea3f6 libc: Prefix Rounding Control Error enum to avoid conflict in windows build (#172562)
fa78d6a5f15e [SDAG] Shrink (abd? (?ext x) (?ext y)) (#171865)
d4e732314869 [PPC] mark ppc/fixtfti_test.c and ppc/fixunstfti_test.c as UNSUPPORTED because they pass/fail in different builds
4947f68970bb Revert "Reland [clang][modules-driver] Add scanner to detect C++20 module presence" (#172655)
b8ee6ed79449 [X86] combineConcatVectorOps - add handling for FADD/FMUL etc. with a repeated vector op (#172682)
550ccd22f4df [clang][NFC] Use range-based for loop and algorithms in `SemaDeclCXX.cpp` (#169938)
97cf9ea3b93b [clang][NFC] `inferNoReturnAttr` modifies the `Decl`, so it shouldn't be marked `const` (#172571)
5c371b14b0ae [clang][NFC] `getAsVoidPointer` and `getFromVoidPointer` should deal in pointers to `const` (#172572)
06a7aceb4d43 default to LIBOMPTARGET_TREAT_ATTACH_AUTO_AS_ALWAYS on for 534.hpgmgfv
a3b79893da47 [X86] lowerBuildVectorToBitOp - handle cases where either side folds to vector load (#172467)
a7778cac5fae remaster  lit tests for Use  map-type...
77455615a4be [MLIR][GPUToLLVMSPV] Use global & local memory scope for GPUBarrierConversion (#169026)
2fc0d8cb9a65 [X86] combineConcatVectorOps - add handling for ISD::SMIN/SMAX/UMIN/UMAX vector ops (#172667)
a452be50077d [bazel] fix PR172479 for bazel (#172676)
cc23ed207c71 [OpenMP][Clang] Use `ATTACH` map-type for list-items with base-pointers. (#153683)
b9d6ad9ce915 Revert "[mlir][amdgpu] Add tensor load store operations (#170918)" (#172671)
4e4bba73320c [AArch64][llvm] Add intrinsics for SVE BFSCALE (#172025)
ecb94bcfe27d [Offload] Debug message update part 3 (#171684)
101c6ede3b06 [Offload] Debug message update part 2 (#171683)
a98a99da45fe [Offload] Debug message update part 1 (#171672)
c3a5ec03608d [NFC][Offload] Missing test change in #153683 (#172587)
a6f38b37c674 [libc++][vis-historical] Don't create a git.Repo for each commit
c6f3b202792d [libc++] Disable the generation of relative deltas in compare-benchmarks --kpis
7149e05b7f01 [CIR][AArch64] Add lowering for `svlen` builtins (#172346)
edb45d8ae4b3 [SDAG] Allow implicit trunc in BUILD_VECTOR legalization
b8972a70d472 [X86] Add tests showing failure to concat fmuls with repeated vector ops (#172654)
f053ff732df8 [clang] Avoid invalid iterator in MergeDefinitionData (#172559)
f9d15d5de9a0 merge main into amd-staging (#869)
b963def2cb44 [ELFDebugObjectPlugin] Gracefully handle missing section (#172622)
47fc3992ba3c [MLIR] Extend the extractvalue fold method (#172297)
c7af990cb7b0 Reland [mlir][gpu] Use `SmallString`, `FailureOr` and `StringRef` in `module-to-binary` infra (NFC) (#172390)
839090984261 [mlir][x86vector] Lower BF16 vector.contract to FMA using AVX2 BF16 packed ops. (#170267)
c27fb0f9e84c [X86] combineConcatVectorOps - add handling for X86ISD::FMAX/FMIN vector ops (#172648)
f719e11cc17f [MLIR] Apply clang-tidy fixes for misc-use-internal-linkage in SparseTensorDialect.cpp (NFC)
9cae8ec551e5 [MLIR] Apply clang-tidy fixes for bugprone-argument-comment in SparseTensorDescriptor.cpp (NFC)
8cc9c690ebdf [MLIR] Fix clang-tidy fixes for llvm-prefer-isa-or-dyn-cast-in-conditionals in AMDGPUToROCDL.cpp (NFC)
c386d6d3bfa4 Thread Safety Analysis: Add more cast pointer-alias tests (#172638)
1741bf9bc180 [Offload] Fix incorrect size used in llvm-offload-device-info tool (#868)
e4c4498798b0 [lldb][ObjC][NFCI] Replace StringLexer with llvm::StringRef (#172466)
28d4e33b654b [AMDGPU][SIInsertWaitCnt] Optimize loadcnt insertion at function boundaries (#169647)
06e4728f83e5 [Polly] Recalculate dependencies after import-jscop (#172640)
a2715f031f8f [AArch64][llvm-objdump] Fix arm64_32 symbolization (#171164)
04751b47607d [AArch64][llvm-objdump] Add missing arm64_32 architecture checks (#171638)
ce553ab69f0f Revert "[mlir][amdgpu] Expose waitcnt bitpacking infra (#172313)" (#172636)
6390373461b6 [X86] Add tests showing failure to concat matching fmin/fmax vector ops (#172635)
44a52ea8beed [InstCombine] Fix unsafe PHINode cast and simplify logic in PointerReplacer (#172332)
921a6471dc27 [CGObjCGNU] Use getSigned() for instanceSize
dea9ec84a40a [SLSR] Allow implicit truncation for element size
3186ca25bc6c [ARM] Use getSigned() for signed value
30ce1e9c7a50 [CGExprScalar] Allow implicit truncation for CharacterLiteral
857748d4afbf [PatternInit] Explicitly allow implicit truncation
744552d8703e [CodeGen] Use getSigned() for negative values
159f1c048e08 [IR] Optimize PHINode::removeIncomingValue() by swapping removed incoming value with the last incoming value. (#171963)
80f3c0db901a [DA] Introduce OverflowSafeSignedAPInt to prevent potential overflow (#171991)
93013817afab [mlir][amdgpu] Expose waitcnt bitpacking infra (#172313)
dfe84fbe9383 [MLIR][NFC] Fix the pass description to describe what it actually does. (#172306)
fd31ab968771 [llvm][examples] Run LLJITWithRemoteDebugging test only on native Linux 64-bit Intel (#172518)
462393c73180 merge main into amd-staging
f58d2f32c0eb [clang][ssaf] Add SummaryName handle type (#172474)
0ecadb0422c3 [DTLTO] Remove temporary files on abnormal exit, e.g. CTRL-C (#172280)
90adfb2774d2 [AArch64][SME] Add pass remarks to the MachineSMEABIPass (#170277)
6bc779506107 Reapply "[clang][analyzer] Format macro expansions" (#172479)
60418dd8c2b2 [llvm-objdump] Fix memory leak in `mcpuHelp()` (#172594)
10ed050b34bd [libc][CPP] make the string trap on OOM (#172260)
a6f837e9f83f [MLIR][XeGPU] Add sg layout propagation (#170879)
1ba5afeb8c5f [DWARF] Support block-form DW_AT_GNU_call_site_target_clobbered (#172450)
8570c33d7ce7 [Offload] Fix incorrect size used in llvm-offload-device-info tool
a31964f6ccb1 device-libs: Remove DAZ_OPT check in atan2/atan2pi (#863)
f2d48dd15b2e [libcxx][ci] Remove myself as a contact for Linaro bots (#172620)
b512c1971640 [clang][win] Fix size passed to delete[] in vector deleting destructors (#172513)
c563a73cd34b [lldb][docs] Fix formatting in troubleshooting guide
b7a20c1cc4ea [GlobalISel] Don't permit G_*MIN/G_*MAX of pointer vectors (#168872)
31e757d3a124 [lldb][docs] Add register info problems to troubleshooting doc (#171806)
5bfd57e7c9f8 [clang][analyzer] CallAndMessage warnings at pointer to uninitialized struct (#164600)
fcf762c6ecfc [RISCV] Implement Clang Builtins for XAndesPerf Extension (#147018)
d37b43e172d7 [bazel] export Runtimes.h from GpuToROCDLTransforms (#172607)
d502ff094937 [OpenMP][Offload] Add support for lambdas with debug conditions (#172573)
dec5d6637450 [ORC] Make WrapperFunctionResult constructor explicit. (#172602)
19b28074618c [LifetimeSafety] Implement multi-level origins (#168344)
bfbd3ea493e8 [NFC][llvm-ir2vec] Refactoring llvm-ir2vec.cpp for better separation of concerns in the Tooling classes (#170078)
5bf9aa3eda4c merge main into amd-staging (#866)
f440b5c12df5 [BOLT] Synchronize function and section order (#172419)
bc7b6b1745a9 [NFC] Missing riscv-registered-target in the test (#172596)
99553bcdef95 [lldb] [docs] [sbapi]: fix lldb-forward header name (#172578)
fe577b1f3a47 [AST][RISCV] Preserve RISC-V intrinsic pragma in AST (#171981)
282e8ea78eb4 [AMDGPU] Regenerate gfx11/12 dpp disasm test checks. NFC (#172583)
b394bd0d2177 [flang] Support -funsafe-cray-pointers (#172252)
9d76b2cfce0b [MLIR][NVVM] Fix the lowering of legacy mbar.arrive (#172476)
1ff6baff660e merge main into amd-staging (#865)
5bf5657a97d9 [RISCV] Add short forward branch support for `lb`, `lbu`, `lh`, `lhu`, `lw`, `lwu` and `ld` (#170829)
e04ce74fe252 Revert "[llvm-objdump] Support --mcpu=help/--mattr=help without -d" (#172586)
09babf44b634 merge main into amd-staging
e87e2646cf67 [clang-doc] Add a "Home" link to navbar (#171900)
a1d3cdc88aa2 [clang-doc] Add a top-level index.html page (#171899)
c5c6c6fc878b [clang-doc] Add namespaces to namespace template (#172076)
20449bcc9fce [flang][docs] Add an FAQ about an executable stack (#171241)
c9df272c6869 [clang-tidy] Fix parentheses handling in `readability-math-missing-parentheses` (#172423)
8a6fae4208c3 [RISCV] Remove outdated comment in RISCVInstrInfoZvqdotq.td.
cf8e47b24419 merge main into amd-staging (#864)
816c9d64a7fd [TargetLowering] Use getNegative. NFC (#172526)
f7c156799dd2 merge main into amd-staging
c62cd2877cc2 [OpenMP][Offload] Add `LIBOMPTARGET_TREAT_ATTACH_AUTO_AS_ALWAYS` to treat `attach(auto)` as `attach(always)`. (#172382)
ffe973a3e76e [CI] Make premerge advisor exit with code 0 if failures are explained
67389171abd7 [CI] Use the exit code from the premerge advisor
60587b21d485 [compiler-rt] [Darwin] OS/feature detection should use the run-wrapper (#171167)
6b183f4cfdb4 [Codegen][NewPM] Explicitly Nest Passes in CodegenPassBuilder (#169867)
68aea8e202cb AMDGPU: Avoid introducing unnecessary fabs in fast fdiv lowering (#172553)
b4144c134488 [sanitizer_common] Fix missing `check-sanitizer` deps under LLVM_ENABLE_RUNTIMES (#170817)
e98a765020ab [libclang/python] Remove unused exception variable (#172516)
2d08b0c5f09d Revert "[OpenMP][Offload] Add support for lambdas with debug conditions" (#172570)
36655fca1f71 [RISCV] Guard zero-stride load SchedPredicate with subtarget feature as well (#172111)
b971b510d6f3 AMDGPU: Add baseline test for redundant fabs on fdiv expansion (#172552)
2b08cd693e4b update revert_patches.txt  [OpenMP][Offload] Add support for lambdas with debug conditions (#172107)
97cbb692dbce merge main into amd-staging
9c9b6931d9c7 [TableGen][SchedModel] Add logical combiners for SchedPredicates (#172106)
6751eb9137a5 [clang][NFC] Make `CFGElement` `const`-correct (#172547)
4181a0e22c1b Revert "[OpenMP][Offload] Add support for lambdas with debug conditions (#172107)"
eb4866d569c9 merge main into amd-staging
7d126930bd26 [gn build] Port 2c05ae4b8f85
492340aeb143 [MLIR][XeGPU] Add handling for unit-dim expansion in ShapeCast workgroup-to-subgroup distribution (#171758)
c42aece4b530 [clang][NFC] Remove useless `const_cast` in `CFGWalker` (#172549)
a357cac34c1a [clang][NFC] Make `ClassifyRefs` and `TransferFunctions` const-correct (#172544)
97ce113e490d device-libs: Remove DAZ_OPT check in atan2/atan2pi
26dad2cf4c21 merge main into amd-staging
2c05ae4b8f85 [LFI] Introduce AArch64 LFI Target (#167061)
eb1876c960ec DAG: Fix arith_fence handling in SignBitIsZeroFP (#172537)
c956ac300d60 [compiler-rt][ppc] add powerpc fixup of config.compiler_rt_libdir when building with LLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON (#172361)
83586140f0ec [bazel][libc] Remove unused dep in //libc:pkey_mprotect. (#172540)
33582a9f0538 merge main into amd-staging (#861)
ea9addae8336 [bazel] Port 3c97829d971d133c8984987271a31b90da64da84
601c903dacc0 [libc++] Rename the wg21 paper label to wg21-paper
eb0c7e752f01 [VPlan] Replace BranchOnCount with Compare + BranchOnCond (NFC). (#172181)
cea15b61de91 [MsgPack] Use JSON schema boolean resolution rules (#170561)
18b38cf546a2 [CIR] Support Try catch with handler for specific type (#171042)
13f7b308d23b [LLVM] Remove the requirement for named pipe in jobserver (#169154)
cb43ae446fa0 [libc][fuzzing] Improve printf long double fuzzing (#172113)
50955d6b89ef [C++20][Modules][NFC] Some minor non-functional fixes (#172524)
39f68f737d01 [ADT][NFC] Move FoldingSetBase definition lower in the file (#172503)
8725400921a1 [flang][driver] Add options -fdefault-integer-4 and -fdefault-real-4
b2880eac7c09 [lldb][ClangASTSource][NFC] Clean up RegisterNamespaceMap (#172520)
0729a74e66ae [ROCDL] Added s_wakeup_barrier (GFX1250) (#172320)
3c97829d971d [libc] Add Darwin mutex support via os_sync primitives (#167722)
fd98eae9e059 [Github][CI] Bump VS in Windows Container to VS2022 (#172396)
caeeec0eb1fa [gn build] Port eb3298feda3c
73bcc19aaef3 [OpenMP][Offload] Add support for lambdas with debug conditions (#172107)
8dd75fa473b0 [OpenMP][Offload] Revert format of changed messages (#171995)
eaf6d9a2ff50 [LLDB] Run MSVC STL atomic tests with PDB (#172349)
6da200f72c5a Fix llvm::StripTemplateParameters to not return an empty name. (#157553)
156fc6db2641 [clang][NFC] Remove unused CFGStmtMap.h includes (#172362)
ba05883304d6 [clang][NFC] In `CFGStmtMap`, remove mutable `getBlock` overload. (#172364)
1bd0ec432ff4 [clang-tidy][NFC] Add `doc8` documentations (#167269)
51cdebf33989 [AMDGPU] SIOptimizeExecMaskingPreRA: Fix crash on exec copy fold into INLINEASM (#172481)
0badd4e2cb21 [clang][deps] Prefer `DiagnosticConsumer` over `llvm::{Error,Expected}` (#172389)
eb3298feda3c [Clang] [Headers] Actually install the stddefer.h header (#172512)
d9b72bafc034 [RISCV] Generate Xqcilsm QC_SETWMI store multiple instruction (#172438)
6ad41bcc498b [CodeGen] expand-fp: Change frem expansion criterion (#158285)
a3b3c027bb61 [AMDGPU][NFC] Pre-commit tests for readanylane combines (#172398)
3ca47a767ece merge main into amd-staging
aba8ebbda091 [MLIR][Vector] Add distribution pattern for `vector::ConstantMaskOp` (#172268)
eefe14211227 [flang][NFC] Document an intentional violation of the ISO standard (#172105)
11c3a3b86eb8 merge main into amd-staging (#860)
fe6b9139cc85 Fix build break with BUILD_SHARED_LIBS=ON
f83de973a5da [llvm][clang] Enable IO sandbox for assert builds (#171935)
78711b66bde6 [mlir][Transforms] Legalize nested operations (#172158)
cd806d7e7689 [LLVM] Add plugin hook for back-ends
720b003af037 [clang-tidy] `bugprone-unchecked-optional-access`: handle inheritance from `BloombergLP::bslstl::Optional_Base` to prevent false-positives for allocator-aware BDE types (#168863)
8d0fabe8fe80 [flang] Improve intrinsic error messages when multiple signatures exist (#172099)
5925210020b6 [OpenMP][CIR] Implement basic 'parallel' lowering + some clause infra (#172308)
49f697971c64 [OpenMP][CIR] Implement 'barrier' lowering (#172305)
516dd2b70b63 [llvm-objdump] Support --mcpu=help/--mattr=help without -d (#165661)
18e9b48c4c29 [libc++][tuple] Applied `[[nodiscard]]` (#172008)
23f967ada057 [AArch64] Add support for C1 CPUs (#171124)
2f9ed9d582ac [AArch64][SVE] Select non-temporal instructions for unpredicated loads/stores with the nontemporal flag (#171261)
d4e93550a0e6 [lld][MachO] Add --lto-emit-llvm command line option
de0cb77152a0 [libc++] Store the premerge runner images in the monorepo (#171443)
ecbb44464a3a [mlir][amdgpu] Add tensor load store operations (#170918)
bbbba9c07c65 [CLANG] Fixes the crash on the use of nested requirements in require expressions (#169876)
e89734e61475 [Headers][X86] Allow vector bitcast intrinsics to be used in constexpr (#167180)
9e11754c80da [CGHLSLRuntime] Use getSigned() for total array size
dc69c41a9671 [CGObjCGNU] Set isSigned for negative value
58ee3ece03f2 [ItaniumCXXABI] Use getSigned() for signed offset
f3025d1a33ea [CGBuilder] Use getSigned() for CharUnits
aa7a95c8e96c [flang][OpenMP] Make function name more accurate, NFC (#172328)
385f1866ad07 [libclang/python] Remove unused import in example (#172369)
a0622a66c7f8 [libclang/python] Add LIBCLANG_LIBRARY_PATH and LIBCLANG_LIBRARY_FILE (#170201)
cc13d46d8fe1 [lldb-dap] Fix Windows build (#172475)
1c6e5b2d0408 [LV] Improve code using VPlan::get{ConstantInt,True} (NFC) (#172471)
b6f210b21531 [X86] Correct CPUID checks for AVX10 (#172350)
ce31a369d0e9 Add Use  map-type for list-items to revert_patches.txt
418c77ff7149 Revert "[OpenMP][Clang] Use `ATTACH` map-type for list-items with base-pointers. (#153683)"
055864c5685d Enable libarcher in LLVM releases (#172310)
5fe376376f57 merge main into amd-staging
ed5589b0dcec Revert "[clang][analyzer] Format macro expansions" (#172468)
35a22719f503 [lldb-dap] Migrate additional requests to stuctured types (#172283)
c1e829fc3d2d [AMDGPU] Simplify waitcnt insertion on function entry. NFC. (#172461)
c3ebcfe0f584 [libc++] Remove [[nodiscard]] from map etc. operator[] (#172444)
a96ce5f09c3e [Clang] Fix crash on malformed std::partial_ordering static members (#172001)
b10504f93f2e [clang][analyzer] Format macro expansions (#156046)
91eeffdedd0d [NFC][compiler-rt] Fix typos in int_to_fp_impl (#171067)
af479d66b35d [clang-tidy][NFC] Use structured binding in `ClangTidyModule.cpp` (#172432)
58919218017c [CGExprScalar] Use getSigned() for signed value
ea9d44defc97 [CGExpr] Use getSigned() for negative constant
685170dccd73 [CGVTables] Use getSigned() for signed offset
1bc52c34c26d [CGOpenMP] Use getSigned() for NumTeams constant
8485ea220329 [AArch64] Improve expansion of immediates with shifted negated components. (#162286)
cc1e92cfa38c [clang-tidy][NFC] Switch to new file header style: attempt number two (#172451)
f1bf53112e5e [DA] Add test where inferDomainOfAffine causes overflow (NFC) (#171990)
d2a097d6a36e Reland: [ORC] Tailor ELF debugger support plugin to load-address patching only (#169482)
16f41cb1b80b [mlir][amdgpu] Add Python bindings for TDM types (#172309)
7447fbfcf5c7 [libc++] Use the C++11 syntax for the no_thread_safety_analysis attribute (#171800)
f2068e5a7987 device-libs: Reorder cbrt edge case check (#855)
3c100d5d548d [libc++] Enable -Wmissing-prototypes (#116261)
1ba23fff263d [AArch64] Add intrinsics support for SVE2p2 instructions (#163575)
44735e1082db [mlir] Clean up leftovers from moving linalg::Expand/CollapseShapeOp to memref dialect (#172448)
3ed5c191c783 [sancov] Add -diff and -union options to compute set difference and union of sancov files  (#171191)
a08a55cab9ee [mlir][tosa] Extend narrowing pass (#170712)
ff3d477894a7 [utils] Update update_mir_regclass_numbers usage (#172436)
1bfa250dbf7c [Clang][C++26] Remove the notion of replaceability. (#172150)
e60471d9fd7c [NFC] [C++20] [Modules] Add a test for module local template instantiation in namespace
e22010815024 [AArch64][GlobalISel] SIMD fpcvt codegen for rounding nodes (#171446)
c7147a9c09a5 [LangRef] Fix inequalities and add examples for `loop.dependence.*.mask` (#170861)
c0e72f691130 [Clang] Remove 't' from __builtin_amdgcn_ds_atomic_fadd_f32/f64 (#172293)
22f33f9fd568 [MicrosoftCXXABI] Use getSigned() for signed vptr offset
3d069684375a [CodeGen] Use getAllOnesValue() for -1 constants
af3f2da253bb [CGExprScalar] Fix inc/dec of vector larger than 64-bit (#172301)
447c96363af9 [SimplifyLibCalls] Avoid implicit truncation in convertStrToInt()
07f63d94a4da [UnitTests] Avoid implicit truncation
01846a940021 [Mips] Use getSigned() for LwConstant32 immedate
11ca95ac89ff [SandboxIRTest] Use getSigned() for negative constants
fc459d010d48 [RandomIRBuilderTest] Explicitly allow implicit truncation
6b72ef7244b0 [OMPIRBuilderTest] Pass sign flag to ConstantInt::get()
6ef27e2abb51 [gn build] Port 1928c1ea9b57
2c26d56045e6 [SPIRV] Support for the extension SPV_EXT_image_raw10_raw12 (#160032)
1928c1ea9b57 [C++20][Modules] Improve namespace look-up performance for modules. (#171769)
f2be60c40ef4 [orc-rt] Add QueueingTaskDispatcher API. (#172401)
67d0e21a626f Reapply "[VPlan] Remove legacy costing inside VPBlendRecipe::computeCost (#171846)" (#172261)
7d381f2a5634 [AMDGPU] Schedule independent instructions between s_barrier_signal and s_barrier_wait (#172057)
1eba2cbe7288 [LV] Convert uniform-address unmasked scatters to scalar store. (#166114)
da56eb7026c5 merge main into amd-staging (#856)
941809bccc30 [clang-format] Align different categories together (#172242)
f19c83f344ff [Github][CI] Add `doc8` to `code-linter` dependency (#172256)
818ac70ed98b  [HWASan] Compatible with Windows path retrieval (#172194)
e9b1f56d3560 [AMDGPU][GlobalISel] Add RegBankLegalize support for G_BITREVERSE (#172101)
031ec802b742 [CodeGen] Fix volatile inst, to match MSVC code semantics on windows (#171862)
1ea201d73be2 [WoA] Remove extra barriers after ARM LSE instructions with MSVC (#169596)
848094c962ae [Github][CI] Bump Windows CI Container Python to v3.12.3 (#172383)
41ffab0c233b [CI] Fix generate_report call parameters
ff642616f7b6 [CI] Remove unused variable
965fecf813bb [flang][NFC] Strip trailing whitespace from tests (13 of 14)
faf879abbb48 [clang][lex] Introduce new single-module-parse mode (#135813)
1fbf33cd4039 [OpenMP][Clang] Use `ATTACH` map-type for list-items with base-pointers. (#153683)
8af59b240cd2 [sanitizers] build symbolizer with lld (#172093)
a491f4a16901 merge main into amd-staging
bd81f412014b [bazel] Fix for 908a5a8292ea1 (#172385)
4c7d765d0563 Revert "[mlir][gpu] Use `SmallString`, `FailureOr` and `StringRef` in `module-to-binary` infra (NFC) (#172284)" (#172386)
236d0c5c3c98 device-libs: Reorder cbrt edge case check
3a88bb90bb2a [mlir][AMDGPU] Add scaled wmma ops for gfx1250 (#169854)
c8b8b2f1f9ce [mlir][gpu] Use `SmallString`, `FailureOr` and `StringRef` in `module-to-binary` infra (NFC) (#172284)
436c9096eb4b merge main into amd-staging (#854)
f44740afff17 [NFC][Offload] Fix minor debug print issues introduced in #170425. (#172377)
89658f5482c2 [libclang/python] Fix cindex test for cpp lang (#172368)
644fd3b665ae [FastISel] Don't select a CallInst as a BasicBlock in the SelectionDAG fallback if it has bundled ops (#162895)
83eea87a3678 [VPlan] Create header phis once, after constructing VPlan0 (NFC). (#168291)
ef95590a98b6 [gn build] Port fa79e0a4001a
4d0bcf4c58ba [gn build] Port 908a5a8292ea
5db0d5a0f53e [gn build] Port 1e15dbe311eb
518c32824dbb [flang][runtime] Fix GPU output for multiple statements (#172363)
dbb4f5c2dd83 [VPlan] Set VF scale factor in tryToCreatePartialReduction (NFCI).
ffe9a2242a1e merge main into amd-staging
691b04d5a2a9 [gn] port c81d44942e56
4b78647754d9 [MemProf] Add CalleeGUIDs from profile to existing VP metadata (#171495)
908a5a8292ea [clang-tidy] add abseil-unchecked-statusor-access
3c31c84bca28 [mlir][tosa] Allow dynamic dims in `--tosa-validate` pass (#171463)
9741a84946e1 [NFC][HIP] Disable device-side kernel launches for HIP (#171043)
cbb2aa9b2dac InstCombine: Replace some isa<FPMathOperator> with dyn_cast (#172356)
0009536bacde [CIR] Add support for EmbedExpr for ScalarExpr (#172088)
f054c161a5f1 [AArch64] Fix typos in 09e57cfd32b0073b63d568835f07251e0d51affb (#172357)
8fa29f845733 Reapply "[MC] Use a variant to hold MCCFIInstruction state (NFC)" (#170342)
88bd56597ca8 VectorCombine: Improve the insert/extract fold in the narrowing case (#168820)
bb1bfb1c7558 [clang][Dependency Scanning][NFC] Extract the DependencyScanningTool Logic to Initialize the Compiler Instance With Context (#172345)
92abf7c356f0 [NFC][Linalg] Remove redundant doc comments from Conv matchers (#172344)
1b4a74fcdca7 [AArch64] Fix typo in 09e57cfd32b0073b63d568835f07251e0d51affb (#172354)
1d0f4e413c83 [flang] Converting old lowering tests to the new lowering (part 1) (#172246)
09e57cfd32b0 [AArch64] Extend Windows CPU feature detection with more features. (#171930)
34f63032932a [lldb][NFCI] Make LookupInfo const (#171901)
60010b373477 [mlir][linalg] Clarify comments and remove outdated TODO (nfc) (#171695)
5123d36c021e [mlir][amdgpu] Lower make_gather_dma_descriptor. (#172083)
3e327350206a [DWARF] Add support for DW_GNU_call_target_clobbered (#172336)
e4d5a0cfd1e8 merge main into amd-staging (#853)
e3c621c50bc0 [ThinLTO][MemProf] Add option to override max ICP with larger number (#171652)
3eef28844734 [CIR] Add BuildableType traits to relevant constraints (NFC) (#172224)
aea5dadb5139 [RISCV][NFC] Fix gcc 11.4.0 warnings (#172315)
4ab36d8674f3 [NFC][lldb] fix PseudoTerminal documentation (#172318)
60d447d0d158 [lldb][windows] fix a race condition when polling Pipes (#172063)
2185596c07d8 [DirectX] Add lowering support for `llvm.fsh[l|r].*` (#170570)
e53acac02289 [clang] Properly check for -mms-bitfields in Sema/struct-packed-align.c (#172337)
b9885558120c [SLP]Check if the extractelement is part of other buildvector node before marking for erasing
82a8748aad38 [Linalg] Add *Conv3D* matchers (#172141)
566eb2b78ff0 [lldb] Respect max packet size limits for MultiMemRead in ProcessGDBRemote (#172022)
d68afde21d38 [llvm][utils] Add a script to use PRs over pushing to main (#166473)
31ba65c13318 hipcc/ld.lld unable to link separable compilation when dynamic librar… (#169551)
18df40e66986 [Github][Docs] Add testing best practices to CI Best Practices Doc (#172235)
3a522b10803f [Github] Remove use of deprecated login_or_token (#172240)
d6715cd6427e [LLVM] InstCount pass added when enabling stats (#171658)
63d165ac7413 [clang][DependencyScanning] Use structured bindings for initVFSForByNameScanning (NFC) (#172335)
3645cef1ef50 [AMDGPU] LiveRegOptimizer: consider i8/i16 binops on SDWA (#155800)
463c9f08be48 InstCombine: Stop using m_c_BinOp for non-commutative ops (#172327)
b6c0eefefd80 [clang-tidy] Fix misplaced fix-its in `modernize-use-override` (#172196)
01c3e2558603 [flang] restrict fir.convert lowering (#172117)
28e9954a448b llvm: Add missing `VirtualFileSystem.h` include (#171848)
5fcf712f27e2 [clang-tidy][NFC] Add misc-include-cleaner to clang-tidy codebase (#171903)
57fcbd46bfb5 [clang][DependencyScanning] Unify initVFS helpers to return OverlayFileSystem (NFC) (#172166)
5ba62562b634 [clang-tidy] New option to remove arguments from the command line (#164344)
c4a0d7138697 [libc] Fix ifdef in compiler.h. (#172324)
e7dd7b81ac24 [AMDGPU] tensor_{load_to/store_from}_lds => ..._d2 simplification (#171540)
b36835743833 [X86] combineStore - attempt to store i256/i512 types as v4i64/v8i64 vectors (#172288)
ddd7e4bb2dc1 Fix typo in readability-identifier-naming docs (#172319)
efd9dc83f26f Revert "[APFloat] Add exp function for APFloat::IEEESsingle using expf implementation from LLVM libc. (#143959)" (#172325)
f0c9acc33054 [SPIRV][HLSL] Add FixedVector GEP legalization (#171682)
84d1de29d957 Fix build when the build directory isn't just immediately nested in the source dir
4190d576823c [APFloat] Add exp function for APFloat::IEEESsingle using expf implementation from LLVM libc. (#143959)
491e00162713 [HLSL][Matrix] Add Matrix Bool and represent them as i32 elements (#171051)
6e88ece6eeb3 merge main into amd-staging
1847a4efae6b [SDAG] Fix incorrect usage of VECREDUCE_ADD (#171459)
b5968780ba2e [llvm][RISCV] Add frm range check for xsfvfnrclipxfqf (#172135)
ef927ae26318 [llvm][RISCV] Support mulh for P extension codegen (#171581)
c87279505613 merge main into amd-staging (#852)
8975eb327427 [FuzzerMutate] Allow implicit truncation
1154ed8f3c97 [SandboxIRTest] Use larger integer type
015ab4e2e4a7 [Reassociate] Allow implicit truncation when converting adds to mul
42a47bf18aa1 [WPD] Avoid implicit truncation when creating full set
818c9138f989 [SimplifyCFG] Use getSigned() for signed value
3f82a8a7848a [ExpandFp] Use getSignMask() (NFC)
0b2fe07e6b21 [VectorCombine] Prevent redundant cost computation for repeated operand pairs in foldShuffleOfIntrinsics (#171965)
0fff58a50ff7 [NFC][SPIRV] Re-work extension parsing (#171826)
de776fb60a89 [OpenMP] Fix libarcher tests on Ubuntu 22.04 (#170671)
df14096e00b8 [NFC][AMDGPU] Refactor the multiclass for WMMA_F8F6F4 instructions (#172245)
2e2e48f17197 [OpenMP][CIR] Add basic infrastructure for CIR lowering (#171902)
f5a198b8aae8 [lldb][test] Xfail 3 backtrace related tests on Windows on Arm (#172300)
a68fde578095 [DAG] foldAddToAvg - optimize nested m_Reassociatable matchers (#171681)
a93214c5828d Build and ship OpenMP with LLVM releases (#160581)
7fefee3a82b4 [LLVM][CodeGen][AArch64] Add NEON lowering for vector.(de)interleave intrinsics. (#169700)
ee5b9cd59d16 [llvm-symbolizer] Recognize and symbolize archive members (#150401)
6e01ea4babc8 [flang][OpenMP] Generalize checks of loop construct structure (#170735)
792704038a54 [DebugInfo][DWARF] Use DW_AT_call_target_clobbered for exprs with volatile regs (#172167)
2f9bf3f29228 [GlobalISel](NFC) Refactor construction of LLTs in `LegalizerHelper` (#170664)
b6396b41f1fd merge main into amd-staging
72f3995363eb [CodeExtractor] Optimize PHI incoming value removal using removeIncomingValueIf() (NFC) (#171956)
c9c46a08206f [CloneFunction] Optimize PHI incoming value removal using reverse iteration (NFC) (#171955)
9f176e30e603 [libcxx][docs] Fix boostrapping build configure command (#172015)
b225907804af [AArch64]Enable aggressive interleaving for A320 (#169825)
10767aad898e [llvm][examples] Disable some JIT examples when threading is disabled (#172282)
515c3bdda00b [AMDGPU] Stop handling soft waitcnts in pseudoToMCOpcode. NFC. (#172278)
57aab634179f [libc++] Fix std::for_each(associative-container) not using std:invoke and projections (#171984)
7d0865122eaa [clang][NVPTX] Add support for mixed-precision FP arithmetic (#168359)
0636225b938c [VPlan] Directly unroll VectorPointerRecipe (#168886)
b3ec8be22bda [mlir][gpu] Expose some utility functions from `gpu-to-binary` infra (#172205)
4e95718a2aba [libc++] Remove unused __parent_pointer alias from __tree and map (#172185)
ed79fd714fa9 [Clang][x86]:  allow PCLMULQDQ intrinsics to be used in constexpr (#169214)
f024026a219c AMDGPU/GlobalISel: Regbanklegalize for G_CONCAT_VECTORS (#171471)
f3e508ceec85 [mlir:bazel] Fix missing dependency introduced in #171727. (#172267)
90783f5c4acc [lldb][AppleObjCDeclVendor] Fix format specifiers when printing log (#172263)
96881c1226a5 llvm: Export IndexedCodeGenDataLazyLoading (#169563)
5785b4a4fb29 Add .gitignore file in .cache/clangd/index (#170003)
a5bfe8e5c3c4 [Clang] Recompute the value category when rebuilding SubstNonTypeTemplateParmExpr (#172251)
17f29c22abc5 [AArch64] Support lowering smaller than legal LOOP_DEP_MASKs to whilewr/rw (#171982)
80b900e91c88 [InstSimplify] Support ptrtoaddr in simplifyICmpInst() (#171985)
37c7f695dce3 [libc++][char_traits] Applied `[[nodiscard]]` (#172244)
db557bee1e2c [clang][bytecode][NFC] Add Block::getBlockDesc<T>() (#172218)
ce1b04720af5 [SelectOptimize] Respect optnone (#170858)
c13bf9eb2667 Reapply "[AMDGPU][SDAG] Add missing cases for SI_INDIRECT_SRC/DST (#170323) (#171838)
e309272467c5 [AArch64][ARM] Regenerate llvm-mca tests. NFC
1e9e38983cea [AArch64] Add a performBICiCombine function.
6ff3df87d1c6 [libc++][unordered_set] Applied `[[nodiscard]]` (#170435)
e22ff9b3d931 [libc++][unordered_multiset] Applied `[[nodiscard]]` (#171664)
a5b7c42ab2de [libc++][unordered_multimap] Applied `[[nodiscard]]` (#171659)
77d6e40b1db2 merge main into amd-staging (#851)
ffaa6f23fdd5 [RISCV] Custom legalize i32 saddo/ssubo on RV64 to return a sign extended value for the data result. (#172112)
7fa062ad5869 [RISCV] Add BFloat16 to mangleRISCVFixedRVVVectorType. (#172095)
c878cf4580b1 [SelectionDAG] Consistently use doxygen comments in the NodeType enum. NFC (#172178)
61908c595776 [orc-rt] Prevent RTTIExtends from being used for errors. (#172250)
5a581acb2970 [CIR] Rename allEnumCasesCovered to all_enum_cases_covered (#172153)
35315a84b4a3 [offload] Fix CUDA args size by subtracting tail padding (#172249)
35b23172c590 [AArch64] Support USDOT in performAddDotCombine (#171864)
1d821b0c6b71 [AArch64] use `isTRNMask` to calculate shuffle costs (#171524)
8f51da369e6e [orc-rt] Add Error / Exception interop. (#172247)
c24f66e33bcb [llvm][RISCV] Add bf16 vfabs and vfneg intrinsics for zvfbfa. (#172130)
4ce266b07b61 merge main into amd-staging
9a03a30706cc [libc++][unordered_map] Applied `[[nodiscard]]` (#170423)
7ac01771c342 [mlir][ExecutionEngine] Remove stderr printing when propagating errors (#171997)
00b92e3d814f [orc-rt] Add config.h.in (missing from 7ccf968d0bf).
59fb3bc3e77a [libc++][pair] Applied `[[nodiscard]]` (#171999)
b6d940d9bc47 [libc++][multimap] Applied `[[nodiscard]]` (#171644)
febc580e1ea9 add -amdsmi to force use (#849)
6fb6b9d45654 merge main into amd-staging (#848)
b9d1432213cb [flang-rt][device] Use snprintf result for length (#172239)
adaca1348eaf [BOLT] Introduce getOutputBinaryFunctions(). NFCI (#172174)
ca81d7c2db9a [orc-rt] Ensure EH/RTTI=On overrides LLVM opts, applies to unit tests. (#172155)
21a25f44af3b [clang-tidy] Suggest `std::views::reverse` instead of `std::ranges::reverse_view` in `modernize-use-ranges` (#172199)
ecfdf8cb05d1 [bazel] One more fix for f785ca0d72cc37ac951afe81cba37c292b0027eb
14c69497b310 Partially revert "[NFCI][lldb][test][asm] Enable AT&T syntax explicitly (#166770)" (#172233)
f12fcf030c87 [MLIR][Transform][Python] transform.foreach wrapper and .owner OpViews (#172228)
423919d31f4b [NFCI][lldb][test][asm] Enable AT&T syntax explicitly (#166770)
bebc28a0acc2 [bazel] Port for f785ca0d72cc37ac951afe81cba37c292b0027eb
b9fe6532a70c Revert "[MLIR][Transform][Python] transform.foreach wrapper and .owner OpViews" (#172225)
bcbbe2c2bcbe [VPlan] Pass backedge value directly to FOR and reduction phis (NFC).
4cdec92827e6 [MLIR][Transform][Python] transform.foreach wrapper and .owner OpViews (#171544)
53cf22f3a1e6 [VPlan] Simplify live-ins early using SCEV. (#155304)
a401ea0d57c6 merge main into amd-staging
06f631aeed1a merge main into amd-staging (#847)
9975cb166ea3 [libc++][expected] Applied `[[nodiscard]]` (#170245)
8f93365b19e2 [tsan] Export __cxa_guard_ interceptors from TSan runtime. (#171921)
4b267d5caa9d [MLIR][MemRef] Emit error on atomic generic result op defined outside the region (#172190)
dd33690686e6 [X86] combineVectorSizedSetCCEquality - convert to mayFoldIntoVector helper (#172215)
e82edd28f6dc [lldb-dap] Migrate locations request to structured types (#171099)
969f72690745 merge main into amd-staging
2eec1d4c9c16 merge main into amd-staging (#846)
9de41eef6e4c [LLDB][NativePDB] Create typedefs in structs (#169248)
8a53c01b6796 [XRay][test] Mark fdr-mode.cpp test as unsupported for RISC-V
8d7c3fa6e476 [X86] combineVectorSizedSetCCEquality - ensure the load is a normal load (#172212)
f56e9db27f31 merge main into amd-staging (#844)
1a1c5df7f9ec [ARM] Introduce intrinsics for MVE fp-converts under strict-fp. (#170686)
1281693c3f72 merge main into amd-staging
b97d24796c77 [ARM] Introduce intrinsics for MVE vcmp under strict-fp. (#169798)
1b93f8b48f9d [ARM] Introduce intrinsics for MVE vrnd under strict-fp. (#169797)
66d92d4cfb76 [LLVM][Examples] Disable broken JIT + plugin tests (AIX, Sparc)
755a6932993b [DAG] SDPatternMatch - Replace runtime data structures with lengths known at compile time (#172064)
f785ca0d72cc [mlir][nvgpu] Move memref memspace attributes conversion to single place (#172156)
8680feb9133e [libc++] Use native wait in std::barrier instead of sleep loop (#171041)
86dc131997af [libc++][flat_multiset] Applied `[[nodiscard]]` (#169984)
61f4cc7cd263 [libc++][flat_multimap] Applied `[[nodiscard]]` (#169986)
d78e43191717 [clangd] Add option to fuzzy-match macros in code-complete (#169880)
4a1b6966ac5b [clangd] Add a (currently hidden) --strong-workspace-mode flag (#172160)
7db97696a225 [ARM][AArch64] Replace ".f16(bfloat" with ".bf16(bfloat" in intrinsics. NFC
72574b81956a [clang-tidy] Add `IgnoreMacro` option to `bugprone-chained-comparison` (#171975)
f0557d8eb783 Revert "[X86][APX] Add pattern for zext(X86setcc ..) -> SETZUCCr (#170806)" (#172192)
aa979095e350 merge main into amd-staging
1ea9f44f29f8 [IR] Optimize PHINode::removeIncomingValueIf() using two-pointer (#171961)
2a85673b4a25 [fopenmp-target-fast] use -fno-gpu-flang-rt disable libs (#835)
9e4487c55365 cleanup C2y from revert_patches.txt (#842)
fd95803a3565 [LoopRotate] Simplify PHINode::removeIncomingValue usage (NFC) (#171958)
1156629f4ff2 [ARM][MVE] Avoid `PHINode::removeIncomingValue()` with `PHINode::setIncomingValue()` and `PHINode::setIncomingBlock()` (NFC) (#171960)
681dbf99414f [WinEH] Use removeIncomingValueIf() in UpdatePHIOnClonedBlock() (NFC) (#171962)
7f2c5d187b94 merge main into amd-staging (#841)
b33354f27236 [MLIR][Python][Transform] Print diagnostics also upon success (#172188)
49ad1e9ea25b [clang-tidy][NFC] Remove obsolete FIXME comment (#172120)
ff1433d08f1b [C2y] Support WG14 N3457, the __COUNTER__ macro (#162662) (#840)
16727674e09d AArch64: Use AArch64InstrInfo instead of base class in frame lowering (#172183)
a99a9824400e [LV] Add test coverage for remark for unprofitable RT checks.
ba52c73f26be merge main into amd-staging
0cdc1b6dd4a8 [SelectionDAG] Support integer types with multiple registers in ComputePHILiveOutRegInfo. (#172081)
61db1ff8c594 [lldb] Add unit tests for NonNullSharedPtr (#172173)
a630642585cc [-Wunsafe-buffer-usage] Check isValueDependent before EvaluateAsBooleanCondition (#172091)
3c2f81820cbb [BOLT] Introduce BinaryFunctionListType. NFC (#172128)
1e15dbe311eb [libc++] Implement adjacent_view (#165089)
b2d9356719a5 DAG: Make more use of the LibcallImpl overload of getExternalSymbol (#172171)
a5625ed1280f [Clang] [NFC] Add an accessor for `ASTUnit::CodeGenOpts` (#172164)
2490bb7e4b72 [ADT] Only call reserve on empty containers in append_values (#172109)
a523ee6e7acb llvm/cmake/config.guess: add support for e2k (Elbrus-2000) (#162460)
fa1dceb67f6f [DebugInfo][DWARF] Allow memory locations in DW_AT_call_target expressions (#171183)
f721a3965cae [GitHub][CI] Drop manual build of universal-ctags from abi container (#172096)
ff652d82a945 [AMDGPU][MC] Disallow op_sel in some VOP3P dot instructions (#100485) (#839)
2aafb179813f merge main into amd-staging (#838)
96891b73a95c [X86] EltsFromConsecutiveLoads - attempt to match consecutive truncated loads (#172051)
249acb6f8796 [LLVM][Examples] Disable tests on AIX
0c698f6ea882 [bazel] Port for e0379b8f91e52e978208887e2f74ea9efda3180d
5b75dea5851d merge main into amd-staging (#836)
c3a084933ff0 [CIR] Add support for the ArrayTypeTraitExpr (#171710)
06e762598030 merge main into amd-staging
4ea8157773f2 Revert "[VPlan] Remove legacy costing inside VPBlendRecipe::computeCost (#171846)"
95e4dc62b14b [CIR] Add support for the RequiresExpr (#171818)
fa79e0a4001a [libc++][ranges] implement `ranges::elements_of` (#91414)
d8b03f282a50 DAG: Use the LibcallImpl to get calling conv in ExpandDivRemLibCall (#172152)
3afa68fb0eb0 [Flang] Rename modfile75.f90 to modfile81.f90. (NFC)
09197e4633c9 [Docs] Fix typo: missing closing parenthesis in __attribute__ (#172148)
0b64dc96df59 [LLVM][Examples][Cygwin] Exclude examples that are not built from test dependencies (#172145)
bea172c08bdb [AArch64][GlobalISel] Fix incorrect codegen for FPR16/FPR8 to GPR copies (#171499)
ad8d9e142872 [mlir][gpu] Use `arith` dialect to lower gpu.global_id (#171614)
9f5c96318d03 [LLVM][Example] More test feature fixes for s390 and RISC-V
eeaf43585984 [MLIR][Remarks] Improve the doc (#171128)
b5019c2de49f [LLVM][Examples] Fix test requirements
d52d761c23cb [libc++][complex] Applied `[[nodiscard]]` (#171027)
c81d44942e56 [LLVM][CMake] Build examples for llvm-test-depends
e7c652bf9e8e [gn build] Port e0379b8f91e5
7ccf968d0bfa [orc-rt] Add build options for EH and RTTI, and a config.h header. (#172129)
b6c7a27c121d [orc-rt] Refactor ErrorHandlerTraits to use CallableTraitsHelper. (#172126)
9483353ba2ee [SelectionDAG] Remove single quote around GET_ROUNDING in doxygen comment in ISDOPcode.h. NFC (#172114)
90d593220126 merge main into amd-staging
e0379b8f91e5 [clang-tidy] Moved Multiple Inheritence check from fuchsia to misc module (#171565)
a517d8c5b689 merge main into amd-staging (#834)
62ee2cf0f4e1 [libc] Add `IN6_IS_ADDR_{LINK, SITE}LOCAL` (#168207)
b3bc0058321e [libc++][map] Applied `[[nodiscard]]` (#169971)
387f93dd5b89 merge main into amd-staging (#832)
e45241a4fe9e [AMDGPU] Hoist s_set_vgpr_msb past SALU program state instructions (#172108)
6337c424bba2 Manual update of LLVM_MAIN_REVISION to 562423 (#833)
8176194f36b9 [OpenMPOpt] Allow spmdization in presence of functions taking callbacks. (#828)
07e7a3932757 merge main into amd-staging
9878bac3a83b Revert "[lldb] Still echo the command if we print the error." (#172110)
0de943421f0f fix rpath issue for libLLVMOffload.so libhsa-runtime64.so (#830)
7927597860b5 Revert "[lldb/test] Enable debug info for TestFrameProviderCircularDependency.py"
b50f6f24fe63 [libc] Properly fix printf long double subnormals (#172103)
835b0bdd5e32 Regen lit ../llvm/test/CodeGen/AMDGPU/insert-waitcnts-merge.ll
ad2fca7513aa [clang][DependencyScanning] Move driver-command logic for by-name scanning into DependencyScanningTool (#171238)
95d123ff0ea3 merge main into amd-staging
eb501b211aad [lldb] Still echo the command if we print the error. (#171931)
47b4c6a7d7f5 [acc][test] add tests for RegionBranchOpInterface for acc regions (#172073)
cf4be781be86 [flang][runtime] Debug PRINT *, "HI" on GPU (#172087)
6a41acef89b4 [flang] Initializers for proc pointers in module files (#170349)
6f1e3c396812 [alpha.webkit.UncountedLocalVarsChecker] Ignore a VarDecl in "if" with trivial "then" (#171764)
e23e5705e69a [webkit.UncountedLambdaCapturesChecker] Ignore a lambda which gets called immediately (#162977)
20e6f4939d24 Revert "AMDGPU/PromoteAlloca: Refactor into analysis / commit phases (#170512)"
13b4eb9452d3 [lldb/test] Enable debug info for TestFrameProviderCircularDependency.py
536163650e14 [mlir][LLVM] refactor FailOnUnsupportedFP (#172054)
560fe76506ef [docs] Point to `git bisect --first-parent` (#171728)
54ae1222ef1f VectorCombine: Fold chains of shuffles fed by length-changing shuffles (#168819)
e6e3f94b5c61 [VPlan] Re-add clarifying comment regarding part to extract. (NFC)
333ee931df88 [LV] Update stale comment after 4e05d702f02a. (NFC)
20c67c75ec3a [libc++] Produce summary reports in compare-benchmarks
0171e881b52b [VPlan] Strip stray whitespace when printing VPWidenIntOrFpInduction.
5ee641d85612 [MLIR][OpenMP] Fix allocation location with split distribute and parallel. (#827)
f3c16454b4f7 [Reland][AMDGPU][GlobalISel] Add register bank legalization for buffer_load byte and short (#172065)
f195d5278f68 [CIR] Support wide string literals in CIR codegen (#171541)
0d53746eaa7e [HLSL][Matrix] Add support for ICK_HLSL_Matrix_Splat to add splat cast of scalars (#170885)
1307b77de31f Delete unused code in WebKit checkers (#171768)
a9a2a26ae9b0 merge main into amd-staging
a6c211dfa9e3 [bazel] Port d107b3c82a7abd1a6a0e2900e8cd01e2a7c46748 (#172077)
9bc38df587de [LoopFusion] Simplifying the legality checks (#171889)
0603d4af1d8f Fix misprint in computeKnownFPClass in GISelValueTracking.cpp (#171566)
d901485655b9 [Mips] Add compact branch patterns for MipsR6 (#171131)
c9ad896dd733 [clang-doc] Add functions to namespace template (#171938)
5f6a5e02cd51 [alpha.webkit.ForwardDeclChecker] Add a missing nullptr check (#171740)
d107b3c82a7a [MLIR][AMDGPU] Implement reifyDimOfResult for FatRawBufferCastOp (#171839)
a94920cdd56c [NVPTX] Fixup and refactor brx.idx support (#171933)
8d5ade8feb0e [mlir] enable APFloatWrappers on MacOS (#172070)
8f264586d752 [clang-doc] Add class template to HTML (#171937)
e0e5b6e1f72e [GISel][Inlineasm] Support inlineasm i/s constraint for symbols (#170094)
76c3eed67321 [AArch64][GlobalISel] Fix vector lrint/llrint fallbacks (#170814)
69ea1440b0d1 merge main into amd-staging (#826)
53972216d108 [lldb] Add arm32/thumb register layout to Scripted{Frame,Thread} (#172005)
ed42c81bd69c [clang-doc] Add JSON output to existing template tests (#171936)
112a6126ef1c Fixes non-functional changes found static analyzer (#171197)
68535970ab9f [Delinearization] Fix unused variable from 5cdb757
a0e7476be565 [bazel] Port 568ce76c6e8134ab9b631e357c134091d2fd4aa8 (#172059)
858fa0e1edd8 [CIR][NFC] Fix the mms-bitfields test file (#172060)
ef21740781f8 [LoopPeel] Check for onlyAccessesInaccessibleMemory instead of llvm.assume in peelToTurnInvariantLoadsDereferenceable. (#171910)
5cdb757cc380 [Delinearization] Remove `isKnownNonNegative` (#171817)
366f3ac144a3 [CIR] Add support for the ConceptSpecializationExpr (#171824)
26ff16663777 [lldb][ClangExpressionParser] Emit more accurate language note for Objective-C++ fallback (#172047)
b8816a4e833e Revert "[AMDGPU][GlobalISel] Add register bank legalization for buffer_load byte and short (#167798)"
4ca2caeab6b6 [HLSL] Implement ddx/ddy_fine intrinsics (#168874)
8515ddaa2bf6 [lldb] fix failing diagnostics test when Unicode is supported (#172038)
c14c2561704c [flang][TBAA] fix unsafe optional deref after #170908 (#172033)
1760effa3374 [clang] Implement gcc_struct attribute on Itanium targets (#71148)
44aec0e768d9 [CIR] Add support for TypeTraitExpr with bool result (#171687)
9dc6f18a3e4f [MLIR][NVVM] Fix results-check for mbarrier Op (#171657)
79a4e26defed device-libs: Move special case check in rsqrt f64 implementation (#821)
2af693bbec82 AMDGPU: Fix selection failure on bf16 inverse sqrt (#172044)
4ffd373e3853 [runtimes] Remove dependencies on cxx_experimental for test-suite installs (#171678)
b880428bf0fc [X86] Cleanup check prefixes for any/zero_extend_vector_inreg_of_broadcast_from_memory.ll tests (#172043)
dbd0122cf566 [CIR] Add support the ChooseExpr for scalar (#171882)
4dbd16bb62ca [AMDGPU][GlobalISel] Add register bank legalization for buffer_load byte and short (#167798)
ff3dcd06a94a [GlobalOpt][profcheck] Mark as `unknown` the branch weights of global shrunk to boolean (#171530)
0570cab7c13f [MLIR] Apply clang-tidy fixes for misc-use-internal-linkage in IndexingUtils.cpp (NFC)
d7cbc7f9e465 [lldb][InstrumentationRuntime] Run sanitizer utility expressions as C (#172019)
66f2b6625e14 [lldb][nfc] Change ProcessGDBRemote::ParseMultiMemReadPacket signature (#172020)
aeee85f35c6e merge main into amd-staging
3bb43fcb49c0 [GPURUN] new binding script uses numa info from: lscpu and amd-smi li… (#823)
44c0469e5f5e ValueTracking: Handle amdgcn.rsq intrinsic in computeKnownFPClass (#171837)
55c0e2e20f5b [AMDGPU] Add missing cases for V_INDIRECT_REG_{READ/WRITE}_GPR_IDX and V/S_INDIRECT_REG_WRITE_MOVREL (#171835)
ec1bf9c56250 Use `llvm::SmallVector` instead of `OwningArrayRef` in `VTableLayout`. (#168768)
3ae5f2782e3c [ROCDL] Added LDS barrier ops to ROCDL (gfx1250) (#171810)
7a43921af824 [AArch64][SVE] Fix -msve-vector-bits=256 fixed width vector crash (#171776)
7345233fb601 [lldb] improve the heuristics for checking if a terminal supports Unicode (#171832)
95e6d23f24ec [X86] combineHorizOpWithShuffle - ensure we handle undef elements from widened shuffle (#172014)
568ce76c6e81 [MLIR][LLVM] Add pass to update ops with default visibility (#171727)
3a04e01f347e [libc][wctype][codegen] Add generation script for conversion data (#170868)
7d21334127fc [flang][NFC] Strip trailing whitespace from tests (12 of 14)
6f44be6f3e9f [LLDB][NativePDB] Use original struct name when searching for constants (#166845)
42defcd39c48 [gn build] Port 48d942c7158a
5ebb92853211 [mlir][amdgpu] Adds make_dma_gather_base (#171857)
1451f3d9b008 [flang][OpenMP] Use StylizedInstance in converted clauses (#171907)
099985fded94 [gn] port 3d1f04425a8fe
ffac200e715c [gn] port 4e9e7c5816bc1
1dbff713127a [OpenACC][CIR] 'bind' lowering with identifier (#171749)
eb98089a26b8 [llvm-objcopy] Allow -p on COFF targets (#171237)
2d3bf2008e58 device-libs: Move special case check in rsqrt f64 implementation
234c41413fba [gn] "port" 8e999e3d7857 (LLVM_ENABLE_IO_SANDBOX)
66601d87345e [gn] port 4c6aa8fd8abe7e4f0 (clang/Analysis/Scalable)
7d151cf17084 [LLDB][NFC] Remove redundant target/process checks in SBFrame (#153258)
e151434b0f05 [AMDGPU][InsertWaitCnts][NFC] Merge VMEM_ACCESS and VMEM_READ_ACCESS into a single event type (#171973)
87b3bf5b6615 ValueTracking: Add baseline test for fpclass handling of amdgcn.rsq (#171836)
07eb9fa43f4f [MLIR][NVVM] Support for dense and sparse MMA with block scaling (#170566)
df7b90b9dbf0 [clang-tidy][NFC] Refactor `bugprone-branch-clone` (#171849)
04b197599e73 [MLIR] [Vector] Fix canonicalization for vector.scatter with tensor output (#168824)
81a75b1af98c [X86] Remove rest of AMX-TRANSPOSE (#171906)
6e47d4ef45f1 Reapply "InstCombine: Fold ldexp with constant exponent to fmul" (#171895) (#171977)
975bda005ea4 InstCombine: Add more ldexp by constant tests (#171976)
f9a3076180bf [clang-tidy] Fix some false negatives in `readability-redundant-typename` (#171947)
54744bc0a608 [LLVM][AArch64] Add "u" variants of sve.[s,u]hsub intrinsics (#170894)
ad62c049b679 [flang] add includes to AbstractConverter.h after #171501 (#171987)
80ec43d455a5 [CIR] Implement builtin reduce fadd/fmul/fmax/fmin (#171633)
c18d9eabd4d5 [AArch64] Generalize bfdotq_lane patterns to work for f32/i32 duplanes  (#171146)
e4733424bc9f [mlir][Vector] Improve vector.transferx store-to-load-forwarding (#171840)
0deee8ca98ac Fix a typo in the Modules.rst doc.
b123b7059f5a [NFC] Fix build error: multi-line comment in opencl-c.h (#171953)
9c5744cbc8fb [flang] add missing headers in ConvertVariable.h after #171501 (#171983)
04ce013d7c87 Reapply "[llvm][lit] Add option to run only the failed tests" (#171588)
51c439cce508 [OpenMP] Output scratch memory usage in kernel trace (#824)
49adebce00b2 merge main into amd-staging (#822)
123d4d9b85e0 [AMGGPUInstCombine] Use getSigned() for frexp exponent
89c37fee25b0 [WPD] Use getSigned() for offset
1d7bfb752f7f [SafeStack] Use getSigned() for negative value
917e458b9644 [mlir] Cleanup the addLegalOp of convert-linalg-to-std pass (NFC) (#171979)
2aa345054fa2 [flang][OpenACC] remap component references in structured constructs (#171501)
f0d7d833f53b [lldb-dap] Allow empty memory reference in disassemble arguments (#162517)
71c3acb18b9e [Analysis][AArch64] Add cost model for loop.dependence.{war/raw}.mask (#167551)
51bd0edb566e [libc++][valarray] Applied `[[nodiscard]]` (#170996)
d714a6c21062 Reland [MS][clang] Add support for vector deleting destructors (#170337)
a318c5011071 [mlir][tosa] Remove NegateOp to SubOp and 48-bit promotion in TosaToLinalg (#170622)
19e1011df51e [SelectionDAG] Fix unsafe cases for loop.dependence.{war/raw}.mask (#168565)
fbde1dcfb3bc [flang][OpenACC] do not load pointer and allocatables component in data clauses (#171445)
025d0c0d1daa (reland) [AMDGPU][SIInsertWaitCnts] Use RegUnits-based tracking (#162077) (#171779)
b492b3523c4c [LoopInterchange] Motivating example for interchange. NFC. (#171631)
3e2a8e2effd1 [libc++][multiset] Applied `[[nodiscard]]` (#171654)
5bc7b9d462f0 [llvm][dwarfdump] Print the name (if available) of entities referenced by DW_AT_import (#171859)
64dfc26237d9 [Mips] Support "$sp" named register (#171793)
a34a92d9e253 [libc++] Always return bool from bitset::operator[](size_t) const (#169894)
4fc5b6d8c4d8 [libc++] Optimize {std,ranges}::for_each for iterating over __trees (#164405)
d0d8359c0140 [InstSimplify] Remove redundant icmp+ptrtoint fold (#171807)
294fb60e5b25 [SandboxIR] Fix ConstantInt::get() for vector types (#171852)
43a4442fac49 [ExpandFp] Fix incorrect ConstantInt construction (#171861)
3383004a958d [clang-tidy] Support comments in WarningsAsErrors (#171816)
c05a3ac915ae [libc++][filesystem] Applied `[[nodiscard]]` (#171085)
727581773944 [TableGen] Improve generated comments for RegClassByHwMode tables
f0bec9ec468f [RISCV] Use OPERAND_MEMORY as the OperandType for CVrr. NFC (#171967)
618b874d8495 [RISCV] Add OperandType to tsimm5 used by Xsfvcp. (#171964)
63ea393f969b [libc++][optional] Applied `[[nodiscard]]` (#170045)
c3e7a1ab8f60 [NFC][PowerPC] Optimize vector compares for not equal to non zero vectors (#171635)
cdfdb06c9155 [TSan] Zero-initialize Trace.local_head
c7ca7047a758 [flang][docs] Reorganize the table of contents (#171240)
1335a05ab8bc [MLIR][Python] Fix AffineIfOp insertion point (#171957)
ecaf673850be [libc++][format] Applied `[[nodiscard]]` to more classes (#170808)
cea98135650a [RISCV] Add an OperandType to VMaskOp. NFC (#171926)
8deb4221e2bb [RISCV] Use VMV0 instead of VMaskOp in masked vector pseudoinstructions. NFC (#171924)
9d5403c8924b [RISCV] Fix incorrect chapter number in comments in RISCVSchedSpacemitX60.td. (#171765)
8378ec44ff96 [libc++][set] Applied `[[nodiscard]]` (#169982)
96b6594c901d [Clang] Remove the early-check for anonymous struct in ShouldDeleteSpecialMember (#171799)
b0d340557841 SROA: Recognize llvm.protected.field.ptr intrinsics.
2b1fa68ad0f7 [HLSL] Add the DXC matrix orientation flags (#171550)
e13998f1162f [bazel] Port 8e999e3d7857ce131d03bab4fd5c42b0e8edd980 (#171946)
a1b35864925c [flang][docs] Remove stale inline links to Intel and IBM compiler option
e760d0619fe5 AMDGPU/PromoteAlloca: Refactor into analysis / commit phases (#170512)
b2f36149d820 [CIR] Implement function personality attribute and its lowering (#171001)
d8d87b594eb6 [CIR] Add support for global member pointer values (#171888)
e2818009243c Revert: check-builtins target for LLVM_ENABLE_RUNTIMES (#171940)
ee22217d1f37 [Sanitizer] show configure.log for libz build (#171932)
86cc934b4a89 [python] Expose replaceUsesOfWith C API (#171892)
8e999e3d7857 [llvm][clang] Sandbox filesystem reads (#165350)
f36792b2a391 [scudo] Fix typo in multiple files (#171925)
5e6316796522 [libc] Refactor static polymorphism in WriteBuffer (NFC). (#169089)
efd472c88d7b [scudo] Show the resident pages from the secondary cache. (#170568)
c1b60b96ec85 workflows/upload-release-artifact: Upload a separate attestation for each artifact (#171525)
2f14fb3ddea9 The C++ dynamic typing tests are failing on Windows, skip for now. (#171922)
43bfec29cbec [flang][cuda] Fix condition in barrier_try_wait lowering (#171916)
20cc9fe95d53 [RISCV] Remove now unused ixlenimm Operand. NFC (#171896)
429dbce8d750 [BOLT][AArch64] Tweak heuristics for epilogue recognition (#169584)
02ab4bdecaad [M68k] Emit MOV16ds for moves from CCR on 68000 (#171703)
bb12c435735e [Hexagon] Add HVX patterns for vector arithmetic (#170704)
6344e3aa8106 Make result variables obey their dynamic values in subsequent expressions (#168611)
a50a7ea2e290 [libc] Add support for getpagesize. (#171713)
65deac0872c2 [VPlan] Remove vector type checking in inferScalartType (NFC).
374063b1af2c merge main into amd-staging
4e05d702f02a [LV] Always include middle block cost in isOutsideLoopWorkProfitable. (#171102)
a2658d0be330 merge main into amd-staging (#819)
dd6c14742b67 [libc][darwin] avoid importing macros conflicting with function names in overlay mode (#171691)
3e414b940a70 [FunctionLoweringInfo] Use KnownBits::intersectWith. NFC (#171893)
98a8072a6599 [FunctionLoweringInfo] Remove unnecesary check for isVectorTy when isIntegerTy is true. NFC (#171880)
71ee84acc4f7 [MLIR][Vec…
…ing LDSBarrierOp and some fixes for the interface change for the Target,

And apply this patch ROCm/llvm-project@ecbb444, it is necessary to pass unit-test
@umangyadav umangyadav requested a review from causten as a code owner December 18, 2025 20:57
@umangyadav umangyadav mentioned this pull request Dec 18, 2025
12 tasks
Copy link
Contributor

@justinrosner justinrosner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes in mlir_diff and llvm_diff have all of the modifications that I made in external. Looks good to me.

umangyadav and others added 3 commits December 19, 2025 18:48
This change is causing crash with this attention kernel :
rocmlir-gen --operation attention --arch gfx950 -t f16 -transK -g 256 -seq_len_q 1024 -seq_len_k 1024 -num_heads_q 1 -num_heads_kv 1 -head_dim_qk 128 -head_dim_v 128  -split_kv 1  | rocmlir-tuning-driver --benchmark-config="attn:v3:32,256,128,16,32,128,32,4,1,1,2,0,1"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants