Antonio Sánchez
|
e5f3fa2d61
|
Add gemmtr implementation.
|
2025-09-05 22:31:30 +00:00 |
|
Antonio Sanchez
|
f426eff949
|
Add inline/device-function attributes to fma.
|
2025-09-02 22:51:35 +00:00 |
|
Antonio Sánchez
|
da1a34a6ba
|
Zero-out matrix for empty set of triplets.
|
2025-09-02 22:51:17 +00:00 |
|
Evan Porter
|
52fc978c6f
|
fixed typo sparcity -> sparsity
|
2025-09-02 19:34:43 +00:00 |
|
Antonio Sánchez
|
8a8fbc8f5e
|
Don't enable AVX for wasm.
|
2025-08-29 21:50:25 +00:00 |
|
Antonio Sanchez
|
70d8d99d0d
|
Only build docs on push to master branch, not MRs.
|
2025-08-29 18:33:09 +00:00 |
|
Antonio Sánchez
|
7f0cb638c5
|
Specialize numext::madd for half/bfloat16.
|
2025-08-29 18:11:25 +00:00 |
|
Antonio Sánchez
|
1e9d7ed7d3
|
Add missing semicolon to has_fma definitions to fix GPU builds.
|
2025-08-29 17:19:28 +00:00 |
|
Antonio Sanchez
|
5d4485e767
|
Move more jobs to gitlab runners.
|
2025-08-29 10:06:35 -07:00 |
|
Antonio Sánchez
|
2e8cc042a1
|
Replace calls to numext::fma with numext:madd.
|
2025-08-28 21:40:19 +00:00 |
|
Antonio Sánchez
|
52f570a409
|
Move GPU ci jobs to gitlab-hosted runners.
|
2025-08-28 18:24:41 +00:00 |
|
Charles Schlosser
|
38b51d5b7e
|
Mitigate setConstant regression with custom scalars
|
2025-08-26 20:04:17 +00:00 |
|
Antonio Sanchez
|
d2a70fe4e2
|
Make permutation products aliasing by default.
|
2025-08-25 18:39:06 +00:00 |
|
Antonio Sánchez
|
4ae5647355
|
Fix direct index aliased assignment.
|
2025-08-25 18:17:18 +00:00 |
|
Antonio Sánchez
|
1a45d2168e
|
Fix use of FMA in triangular solver for boost multiprecision.
|
2025-08-25 18:05:22 +00:00 |
|
anonymouspc
|
05e74b1a40
|
Tiny fix in unsupported/Eigen/CXX11/src/Tensor/TensorContraction.h
|
2025-08-25 10:26:46 +00:00 |
|
Tyler Veness
|
d368998120
|
Fix MSVC error about missing std::bit_cast
|
2025-08-23 22:25:52 +00:00 |
|
Aleksei Nikiforov
|
c487a4fe9e
|
Clean up most of testsuite on s390x
|
2025-08-15 20:04:25 +00:00 |
|
Charles Schlosser
|
4033cfcc1d
|
Fix dangling reference in VectorwiseOp::iterator: Episode II: The Dependent Typedef Strikes Back
|
2025-08-14 16:30:19 +00:00 |
|
Charles Schlosser
|
e9dfbad618
|
Fix dangling reference in VectorwiseOp::iterator
|
2025-08-14 00:04:01 +00:00 |
|
Charles Schlosser
|
43a65a9cbd
|
add RealView api
|
2025-08-12 16:55:05 +00:00 |
|
Rasmus Munk Larsen
|
954e21152e
|
Include <limits> in test main.h
|
2025-08-10 21:23:31 +00:00 |
|
Artem Bishev
|
e15cd620a0
|
Remove select class
|
2025-08-10 17:44:09 +00:00 |
|
Cheng Wang
|
1c0048a08c
|
Fix inconsistency between ptrue and pcmp_* in HVX
|
2025-08-09 19:32:30 +00:00 |
|
Artem Bishev
|
ddce1d7d12
|
Fixes #2952
|
2025-08-07 16:58:22 +00:00 |
|
Tyler Veness
|
8b9dbcdaaf
|
Fix numext::bit_cast() compilation failure in C++20
|
2025-08-07 00:03:33 +00:00 |
|
Rasmus Munk Larsen
|
975a5aba4f
|
Fix TODO: Use std::bit_cast or __builtin_bit_cast if available.
|
2025-08-06 19:00:08 +00:00 |
|
Rasmus Munk Larsen
|
4be7e6b4e0
|
Fix pcmp_* for HVX to comply with the new definition of true = Scalar(1)
|
2025-08-04 20:56:24 +00:00 |
|
Antonio Sánchez
|
edcf4c135f
|
Remove fortran dependency for eigenblas.
|
2025-08-04 19:11:43 +00:00 |
|
Antonio Sanchez
|
e4493233e8
|
Fix EIGEN_OPTIMIZATION_BARRIER for clang-cl
|
2025-07-31 17:02:43 +00:00 |
|
Charles Schlosser
|
f5ead2d34c
|
Fix intel packet math header inclusion order
|
2025-07-29 01:00:37 +00:00 |
|
Charles Schlosser
|
1e65707aa2
|
Suppress Warray-bounds warning in generic ploaduSegment, fix edge case for vectorized cast
|
2025-07-23 22:26:40 +00:00 |
|
Rasmus Munk Larsen
|
abeba85356
|
Use proper float literals in SpecialFunctionsImpl.h.
|
2025-07-19 01:17:12 +00:00 |
|
Rasmus Munk Larsen
|
b5bef9dcb0
|
Fix bug in Erfc introduced in !1862.
|
2025-07-18 17:58:48 -07:00 |
|
Rasmus Munk Larsen
|
97c7cc6200
|
Explicitly use the packet trait HasPow to control whether Pow is vectorized.
|
2025-07-18 21:51:42 +00:00 |
|
Rasmus Munk Larsen
|
efe5b6979d
|
Unconditionally include <memory>. Some c++20 builds are currently broken because it is needed for std::assume_aligned.
|
2025-07-18 18:06:28 +00:00 |
|
Rasmus Munk Larsen
|
2cf66d4b0d
|
Use numext::fma in more places in SparseCore.
|
2025-07-17 21:20:39 +00:00 |
|
jacques FRANC
|
d7fa5ebe0e
|
Fix API incompatibility for ILU in superLU support
|
2025-07-17 15:27:26 +00:00 |
|
Kuan-Ting
|
cedf1f4c17
|
Fix typo: duplicated 'for' in docs
|
2025-07-16 01:12:48 +00:00 |
|
Charles Schlosser
|
302fc46bc3
|
arm packet alignment requirements and aligned loads/stores
|
2025-07-15 23:49:04 +00:00 |
|
Sean McBride
|
430e35fbd1
|
Fixed -Wshadow warning by renaming variables
|
2025-07-11 11:30:23 -04:00 |
|
Antonio Sánchez
|
bd0cd1d67b
|
Fix self-adjoint products when multiplying by a compile-time vector.
|
2025-07-08 21:48:59 +00:00 |
|
Charles Schlosser
|
6854da2ea0
|
Fix 1x1 selfadjoint matrix-vector product bug
|
2025-07-07 17:32:54 +00:00 |
|
Sean McBride
|
ac1b29f823
|
Set CMake POLICY CMP0177 to NEW
|
2025-07-07 16:37:01 +00:00 |
|
Antonio Sánchez
|
849a336243
|
Move default builds/tests to GitLab runners.
|
2025-07-05 04:37:08 +00:00 |
|
Rasmus Munk Larsen
|
8ac2fb077d
|
Use numext::fma for sparse x dense dot product.
|
2025-07-02 23:19:26 +00:00 |
|
Antonio Sánchez
|
cc0be00435
|
Fix docs build.
|
2025-07-02 22:10:33 +00:00 |
|
Antonio Sánchez
|
f169c13d8e
|
Replace PPC g++-10 with g++14.
|
2025-07-02 17:07:44 +00:00 |
|
Henric Ryden
|
7fa069ef90
|
tensor documentation
|
2025-06-29 03:47:42 +00:00 |
|
Antonio Sánchez
|
7c636dd5db
|
Move HIP/CUDA defines to Core.
|
2025-06-27 16:48:07 +00:00 |
|