Evan Porter
|
6cd6284f7f
|
Make the sparse matrix printer pretty
|
2025-09-08 20:05:46 +00:00 |
|
Antonio Sanchez
|
f426eff949
|
Add inline/device-function attributes to fma.
|
2025-09-02 22:51:35 +00:00 |
|
Antonio Sánchez
|
da1a34a6ba
|
Zero-out matrix for empty set of triplets.
|
2025-09-02 22:51:17 +00:00 |
|
Evan Porter
|
52fc978c6f
|
fixed typo sparcity -> sparsity
|
2025-09-02 19:34:43 +00:00 |
|
Antonio Sánchez
|
8a8fbc8f5e
|
Don't enable AVX for wasm.
|
2025-08-29 21:50:25 +00:00 |
|
Antonio Sánchez
|
7f0cb638c5
|
Specialize numext::madd for half/bfloat16.
|
2025-08-29 18:11:25 +00:00 |
|
Antonio Sánchez
|
1e9d7ed7d3
|
Add missing semicolon to has_fma definitions to fix GPU builds.
|
2025-08-29 17:19:28 +00:00 |
|
Antonio Sánchez
|
2e8cc042a1
|
Replace calls to numext::fma with numext:madd.
|
2025-08-28 21:40:19 +00:00 |
|
Charles Schlosser
|
38b51d5b7e
|
Mitigate setConstant regression with custom scalars
|
2025-08-26 20:04:17 +00:00 |
|
Antonio Sanchez
|
d2a70fe4e2
|
Make permutation products aliasing by default.
|
2025-08-25 18:39:06 +00:00 |
|
Antonio Sánchez
|
4ae5647355
|
Fix direct index aliased assignment.
|
2025-08-25 18:17:18 +00:00 |
|
Antonio Sánchez
|
1a45d2168e
|
Fix use of FMA in triangular solver for boost multiprecision.
|
2025-08-25 18:05:22 +00:00 |
|
Aleksei Nikiforov
|
c487a4fe9e
|
Clean up most of testsuite on s390x
|
2025-08-15 20:04:25 +00:00 |
|
Charles Schlosser
|
4033cfcc1d
|
Fix dangling reference in VectorwiseOp::iterator: Episode II: The Dependent Typedef Strikes Back
|
2025-08-14 16:30:19 +00:00 |
|
Charles Schlosser
|
e9dfbad618
|
Fix dangling reference in VectorwiseOp::iterator
|
2025-08-14 00:04:01 +00:00 |
|
Charles Schlosser
|
43a65a9cbd
|
add RealView api
|
2025-08-12 16:55:05 +00:00 |
|
Artem Bishev
|
e15cd620a0
|
Remove select class
|
2025-08-10 17:44:09 +00:00 |
|
Cheng Wang
|
1c0048a08c
|
Fix inconsistency between ptrue and pcmp_* in HVX
|
2025-08-09 19:32:30 +00:00 |
|
Artem Bishev
|
ddce1d7d12
|
Fixes #2952
|
2025-08-07 16:58:22 +00:00 |
|
Tyler Veness
|
8b9dbcdaaf
|
Fix numext::bit_cast() compilation failure in C++20
|
2025-08-07 00:03:33 +00:00 |
|
Rasmus Munk Larsen
|
975a5aba4f
|
Fix TODO: Use std::bit_cast or __builtin_bit_cast if available.
|
2025-08-06 19:00:08 +00:00 |
|
Rasmus Munk Larsen
|
4be7e6b4e0
|
Fix pcmp_* for HVX to comply with the new definition of true = Scalar(1)
|
2025-08-04 20:56:24 +00:00 |
|
Antonio Sanchez
|
e4493233e8
|
Fix EIGEN_OPTIMIZATION_BARRIER for clang-cl
|
2025-07-31 17:02:43 +00:00 |
|
Charles Schlosser
|
1e65707aa2
|
Suppress Warray-bounds warning in generic ploaduSegment, fix edge case for vectorized cast
|
2025-07-23 22:26:40 +00:00 |
|
Rasmus Munk Larsen
|
97c7cc6200
|
Explicitly use the packet trait HasPow to control whether Pow is vectorized.
|
2025-07-18 21:51:42 +00:00 |
|
Rasmus Munk Larsen
|
2cf66d4b0d
|
Use numext::fma in more places in SparseCore.
|
2025-07-17 21:20:39 +00:00 |
|
jacques FRANC
|
d7fa5ebe0e
|
Fix API incompatibility for ILU in superLU support
|
2025-07-17 15:27:26 +00:00 |
|
Kuan-Ting
|
cedf1f4c17
|
Fix typo: duplicated 'for' in docs
|
2025-07-16 01:12:48 +00:00 |
|
Charles Schlosser
|
302fc46bc3
|
arm packet alignment requirements and aligned loads/stores
|
2025-07-15 23:49:04 +00:00 |
|
Sean McBride
|
430e35fbd1
|
Fixed -Wshadow warning by renaming variables
|
2025-07-11 11:30:23 -04:00 |
|
Antonio Sánchez
|
bd0cd1d67b
|
Fix self-adjoint products when multiplying by a compile-time vector.
|
2025-07-08 21:48:59 +00:00 |
|
Charles Schlosser
|
6854da2ea0
|
Fix 1x1 selfadjoint matrix-vector product bug
|
2025-07-07 17:32:54 +00:00 |
|
Rasmus Munk Larsen
|
8ac2fb077d
|
Use numext::fma for sparse x dense dot product.
|
2025-07-02 23:19:26 +00:00 |
|
Antonio Sánchez
|
7c636dd5db
|
Move HIP/CUDA defines to Core.
|
2025-06-27 16:48:07 +00:00 |
|
Antonio Sánchez
|
26616fe5b8
|
Fix VSX packetmath psin and pcast tests.
|
2025-06-27 04:08:20 +00:00 |
|
Antonio Sánchez
|
a395ee162d
|
Fix a collection of random failures encountered when testing with Bazel.
|
2025-06-26 16:58:24 +00:00 |
|
Antonio Sánchez
|
db8bd5b825
|
Modify pselect and various masks to use Scalar(1) for true.
|
2025-06-20 22:40:46 +00:00 |
|
Charles Schlosser
|
81044ec13d
|
Provide macro to explicitly disable alloca
|
2025-06-19 04:23:35 +00:00 |
|
Charles Schlosser
|
bcce88c99e
|
Faster emulated half comparisons
|
2025-06-17 17:05:58 +00:00 |
|
Filippo Basso
|
ac6955ebc6
|
Remove MSVC warnings in FindCoeff.h
|
2025-06-17 00:39:02 +00:00 |
|
Antonio Sánchez
|
67a898a079
|
Fix unprotected SIZE in macro.
|
2025-06-16 22:54:25 +00:00 |
|
Antonio Sánchez
|
cdf6a1f5ed
|
Add OpenBLAS sbgemm.
|
2025-06-16 18:23:03 +00:00 |
|
Charles Schlosser
|
d228bcdf8f
|
Fix neon compilation bug
|
2025-06-10 21:52:01 +00:00 |
|
Charles Schlosser
|
994f3d107a
|
Fix neon packet math tests, add missing neon intrinsics
|
2025-06-09 17:13:31 +00:00 |
|
AnonymousPC
|
cda19a6255
|
Make Eigen::Map<const Vector>::operator[] return correct type
|
2025-06-06 19:15:18 +00:00 |
|
Charles Schlosser
|
d0b490ee09
|
Optimize maxCoeff and friends
|
2025-06-06 14:55:49 +00:00 |
|
Charles Schlosser
|
21e89b930c
|
Enable default behavior for pmin<PropagateFast>, predux_min, etc
|
2025-06-02 17:23:37 +00:00 |
|
Charles Schlosser
|
4fdf87bbf5
|
clean up intel packet reductions
|
2025-05-30 19:18:07 +00:00 |
|
Hs293Go
|
a7f183cadb
|
Add factory/getters for quat coeffs in both orders
|
2025-05-28 18:39:55 -04:00 |
|
Sergiu Deitsch
|
d81aa18f4d
|
Explicitly construct the scalar for non-implicitly convertible types
|
2025-05-15 17:40:29 +02:00 |
|