Rasmus Munk Larsen
|
2067b54b13
|
Fix bug in minmax_coeff_visitor for matrix of all NaNs.
|
2023-03-13 18:25:22 +00:00 |
|
Rasmus Munk Larsen
|
ee0ff0ab3a
|
Fix typo in MathFunctions.h
|
2023-03-13 15:50:40 +00:00 |
|
Rasmus Munk Larsen
|
21c49e8f8e
|
Delete mystery character from Eigen/src/Core/arch/NEON/MathFunctions.h
|
2023-03-10 23:27:24 +00:00 |
|
Rasmus Munk Larsen
|
6bb9609bcb
|
Make new Select implementation backwards compatible.
|
2023-03-10 23:07:47 +00:00 |
|
Antonio Sánchez
|
394aabb0a3
|
Fix failing MSVC tests due to compiler bugs.
|
2023-03-10 22:36:57 +00:00 |
|
Rasmus Munk Larsen
|
d6235d76db
|
Clean up generic packetmath specializations for various backends with the help of a macro.
|
2023-03-10 22:02:23 +00:00 |
|
Rasmus Munk Larsen
|
e8fdf127c6
|
Work around compiler bug in Tridiagonalization.h
|
2023-03-10 21:21:07 +00:00 |
|
Rasmus Munk Larsen
|
adf26b6840
|
Add newline to end of file.
|
2023-03-10 16:53:22 +00:00 |
|
Rasmus Munk Larsen
|
3492d9e2e5
|
s/Lesser/Less/
|
2023-03-10 00:28:31 +00:00 |
|
Rasmus Munk Larsen
|
2419632cf5
|
Revert change to allFinite(), since the new version does not work for complex numbers.
|
2023-03-09 21:50:43 +00:00 |
|
Zach Davis
|
b1beba8a3e
|
Fix LinAlgSVD example code
|
2023-03-08 17:04:59 +00:00 |
|
Charles Schlosser
|
7bf2968fed
|
Specify Permutation Index for PartialPivLU and FullPivLU
|
2023-03-07 20:28:05 +00:00 |
|
Antonio Sánchez
|
eb4dbf6135
|
Modify failing cwise test to get it to pass.
|
2023-03-07 19:47:42 +00:00 |
|
Timofey Pushkin
|
e577f43ab2
|
Set CMAKE_* cache variables only when Eigen is a top-level project
|
2023-03-07 14:39:45 +00:00 |
|
Charles Schlosser
|
1ce8b25825
|
Vectorize any() / all()
|
2023-03-06 23:54:02 +00:00 |
|
Charles Schlosser
|
cb8e6d4975
|
Fix 2240, 2620
|
2023-03-06 23:11:06 +00:00 |
|
Charles Schlosser
|
d670039309
|
fix tensor comparison test
|
2023-03-06 13:11:14 +00:00 |
|
Chip Kerchner
|
2b513ca2a0
|
Added partial linear access for LHS & Output - 30% faster for bfloat16 GEMM MMA (Power)
|
2023-03-02 19:22:43 +00:00 |
|
Charles Schlosser
|
0b396c3167
|
Scalarize comps
|
2023-03-02 17:06:23 +00:00 |
|
Charles Schlosser
|
3abe12472e
|
fix signed shift test
|
2023-03-01 14:31:13 +00:00 |
|
Antonio Sánchez
|
ba7417f146
|
Fix gpu conv3d out-of-resources failure.
|
2023-02-28 21:25:00 +00:00 |
|
Antonio Sánchez
|
62d5cfe835
|
Fix ODR issues with Intel's AVX512 TRSM kernels.
|
2023-02-27 07:54:52 +00:00 |
|
Charles Schlosser
|
826627f653
|
vectorize comparisons and select by enabling typed comparisons
|
2023-02-25 20:52:11 +00:00 |
|
Rasmus Munk Larsen
|
2e9b945baf
|
Fix bug that disabled vectorization for coeffMin/coeffMax.
|
2023-02-25 20:03:54 +00:00 |
|
Antonio Sánchez
|
bc5cdc7a67
|
Guard use of long double on GPU device.
|
2023-02-24 21:49:59 +00:00 |
|
Chip Kerchner
|
e4598fedbe
|
Fix compiler versions for certain instructions on Power.
|
2023-02-23 23:24:41 +00:00 |
|
Rasmus Munk Larsen
|
1c0a6cf228
|
Get rid of EIGEN_HAS_AVX512_MATH workaround.
|
2023-02-23 23:16:41 +00:00 |
|
Rasmus Munk Larsen
|
00844e3865
|
Fix a number of MSAN failures in SVD tests.
|
2023-02-23 18:44:53 +00:00 |
|
Mehdi Goli
|
c3f67063ed
|
[SYCL-2020]- null placeholder accessor issue in Reduction SYCL test
|
2023-02-22 17:44:53 +00:00 |
|
Rasmus Munk Larsen
|
6bcd941ee3
|
Use pmsub in twoprod. This speeds up pow() on Skylake by ~1%.
|
2023-02-21 20:09:29 +00:00 |
|
Rasmus Munk Larsen
|
ce62177b5b
|
Vectorize atanh & add a missing definition and unit test for atan.
|
2023-02-21 03:14:05 +00:00 |
|
Charles Schlosser
|
049a144798
|
Add typed logicals
|
2023-02-18 01:23:47 +00:00 |
|
Chip Kerchner
|
e797974689
|
Add and enable Packet int divide for Power10.
|
2023-02-17 19:04:18 +00:00 |
|
Chip Kerchner
|
54459214a1
|
Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC.
|
2023-02-16 23:35:42 +00:00 |
|
Antonio Sánchez
|
a16fb889dd
|
Guard complex sqrt on old MSVC compilers.
|
2023-02-16 19:47:00 +00:00 |
|
Charles Schlosser
|
94b19dc5f2
|
Add CArg
|
2023-02-15 21:33:06 +00:00 |
|
Charles Schlosser
|
71a8e60a7a
|
Tweak pasin_float, fix psqrt_complex
|
2023-02-15 01:01:14 +00:00 |
|
Antonio Sánchez
|
384269937f
|
More NEON packetmath fixes.
|
2023-02-14 21:45:25 +00:00 |
|
Antonio Sánchez
|
c15b386203
|
Fix MSVC atan2 test.
|
2023-02-14 18:30:58 +00:00 |
|
Antonio Sánchez
|
2dfbf1b251
|
Fix NEON make_packet2f.
|
2023-02-14 16:52:07 +00:00 |
|
Rasmus Munk Larsen
|
07aaa62e6f
|
Fix compiler warnings in tests.
|
2023-02-14 02:29:03 +00:00 |
|
Chip Kerchner
|
4a03409569
|
Fix problem with array conversions BF16->F32 in Power.
|
2023-02-13 21:30:45 +00:00 |
|
Rasmus Munk Larsen
|
77b48c440e
|
Fix compiler warnings.
|
2023-02-10 20:46:23 +00:00 |
|
Chip Kerchner
|
0ecae61568
|
Disable array BF16 to F32 conversions in Power
|
2023-02-10 20:06:58 +00:00 |
|
Charles Schlosser
|
c999284bad
|
Print diagonal matrix
|
2023-02-10 18:07:29 +00:00 |
|
Chip Kerchner
|
fba12e02b3
|
Fold extra column calculations into an extra MMA accumulator and other bfloat16 MMA GEMM improvements
|
2023-02-10 17:32:06 +00:00 |
|
Chip Kerchner
|
79cfc74f4d
|
Revert ODR changes and make gemm_extra_cols and gemm_complex_extra_cols EIGEN_ALWAYS_INLINE to avoid external functions.
|
2023-02-10 17:05:07 +00:00 |
|
Alexander Grund
|
f9659d91f1
|
Fix ODR violation with gemm_extra_cols on PPC
|
2023-02-09 22:16:06 +00:00 |
|
Charles Schlosser
|
325e3063d9
|
Optimize psign
|
2023-02-09 22:15:26 +00:00 |
|
Charles Schlosser
|
0e490d452d
|
Update file ColPivHouseholderQR_LAPACKE.h
|
2023-02-09 13:45:56 +00:00 |
|