Charles Schlosser
|
18018ed013
|
Unwind Block of Blocks
|
2023-08-29 17:21:41 +00:00 |
|
Charles Schlosser
|
81b48065ea
|
Fix arm32 float division and related bugs
|
2023-08-29 00:36:07 +00:00 |
|
Antonio Sánchez
|
2873916f1c
|
Rename plugin headers to .inc.
|
2023-08-21 16:26:11 +00:00 |
|
Antonio Sánchez
|
6e4d5d4832
|
Add IWYU private pragmas to internal headers.
|
2023-08-21 16:25:22 +00:00 |
|
Antonio Sánchez
|
328b5f9085
|
Add temporary macro to allow unaligned scalar UB.
|
2023-08-15 15:58:41 +00:00 |
|
Charles Schlosser
|
a798d07659
|
Fix tensor stridedlinearbuffercopy
|
2023-08-03 20:36:42 +00:00 |
|
Charles Schlosser
|
8d9f467036
|
fix boost mp test to refer to new svd tests
|
2023-08-02 13:38:12 +00:00 |
|
Antonio Sánchez
|
0ae7d7a361
|
Fix unaligned scalar alignment UB.
|
2023-08-01 19:39:08 +00:00 |
|
cheng wang
|
66e8f38891
|
Add architecture definition files for Qualcomm Hexagon Vector Extension (HVX)
|
2023-08-01 17:47:57 +00:00 |
|
Antonio Sánchez
|
2af700aa40
|
Fix nullptr dereference in SVD.
|
2023-08-01 16:33:16 +00:00 |
|
Rasmus Munk Larsen
|
86a43d8c04
|
Fix clang-tidy warning
|
2023-07-31 21:26:28 +00:00 |
|
Antonio Sánchez
|
0cef325b07
|
Fix another UB access.
|
2023-07-31 19:18:45 +00:00 |
|
Charles Schlosser
|
5527e78a64
|
Add missing x86 pcasts
|
2023-07-28 23:41:38 +00:00 |
|
Alejandro Acosta
|
24d15e086f
|
[SYCL-2020] Add test to validate SYCL in Eigen core.
|
2023-07-28 15:45:08 +00:00 |
|
Antonio Sánchez
|
d4ae542ed1
|
Fix nullptr dereference issue in triangular product.
|
2023-07-27 22:10:21 +00:00 |
|
Chip Kerchner
|
7769eb1b2e
|
Fix problems with recent changes and Tensorflow in Power
|
2023-07-26 16:24:58 +00:00 |
|
Yingnan Wu
|
ba1cb6e45e
|
Fixes #2703 by adding max_digits10 function
|
2023-07-26 16:02:52 +00:00 |
|
Charles Schlosser
|
9995c3da6f
|
Fix -Wmaybe-uninitialized in SVD
|
2023-07-25 22:22:17 +00:00 |
|
Charles Schlosser
|
4e9e493b4a
|
Fix -Waggressive-loop-optimizations
|
2023-07-21 03:47:40 +00:00 |
|
Charles Schlosser
|
6e7abeae69
|
fix arm build warnings
|
2023-07-17 20:37:27 +00:00 |
|
Charles Schlosser
|
81fe2d424f
|
Fix more gcc compiler warnings / sort-of bugs
|
2023-07-14 21:12:45 +00:00 |
|
Charles Schlosser
|
21cd3fe209
|
Optimize check_rows_cols_for_overflow
|
2023-07-10 17:40:17 +00:00 |
|
Antonio Sánchez
|
9297aae66f
|
Fix AVX512 nomalloc issues in trsm.
|
2023-07-10 16:42:13 +00:00 |
|
Charles Schlosser
|
1a2bfca8f0
|
Fix annoying warnings
|
2023-07-07 20:19:58 +00:00 |
|
Antonio Sánchez
|
63dcb429cd
|
Fix use of arg function in CUDA.
|
2023-07-07 18:37:14 +00:00 |
|
Marcus Comstedt
|
8f927fb52e
|
Altivec: fix compilation with C++20 and higher
|
2023-07-05 13:14:02 +00:00 |
|
Kevin Leonardic
|
d4b05454a7
|
Fix argument for _mm256_cvtps_ph imm parameter
|
2023-07-03 13:44:20 +02:00 |
|
Charles Schlosser
|
15ac3765c4
|
Fix ivcSize return type in IndexedViewMethods.h
|
2023-07-03 03:49:37 +00:00 |
|
Chip Kerchner
|
3791ac8a1a
|
Fix supportsMMA to obey EIGEN_ALTIVEC_MMA_DYNAMIC_DISPATCH compilation flag and compiler support.
|
2023-06-28 17:57:21 +00:00 |
|
H S Helson Go
|
bc57b926a0
|
Add Quaternion constructor from real scalar and imaginary vector
|
2023-06-27 05:38:17 +00:00 |
|
Antonio Sánchez
|
31cd2ad371
|
Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm.
|
2023-06-26 19:21:54 +00:00 |
|
Antonio Sánchez
|
7465b7651e
|
Disable FP16 arithmetic for arm32.
|
2023-06-26 18:39:42 +00:00 |
|
Rasmus Munk Larsen
|
b3267f6936
|
Remove unused variable in test/svd_common.h.
|
2023-06-23 23:12:19 +00:00 |
|
Chip Kerchner
|
211c5dfc67
|
Add optional offset parameter to ploadu_partial and pstoreu_partial
|
2023-06-23 19:53:05 +00:00 |
|
Charles Schlosser
|
44c20bbbe3
|
rint round floor ceil
|
2023-06-23 16:29:16 +00:00 |
|
Charles Schlosser
|
6ee86fd473
|
delete deprecated function call in svd test
|
2023-06-23 14:17:27 +00:00 |
|
Charles Schlosser
|
387175c258
|
Fix safe_abs in int_pow
|
2023-06-23 04:12:41 +00:00 |
|
Charles Schlosser
|
c6db610bc7
|
Fix svd test
|
2023-06-22 17:37:24 +00:00 |
|
Charles Schlosser
|
969c31eefc
|
Fix AVX pstore
|
2023-06-15 01:47:38 +00:00 |
|
wilfried.karel
|
6c1411e521
|
define a move constructor for Ref<const...>
|
2023-06-14 20:10:51 +00:00 |
|
wilfried.karel
|
d8f3eb87bf
|
Compile- and run-time assertions for the construction of Ref<const>.
|
2023-06-14 15:49:58 +00:00 |
|
Charles Schlosser
|
59b3ef5409
|
Partially Vectorize Cast
|
2023-06-09 16:54:31 +00:00 |
|
Rasmus Munk Larsen
|
7d7576f326
|
Avoid underflow in prsqrt.
|
2023-06-06 14:06:19 -07:00 |
|
Charles Schlosser
|
b7151ffaab
|
Fix unary pow error handling and test
|
2023-06-06 18:46:55 +00:00 |
|
Rasmus Munk Larsen
|
7ac8897431
|
Reduce max relative error of prsqrt from 3 to 2 ulps.
|
2023-06-04 22:25:33 +00:00 |
|
Charles Schlosser
|
1d80e23186
|
Optimize scalar_unary_pow_op error handling
|
2023-06-02 18:53:06 +00:00 |
|
Alexander Shaposhnikov
|
316eab8deb
|
Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation
|
2023-05-31 15:15:06 +00:00 |
|
Alejandro Acosta
|
07e4604b19
|
Replace usage of CudaStreamDevice with GpuStreamDevice in tensor benchmarks GPU
|
2023-05-30 15:44:07 +00:00 |
|
Rasmus Munk Larsen
|
8c43bf2b5b
|
Clean up Redux.h and fix vectorization_logic test after changes to traversal order in Redux.
|
2023-05-24 20:26:52 +00:00 |
|
Charles Schlosser
|
da6a71faf0
|
Add linear redux evaluators
|
2023-05-24 17:07:25 +00:00 |
|