Antonio Sánchez
|
f5364331eb
|
Fix some cmake issues.
|
2022-09-02 16:43:14 +00:00 |
|
Antonio Sánchez
|
d816044b6e
|
Fix mixingtypes tests.
|
2022-09-02 15:30:13 +00:00 |
|
Gilles Aouizerate
|
94cc83faa1
|
2 typos fix in the 3rd table.
|
2022-08-31 19:54:42 +00:00 |
|
Antonio Sánchez
|
30c42222a6
|
Fix some test build errors in new unary pow.
|
2022-08-30 17:24:14 +00:00 |
|
Rasmus Munk Larsen
|
bd393e15c3
|
Vectorize acos, asin, and atan for float.
|
2022-08-29 19:49:33 +00:00 |
|
Charles Schlosser
|
e5af9f87f2
|
Vectorize pow for integer base / exponent types
|
2022-08-29 19:23:54 +00:00 |
|
chuckyschluz
|
8acbf5c11c
|
re-enable pow for complex types
|
2022-08-26 17:29:02 -04:00 |
|
Rasmus Munk Larsen
|
7064ed1345
|
Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>.
|
2022-08-26 17:02:37 +00:00 |
|
Rasmus Munk Larsen
|
98e51c9e24
|
Avoid undefined behavior in array_cwise test due to signed integer overflow
|
2022-08-26 16:19:03 +00:00 |
|
Arthur
|
a7c1cac18b
|
Fix GeneralizedEigenSolver::info() and Asserts
|
2022-08-25 22:05:04 +00:00 |
|
Antonio Sanchez
|
714678fc6c
|
Add missing ptr in realloc call.
|
2022-08-24 22:04:04 -07:00 |
|
Charles Schlosser
|
b2a13c9dd1
|
Sparse Core: Replace malloc/free with conditional_aligned
|
2022-08-23 21:44:22 +00:00 |
|
Rasmus Munk Larsen
|
6aad0f821b
|
Fix psign for unsigned integer types, such as bool.
|
2022-08-22 20:19:35 +00:00 |
|
Rasmus Munk Larsen
|
1a09defce7
|
Protect new pblend implementation with EIGEN_VECTORIZE_AVX2
|
2022-08-22 18:28:03 +00:00 |
|
Rasmus Munk Larsen
|
7c67dc67ae
|
Use proper double word division algorithm for pow<double>. Gives 11-15% speedup.
|
2022-08-17 18:36:23 +00:00 |
|
Matthew Sterrett
|
7a3b667c43
|
Add support for AVX512-FP16 for vectorizing half precision math
|
2022-08-17 18:15:21 +00:00 |
|
Charles Schlosser
|
76a669fb45
|
add fixed power unary operation
|
2022-08-16 21:32:36 +00:00 |
|
Matthew Sterrett
|
39fcc89798
|
Removed unnecessary checks for FP16C
|
2022-08-16 18:14:41 +00:00 |
|
Romain Biessy
|
2f7cce2dd5
|
[SYCL] Fix some SYCL tests
|
2022-08-16 17:37:54 +00:00 |
|
Arthur
|
27367017bd
|
Disable bad "deprecated warning" edge-case in BDCSVD
|
2022-08-11 18:43:31 +00:00 |
|
Antonio Sánchez
|
b8e93bf589
|
Eliminate bool bitwise warnings.
|
2022-08-09 22:42:30 +00:00 |
|
Lexi Bromfield
|
66ea0c09fd
|
Don't double-define Half functions on aarch64
|
2022-08-09 20:00:34 +00:00 |
|
Rasmus Munk Larsen
|
97e0784dc6
|
Vectorize the sign operator in Eigen.
|
2022-08-09 19:54:57 +00:00 |
|
Arthur
|
be20207d10
|
Fix vectorized Jacobi Rotation
|
2022-08-08 19:29:56 +00:00 |
|
Rasmus Munk Larsen
|
7a87ed1b6a
|
Fix code and unit test for a few corner cases in vectorized pow()
|
2022-08-08 18:48:36 +00:00 |
|
Chip Kerchner
|
9e0afe0f02
|
Fix non-VSX PowerPC build
|
2022-08-08 18:18:17 +00:00 |
|
Chip Kerchner
|
84a9d6fac9
|
Fix use of Packet2d type for non-VSX.
|
2022-08-03 20:48:13 +00:00 |
|
Chip Kerchner
|
ce60a7be83
|
Partial Packet support for GEMM real-only (PowerPC). Also fix compilation warnings & errors for some conditions in new API.
|
2022-08-03 18:15:19 +00:00 |
|
Antonio Sánchez
|
5a1c7807e6
|
Fix inner iterator for sparse block.
|
2022-08-03 17:26:12 +00:00 |
|
Antonio Sánchez
|
39d22ef46b
|
Fix flaky packetmath_1 test.
|
2022-08-02 17:42:45 +00:00 |
|
Antonio Sánchez
|
7896c7dc6b
|
Use numext::sqrt in ConjugateGradient.
|
2022-07-29 20:17:23 +00:00 |
|
Ilya Tokar
|
e618c4a5e9
|
Improve pblend AVX implementation
|
2022-07-29 18:45:33 +00:00 |
|
sjusju
|
ef4654bae7
|
Add true determinant to QR and it's variants
|
2022-07-29 18:24:14 +00:00 |
|
Alexander Richardson
|
b7668c0371
|
Avoid including <sstream> with EIGEN_NO_IO
|
2022-07-29 18:02:51 +00:00 |
|
John Mather
|
7dd3dda3da
|
Updated AccelerateSupport documentation after PR 966.
|
2022-07-29 17:42:31 +00:00 |
|
Julian Kent
|
69714ff613
|
Add Sparse Subset of Matrix Inverse
|
2022-07-28 18:04:35 +00:00 |
|
Antonio Sánchez
|
34780d8bd1
|
Include immintrin.h header for enscripten.
|
2022-07-22 02:27:42 +00:00 |
|
Antonio Sánchez
|
2cf4d18c9c
|
Disable AVX512 GEMM kernels by default.
|
2022-07-20 21:22:48 +00:00 |
|
Charles Schlosser
|
a678a3e052
|
Fix aligned_realloc to call check_that_malloc_is_allowed() if ptr == 0
|
2022-07-19 20:59:07 +00:00 |
|
b-shi
|
4a56359406
|
Add option to disable avx512 GEBP kernels
|
2022-07-18 17:59:09 +00:00 |
|
Mathieu Westphal
|
1092574b26
|
Fix wrong doxygen group usage
|
2022-07-12 13:22:46 +02:00 |
|
Antonio Sánchez
|
e1165dbf9a
|
AutoDiff depends on Core, so include appropriate header.
|
2022-07-09 23:57:09 +00:00 |
|
Antonio Sánchez
|
bb51d9f4fa
|
Fix ODR violations.
|
2022-07-09 04:56:36 +00:00 |
|
Rohit Santhanam
|
06a458a13d
|
Enable subtests which use device side malloc since this has been fixed in ROCm 5.2.
|
2022-06-29 17:09:43 +00:00 |
|
Chip Kerchner
|
84cf3ff18d
|
Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial.
|
2022-06-27 19:18:00 +00:00 |
|
Chip Kerchner
|
c603275dc9
|
Better performance for Power10 using more load and store vector pairs for GEMV
|
2022-06-27 18:11:55 +00:00 |
|
Antonio Sanchez
|
0e18714167
|
Fix clang-tidy warnings about function definitions in headers.
|
2022-06-24 15:10:58 +00:00 |
|
Antonio Sánchez
|
8ed3b9dcd6
|
Skip f16/bf16 bessel specializations on AVX512 if unavailable.
|
2022-06-24 15:10:36 +00:00 |
|
Antonio Sánchez
|
bc2ab81634
|
Eliminate undef warnings when not compiling for AVX512.
|
2022-06-24 15:10:10 +00:00 |
|
Antonio Sánchez
|
0e083b172e
|
Use numext::sqrt in Householder.h.
|
2022-06-21 16:29:59 +00:00 |
|