Rasmus Munk Larsen
|
afc014f1b5
|
Allow mixed types for pow(), as long as the exponent is exactly representable in the base type.
|
2022-09-12 21:55:30 +00:00 |
|
Rasmus Munk Larsen
|
e8a2aa24a2
|
Fix a couple of issues with unary pow():
|
2022-09-09 17:21:11 +00:00 |
|
Rohit Santhanam
|
07d0759951
|
[ROCm] Fix for sparse matrix related breakage on ROCm.
|
2022-09-09 14:41:00 +00:00 |
|
Antonio Sánchez
|
fb212c745d
|
Fix g++-6 constexpr and c++20 constexpr build errors.
|
2022-09-09 03:41:45 +00:00 |
|
Thomas Gloor
|
ec9c7163a3
|
Feature/skew symmetric matrix3
|
2022-09-08 20:44:40 +00:00 |
|
Antonio Sánchez
|
311ba66f7c
|
Fix realloc for non-trivial types.
|
2022-09-08 19:39:36 +00:00 |
|
Rasmus Munk Larsen
|
f9dfda28ab
|
Add missing comparison operators for GPU packets.
|
2022-09-07 21:13:45 +00:00 |
|
Tobias Schlüter
|
133498c329
|
Add constexpr, test for C++14 constexpr.
|
2022-09-07 03:42:34 +00:00 |
|
Antonio Sanchez
|
3e44f960ed
|
Reduce compiler warnings for tests.
|
2022-09-06 18:20:56 +00:00 |
|
Florian Richer
|
b7e21d4e38
|
Call check_that_malloc_is_allowed() in aligned_realloc()
|
2022-09-06 18:00:37 +00:00 |
|
Antonio Sánchez
|
f241a2c18a
|
Add asserts for index-out-of-bounds in IndexedView.
|
2022-09-02 17:28:03 +00:00 |
|
Antonio Sánchez
|
30c42222a6
|
Fix some test build errors in new unary pow.
|
2022-08-30 17:24:14 +00:00 |
|
Rasmus Munk Larsen
|
bd393e15c3
|
Vectorize acos, asin, and atan for float.
|
2022-08-29 19:49:33 +00:00 |
|
Charles Schlosser
|
e5af9f87f2
|
Vectorize pow for integer base / exponent types
|
2022-08-29 19:23:54 +00:00 |
|
chuckyschluz
|
8acbf5c11c
|
re-enable pow for complex types
|
2022-08-26 17:29:02 -04:00 |
|
Rasmus Munk Larsen
|
7064ed1345
|
Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>.
|
2022-08-26 17:02:37 +00:00 |
|
Rasmus Munk Larsen
|
98e51c9e24
|
Avoid undefined behavior in array_cwise test due to signed integer overflow
|
2022-08-26 16:19:03 +00:00 |
|
Rasmus Munk Larsen
|
6aad0f821b
|
Fix psign for unsigned integer types, such as bool.
|
2022-08-22 20:19:35 +00:00 |
|
Rasmus Munk Larsen
|
1a09defce7
|
Protect new pblend implementation with EIGEN_VECTORIZE_AVX2
|
2022-08-22 18:28:03 +00:00 |
|
Rasmus Munk Larsen
|
7c67dc67ae
|
Use proper double word division algorithm for pow<double>. Gives 11-15% speedup.
|
2022-08-17 18:36:23 +00:00 |
|
Matthew Sterrett
|
7a3b667c43
|
Add support for AVX512-FP16 for vectorizing half precision math
|
2022-08-17 18:15:21 +00:00 |
|
Charles Schlosser
|
76a669fb45
|
add fixed power unary operation
|
2022-08-16 21:32:36 +00:00 |
|
Matthew Sterrett
|
39fcc89798
|
Removed unnecessary checks for FP16C
|
2022-08-16 18:14:41 +00:00 |
|
Romain Biessy
|
2f7cce2dd5
|
[SYCL] Fix some SYCL tests
|
2022-08-16 17:37:54 +00:00 |
|
Arthur
|
27367017bd
|
Disable bad "deprecated warning" edge-case in BDCSVD
|
2022-08-11 18:43:31 +00:00 |
|
Lexi Bromfield
|
66ea0c09fd
|
Don't double-define Half functions on aarch64
|
2022-08-09 20:00:34 +00:00 |
|
Rasmus Munk Larsen
|
97e0784dc6
|
Vectorize the sign operator in Eigen.
|
2022-08-09 19:54:57 +00:00 |
|
Rasmus Munk Larsen
|
7a87ed1b6a
|
Fix code and unit test for a few corner cases in vectorized pow()
|
2022-08-08 18:48:36 +00:00 |
|
Chip Kerchner
|
9e0afe0f02
|
Fix non-VSX PowerPC build
|
2022-08-08 18:18:17 +00:00 |
|
Chip Kerchner
|
84a9d6fac9
|
Fix use of Packet2d type for non-VSX.
|
2022-08-03 20:48:13 +00:00 |
|
Chip Kerchner
|
ce60a7be83
|
Partial Packet support for GEMM real-only (PowerPC). Also fix compilation warnings & errors for some conditions in new API.
|
2022-08-03 18:15:19 +00:00 |
|
Ilya Tokar
|
e618c4a5e9
|
Improve pblend AVX implementation
|
2022-07-29 18:45:33 +00:00 |
|
Alexander Richardson
|
b7668c0371
|
Avoid including <sstream> with EIGEN_NO_IO
|
2022-07-29 18:02:51 +00:00 |
|
Antonio Sánchez
|
34780d8bd1
|
Include immintrin.h header for enscripten.
|
2022-07-22 02:27:42 +00:00 |
|
Antonio Sánchez
|
2cf4d18c9c
|
Disable AVX512 GEMM kernels by default.
|
2022-07-20 21:22:48 +00:00 |
|
Charles Schlosser
|
a678a3e052
|
Fix aligned_realloc to call check_that_malloc_is_allowed() if ptr == 0
|
2022-07-19 20:59:07 +00:00 |
|
b-shi
|
4a56359406
|
Add option to disable avx512 GEBP kernels
|
2022-07-18 17:59:09 +00:00 |
|
Mathieu Westphal
|
1092574b26
|
Fix wrong doxygen group usage
|
2022-07-12 13:22:46 +02:00 |
|
Chip Kerchner
|
84cf3ff18d
|
Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial.
|
2022-06-27 19:18:00 +00:00 |
|
Chip Kerchner
|
c603275dc9
|
Better performance for Power10 using more load and store vector pairs for GEMV
|
2022-06-27 18:11:55 +00:00 |
|
Antonio Sánchez
|
bc2ab81634
|
Eliminate undef warnings when not compiling for AVX512.
|
2022-06-24 15:10:10 +00:00 |
|
b-shi
|
37673ca1bc
|
AVX512 TRSM kernels use alloca if EIGEN_NO_MALLOC requested
|
2022-06-17 18:05:26 +00:00 |
|
Chip Kerchner
|
4d1c16eab8
|
Fix tanh and erf to use vectorized version for EIGEN_FAST_MATH in VSX.
|
2022-06-15 16:06:43 +00:00 |
|
Mehdi Goli
|
7ea823e824
|
[SYCL-Spec] According to [SYCL-2020 spec](...
|
2022-06-13 15:52:29 +00:00 |
|
Arthur
|
ba4d7304e2
|
Document DiagonalBase
|
2022-06-08 17:46:32 +00:00 |
|
Binhao Qin
|
95463b59bc
|
Mark index_remap as EIGEN_DEVICE_FUNC in src/Core/Reshaped.h (Fixes #2493)
|
2022-06-07 20:10:47 +00:00 |
|
Shi, Brian
|
28812d2ebb
|
AVX512 TRSM Kernels respect EIGEN_NO_MALLOC
|
2022-06-07 11:28:42 -07:00 |
|
Arthur
|
14aae29470
|
Provide DiagonalMatrix Product and Initializers
|
2022-06-06 21:43:22 +00:00 |
|
aaraujom
|
8fbb76a043
|
Fix build issues with MSVC for AVX512
|
2022-06-03 14:55:40 +00:00 |
|
aaraujom
|
d49ede4dc4
|
Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
|
2022-05-28 02:00:21 +00:00 |
|