Rasmus Munk Larsen
|
462758e8a3
|
Don't use generic sign function for sign(complex) unless it is vectorizable
|
2022-10-12 16:03:29 +00:00 |
|
Rasmus Munk Larsen
|
c0d6a72611
|
Use pnegate(pzero(x)) as a generic way to generate -0.0. Some compiler do not handle the literal -0.0 properly in fastmath mode.
|
2022-10-12 01:57:05 +00:00 |
|
Laurent Rineau
|
7846c7387c
|
Eigen/Sparse: fix warnings -Wunused-but-set-variable
|
2022-10-11 17:37:04 +00:00 |
|
Rasmus Munk Larsen
|
3167544873
|
Handle NaN inputs to atan2.
|
2022-10-10 19:36:36 -07:00 |
|
Rasmus Munk Larsen
|
72db3f0fa5
|
Remove references to M_PI_2 and M_PI_4.
|
2022-10-11 00:27:16 +00:00 |
|
Rasmus Munk Larsen
|
5ceed0d57f
|
Guard GCC-specific pragmas with "#ifdef EIGEN_COMP_GNUC"
|
2022-10-10 20:38:53 +00:00 |
|
Rasmus Munk Larsen
|
e95c4a837f
|
Simpler range reduction strategy for atan<float>().
|
2022-10-04 18:11:00 +00:00 |
|
Antonio Sánchez
|
80efbfdeda
|
Unconditionally enable CXX11 math.
|
2022-10-04 17:37:47 +00:00 |
|
Antonio Sánchez
|
e5794873cb
|
Replace assert with eigen_assert.
|
2022-10-04 17:11:23 +00:00 |
|
Antonio Sánchez
|
7d6a9925cc
|
Fix 4x4 inverse when compiling with -Ofast.
|
2022-10-04 16:05:49 +00:00 |
|
Rasmus Munk Larsen
|
1414a76fa9
|
Only vectorize atan<double> for Altivec if VSX is available.
|
2022-10-03 22:06:58 +00:00 |
|
Rasmus Munk Larsen
|
c475228b28
|
Vectorize atan() for double.
|
2022-10-01 01:49:30 +00:00 |
|
Rasmus Munk Larsen
|
1e1848fdb1
|
Add a vectorized implementation of atan2 to Eigen.
|
2022-09-28 20:46:49 +00:00 |
|
Rasmus Munk Larsen
|
b3bf8d6a13
|
Try to reduce size of GEBP kernel for non-ARM targets.
|
2022-09-28 02:37:18 +00:00 |
|
Rasmus Munk Larsen
|
13b69fc1b0
|
Try to reduce compilation time/memory for GEBP kernel using EIGEN_IF_CONSTEXPR
|
2022-09-23 20:09:42 +00:00 |
|
Rasmus Munk Larsen
|
ed8cda3ce4
|
Move EIGEN_NEON_GEBP_NR macro to the right place in GeneralBlockPanelKernel.h
|
2022-09-23 02:24:27 +00:00 |
|
Rasmus Munk Larsen
|
e2ea866515
|
Add a macro to set the nr trait in the BEBP kernel for NEON.
|
2022-09-22 23:56:34 +00:00 |
|
Lianhuang Li
|
23299632c2
|
Use 3px8/2px8/1px8/1x8 gebp_kernel on arm64-neon
|
2022-09-21 16:36:40 +00:00 |
|
Rasmus Munk Larsen
|
7b2901e2aa
|
Add vectorized integer division for int32 with AVX512, AVX or SSE.
|
2022-09-21 00:27:23 +00:00 |
|
Rasmus Munk Larsen
|
f913a40678
|
Revert "Add AVX int32_t pdiv"
This reverts commit ea84e7ad638c259397fc36fe6e3d82b9cb3b89d0
|
2022-09-16 22:48:08 +00:00 |
|
Rasmus Munk Larsen
|
273e0c884e
|
Revert "Add constexpr, test for C++14 constexpr."
|
2022-09-16 21:14:29 +00:00 |
|
Charles Schlosser
|
ea84e7ad63
|
Add AVX int32_t pdiv
|
2022-09-16 17:06:29 +00:00 |
|
Rasmus Munk Larsen
|
afc014f1b5
|
Allow mixed types for pow(), as long as the exponent is exactly representable in the base type.
|
2022-09-12 21:55:30 +00:00 |
|
Rasmus Munk Larsen
|
e8a2aa24a2
|
Fix a couple of issues with unary pow():
|
2022-09-09 17:21:11 +00:00 |
|
Rohit Santhanam
|
07d0759951
|
[ROCm] Fix for sparse matrix related breakage on ROCm.
|
2022-09-09 14:41:00 +00:00 |
|
Antonio Sánchez
|
fb212c745d
|
Fix g++-6 constexpr and c++20 constexpr build errors.
|
2022-09-09 03:41:45 +00:00 |
|
Thomas Gloor
|
ec9c7163a3
|
Feature/skew symmetric matrix3
|
2022-09-08 20:44:40 +00:00 |
|
Antonio Sánchez
|
311ba66f7c
|
Fix realloc for non-trivial types.
|
2022-09-08 19:39:36 +00:00 |
|
Rasmus Munk Larsen
|
f9dfda28ab
|
Add missing comparison operators for GPU packets.
|
2022-09-07 21:13:45 +00:00 |
|
Tobias Schlüter
|
133498c329
|
Add constexpr, test for C++14 constexpr.
|
2022-09-07 03:42:34 +00:00 |
|
Antonio Sanchez
|
3e44f960ed
|
Reduce compiler warnings for tests.
|
2022-09-06 18:20:56 +00:00 |
|
Florian Richer
|
b7e21d4e38
|
Call check_that_malloc_is_allowed() in aligned_realloc()
|
2022-09-06 18:00:37 +00:00 |
|
Michael Palomas
|
525f066671
|
fixed msvc compilation error in GeneralizedEigenSolver.h
|
2022-09-04 17:50:43 +00:00 |
|
Antonio Sánchez
|
f241a2c18a
|
Add asserts for index-out-of-bounds in IndexedView.
|
2022-09-02 17:28:03 +00:00 |
|
Antonio Sánchez
|
30c42222a6
|
Fix some test build errors in new unary pow.
|
2022-08-30 17:24:14 +00:00 |
|
Rasmus Munk Larsen
|
bd393e15c3
|
Vectorize acos, asin, and atan for float.
|
2022-08-29 19:49:33 +00:00 |
|
Charles Schlosser
|
e5af9f87f2
|
Vectorize pow for integer base / exponent types
|
2022-08-29 19:23:54 +00:00 |
|
chuckyschluz
|
8acbf5c11c
|
re-enable pow for complex types
|
2022-08-26 17:29:02 -04:00 |
|
Rasmus Munk Larsen
|
7064ed1345
|
Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>.
|
2022-08-26 17:02:37 +00:00 |
|
Rasmus Munk Larsen
|
98e51c9e24
|
Avoid undefined behavior in array_cwise test due to signed integer overflow
|
2022-08-26 16:19:03 +00:00 |
|
Arthur
|
a7c1cac18b
|
Fix GeneralizedEigenSolver::info() and Asserts
|
2022-08-25 22:05:04 +00:00 |
|
Antonio Sanchez
|
714678fc6c
|
Add missing ptr in realloc call.
|
2022-08-24 22:04:04 -07:00 |
|
Charles Schlosser
|
b2a13c9dd1
|
Sparse Core: Replace malloc/free with conditional_aligned
|
2022-08-23 21:44:22 +00:00 |
|
Rasmus Munk Larsen
|
6aad0f821b
|
Fix psign for unsigned integer types, such as bool.
|
2022-08-22 20:19:35 +00:00 |
|
Rasmus Munk Larsen
|
1a09defce7
|
Protect new pblend implementation with EIGEN_VECTORIZE_AVX2
|
2022-08-22 18:28:03 +00:00 |
|
Rasmus Munk Larsen
|
7c67dc67ae
|
Use proper double word division algorithm for pow<double>. Gives 11-15% speedup.
|
2022-08-17 18:36:23 +00:00 |
|
Matthew Sterrett
|
7a3b667c43
|
Add support for AVX512-FP16 for vectorizing half precision math
|
2022-08-17 18:15:21 +00:00 |
|
Charles Schlosser
|
76a669fb45
|
add fixed power unary operation
|
2022-08-16 21:32:36 +00:00 |
|
Matthew Sterrett
|
39fcc89798
|
Removed unnecessary checks for FP16C
|
2022-08-16 18:14:41 +00:00 |
|
Romain Biessy
|
2f7cce2dd5
|
[SYCL] Fix some SYCL tests
|
2022-08-16 17:37:54 +00:00 |
|