Charles Schlosser
|
21cd3fe209
|
Optimize check_rows_cols_for_overflow
|
2023-07-10 17:40:17 +00:00 |
|
Charles Schlosser
|
1a2bfca8f0
|
Fix annoying warnings
|
2023-07-07 20:19:58 +00:00 |
|
Antonio Sánchez
|
31cd2ad371
|
Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm.
|
2023-06-26 19:21:54 +00:00 |
|
Antonio Sánchez
|
7465b7651e
|
Disable FP16 arithmetic for arm32.
|
2023-06-26 18:39:42 +00:00 |
|
Charles Schlosser
|
59b3ef5409
|
Partially Vectorize Cast
|
2023-06-09 16:54:31 +00:00 |
|
Alexander Shaposhnikov
|
316eab8deb
|
Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation
|
2023-05-31 15:15:06 +00:00 |
|
Charles Schlosser
|
fbf7189bd5
|
Fix cuda compilation
|
2023-05-08 16:15:47 +00:00 |
|
Mehdi Goli
|
0623791930
|
[SYCL-2020] Enabling USM support for SYCL. SYCL-1.2.1 did not have support for USM.
|
2023-05-05 17:30:36 +00:00 |
|
Tobias Wood
|
94f57867fe
|
Thread pool
|
2023-05-05 16:23:34 +00:00 |
|
Antoine Hoarau
|
9b48d10215
|
Guard all malloc, realloc and free() fonctions with check_that_malloc_is_allowed()
|
2023-04-04 04:24:22 +00:00 |
|
Colin Broderick
|
8f9b8e3630
|
Replaced all instances of internal::(U)IntPtr with std::(u)intptr_t. Remove ICC workaround.
|
2023-03-21 16:50:23 +00:00 |
|
Antonio Sánchez
|
2c8011c2dd
|
Fix arm builds.
|
2023-03-20 16:59:38 +00:00 |
|
Jonas Schulze
|
81cb6a51d0
|
Fix some typos
|
2023-03-16 23:11:43 +00:00 |
|
Chip Kerchner
|
d71ac6a755
|
Fix recent PowerPC warnings and clang warning
|
2023-03-15 16:50:46 +00:00 |
|
Charles Schlosser
|
7bf2968fed
|
Specify Permutation Index for PartialPivLU and FullPivLU
|
2023-03-07 20:28:05 +00:00 |
|
Antonio Sánchez
|
bc5cdc7a67
|
Guard use of long double on GPU device.
|
2023-02-24 21:49:59 +00:00 |
|
Charles Schlosser
|
049a144798
|
Add typed logicals
|
2023-02-18 01:23:47 +00:00 |
|
Antonio Sánchez
|
e256ad1823
|
Remove LGPL Code and references.
|
2023-02-08 01:25:06 +00:00 |
|
Antonio Sánchez
|
f6cc359e10
|
More EIGEN_DEVICE_FUNC fixes for CUDA 10/11/12.
|
2023-02-03 19:18:45 +00:00 |
|
Charles Schlosser
|
fd2fd48703
|
Update file ForwardDeclarations.h
|
2023-02-01 16:52:20 +00:00 |
|
Antonio Sánchez
|
17ae83a966
|
Fix bugs exposed by enabling GPU asserts.
|
2023-01-27 21:43:00 +00:00 |
|
Charles Schlosser
|
6d9f662a70
|
Tweak atan2
|
2023-01-26 17:38:21 +00:00 |
|
Rasmus Munk Larsen
|
576448572f
|
More fixes for __GNUC_PATCHLEVEL__.
|
2023-01-23 17:04:24 +00:00 |
|
Rasmus Munk Larsen
|
164ddf75ab
|
Use __GNUC_PATCHLEVEL__ rather than __GNUC_PATCH__, according to the documentation https://gcc.gnu.org/onlinedocs/cpp/Common-Predefined-Macros.html
|
2023-01-23 16:56:14 +00:00 |
|
Antonio Sánchez
|
08c961e837
|
Add custom ODR-safe assert.
|
2023-01-20 17:38:13 +00:00 |
|
Sean McBride
|
d70b4864d9
|
issue #2581: review and cleanup of compiler version checks
|
2023-01-17 18:58:34 +00:00 |
|
tttapa
|
bae119bb7e
|
Support per-thread is_malloc_allowed() state
|
2023-01-16 01:34:56 +00:00 |
|
Antonio Sánchez
|
2e61c0c6b4
|
Add missing EIGEN_DEVICE_FUNC in a few places when called by asserts.
|
2023-01-15 02:06:17 +00:00 |
|
Charles Schlosser
|
68082b8226
|
Fix QR, again
|
2023-01-13 03:23:17 +00:00 |
|
Sergey Fedorov
|
4d05765345
|
Altivec fixes for Darwin: do not use unsupported VSX insns
|
2023-01-12 16:33:33 +00:00 |
|
Rasmus Munk Larsen
|
6156797016
|
Revert "Add template to specify QR permutation index type, Fix ColPivHouseholderQR Lapacke bindings"
This reverts commit be7791e097c1fc21d4f2e8713467431784f3a4fd
|
2023-01-11 18:50:52 +00:00 |
|
Charles Schlosser
|
be7791e097
|
Add template to specify QR permutation index type, Fix ColPivHouseholderQR Lapacke bindings
|
2023-01-11 15:57:28 +00:00 |
|
Chip Kerchner
|
d20fe21ae4
|
Improve performance for Power10 MMA bfloat16 GEMM
|
2023-01-06 23:08:37 +00:00 |
|
Antonio Sánchez
|
3564668908
|
Fix overalign check.
|
2023-01-05 17:10:48 +00:00 |
|
Charles Schlosser
|
f3929ac7ed
|
Fix EIGEN_HAS_CXX17_OVERALIGN for icc
|
2023-01-03 17:30:10 +00:00 |
|
Antonio Sánchez
|
dbf7ae6f9b
|
Fix up C++ version detection macros and cmake tests.
|
2022-12-20 18:06:03 +00:00 |
|
Pedro Caldeira
|
31ab62d347
|
Add support for Power10 (AltiVec) MMA instructions for bfloat16.
|
2022-11-30 23:33:37 +00:00 |
|
Charles Schlosser
|
044f3f6234
|
Fix bug in handmade_aligned_realloc
|
2022-11-18 22:35:31 +00:00 |
|
Gabriele Buondonno
|
6431dfdb50
|
Cross product for vectors of size 2. Fixes #1037
|
2022-11-15 22:39:42 +00:00 |
|
Charles Schlosser
|
82b152dbe7
|
Add signbit function
|
2022-11-04 00:31:20 +00:00 |
|
Antonio Sánchez
|
ab407b2b6e
|
Fix handmade_aligned_malloc offset computation.
|
2022-10-27 17:33:47 +00:00 |
|
Charles Schlosser
|
a226371371
|
Change handmade_aligned_malloc/realloc/free to store a 1 byte offset instead of absolute address
|
2022-10-22 22:51:31 +00:00 |
|
Antonio Sánchez
|
80efbfdeda
|
Unconditionally enable CXX11 math.
|
2022-10-04 17:37:47 +00:00 |
|
Rasmus Munk Larsen
|
13b69fc1b0
|
Try to reduce compilation time/memory for GEBP kernel using EIGEN_IF_CONSTEXPR
|
2022-09-23 20:09:42 +00:00 |
|
Rasmus Munk Larsen
|
273e0c884e
|
Revert "Add constexpr, test for C++14 constexpr."
|
2022-09-16 21:14:29 +00:00 |
|
Thomas Gloor
|
ec9c7163a3
|
Feature/skew symmetric matrix3
|
2022-09-08 20:44:40 +00:00 |
|
Antonio Sánchez
|
311ba66f7c
|
Fix realloc for non-trivial types.
|
2022-09-08 19:39:36 +00:00 |
|
Tobias Schlüter
|
133498c329
|
Add constexpr, test for C++14 constexpr.
|
2022-09-07 03:42:34 +00:00 |
|
Florian Richer
|
b7e21d4e38
|
Call check_that_malloc_is_allowed() in aligned_realloc()
|
2022-09-06 18:00:37 +00:00 |
|
chuckyschluz
|
8acbf5c11c
|
re-enable pow for complex types
|
2022-08-26 17:29:02 -04:00 |
|