Commit Graph

  • e4f58816d9 Get rid of custom implementation of equal_to and not_equal_no. No longer needed with c+14. Rasmus Munk Larsen 2023-02-07 21:36:44 -08:00
  • e256ad1823 Remove LGPL Code and references. Antonio Sánchez 2023-02-08 01:25:06 +00:00
  • e71f88abce Change in Power eigen_asserts to eigen_internal_asserts since it is putting unnecessary error checking and assertions without NDEBUG. Chip Kerchner 2023-02-08 00:57:30 +00:00
  • 232b18fa8a Fixes #2602 Gregory Kramida 2023-02-06 22:52:39 +00:00
  • f6cc359e10 More EIGEN_DEVICE_FUNC fixes for CUDA 10/11/12. Antonio Sánchez 2023-02-03 19:18:45 +00:00
  • 2a90653395 fix lapacke config Charles Schlosser 2023-02-03 16:40:08 +00:00
  • 3460f3558e Use VERIFY_IS_EQUAL to compare to zeros. Rasmus Munk Larsen 2023-02-01 13:49:56 -08:00
  • 13a1f25da9 Revert StlIterators edit from "Fix undefined behavior..." Jeremy Nimmer 2023-02-01 20:01:36 +00:00
  • fd2fd48703 Update file ForwardDeclarations.h Charles Schlosser 2023-02-01 16:52:20 +00:00
  • 37b2e97175 Tweak special case handling in atan2. Rasmus Munk Larsen 2023-01-31 17:48:00 -08:00
  • a1cdcdb038 Fix undefined behavior in Block access Jeremy Nimmer 2023-02-01 00:40:45 +00:00
  • 4a58f30aa0 Fix pre-POWER8_VECTOR bugs in pcmp_lt and pnegate and reactivate psqrt. Chip Kerchner 2023-01-31 19:40:24 +00:00
  • 12ad99ce60 Remove unused variables from GenericPacketMathFunctions.h Rasmus Munk Larsen 2023-01-29 18:10:28 +00:00
  • 6987a200bb Fix stupid sparse bugs with outerSize == 0 Charles Schlosser 2023-01-28 02:03:09 +00:00
  • 0471e61b4c Optimize various mathematical packet ops Charles Schlosser 2023-01-28 01:34:26 +00:00
  • 1aa6dc2007 Fix sparse warnings Charles Schlosser 2023-01-27 22:47:42 +00:00
  • 17ae83a966 Fix bugs exposed by enabling GPU asserts. Antonio Sánchez 2023-01-27 21:43:00 +00:00
  • ab8725d947 Turn off vectorize version of rsqrt - doesn't match generic version Chip Kerchner 2023-01-27 18:28:54 +00:00
  • 6d9f662a70 Tweak atan2 Charles Schlosser 2023-01-26 17:38:21 +00:00
  • 6fc9de7d93 Fix slowdown in bfloat16 MMA when rows is not a multiple of 8 or columns is not a multiple of 4. Chip Kerchner 2023-01-25 18:22:20 +00:00
  • 6d4221af76 Revert qr tests Charles Schlosser 2023-01-23 22:23:08 +00:00
  • 7f58bc98b1 Refactor sparse Charles Schlosser 2023-01-23 17:55:50 +00:00
  • 576448572f More fixes for __GNUC_PATCHLEVEL__. Rasmus Munk Larsen 2023-01-23 17:04:24 +00:00
  • 164ddf75ab Use __GNUC_PATCHLEVEL__ rather than __GNUC_PATCH__, according to the documentation https://gcc.gnu.org/onlinedocs/cpp/Common-Predefined-Macros.html Rasmus Munk Larsen 2023-01-23 16:56:14 +00:00
  • 5a7ca681d5 Fix sparse insert Charles Schlosser 2023-01-20 21:32:32 +00:00
  • 08c961e837 Add custom ODR-safe assert. Antonio Sánchez 2023-01-20 17:38:13 +00:00
  • 3fe8c51104 Replace the Deprecated $<CONFIGURATION> with $<CONFIG> Amir Masoud Abdol 2023-01-17 19:44:32 +00:00
  • d70b4864d9 issue #2581: review and cleanup of compiler version checks Sean McBride 2023-01-17 18:58:34 +00:00
  • b523120687 [SYCL-2020 Support] Enabling Intel DPCPP Compiler support to Eigen Mehdi Goli 2023-01-16 07:04:08 +00:00
  • bae119bb7e Support per-thread is_malloc_allowed() state tttapa 2023-01-16 01:34:56 +00:00
  • fa0bd2c34e improve sparse permutations Charles Schlosser 2023-01-15 03:21:25 +00:00
  • 2e61c0c6b4 Add missing EIGEN_DEVICE_FUNC in a few places when called by asserts. Antonio Sánchez 2023-01-15 02:06:17 +00:00
  • 4aca06f63a avoid move assignment in ColPivHouseholderQR Charles Schlosser 2023-01-15 01:34:10 +00:00
  • 68082b8226 Fix QR, again Charles Schlosser 2023-01-13 03:23:17 +00:00
  • 4d05765345 Altivec fixes for Darwin: do not use unsupported VSX insns Sergey Fedorov 2023-01-12 16:33:33 +00:00
  • 6156797016 Revert "Add template to specify QR permutation index type, Fix ColPivHouseholderQR Lapacke bindings" Rasmus Munk Larsen 2023-01-11 18:50:52 +00:00
  • be7791e097 Add template to specify QR permutation index type, Fix ColPivHouseholderQR Lapacke bindings Charles Schlosser 2023-01-11 15:57:28 +00:00
  • 9463fc95f4 change insert strategy Charles Schlosser 2023-01-11 06:24:49 +00:00
  • c54785b071 Fix error: unused parameter 'tmp' [-Werror,-Wunused-parameter] on clang/32-bit arm Martin Burchell 2023-01-10 21:15:28 +00:00
  • f47472603b Add missing header for GPU tests. Antonio Sanchez 2023-01-09 11:21:13 -08:00
  • 81172cbdcb Overhaul Sparse Core Charles Schlosser 2023-01-07 22:09:42 +00:00
  • 9255181891 Modified spbenchsolver help message because it could be misunderstood Robin Miquel 2023-01-07 21:35:46 +00:00
  • d20fe21ae4 Improve performance for Power10 MMA bfloat16 GEMM Chip Kerchner 2023-01-06 23:08:37 +00:00
  • fe7f527787 Fix guard macros for emulated FP16 operators on GPU Ryan Senanayake 2023-01-06 22:02:51 +00:00
  • b8422c99cd Update file jacobisvd.cpp Rasmus Munk Larsen 2023-01-06 21:14:17 +00:00
  • 262194f12c Fix a bunch of minor build and test issues. Antonio Sánchez 2023-01-06 16:37:26 +00:00
  • 3564668908 Fix overalign check. Antonio Sánchez 2023-01-05 17:10:48 +00:00
  • f3929ac7ed Fix EIGEN_HAS_CXX17_OVERALIGN for icc Charles Schlosser 2023-01-03 17:30:10 +00:00
  • 1b33a6374b Fixes git add . doesn't include scripts/buildtests.in LAI Bruce 2023-01-03 17:06:36 +00:00
  • a8bab0d8ae Patch SparseLU Charles Schlosser 2022-12-31 04:52:36 +00:00
  • 910f6f65d0 Adjust thresholds for bfloat16 product tests that are currently failing. Antonio Sánchez 2022-12-28 19:32:25 +00:00
  • 311cc0f9cc Enable NEON pcmp, plset, and complex psqrt Arthur 2022-12-22 05:38:34 +00:00
  • dbf7ae6f9b Fix up C++ version detection macros and cmake tests. Antonio Sánchez 2022-12-20 18:06:03 +00:00
  • bb6675caf7 Fix incorrect NEON native fp16 multiplication. Antonio Sánchez 2022-12-19 20:46:44 +00:00
  • dd85d26946 Revert "Avoid mixing types in CompressedStorage.h" Rasmus Munk Larsen 2022-12-19 20:09:37 +00:00
  • c4fb6af24b Enable NEON pabs for unsigned int types Arthur Feeney 2022-12-18 18:30:02 -08:00
  • 400bc5cd5b Add sparse_basic_1 to smoke tests. Rasmus Munk Larsen 2022-12-16 22:03:33 +00:00
  • 04e4f0bb24 Add missing colon in SparseMatrix.h. Rasmus Munk Larsen 2022-12-16 21:50:00 +00:00
  • 3d8a8def8a Avoid mixing types in CompressedStorage.h Rasmus Munk Larsen 2022-12-16 20:11:02 +00:00
  • 4bb2446796 Add operators to CompressedStorageIterator Charles Schlosser 2022-12-16 16:48:50 +00:00
  • e1aee4ab39 Update test of numext::signbit. Rasmus Munk Larsen 2022-12-15 11:39:32 -08:00
  • 3717854a21 Use numext::signbit instead of std::signbit, which is not defined for bfloat16. Rasmus Munk Larsen 2022-12-15 18:41:46 +00:00
  • 37de432907 Avoid using std::raise() for divide by zero Alexander Richardson 2022-12-14 20:06:16 +00:00
  • 62de593c40 Allow std::initializer_list constructors in constexpr expressions Alexander Richardson 2022-12-14 17:05:37 +00:00
  • 6d3e3678b4 optimize equalspace packetop Charles Schlosser 2022-12-13 01:22:25 +00:00
  • 2004831941 add EqualSpaced / setEqualSpaced Charles Schlosser 2022-12-13 00:54:57 +00:00
  • 273f803846 Add BDCSVD_LAPACKE binding Melven Roehrig-Zoellner 2022-12-09 18:50:12 +00:00
  • 03c9b4738c Enable direct access for NestByValue. Antonio Sánchez 2022-12-07 18:21:45 +00:00
  • b59f18b4f7 Increase L2 and L3 cache size for Power10. Chip Kerchner 2022-12-07 18:20:33 +00:00
  • c614b2bbd3 Fix index type for sparse index sorting. Antonio Sánchez 2022-12-06 00:02:31 +00:00
  • 44fe539150 add sparse sort inner vectors function Charles Schlosser 2022-12-01 19:28:56 +00:00
  • d194167149 Fix the bug using neon instruction fmla for data type half Lianhuang Li 2022-12-01 17:28:57 +00:00
  • 31ab62d347 Add support for Power10 (AltiVec) MMA instructions for bfloat16. Pedro Caldeira 2022-11-30 23:33:37 +00:00
  • dcb042a87d Fix serialization for non-compressed matrices. Antonio Sánchez 2022-11-30 18:16:47 +00:00
  • 2260e11eb0 Fix reshape strides when input has non-zero inner stride. Antonio Sánchez 2022-11-29 19:39:29 +00:00
  • 23524ab6fc Changing BiCGSTAB parameters initialization so that it works with custom types Alexandre Hoffmann 2022-11-29 19:37:46 +00:00
  • ab2b26fbc2 Fix sparseLU solver when destination has a non-unit stride. Antonio Sánchez 2022-11-29 19:37:03 +00:00
  • 551eebc8ca Add synchronize method to all devices. Antonio Sánchez 2022-11-29 19:35:02 +00:00
  • b7551bff92 Fix a bunch of annoying compiler warnings in tests Charles Schlosser 2022-11-21 20:07:19 +00:00
  • e7b1ad0315 Add serialization for sparse matrix and sparse vector. Antonio Sánchez 2022-11-21 19:43:07 +00:00
  • 044f3f6234 Fix bug in handmade_aligned_realloc Charles Schlosser 2022-11-18 22:35:31 +00:00
  • 6728683938 Small cleanup of IDRS.h Chris 2022-11-16 13:51:23 +00:00
  • 02805bd56c Fix AVX2 psignbit Charles Schlosser 2022-11-16 13:43:11 +00:00
  • 399ce1ed63 Fix duplicate execution code for Power 8 Altivec in pstore_partial. Chip Kerchner 2022-11-16 13:41:42 +00:00
  • 6431dfdb50 Cross product for vectors of size 2. Fixes #1037 Gabriele Buondonno 2022-11-15 22:39:42 +00:00
  • 8588d8c74b Correct pnegate for floating-point zero. Antonio Sánchez 2022-11-15 18:07:23 +00:00
  • 5eacb9e117 Put brackets around unsigned type names. Antonio Sanchez 2022-11-15 09:09:08 -08:00
  • 37e40dca85 Fix ambiguity in PPC for vec_splats call. Antonio Sánchez 2022-11-14 18:58:16 +00:00
  • 7dc6db75d4 Fix typo in CholmodSupport Antonio Sánchez 2022-11-08 23:49:56 +00:00
  • 9b6d624eab fix neon Charles Schlosser 2022-11-08 20:03:01 +00:00
  • 7e398e9436 Add missing return keyword in psignbit for NEON. Rasmus Munk Larsen 2022-11-04 16:13:09 +00:00
  • 82b152dbe7 Add signbit function Charles Schlosser 2022-11-04 00:31:20 +00:00
  • 8f8e36458f Remove recently added sparse assert in SparseMapBase. Antonio Sánchez 2022-11-03 17:29:05 +00:00
  • 01a31b81b2 Remove unused parameter name. Antonio Sanchez 2022-11-01 15:51:25 -07:00
  • c5b896c5a3 Allow empty matrices to be resized. Antonio Sánchez 2022-10-27 20:33:35 +00:00
  • 886aad1361 Disable patan for double on PPC. Antonio Sánchez 2022-10-27 17:56:08 +00:00
  • ab407b2b6e Fix handmade_aligned_malloc offset computation. Antonio Sánchez 2022-10-27 17:33:47 +00:00
  • adb30efb25 Add assert for invalid outerIndexPtr array in SparseMapBase. Antonio Sánchez 2022-10-26 22:51:33 +00:00
  • c27d1abe46 Fix pragma check for disabling fastmath. Antonio Sánchez 2022-10-26 22:50:57 +00:00
  • a226371371 Change handmade_aligned_malloc/realloc/free to store a 1 byte offset instead of absolute address Charles Schlosser 2022-10-22 22:51:31 +00:00