Commit Graph

  • 2a35a917be Fix syntax error in NonBlockingThreadPool.h Rasmus Munk Larsen 2025-01-28 18:34:31 +00:00
  • a056b93114 improve Simplicial Cholesky analyzePattern Charles Schlosser 2025-01-28 17:53:43 +00:00
  • 5d866a7a78 Fix potential data race on spin_count_ NonBlockingThreadPool member variable William Kong 2025-01-28 17:22:15 +00:00
  • bc67025ba7 Clean up and fix the documentation of ForkJoin.h William Kong 2025-01-27 23:12:17 +00:00
  • dc1126e762 Fix threadpool for c++14. Antonio Sánchez 2025-01-27 21:57:23 +00:00
  • cd511a09aa Fix initialization order and remove unused variables in NonBlockingThreadPool.h. Rasmus Munk Larsen 2025-01-27 19:35:49 +00:00
  • f679843dc2 Block doc non square Johannes Zipfel 2025-01-25 17:14:21 +00:00
  • f9705adabb Fix typo introduced in the refactor of NonBlockingThreadPool William Kong 2025-01-25 17:13:24 +00:00
  • b75895a8b6 Try to fix loongarch Antonio Sánchez 2025-01-25 16:38:41 +00:00
  • 4a6ac97d13 Add a ForkJoin-based ParallelFor algorithm to the ThreadPool module William Kong 2025-01-24 22:12:05 +00:00
  • e986838464 Add LoongArch64 architecture LSX support.(build/test ) Pengzhou0810 2025-01-20 18:37:44 +00:00
  • c486af5ad3 Change Eigen::aligned_allocator to not inherit from std::allocator Markus Vieth 2025-01-20 16:04:43 +00:00
  • abac563f5d Update documentation to clarify cross product for complex numbers. Antonio Sánchez 2025-01-16 00:52:40 +00:00
  • 2e76277bd0 Zero-initialize test arrays to avoid uninitialized reads. Antonio Sanchez 2025-01-14 09:15:43 -08:00
  • ad13df7ea4 Fix std::fill_n reference. Antonio Sánchez 2025-01-14 00:43:00 +00:00
  • 9836e8d035 Fix read of uninitialized threshold in SparseQR Frédéric Simonis 2025-01-08 23:40:58 +00:00
  • 7bb23b1e36 CI: don't add ToolChain PPA Charles Schlosser 2024-12-31 14:04:01 +00:00
  • 7bb8c58e7c Fix the missing CUDA device qualifier xsjk 2024-12-28 15:17:55 +00:00
  • 24e0c2a125 use omp_get_max_threads if setNbThreads is not set Joerg Buchwald 2024-12-20 21:16:15 +00:00
  • a32db43966 Add missing #include <new> Jordan Rupprecht 2024-12-19 11:06:08 +00:00
  • c01ff45312 Enable fill_n and memset optimizations for construction and assignment Charles Schlosser 2024-12-14 14:25:04 +00:00
  • af59ada0ac Use alpine for deploying nightly tag. Antonio Sánchez 2024-12-10 22:48:29 +00:00
  • 4a9e32ae0b matrix equality operator Charles Schlosser 2024-12-10 12:40:39 +00:00
  • 00776d1ba4 Remove branch name from nightly tag job. Antonio Sanchez 2024-12-09 20:18:06 -08:00
  • 7f23778593 Add tag to commit instead of branch Antonio Sanchez 2024-12-09 07:47:48 -08:00
  • c30b35a310 Force tag to update to latest head. Antonio Sánchez 2024-12-08 04:48:21 +00:00
  • a26ba67349 Add LICENSE file in correct place so it is picked up by gitlab. Antonio Sánchez 2024-12-08 03:26:43 +00:00
  • 08c31c3ba6 try alpine for formatting Charles Schlosser 2024-12-08 01:09:33 +00:00
  • 1ac1af62ef Update deploy job Antonio Sanchez 2024-12-07 09:19:21 -08:00
  • 7b6623af30 Fix special packetmath erfc flushing for ARM32. Antonio Sánchez 2024-12-07 01:42:30 +00:00
  • fd48fbb260 Update rocm docker again again. Antonio Sánchez 2024-12-06 22:13:53 +00:00
  • a885340ba5 Update rocm docker again. Antonio Sánchez 2024-12-06 17:19:31 +00:00
  • 45a8478d09 Update rocm docker image in CI. Antonio Sanchez 2024-12-06 07:14:59 -08:00
  • de4afcf414 Add a deploy phase to the CI that tags the latest nightly pipeline if it passes. Antonio Sánchez 2024-12-05 15:28:18 +00:00
  • 5e8916050b move constructor / move assignment doc strings Charles Schlosser 2024-12-04 17:42:20 +00:00
  • 77a073aaa8 fix checkformat ci stage Charles Schlosser 2024-12-04 02:45:52 +00:00
  • 41e46ed243 fix IOFormat alignment Charles Schlosser 2024-12-04 01:13:48 +00:00
  • a0d32e40d9 fix map fill logic Charles Schlosser 2024-11-30 13:39:02 +00:00
  • d34b100c13 Fix UB in setZero Charles Schlosser 2024-11-27 19:32:14 +00:00
  • f19a6803c8 Refactor special case handling in pow(x,y) and revert to repeated squaring for <float,int> Rasmus Munk Larsen 2024-11-27 00:24:21 +00:00
  • 5064cb7d5e Add test for using pcast on scalars. Rasmus Munk Larsen 2024-11-25 22:27:26 -08:00
  • 1ea61a5d26 Improve pow(x,y): 25% speedup, increase accuracy for integer exponents. Rasmus Munk Larsen 2024-11-26 06:13:48 +00:00
  • 8ad4344ca7 optimize setConstant, setZero Charles Schlosser 2024-11-22 03:39:19 +00:00
  • 5610a13b77 Simplify and speed up pow() by 5-6% Rasmus Munk Larsen 2024-11-20 12:45:00 +00:00
  • 6c6ce9d06b Enable vectorized erf<double>(x) for SSE and AVX, which was accidentally removed in merge request 1750. Rasmus Munk Larsen 2024-11-19 22:14:29 +00:00
  • e7c799b7c9 Prevent premature overflow to infinity in exp(x). The changes also provide a 3-4% speedup. Rasmus Munk Larsen 2024-11-19 13:08:18 -08:00
  • 00af47102d Revert 040180078d Rasmus Munk Larsen 2024-11-19 10:25:16 -08:00
  • 8ee6f8475a Speed up exp(x). Rasmus Munk Larsen 2024-11-19 17:50:34 +00:00
  • 93ec5450cb disable fill_n optimization for msvc Charles Schlosser 2024-11-19 01:38:48 +00:00
  • 0af6ab4b76 Remove unnecessary check for HasBlend trait. Rasmus Munk Larsen 2024-11-18 13:04:59 -08:00
  • d5eec781b7 Get rid of redundant computation for large arguments to erf(x). Rasmus Munk Larsen 2024-11-18 10:51:58 -08:00
  • 2fc63808e4 Fix C++20 constexpr test compilation failures Tyler Veness 2024-11-18 01:56:55 +00:00
  • 5133c836c0 Vectorize erf(x) for double. Rasmus Munk Larsen 2024-11-16 19:05:16 +00:00
  • d6e3b528b2 Update Assign_MKL.h to cast disparate enum type to int, so it can be compared... Conrad Poelman 2024-11-15 20:00:29 +00:00
  • 040180078d Ensure that destructor's needed by lldb make it into binary in non-inlined fashion breathe1 2024-11-15 17:15:09 +00:00
  • 0fb2ed140d Make element accessors constexpr Tyler Veness 2024-11-14 01:05:29 +00:00
  • e67c494cba Use old syntax for CMake's separate_arguments() to restore compatiblity with old CMake versions. Morris Hafner 2024-11-13 17:01:13 +00:00
  • 8b4efc8ed8 check_size_for_overflow: use numeric limits instead of c99 macro Charles Schlosser 2024-11-13 00:35:35 +00:00
  • 489dbbc651 make fixed_size matrices conform to std::is_standard_layout Charles Schlosser 2024-11-12 23:34:26 +00:00
  • 283d871a3f Add missing EIGEN_DEVICE_FUNCTION decorations. Rasmus Munk Larsen 2024-11-08 14:25:57 -08:00
  • 0d366f6532 Vectorize erfc(x) for double and improve erfc(x) for float. Rasmus Munk Larsen 2024-11-08 17:21:11 +00:00
  • 8adf43640e more avx predux_any Charles Schlosser 2024-11-07 19:58:48 +00:00
  • bc424f617a add missing avx predux_any functions Charles Schlosser 2024-11-07 19:11:29 +00:00
  • e52ac76ca3 use EIGEN_CPLUSPLUS instead of checking cpp version Charles Schlosser 2024-11-06 17:25:22 +00:00
  • 122be167cd Revert "make fixed-size objects trivially move assignable" Rasmus Munk Larsen 2024-11-06 01:09:38 +00:00
  • d49021212b Tensor Roll / Circular Shift / Rotate Tobias Wood 2024-11-05 14:10:19 +00:00
  • 3e7bcf54f7 cherry-pick !1682 Add nvc++ support into 3.4 Morris Hafner 2024-11-04 17:55:47 +00:00
  • bb73be8a2e make fixed-size objects trivially move assignable Charles Schlosser 2024-11-04 17:55:27 +00:00
  • 7fd305ecae Fix GPU builds. Antonio Sánchez 2024-11-01 04:50:03 +00:00
  • c8267654f2 Don't use __builtin_alloca_with_align with nvc++ Morris Hafner 2024-10-30 18:02:08 +00:00
  • 84c446df2c Fix macro redefinition warning in FFTW test Tyler Veness 2024-10-30 17:18:42 +00:00
  • a9584d8e3c Fix clang6 failures. Antonio Sánchez 2024-10-30 14:41:50 +00:00
  • dd4c2805d9 Fix clang6 failures. Antonio Sánchez 2024-10-29 22:18:30 +00:00
  • 9e962d9c54 Fix OOB access in triangular matrix multiplication. Antonio Sánchez 2024-10-29 19:07:07 +00:00
  • 695e49d1bd Fix NVCC builds for CUDA 10+. Antonio Sánchez 2024-10-29 18:38:14 +00:00
  • dae09773fc Don't pass matrices by value. Antonio Sánchez 2024-10-29 18:19:02 +00:00
  • c23ec3420e Add tests for sizeof() with one dynamic dimension. Rasmus Munk Larsen 2024-10-28 13:48:53 -07:00
  • 58b252e5b3 Fix typo in PacketMath.h Rasmus Munk Larsen 2024-10-28 18:19:52 +00:00
  • 6c04d0cd68 Add missing exp2 definition for Altivec. Rasmus Munk Larsen 2024-10-28 18:12:36 +00:00
  • b15ebb1c2d add nextafter for bfloat16 Peter Gavin 2024-10-21 21:23:41 +00:00
  • 53b83cddf9 Include <type_traits> in main.h for std::is_trivial* Rasmus Munk Larsen 2024-10-25 20:55:51 +00:00
  • 37563856c9 Fix stack allocation assert Charles Schlosser 2024-10-25 17:02:43 +00:00
  • 3f067c4850 Add exp2() as a packet op and array method. Rasmus Munk Larsen 2024-10-22 22:09:34 +00:00
  • 4e5136d239 make fixed size matrices and arrays trivially_default_constructible Charles Schlosser 2024-10-21 17:10:15 +00:00
  • b396a6fbb2 Add free-function swap. Antonio Sánchez 2024-10-14 15:51:40 +00:00
  • 820e8a45fb add compile time info to reverse in place Charles Schlosser 2024-10-13 17:55:56 +00:00
  • b55dab7f21 Fix DenseBase::tail for Dynamic template argument Charles Schlosser 2024-10-12 21:03:30 +00:00
  • e0cbc55d92 Update README.md Charles Schlosser 2024-10-10 01:54:30 +00:00
  • 7eea0a9213 Vectorize erfc() for float Rasmus Munk Larsen 2024-10-09 18:38:05 +00:00
  • 78f3c654ee Don't use constexpr with half. Rasmus Munk Larsen 2024-10-08 16:44:40 +00:00
  • 6d7af238fa Adjust array_cwise for 32-bit arm. Antonio Sánchez 2024-10-07 23:15:24 +00:00
  • 74dcfbbd0f Use ppolevl for polynomial evaluation in more places. Rasmus Munk Larsen 2024-10-07 13:27:28 -07:00
  • a097f728fe Avoid producing erf(x) = NaN for large |x|. Rasmus Munk Larsen 2024-10-04 12:15:23 -07:00
  • 44b16f48cb Improve speed and accuracy or erf() Rasmus Munk Larsen 2024-10-03 01:52:16 +00:00
  • 12068cbcdb Fix inverse evaluator for running on CUDA device. Antonio Sánchez 2024-10-01 20:59:54 +00:00
  • 4e8e5e7409 Add max_digits10 in NumTraits for mpreal types. Rasmus Munk Larsen 2024-10-01 11:45:06 -07:00
  • 8e8c319087 Add missing EIGEN_DEVICE_FUNC annotations. Rasmus Munk Larsen 2024-10-01 11:40:58 -07:00
  • 7ad7c1d5c5 fix implicit conversion warning (again) Charles Schlosser 2024-09-24 22:07:00 +00:00
  • d052b7f864 add extra debugging info to float_pow_test_impl, clean up array_cwise tests Charles Schlosser 2024-09-24 21:08:22 +00:00
  • ba5183f98c fix warning in EigenSolver::pseudoEigenvalueMatrix() Charles Schlosser 2024-09-24 17:23:58 +00:00