Commit Graph

  • 3ffb4e50df fix implicit conversion in TensorChipping Charles Schlosser 2024-09-24 16:58:49 +00:00
  • b6b8b54e5e Fixed issue #2858: removed unneeded call to _mm_setzero_si128 Sean McBride 2024-09-24 16:29:45 +00:00
  • 2a3465102a Refactor code to use constexpr for data() functions. Frédéric BRIOL 2024-09-23 16:43:53 +00:00
  • 2d4c9b400c make fixed size matrices and arrays trivially_copy_constructible and trivially_move_constructible Charles Schlosser 2024-09-17 17:43:36 +00:00
  • 132f281f50 Fix generic ceil for SSE2. Antonio Sánchez 2024-09-14 01:31:21 +00:00
  • 84282c42fc optimize new dot product Charles Schlosser 2024-09-11 21:40:43 +00:00
  • fb477b8be1 Better dot products Charles Schlosser 2024-09-10 21:02:31 +00:00
  • 134b526d61 Update NonBlockingThreadPool.h plain asserts to use eigen_plain_assert Sophie Chang 2024-09-10 00:18:27 +00:00
  • 072ec9d954 Fix a bug for pcmp_lt_or_nan and Add sqrt support for SVE qile lin 2024-09-04 21:45:39 +00:00
  • 9315389795 Fix bug in bug fix for atanh. Rasmus Munk Larsen 2024-09-04 09:37:59 -07:00
  • f33af052e0 Fix bug for atanh(-1). Rasmus Munk Larsen 2024-09-03 20:54:01 +00:00
  • 66927f7807 Fix out-of-range arguments to _mm_permute_pd. Rasmus Munk Larsen 2024-08-30 10:29:25 -07:00
  • bbdabebf44 Vectorize atanh<double>. Make atanh(x) standard compliant for |x| >= 1. Rasmus Munk Larsen 2024-08-30 17:27:55 +00:00
  • 26e2c4f617 Add nvc++ support Morris Hafner 2024-08-30 12:34:48 +00:00
  • c59332d74a Detect "effectively inner/outer" chipping in TensorChipping Eugene Zhulenev 2024-08-29 17:49:59 +00:00
  • 648bce6cae SSE/AVX Complex FMA Charles Schlosser 2024-08-29 17:37:57 +00:00
  • c21a80be3d BDCSVD: Suppress Wmaybe-uninitialized Charles Schlosser 2024-08-29 02:45:38 +00:00
  • 9d3d37c5b7 Complex Numtraits::HasSign and nmsub test Charles Schlosser 2024-08-28 03:02:47 +00:00
  • c5189ac656 Fix GeneralizedEigenSolver::eigenvectors() not appearing in documentation Valentin Sarthou 2024-08-21 11:17:30 +02:00
  • 3b5a1b4157 sve instrinsics with "_x" suffix will be faster than "_z" suffix qile lin 2024-08-23 12:52:22 +00:00
  • 98f1ac5e65 Fix breakage in GPU build. Rasmus Munk Larsen 2024-08-23 06:08:37 +00:00
  • 231308f690 TensorVolumePatchOp: Suppress Wmaybe-uninitialized caused by unreachable code Charles Schlosser 2024-08-23 01:55:12 +00:00
  • 2bf8fe1489 NEON Complex Intrinsics Tobias Wood 2024-08-22 22:46:16 +00:00
  • f91f8e9ab9 Consolidate float and double implementations of patan(). Rasmus Munk Larsen 2024-08-21 20:44:18 +00:00
  • 87239e058a vectorize squaredNorm() for complex types Charles Schlosser 2024-08-21 10:54:17 +00:00
  • 32d95bb097 Add vectorized implementation of tanh<double> Rasmus Munk Larsen 2024-08-21 02:29:45 +00:00
  • cc240eea2f Speed up and improve accuracy of tanh. Rasmus Munk Larsen 2024-08-16 23:46:28 +00:00
  • 92e373e6f5 Speed up StableNorm for non-trivial sizes and improve consistency between aligned and unaligned inputs. Rasmus Munk Larsen 2024-08-14 21:42:04 +00:00
  • 1dbc7581ec Include <thread> for std::this_thread::yield(). Rasmus Munk Larsen 2024-08-14 17:44:14 +00:00
  • ab310943d6 Add a yield instruction in the two spinloops of the threaded matmul implementation. Rasmus Munk Larsen 2024-08-09 10:48:24 -07:00
  • 99ffad1971 A few cleanups to threaded product code and test. Rasmus Munk Larsen 2024-08-09 09:35:23 -07:00
  • 59498c96fe SSE/AVX use fmaddsub for complex products Charles Schlosser 2024-08-05 21:26:05 +00:00
  • 1dcae7cefc Revert "BDCSVD fix -Wmaybe-uninitialized" Rasmus Munk Larsen 2024-08-05 18:17:01 +00:00
  • d14b0a4e53 Remove C++23 check around has_denorm deprecation suppression Tyler Veness 2024-08-03 21:34:27 +00:00
  • 24db460503 hlog symbol lookup should not restricted to global namespace Jatin Chaudhary 2024-08-03 03:59:13 +00:00
  • 767e60e290 Fix Woverflow warnings in PacketMathFP16 Alexander Grund 2024-08-03 03:57:18 +00:00
  • 8025683226 Fix conversion of Eigen::half to _Float16 in AVX512 code Alexander Grund 2024-08-03 03:49:51 +00:00
  • ec18dd09c8 fix pi in kissfft Alexey Korepanov 2024-08-02 22:57:47 +00:00
  • 2b7b7aac57 Speed up complex * complex matrix multiplication. Rasmus Munk Larsen 2024-08-02 20:40:53 +00:00
  • b3e3b7b0ec Remove implicit this capture in lambdas Devon Loehr 2024-08-02 20:06:35 +00:00
  • e44db21092 Optimize ThreadPool spinning Eugene Zhulenev 2024-08-02 19:18:34 +00:00
  • c593e9e948 Fix typos Mike Taves 2024-08-02 00:06:24 +00:00
  • fd98cc49f1 Avoid atomic false sharing in RunQueue Eugene Zhulenev 2024-08-01 17:41:16 +00:00
  • 0b646f3f36 Update file .clang-format Charles Schlosser 2024-08-01 03:18:50 +00:00
  • 1dcb07bb2a Update file eigen_navtree_hacks.js Charles Schlosser 2024-08-01 02:51:04 +00:00
  • ddb163ffb1 Update file .clang-format Charles Schlosser 2024-08-01 00:29:36 +00:00
  • 3f06651fd6 BDCSVD fix -Wmaybe-uninitialized Charles Schlosser 2024-07-30 22:53:06 +00:00
  • 6331da95eb fixing a lot of typos Frédéric Chapoton 2024-07-30 22:15:49 +00:00
  • c29c800126 Fix formatting in README.md Alexander Hans 2024-06-14 10:59:50 +00:00
  • 33d0937c6b Add async support for 'chip' and 'extract_volume_patches' adambanas 2024-06-27 09:34:19 +02:00
  • d791d48859 Fix AVX512FP16 build failure Rasmus Munk Larsen 2024-06-18 22:34:32 +00:00
  • 2fae4d7a77 Revert "fix scalar pselect" Charles Schlosser 2024-06-15 20:02:28 +00:00
  • b430eb31e2 AVX512F double->int64_t cast Charles Schlosser 2024-06-15 17:45:02 +00:00
  • 02bcf9b591 fix scalar pselect Charles Schlosser 2024-06-10 17:30:22 +00:00
  • 392b95bdf1 allow pointer_based_stl_iterator to conform to the contiguous_iterator concept if we are in c++20 Louis David 2024-06-06 21:38:09 +00:00
  • 27f8176254 fixing warning C5054: operator '==': deprecated between enumerations of different types Victor Ceballos 2024-06-04 16:44:13 +03:00
  • eac6355df2 Fix warnings created by other warnings fix Charles Schlosser 2024-06-01 03:37:04 +00:00
  • 7029a2e971 Vectorize allFinite() Rasmus Munk Larsen 2024-06-01 03:24:26 +00:00
  • e605227030 Fix warnings Charles Schlosser 2024-05-31 14:33:37 +00:00
  • 38b9cc263b Fix warnings about repeated deinitions of macros. Rasmus Munk Larsen 2024-05-29 13:38:00 -07:00
  • f02f89bf2c Don't redefine EIGEN_DEFAULT_IO_FORMAT in main.h. Rasmus Munk Larsen 2024-05-29 18:14:32 +00:00
  • 9148c47d67 Vectorize isfinite and isinf. Rasmus Munk Larsen 2024-05-29 00:20:12 +00:00
  • 5a9f66fb35 Fix Thread tests Tobias Wood 2024-05-24 16:50:14 +00:00
  • c4d84dfddc Fix compilation failures on constexpr matrices with GCC 14 Tyler Veness 2024-05-22 12:29:01 +00:00
  • 99adca8b34 Incorporate Threadpool in Eigen Core Charles Schlosser 2024-05-20 23:42:51 +00:00
  • d165c7377f Format EIGEN_STATIC_ASSERT() as a statement macro Tyler Veness 2024-05-20 23:02:42 +00:00
  • f78dfe36b0 use built in alloca with align if available Charles Schlosser 2024-05-19 19:32:49 +00:00
  • b9b1c8661e Suppress C++23 deprecation warnings for std::has_denorm and std::has_denorm_loss Tyler Veness 2024-05-13 12:46:15 -07:00
  • 3d2e738f29 fix performance-no-int-to-ptr Charlie Schlosser 2024-05-16 23:25:42 -04:00
  • de8013fa67 Fix ubsan failure in array_for_matrix Antonio Sánchez 2024-05-16 18:47:36 +00:00
  • 5e4f3475b5 Remove call to deprecated method initParallel() in SparseDenseProduct.h Rasmus Munk Larsen 2024-05-15 23:12:32 +00:00
  • 59cf0df1d6 SparseMatrix::insert add checks for valid indices Charles Schlosser 2024-05-15 16:14:32 +00:00
  • c0fe6ce223 Fixed a clerical error at documentation of class Matrix. Anabasis 2024-05-13 02:51:40 +00:00
  • afb17288cb Fix gcc6 compile error. Antonio Sánchez 2024-05-10 19:13:21 +00:00
  • 4d1d14e069 Change predux on PowerPC for Packet4i to NOT saturate the sum of the elements (like other architectures). Chip Kerchner 2024-05-08 22:39:27 +00:00
  • ff174f7926 fix issue: cmake package does not set include path correctly daizhirui 2024-05-07 21:21:08 +00:00
  • e16d70bd4e Fix FFT when destination does not have unit stride. Antonio Sánchez 2024-05-07 17:18:29 +00:00
  • 99c18bce6e Msvc muluh Charles Schlosser 2024-05-07 16:30:58 +00:00
  • 8e47971789 Bit shifting functions Charles Schlosser 2024-05-03 18:55:02 +00:00
  • 9700fc847a Reorganize CMake and minimize configuration for non-top-level builds. Antonio Sánchez 2024-05-01 17:42:53 +00:00
  • c1d637433e Judge unitary-ness relative to scaling. Antonio Sánchez 2024-04-30 22:28:46 +00:00
  • 9000b37677 Fix new generic nearest integer ops on GPU. Rasmus Munk Larsen 2024-04-30 22:18:25 +00:00
  • 0ee5c90aa9 Eigen transpose product Charles Schlosser 2024-04-30 13:32:52 +00:00
  • fb95e90f7f Add truncation op Charles Schlosser 2024-04-29 23:45:49 +00:00
  • d5524fc57b Remove unnecessary semicolons. Jonathan Freed 2024-04-29 21:31:26 +00:00
  • ae5280aa8d Fix more hard-coded magic bounds. Antonio Sánchez 2024-04-29 21:21:11 +00:00
  • a5e147305b Fix undefined behavior for generating inputs to the predux_mul test. Antonio Sánchez 2024-04-29 20:32:09 +00:00
  • dcceb9afec Unbork avx512 preduce_mul on MSVC. Antonio Sánchez 2024-04-26 15:28:03 +00:00
  • 42aa3d17cd Slightly adjust error bound for nonlinear tests. Antonio Sánchez 2024-04-25 18:04:49 +00:00
  • 1c8c734c8b Fix sin/cos on PPC. Antonio Sanchez 2024-04-24 15:57:44 -07:00
  • 34967b0b5b Revert "fix transposed matrix product bug" Charles Schlosser 2024-04-23 14:07:11 +00:00
  • 9cec679ef1 Don't let the PPC runner try to cross-compile. Antonio Sánchez 2024-04-23 03:40:40 +00:00
  • 574bc8820d fix transposed matrix product bug Charles Schlosser 2024-04-23 03:25:57 +00:00
  • 112ad8b846 Revert part of !1583, which may cause underflow on ARM. Rasmus Munk Larsen 2024-04-22 21:14:38 +00:00
  • 8cafbc4736 Fix unused variable warnings in TensorIO Charles Schlosser 2024-04-22 18:14:54 +00:00
  • 4de870b6eb fix autodiff enum comparison warnings Charles Schlosser 2024-04-22 18:14:20 +00:00
  • 2265242aa1 Update CI scripts. Antonio Sánchez 2024-04-20 01:08:19 +00:00
  • ee9d57347b Fix tridiagonalization_inplace_selector::run() when called from CUDA ahmed 2024-04-19 21:06:59 +00:00
  • 1550c99541 Eigen select Charles Schlosser 2024-04-19 17:52:34 +00:00
  • 5635d37f46 more pblend optimizations Charles Schlosser 2024-04-19 02:02:27 +00:00