5029 Commits

Author SHA1 Message Date
Antonio Sánchez
cc941d69a5 Update error about c++14 requirement. 2024-02-29 20:45:13 +00:00
Antonio Sánchez
6893287c99 Add degenerate checks before calling BLAS routines. 2024-02-29 18:56:36 +00:00
Charles Schlosser
b334910700 delete shadowed typedefs 2024-02-28 02:40:45 +00:00
Antonio Sánchez
a962a27594 Fix MSVC GPU build. 2024-02-27 23:26:06 +00:00
Antonio Sánchez
7a88cdd6ad Fix signed integer UB in random. 2024-02-24 13:16:23 +00:00
Antonio Sánchez
feaafda30a Change array_size result from enum to constexpr. 2024-02-22 22:52:25 +00:00
Rasmus Munk Larsen
db6b9db33b Make header guards in GeneralMatrixMatrix.h and Parallelizer.h consistent:... 2024-02-20 20:03:18 +00:00
Antonio Sánchez
b56e30841c Enable direct access for IndexedView. 2024-02-20 18:21:45 +00:00
Antonio Sánchez
6b365e74d6 Fix GPU build for ptanh_float. 2024-02-20 16:08:50 +00:00
Damiano Franzò
be06c9ad51 Implement float pexp_complex 2024-02-17 00:26:57 +00:00
Rasmus Munk Larsen
4d419e2209 Rename generic_fast_tanh_float to ptanh_float and move it to... 2024-02-16 21:27:22 +00:00
Antonio Sánchez
2a9055b50e Fix random for custom scalars that don't have constexpr digits(). 2024-02-16 02:30:54 +00:00
Antonio Sánchez
a24bf2e9a2 Disable float16 packet casting if native AVX512 f16 is available. 2024-02-14 20:05:00 +00:00
Alec Jacobson
7e655c9a5d Fixes 2780 2024-02-13 02:57:43 +00:00
Antonio Sánchez
9229cfa822 Fix division by zero UB in packet size logic. 2024-02-12 21:01:19 +00:00
Gautam Jha
4eac211e96 Fix C++20 error, Arithmetic between different enumeration types 2024-02-12 04:25:04 +00:00
Antonio Sánchez
7b87b21910 Fix UB in bool packetmath test. 2024-02-09 19:46:45 +00:00
Charles Schlosser
431e4a913b Fix the fuzz 2024-02-07 04:52:19 +00:00
Antonio Sánchez
3ebaab8a63 Fix PPC rand and other failures. 2024-02-05 20:07:15 +00:00
Charles Schlosser
d626762e3f improve random 2024-01-31 08:16:29 +00:00
Damiano Franzò
7fd7a3f946 Implement plog_complex 2024-01-30 19:06:05 +00:00
Antonio Sánchez
a73970a864 Fix arm32 issues. 2024-01-23 22:04:55 +00:00
Cheng Wang
2c6b61c006 Add half and quarter vector support to HVX architecture 2024-01-22 21:23:21 +00:00
Antonio Sánchez
f6e41e6433 Revert "Clean up stableNorm" 2024-01-19 20:22:47 +00:00
Nuno Gonçalves
b0f906419e add missing constexpr qualifier 2024-01-19 18:49:53 +00:00
Antonio Sanchez
b2814d53a7 Fix stableNorm when input is zero-sized. 2024-01-16 10:14:51 -08:00
Charles Schlosser
c29a410116 check pointers before freeing 2024-01-12 06:09:46 +00:00
Charles Schlosser
a1a96fafde Clean up stableNorm 2024-01-08 23:28:41 +00:00
Antonio Sánchez
a2cf99ec6f Fix GPU+clang+asan. 2024-01-04 17:29:37 +00:00
Charles Schlosser
2c4541f735 fix msvc clz 2023-12-13 03:33:49 +00:00
Antonio Sánchez
75e273afcc Add internal ctz/clz implementation. 2023-12-11 21:03:09 +00:00
Antonio Sánchez
3252ecc7a4 Fix scalar_logistic_function overflow for complex inputs. 2023-12-05 18:21:04 +00:00
Tobias Wood
f38e16c193 Apply clang-format 2023-11-29 11:12:48 +00:00
Drew Lewis
9ea520fc45 Ensure that mc is not smaller than Traits::nr 2023-11-28 22:48:53 +00:00
Antonio Sánchez
dd8c71e628 Fix typecasting for arm32 2023-11-23 00:47:50 +00:00
Charles Schlosser
283dec7f25 Update file GeneralMatrixVector.h 2023-11-21 19:50:35 +00:00
Pavel Labath
66b9f4ed5c Fix (u)int64_t->float conversion on arm 2023-11-21 16:09:12 +00:00
Charles Schlosser
d1b03fb5c9 Gemv microoptimization 2023-11-20 17:26:39 +00:00
Rasmus Munk Larsen
3cf6bb6f1c Fix a bug in commit 76e8c0455396446f8166c798da5efe879e010bdc: 2023-11-15 21:45:37 +00:00
Charles Schlosser
32165c6f0c Fix Wshorten-64-to-32 warning in gemm parallelizer 2023-11-14 13:51:27 +00:00
Rasmus Munk Larsen
b33dbb5765 Fix implicit narrowing warning in Parallelizer.h. 2023-11-13 21:30:39 +00:00
wk
f78c37f0af traits<Ref>::match: use correct strides 2023-11-11 14:10:56 +00:00
Rasmus Munk Larsen
516d08a490 Fix typo in Parallelizer.h 2023-11-10 20:29:29 +00:00
Rasmus Munk Larsen
76e8c04553 Generalize parallel GEMM implementation in Core to work with ThreadPool in addition to OpenMP. 2023-11-10 17:42:30 +00:00
Antonio Sánchez
4d54c43d6c Fix typo to allow nomalloc test to pass on AVX512. 2023-11-06 18:58:43 +00:00
Kyle Macfarlan
5de0f2f89e Fixes #2735: Component-wise cbrt 2023-10-25 03:06:13 +00:00
Antonio Sánchez
48b254a4bc Disable denorm deprecation warnings in MSVC C++23. 2023-10-23 17:56:04 +00:00
Antonio Sánchez
aa6964bf3a Work around MSVC issue in Block XprType. 2023-10-19 22:02:03 +00:00
Anatoly Borisov
877c2d1e9b fix typo in comment 2023-10-18 12:58:49 +00:00
Antonio Sánchez
0c9526912c Pass div_ceil arguments by value. 2023-10-17 18:46:19 +00:00