Charles Schlosser
|
bb73be8a2e
|
make fixed-size objects trivially move assignable
|
2024-11-04 17:55:27 +00:00 |
|
Antonio Sánchez
|
7fd305ecae
|
Fix GPU builds.
|
2024-11-01 04:50:03 +00:00 |
|
Morris Hafner
|
c8267654f2
|
Don't use __builtin_alloca_with_align with nvc++
|
2024-10-30 18:02:08 +00:00 |
|
Tyler Veness
|
84c446df2c
|
Fix macro redefinition warning in FFTW test
|
2024-10-30 17:18:42 +00:00 |
|
Antonio Sánchez
|
a9584d8e3c
|
Fix clang6 failures.
|
2024-10-30 14:41:50 +00:00 |
|
Antonio Sánchez
|
dd4c2805d9
|
Fix clang6 failures.
|
2024-10-29 22:18:30 +00:00 |
|
Antonio Sánchez
|
9e962d9c54
|
Fix OOB access in triangular matrix multiplication.
|
2024-10-29 19:07:07 +00:00 |
|
Antonio Sánchez
|
695e49d1bd
|
Fix NVCC builds for CUDA 10+.
|
2024-10-29 18:38:14 +00:00 |
|
Antonio Sánchez
|
dae09773fc
|
Don't pass matrices by value.
|
2024-10-29 18:19:02 +00:00 |
|
Rasmus Munk Larsen
|
c23ec3420e
|
Add tests for sizeof() with one dynamic dimension.
|
2024-10-28 13:48:53 -07:00 |
|
Rasmus Munk Larsen
|
58b252e5b3
|
Fix typo in PacketMath.h
|
2024-10-28 18:19:52 +00:00 |
|
Rasmus Munk Larsen
|
6c04d0cd68
|
Add missing exp2 definition for Altivec.
|
2024-10-28 18:12:36 +00:00 |
|
Peter Gavin
|
b15ebb1c2d
|
add nextafter for bfloat16
|
2024-10-26 00:08:25 +00:00 |
|
Rasmus Munk Larsen
|
53b83cddf9
|
Include <type_traits> in main.h for std::is_trivial*
|
2024-10-25 20:55:51 +00:00 |
|
Charles Schlosser
|
37563856c9
|
Fix stack allocation assert
|
2024-10-25 17:02:43 +00:00 |
|
Rasmus Munk Larsen
|
3f067c4850
|
Add exp2() as a packet op and array method.
|
2024-10-22 22:09:34 +00:00 |
|
Charles Schlosser
|
4e5136d239
|
make fixed size matrices and arrays trivially_default_constructible
|
2024-10-21 17:10:15 +00:00 |
|
Antonio Sánchez
|
b396a6fbb2
|
Add free-function swap.
|
2024-10-14 15:51:40 +00:00 |
|
Charles Schlosser
|
820e8a45fb
|
add compile time info to reverse in place
|
2024-10-13 17:55:56 +00:00 |
|
Charles Schlosser
|
b55dab7f21
|
Fix DenseBase::tail for Dynamic template argument
|
2024-10-12 21:03:30 +00:00 |
|
Charles Schlosser
|
e0cbc55d92
|
Update README.md
|
2024-10-10 01:54:30 +00:00 |
|
Rasmus Munk Larsen
|
7eea0a9213
|
Vectorize erfc() for float
|
2024-10-09 18:38:05 +00:00 |
|
Rasmus Munk Larsen
|
78f3c654ee
|
Don't use constexpr with half.
|
2024-10-08 16:44:40 +00:00 |
|
Antonio Sánchez
|
6d7af238fa
|
Adjust array_cwise for 32-bit arm.
|
2024-10-07 23:15:24 +00:00 |
|
Rasmus Munk Larsen
|
74dcfbbd0f
|
Use ppolevl for polynomial evaluation in more places.
|
2024-10-07 13:27:28 -07:00 |
|
Rasmus Munk Larsen
|
a097f728fe
|
Avoid producing erf(x) = NaN for large |x|.
|
2024-10-04 12:15:23 -07:00 |
|
Rasmus Munk Larsen
|
44b16f48cb
|
Improve speed and accuracy or erf()
|
2024-10-03 01:52:16 +00:00 |
|
Antonio Sánchez
|
12068cbcdb
|
Fix inverse evaluator for running on CUDA device.
|
2024-10-01 20:59:54 +00:00 |
|
Rasmus Munk Larsen
|
4e8e5e7409
|
Add max_digits10 in NumTraits for mpreal types.
|
2024-10-01 18:57:17 +00:00 |
|
Rasmus Munk Larsen
|
8e8c319087
|
Add missing EIGEN_DEVICE_FUNC annotations.
|
2024-10-01 11:40:58 -07:00 |
|
Charles Schlosser
|
7ad7c1d5c5
|
fix implicit conversion warning (again)
|
2024-09-24 22:07:00 +00:00 |
|
Charles Schlosser
|
d052b7f864
|
add extra debugging info to float_pow_test_impl, clean up array_cwise tests
|
2024-09-24 21:08:22 +00:00 |
|
Charles Schlosser
|
ba5183f98c
|
fix warning in EigenSolver::pseudoEigenvalueMatrix()
|
2024-09-24 17:23:58 +00:00 |
|
Charles Schlosser
|
3ffb4e50df
|
fix implicit conversion in TensorChipping
|
2024-09-24 16:58:49 +00:00 |
|
Sean McBride
|
b6b8b54e5e
|
Fixed issue #2858: removed unneeded call to _mm_setzero_si128
|
2024-09-24 16:29:45 +00:00 |
|
Frédéric BRIOL
|
2a3465102a
|
Refactor code to use constexpr for data() functions.
|
2024-09-23 16:43:53 +00:00 |
|
Charles Schlosser
|
2d4c9b400c
|
make fixed size matrices and arrays trivially_copy_constructible and trivially_move_constructible
|
2024-09-17 17:43:36 +00:00 |
|
Antonio Sánchez
|
132f281f50
|
Fix generic ceil for SSE2.
|
2024-09-14 01:31:21 +00:00 |
|
Charles Schlosser
|
84282c42fc
|
optimize new dot product
|
2024-09-11 21:40:43 +00:00 |
|
Charles Schlosser
|
fb477b8be1
|
Better dot products
|
2024-09-10 21:02:31 +00:00 |
|
Sophie Chang
|
134b526d61
|
Update NonBlockingThreadPool.h plain asserts to use eigen_plain_assert
|
2024-09-10 00:18:27 +00:00 |
|
qile lin
|
072ec9d954
|
Fix a bug for pcmp_lt_or_nan and Add sqrt support for SVE
|
2024-09-04 21:45:39 +00:00 |
|
Rasmus Munk Larsen
|
9315389795
|
Fix bug in bug fix for atanh.
|
2024-09-04 09:37:59 -07:00 |
|
Rasmus Munk Larsen
|
f33af052e0
|
Fix bug for atanh(-1).
|
2024-09-03 20:54:01 +00:00 |
|
Rasmus Munk Larsen
|
66927f7807
|
Fix out-of-range arguments to _mm_permute_pd.
|
2024-08-30 17:31:52 +00:00 |
|
Rasmus Munk Larsen
|
bbdabebf44
|
Vectorize atanh<double>. Make atanh(x) standard compliant for |x| >= 1.
|
2024-08-30 17:27:55 +00:00 |
|
Morris Hafner
|
26e2c4f617
|
Add nvc++ support
|
2024-08-30 12:34:48 +00:00 |
|
Eugene Zhulenev
|
c59332d74a
|
Detect "effectively inner/outer" chipping in TensorChipping
|
2024-08-29 17:49:59 +00:00 |
|
Charles Schlosser
|
648bce6cae
|
SSE/AVX Complex FMA
|
2024-08-29 17:37:57 +00:00 |
|
Charles Schlosser
|
c21a80be3d
|
BDCSVD: Suppress Wmaybe-uninitialized
|
2024-08-29 02:45:38 +00:00 |
|