Mehdi Goli
0623791930
[SYCL-2020] Enabling USM support for SYCL. SYCL-1.2.1 did not have support for USM.
2023-05-05 17:30:36 +00:00
Antonio Sánchez
1f79a6078f
Return NaN in ndtri for values outside valid input range.
2023-05-05 16:27:26 +00:00
Tobias Wood
94f57867fe
Thread pool
2023-05-05 16:23:34 +00:00
Charles Schlosser
e2bbf496f6
Use select ternary op in tensor select evaulator
2023-04-18 20:52:16 +00:00
Rasmus Munk Larsen
b8b8a26145
Add more missing vectorized casts for int on x86, and remove redundant unit tests
2023-03-24 16:02:00 +00:00
Colin Broderick
8f9b8e3630
Replaced all instances of internal::(U)IntPtr with std::(u)intptr_t. Remove ICC workaround.
2023-03-21 16:50:23 +00:00
Rasmus Munk Larsen
0488b708b4
Vectorize tensor.isnan() by using typed predicates.
2023-03-16 04:04:22 +00:00
Charles Schlosser
d670039309
fix tensor comparison test
2023-03-06 13:11:14 +00:00
Charles Schlosser
826627f653
vectorize comparisons and select by enabling typed comparisons
2023-02-25 20:52:11 +00:00
Mehdi Goli
c3f67063ed
[SYCL-2020]- null placeholder accessor issue in Reduction SYCL test
2023-02-22 17:44:53 +00:00
Chip Kerchner
54459214a1
Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC.
2023-02-16 23:35:42 +00:00
Rasmus Munk Larsen
07aaa62e6f
Fix compiler warnings in tests.
2023-02-14 02:29:03 +00:00
Antonio Sánchez
17ae83a966
Fix bugs exposed by enabling GPU asserts.
2023-01-27 21:43:00 +00:00
Sean McBride
d70b4864d9
issue #2581 : review and cleanup of compiler version checks
2023-01-17 18:58:34 +00:00
Mehdi Goli
b523120687
[SYCL-2020 Support] Enabling Intel DPCPP Compiler support to Eigen
2023-01-16 07:04:08 +00:00
Antonio Sánchez
262194f12c
Fix a bunch of minor build and test issues.
2023-01-06 16:37:26 +00:00
Rasmus Munk Larsen
3c4637640b
Remove unused typedef.
2022-09-23 19:11:31 +00:00
Romain Biessy
2f7cce2dd5
[SYCL] Fix some SYCL tests
2022-08-16 17:37:54 +00:00
Julian Kent
69714ff613
Add Sparse Subset of Matrix Inverse
2022-07-28 18:04:35 +00:00
Antonio Sanchez
0e18714167
Fix clang-tidy warnings about function definitions in headers.
2022-06-24 15:10:58 +00:00
Oleg Shirokobrod
f542b0a71f
Adding an MKL adapter in FFT module.
2022-06-02 18:10:43 +00:00
Tobias Wood
a9868bd5be
Add arg() to tensor
2022-05-20 03:33:01 +00:00
Guoqiang QI
00b75375e7
Adding PocketFFT support in FFT module since kissfft has some flaw in accuracy and performance
2022-05-11 17:44:22 +00:00
Erik Schultheis
b9d2900e8f
added a missing typename and fixed a unused typedef warning
2022-03-24 12:07:18 +02:00
Essex Edwards
cd3c81c3bc
Add a NNLS solver to unsupported - issue #655
2022-03-23 20:20:44 +00:00
Erik Schultheis
421cbf0866
Replace Eigen type metaprogramming with corresponding std types and make use of alias templates
2022-03-16 16:43:40 +00:00
Antonio Sánchez
9296bb4b93
Fix edge-case in zeta for large inputs.
2022-03-08 21:21:20 +00:00
Antonio Sánchez
008ff3483a
Fix broken tensor executor test, allow tensor packets of size 1.
2022-03-07 20:30:37 +00:00
Antonio Sánchez
d819a33bf6
Remove poor non-convergence checks in NonLinearOptimization.
2022-03-02 19:31:20 +00:00
Antonio Sanchez
1c2690ed24
Adjust tolerance of matrix_power test for MSVC.
2022-03-01 23:33:05 +00:00
Antonio Sánchez
ae86a146b1
Modify test expression to avoid numerical differences ( #2402 ).
2022-02-23 16:37:03 +00:00
Romain Biessy
2dd879d4b0
[SYCL] Fix CMake for SYCL support
2022-02-22 16:53:27 +00:00
Rasmus Munk Larsen
18eab8f997
Add convenience method constexpr std::size_t size() const
to Eigen::IndexList
2022-02-12 04:23:03 +00:00
Rasmus Munk Larsen
ea2c02060c
Add reciprocal packet op and fast specializations for float with SSE, AVX, and AVX512.
2022-01-21 23:49:18 +00:00
Erik Schultheis
970640519b
Cleanup
2022-01-21 01:48:59 +00:00
Kolja Brix
8d81a2339c
Reduce usage of reserved names
2022-01-10 20:53:29 +00:00
Jens Wehner
c6fa0ca162
Idrsstabl
2021-12-06 20:00:00 +00:00
Erik Schultheis
cc11e240ac
Some further cleanup
2021-12-06 18:01:15 +00:00
Jens Wehner
f63c6dd1f9
Bicgstabl
2021-12-02 22:48:22 +00:00
Xinle Liu
7ef5f0641f
Remove macro EIGEN_GPU_TEST_C99_MATH
...
Remove macro EIGEN_GPU_TEST_C99_MATH which is used in a single test file only and always defaults to true.
2021-12-01 14:48:56 +00:00
Erik Schultheis
ec2fd0f7ed
Require recent GCC and MSCV and removed EIGEN_HAS_CXX14
and some other feature test macros
2021-12-01 00:48:34 +00:00
Erik Schultheis
4a76880351
Updated CMake
...
This patch updates the minimum required CMake version to 3.10 and removes the EIGEN_TEST_CXX11 CMake option, including corresponding logic.
2021-11-29 20:24:20 +00:00
Erik Schultheis
f33a31b823
removed EIGEN_HAS_CXX11_* and redundant EIGEN_COMP_CXXVER checks
2021-11-29 19:18:57 +00:00
David Tellenbach
08da52eb85
Remove DenseBase::nonZeros() which just calls DenseBase::size()
...
Fixes #2382 .
2021-11-27 14:31:00 +00:00
Erik Schultheis
ec4efbd696
remove EIGEN_HAS_CXX11
2021-11-24 20:08:49 +00:00
Rasmus Munk Larsen
96aeffb013
Make the new TensorIO implementation work with TensorMap with const elements.
2021-11-17 18:16:04 -08:00
cpp977
f73c95c032
Reimplemented the Tensor stream output.
2021-11-16 17:36:58 +00:00
Ben Barsdell
50df8d3d6d
Avoid integer overflow in EigenMetaKernel indexing
...
- The current implementation computes `size + total_threads`, which can
overflow and cause CUDA_ERROR_ILLEGAL_ADDRESS when size is close to
the maximum representable value.
- The num_blocks calculation can also overflow due to the implementation
of divup().
- This patch prevents these overflows and allows the kernel to work
correctly for the full representable range of tensor sizes.
- Also adds relevant tests.
2021-11-05 16:39:37 +11:00
Rasmus Munk Larsen
55e3ae02ac
Compare summation results against forward error bound.
2021-11-04 18:04:04 -07:00
Antonio Sanchez
f6c8cc0e99
Fix TensorReduction warnings and error bound for sum accuracy test.
...
The sum accuracy test currently uses the default test precision for
the given scalar type. However, scalars are generated via a normal
distribution, and given a large enough count and strong enough random
generator, the expected sum is zero. This causes the test to
periodically fail.
Here we estimate an upper-bound for the error as `sqrt(N) * prec` for
summing N values, with each having an approximate epsilon of `prec`.
Also fixed a few warnings generated by MSVC when compiling the
reduction test.
2021-10-30 14:59:00 -07:00