Charles Schlosser
7c6020e424
Fix -Waggressive-loop-optimizations
...
(cherry picked from commit 4e9e493b4abc57dba377fc326082b40d08193619)
2023-11-24 19:06:40 +00:00
arthurfeeney
2e3f1d8044
Fix implicit conversion warning in GEBP kernel's packing
...
(cherry picked from commit 937c3d73cbf55b6edc658decf7931b90ede1daef)
2023-11-18 18:17:21 +00:00
Silvio Traversaro
fc5575264f
Backport "disambiguate overloads for empty index list" to 3.4 branch
2023-11-10 04:03:11 +00:00
Antonio Sanchez
bae907b8f6
Update version to 3.4.1
...
Tests all pass: https://gitlab.com/libeigen/eigen_ci_cross_testing/-/pipelines/1060764169
2023-11-06 13:53:54 -08:00
Charles Schlosser
cf207eacd5
Patch SparseLU
...
(cherry picked from commit a8bab0d8ae0601392287526840ddcb1af3bb85a6)
2023-11-02 21:17:17 -07:00
Chip Kerchner
e734787bb7
Fix pre-POWER8_VECTOR bugs in pcmp_lt and pnegate and reactivate psqrt.
...
(cherry picked from commit 4a58f30aa083d2f5753465a36730ca7734b483be)
2023-10-25 15:19:37 -07:00
Antonio Sanchez
1217390db4
Fix windows+CUDA builds
2023-10-25 20:55:59 +00:00
Antonio Sanchez
7176ae1623
Make 3.4.1 compatible with c++03
2023-10-16 15:38:25 -07:00
Antonio Sánchez
0db5928f00
Eliminate use of _res.
...
(cherry picked from commit 5bdf58b8df31b0377bc5f57ba63d8479793b7bae)
2023-10-16 13:38:17 -07:00
Erik Schultheis
764b132a79
ensure that eigen::internal::size is not found by ADL, rename to ssize and...
...
(cherry picked from commit 9210e71fb378a0f1542272506dc2759b6c147237)
2023-08-24 12:42:34 -07:00
Fabian Keßler
d0bfdc1658
optimize cmake scripts for subproject use
...
(cherry picked from commit 19cacd3ecb9dab73c2dd7bc39d9193e06ba92bdd)
2023-07-26 12:01:28 -07:00
Antonio Sánchez
75ebef26b6
Adds new CMake Options for controlling build components.
...
(cherry picked from commit cf82186416d04ea5df2a397d8fe09dc78d40ca65)
2023-07-26 11:52:47 -07:00
Charles Schlosser
208e44c979
fix warnings in tensorreduction and memory
2023-07-19 16:48:07 +00:00
Antonio Sánchez
17d57fb168
Fix up PowerPC MMA flags so it builds by default.
...
(cherry picked from commit 591906477bc8c8102dbefceefe10d81648865394)
2023-07-11 16:27:32 -07:00
Antonio Sánchez
6973687c70
Fix up PowerPC MMA flags so it builds by default.
...
(cherry picked from commit 65eeedf9646ee6efc457cc3a8f8d9030a6f83689)
2023-07-11 16:20:57 -07:00
Antonio Sanchez
ac561cd038
Reduce tensor_contract_gpu test.
...
The original test times out after 60 minutes on Windows, even when
setting flags to optimize for speed. Reducing the number of
contractions performed from 3600->27 for subtests 8,9 allow the
two to run in just over a minute each.
(cherry picked from commit be9e7d205f38e3e8effdfdded88817b371673930)
2023-07-11 11:27:31 -07:00
Antonio Sanchez
554982beef
Disable Tree reduction for GPU.
...
For moderately sized inputs, running the Tree reduction quickly
fills/overflows the GPU thread stack space, leading to memory errors.
This was happening in the `cxx11_tensor_complex_gpu` test, for example.
Disabling tree reduction on GPU fixes this.
(cherry picked from commit 24ebb37f38287d65c0e0b60c714e39faffeb5b94)
2023-07-10 16:09:30 -07:00
Antonio Sanchez
89a71f3126
Fix gpu special function tests.
...
Some checks used incorrect values, partly from copy-paste errors,
partly from the change in behaviour introduced in !398 .
Modified results to match scipy, simplified tests by updating
`VERIFY_IS_CWISE_APPROX` to work for scalars.
(cherry picked from commit 701f5d1c91c770e558c7760da14ff3365757e275)
2023-07-10 15:57:08 -07:00
Antonio Sanchez
a605d6b996
Rename EIGEN_CUDA_FLAGS to EIGEN_CUDA_CXX_FLAGS
...
Also add a missing space for clang.
(cherry picked from commit 846d34384af80b80793d32257a7f917eeece41d4)
2023-07-10 15:30:41 -07:00
Antonio Sanchez
dfcd6de20a
Clean up CUDA CMake files.
...
- Unify test/CMakeLists.txt and unsupported/test/CMakeLists.txt
- Added `EIGEN_CUDA_FLAGS` that are appended to the set of flags passed
to the cuda compiler (nvcc or clang).
The latter is to support passing custom flags (e.g. `-arch=` to nvcc,
or to disable cuda-specific warnings).
(cherry picked from commit 7b00e8b186a7679b0f46be742809a55d07d4efe8)
2023-07-10 15:30:41 -07:00
Antonio Sanchez
1ec1b16d36
Add buildtests_gpu and check_gpu to simplify GPU testing.
...
This is in preparation of adding GPU tests to the CI, allowing
us to limit building/testing of GPU-specific tests for a given
GPU-capable runner.
GPU tests are tagged with the label "gpu". The new targets
```
make buildtests_gpu
make check_gpu
```
allow building and running only the gpu tests.
(cherry picked from commit 16f9a20a6f408fe8896d9c07c91e23e8159a9d2c)
2023-07-10 15:30:41 -07:00
Antonio Sánchez
0f39c851a5
Fix use of arg function in CUDA.
...
(cherry picked from commit 63dcb429cd91f7e1cdfaffb894bb368502ee0c38)
2023-07-10 15:30:41 -07:00
Kevin Leonardic
daa0b70a65
Fix argument for _mm256_cvtps_ph imm parameter
...
(cherry picked from commit d4b05454a7b33139ce6636584550780ff15af6ed)
2023-07-10 15:30:41 -07:00
Antonio Sánchez
33ba98b641
Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm.
...
(cherry picked from commit 31cd2ad371bf8b7486951dfcd20381427aba5546)
2023-07-10 15:30:41 -07:00
Antonio Sánchez
e6e921f0e3
Disable FP16 arithmetic for arm32.
...
(cherry picked from commit 7465b7651edfb58322557179658853243eb96372)
2023-07-10 15:30:41 -07:00
Alexander Shaposhnikov
ebfdd6bdea
Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation
...
(cherry picked from commit 316eab8deb574d150f9cfc7f8b170156dc0cdd9f)
2023-07-10 15:30:41 -07:00
Alejandro Acosta
357bb11066
Replace usage of CudaStreamDevice with GpuStreamDevice in tensor benchmarks GPU
...
(cherry picked from commit 07e4604b1961a32bbe21841a1e97fc274b50c443)
2023-07-10 15:30:40 -07:00
Rasmus Munk Larsen
9b3d104c02
Add missing braces in Umeyama.h
...
(cherry picked from commit 1321821e86ebbfdf8ccda7f89b59f19feb023dbc)
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
af3ca50f0b
Work around compiler bug in Umeyama.h.
...
(cherry picked from commit 524c329ab23fb565eff8aa2bf36d134e97773a99)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
26b8fabd80
Return NaN in ndtri for values outside valid input range.
...
(cherry picked from commit 1f79a6078fb77da47069c8aec23c4e309fb982e2)
2023-07-10 14:52:08 -07:00
Charles Schlosser
385a0b38f8
JacobiSVD: set m_nonzeroSingularValues to zero if not finite
...
(cherry picked from commit fdc749de2ac1dd6ab25298dd60ab8d594992fd07)
2023-07-10 14:52:08 -07:00
Antonio Sanchez
a4ecfd8ead
Fix boolean bitwise and warning.
...
(cherry picked from commit 70410310a40bf6f790e8f3c785e78aa2b8f4ca12)
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
f296720d7d
Make sure we return +/-1 above the clamping point for Erf().
...
(cherry picked from commit b378014fef017a829fb42c7fad15f3764bfb8ef9)
2023-07-10 14:52:08 -07:00
Rob Conde
f04d02dbf6
exclude Eigen/Core
and Eigen/src/Core
from being ignored due to core
ignore rule
...
(cherry picked from commit 990a282fc40e9fb62a7aea1ba67b5c00ed838732)
2023-07-10 14:52:08 -07:00
Rohit Goswami
6f9bffe8dd
DOC: Update documentation for 3.4.x
...
(cherry picked from commit b0eded878d5d162d61583a286c0d8a45406ad1bc)
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
d4c24eca96
Don't crash on empty tensor contraction.
...
(cherry picked from commit b0f877f8e01e90a5b0f3a79d46ea234899f8b499)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
72b0759451
Fix arm builds.
...
(cherry picked from commit 2c8011c2dd72d6c2086b181aad8cbb6204fed5db)
2023-07-10 14:52:08 -07:00
Jonas Schulze
34d0d83278
Fix some typos
...
(cherry picked from commit 81cb6a51d0358151a7174481e140074401ca4afa)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
63e8b31c94
Fix parsing of command-line arguments when already specified as a cmake list.
...
(cherry picked from commit 555cec17edc2cae91b3310ef8915acbeb3951cab)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
99473f255b
Fix failing MSVC tests due to compiler bugs.
...
(cherry picked from commit 394aabb0a3976d95a5c6f286d49e43bb49558cc2)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
2ce5dc428f
Guard use of long double on GPU device.
...
(cherry picked from commit bc5cdc7a675621ce54c4a9277d041d9dd7aae757)
2023-07-10 14:52:08 -07:00
Chip Kerchner
8f1b6198c2
Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC.
...
(cherry picked from commit 54459214a1b9c67df04bc529474fca1ec9f4c84f)
2023-07-10 14:52:08 -07:00
Antonio Sánchez
dae8c6d7ad
Guard complex sqrt on old MSVC compilers.
...
(cherry picked from commit a16fb889dd5890b2a0788af10568f19155e6b262)
2023-07-10 14:52:07 -07:00
Antonio Sánchez
2dfdaa2abf
More NEON packetmath fixes.
...
(cherry picked from commit 384269937f707669fb1ab65bee7e9bfca2c2dfa1)
2023-07-10 14:52:03 -07:00
Antonio Sánchez
a659b5dbb2
Fix NEON make_packet2f.
...
(cherry picked from commit 2dfbf1b251e7a32c140f36fc865b154b8a725bdd)
2023-07-10 14:34:09 -07:00
Antonio Sánchez
879854382c
Fix MSVC arm build.
...
(cherry picked from commit 0a5392d6061134a4a32d0025fa154f830b83d606)
2023-07-10 14:34:09 -07:00
Jeremy Nimmer
90dce8dfa3
Fix undefined behavior in Block access
...
(cherry picked from commit a1cdcdb038cda474aefb900171222254599e9dd8)
2023-07-10 14:34:09 -07:00
Martin Burchell
b26ada1e03
Fix error: unused parameter 'tmp' [-Werror,-Wunused-parameter] on clang/32-bit arm
...
(cherry picked from commit c54785b071e6297c062883cf43f323525ff0e6fb)
2023-07-10 14:34:09 -07:00
Antonio Sánchez
f5593b4baa
Fix reshape strides when input has non-zero inner stride.
...
(cherry picked from commit 2260e11eb019161cc861ef2b832ce3b8a92efecd)
2023-07-10 14:34:09 -07:00
Alexandre Hoffmann
3eb0c8b69e
Changing BiCGSTAB parameters initialization so that it works with custom types
...
(cherry picked from commit 23524ab6fcbf557b4ae1885d25d867727330c0c2)
2023-07-10 14:34:09 -07:00