eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-08-05 03:30:37 +08:00

Author	SHA1	Message	Date
Antonio Sánchez	4786edba26	Fix pragma check for disabling fastmath. (cherry picked from commit c27d1abe460c32a432e1f019be17f2c0f876ccac)	2023-07-10 10:09:09 -07:00
Rasmus Munk Larsen	3fbb1c1b48	Guard GCC-specific pragmas with "#ifdef EIGEN_COMP_GNUC" (cherry picked from commit 5ceed0d57f14b0d9d62b8732f7f686b3aae56738)	2023-07-10 10:09:09 -07:00
Antonio Sánchez	28cd280726	Fix 4x4 inverse when compiling with -Ofast. (cherry picked from commit 7d6a9925cc38842359750f3e06263e20b7635436)	2023-07-10 10:09:09 -07:00
Antonio Sanchez	5b83d3c4bc	Make inverse 3x3 faster and avoid gcc bug. There seems to be a gcc 4.7 bug that incorrectly flags the current 3x3 inverse as using uninitialized memory. I'm pretty sure it's a false positive, but it's hard to trigger. The same warning does not trigger with clang or later compiler versions. In trying to find a work-around, this implementation turns out to be faster anyways for static-sized matrices. ``` name old cpu/op new cpu/op delta BM_Inverse3x3<DynamicMatrix3T<float>> 423ns ± 2% 433ns ± 3% +2.32% (p=0.000 n=98+96) BM_Inverse3x3<DynamicMatrix3T<double>> 425ns ± 2% 427ns ± 3% +0.48% (p=0.003 n=99+96) BM_Inverse3x3<StaticMatrix3T<float>> 7.10ns ± 2% 0.80ns ± 1% -88.67% (p=0.000 n=114+112) BM_Inverse3x3<StaticMatrix3T<double>> 7.45ns ± 2% 1.34ns ± 1% -82.01% (p=0.000 n=105+111) BM_AliasedInverse3x3<DynamicMatrix3T<float>> 409ns ± 3% 419ns ± 3% +2.40% (p=0.000 n=100+98) BM_AliasedInverse3x3<DynamicMatrix3T<double>> 414ns ± 3% 413ns ± 2% ~ (p=0.322 n=98+98) BM_AliasedInverse3x3<StaticMatrix3T<float>> 7.57ns ± 1% 0.80ns ± 1% -89.37% (p=0.000 n=111+114) BM_AliasedInverse3x3<StaticMatrix3T<double>> 9.09ns ± 1% 2.58ns ±41% -71.60% (p=0.000 n=113+116) ``` (cherry picked from commit 5ad8b9bfe2bf75620bc89467c5cc051fc2a597df)	2021-08-04 22:06:52 +00:00
Guoqiang QI	69ec4907da	Make a copy of input matrix when try to do the inverse in place, this fixes #2285 . (cherry picked from commit 4bcd42c271761dc5341f8e08ca7d357c3614cb01)	2021-07-08 17:07:54 +00:00
Antonio Sanchez	b6db013435	Fix inverse nullptr/asan errors for LU. For empty or single-column matrices, the current `PartialPivLU` currently dereferences a `nullptr` or accesses memory out-of-bounds. Here we adjust the checks to avoid this. (cherry picked from commit 154f00e9eacaec5667215784c7601b55024e2f61)	2021-07-01 22:57:25 +00:00
Antonio Sanchez	ee4e099aa2	Remove pset, replace with ploadu. We can't make guarantees on alignment for existing calls to `pset`, so we should default to loading unaligned. But in that case, we should just use `ploadu` directly. For loading constants, this load should hopefully get optimized away. This is causing segfaults in Google Maps. (cherry picked from commit 12e8d57108c50d8a63605c6eb0144c838c128337)	2021-06-17 17:11:08 +00:00
Rasmus Munk Larsen	1cb1ffd5b2	Use bit_cast to create -0.0 for floating point types to avoid compiler optimization changing sign with --ffast-math enabled. (cherry picked from commit fc87e2cbaa65e7e93a2c695ce5a9dc048a64a985)	2021-06-11 02:57:02 +00:00
Rasmus Munk Larsen	54425a39b2	Make vectorized compute_inverse_size4 compile with AVX. (cherry picked from commit 85a76a16ea835fcfa7d4c185a338ae2aef9a272a)	2021-04-22 17:25:25 +00:00
Christoph Hertzberg	6197ce1a35	Replace `-2147483648` by `-0.0f` or `-0.0` constants (this should fix #2189 ). Also, remove unnecessary `pgather` operations.	2021-04-07 11:25:27 +00:00
Steve Bronder	e7b8643d70	Revert "Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()"" This reverts commit 5f0b4a4010af4cbf6161a0d1a03a747addc44a5d.	2021-03-24 18:14:56 +00:00
David Tellenbach	5f0b4a4010	Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()" This reverts commit 6cbb3038ac48cb5fe17eba4dfbf26e3e798041f1 because it breaks clang-10 builds on x86 and aarch64 when C++11 is enabled.	2021-03-05 13:16:43 +01:00
Steve Bronder	6cbb3038ac	Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()	2021-03-04 18:58:08 +00:00
Antonio Sanchez	60218829b7	EOF newline added to InverseSize4. Causing build breakages due to `-Wnewline-eof -Werror` that seems to be common across Google.	2020-11-18 07:58:33 -08:00
Guoqiang QI	394f564055	Unify Inverse_SSE.h and Inverse_NEON.h into a single generic implementation using PacketMath.	2020-11-17 12:27:01 +00:00
David Tellenbach	f66f3393e3	Use reinterpret_cast instead of C-style cast in Inverse_NEON.h	2020-10-04 00:35:09 +02:00
Rasmus Munk Larsen	22c971a225	Don't cast away const in Inverse_NEON.h.	2020-10-02 15:06:34 -07:00
Rasmus Munk Larsen	068121ec02	Add missing newline at the end of Inverse_NEON.h	2020-09-29 15:32:52 +00:00
Rasmus Munk Larsen	31a6b88ff3	Disable double version of compute_inverse_size4 on Inverse_NEON.h if Packet2d is not supported.	2020-09-17 23:51:06 +00:00
Stephen Zheng	5f25bcf7d6	Add Inverse_NEON.h Implemented fast size-4 matrix inverse (mimicking Inverse_SSE.h) using NEON intrinsics. ``` Benchmark Time CPU Time Old Time New CPU Old CPU New -------------------------------------------------------------------------------------------------------- BM_float -0.1285 -0.1275 568 495 572 499 BM_double -0.2265 -0.2254 638 494 641 496 ```	2020-09-04 10:55:47 +00:00
Gael Guennebaud	115da6a1ea	Fix conversion warnings	2019-02-19 14:00:15 +01:00
Gael Guennebaud	796db94e6e	bug #1194 : implement slightly faster and SIMD friendly 4x4 determinant.	2019-02-18 16:21:27 +01:00
Gael Guennebaud	bdcb5f3304	Let's properly use Score instead of std::abs, and remove deprecated FIXME ( a /= b does a/b and not a * (1/b) as it was a long time ago...)	2019-02-11 22:56:19 +01:00
Gael Guennebaud	eb46f34a8c	Speed up 2x2 LU by a factor 2, and other small fixed sizes by about 10%. Not sure that's so critical, but this does not complexify the code base much.	2019-02-11 17:59:35 +01:00
Gael Guennebaud	ab6e6edc32	Speedup PartialPivLU for small matrices by passing compile-time sizes when available. This change set also makes a better use of Map<>+OuterStride and Ref<> yielding surprising speed up for small dynamic sizes as well. The table below reports times in micro seconds for 10 random matrices: \| ------ float --------- \| ------- double ------- \| size \| before after ratio \| before after ratio \| fixed 1 \| 0.34 0.11 2.93 \| 0.35 0.11 3.06 \| fixed 2 \| 0.81 0.24 3.38 \| 0.91 0.25 3.60 \| fixed 3 \| 1.49 0.49 3.04 \| 1.68 0.55 3.01 \| fixed 4 \| 2.31 0.70 3.28 \| 2.45 1.08 2.27 \| fixed 5 \| 3.49 1.11 3.13 \| 3.84 2.24 1.71 \| fixed 6 \| 4.76 1.64 2.88 \| 4.87 2.84 1.71 \| dyn 1 \| 0.50 0.40 1.23 \| 0.51 0.40 1.26 \| dyn 2 \| 1.08 0.85 1.27 \| 1.04 0.69 1.49 \| dyn 3 \| 1.76 1.26 1.40 \| 1.84 1.14 1.60 \| dyn 4 \| 2.57 1.75 1.46 \| 2.67 1.66 1.60 \| dyn 5 \| 3.80 2.64 1.43 \| 4.00 2.48 1.61 \| dyn 6 \| 5.06 3.43 1.47 \| 5.15 3.21 1.60 \|	2019-02-11 13:58:24 +01:00
Gael Guennebaud	8a06c699d0	bug #1669 : fix PartialPivLU/inverse with zero-sized matrices.	2019-01-29 10:27:13 +01:00
Gael Guennebaud	be05d0030d	Make FullPivLU use conjugateIf<>	2019-01-17 12:01:00 +01:00
Patrick Peltzer	15e53d5d93	PR 567: makes all dense solvers inherit SoverBase (LU,Cholesky,QR,SVD). This changeset also includes: * add HouseholderSequence::conjugateIf * define int as the StorageIndex type for all dense solvers * dedicated unit tests, including assertion checking * _check_solve_assertion(): this method can be implemented in derived solver classes to implement custom checks * CompleteOrthogonalDecompositions: add applyZOnTheLeftInPlace, fix scalar type in applyZAdjointOnTheLeftInPlace(), add missing assertions * Cholesky: add missing assertions * FullPivHouseholderQR: Corrected Scalar type in _solve_impl() * BDCSVD: Unambiguous return type for ternary operator * SVDBase: Corrected Scalar type in _solve_impl()	2019-01-17 01:17:39 +01:00
Gael Guennebaud	7f32109c11	Add conjugateIf<bool> members to DesneBase, TriangularView, SelfadjointView, and make PartialPivLU use it.	2019-01-17 11:33:43 +01:00
Gael Guennebaud	f566724023	Fix StorageIndex FIXME in dense LU solvers	2019-01-13 17:54:30 +01:00
Gael Guennebaud	37c91e1836	bug #1644 : fix warning	2018-12-11 22:07:20 +01:00
Rasmus Munk Larsen	bfc5091dd5	Cast to diagonalSize to RealScalar instead Scalar.	2018-08-09 14:46:17 -07:00
Rasmus Munk Larsen	8603d80029	Cast diagonalSize() to Scalar before multiplication. Without this, automatic differentiation in Ceres breaks because Scalar is a custom type that does not support multiplication by Index.	2018-08-09 11:09:10 -07:00
Andrea Bocci	f7124b3e46	Extend CUDA support to matrix inversion and selfadjointeigensolver	2018-06-11 18:33:24 +02:00
Gael Guennebaud	2f833b1c64	bug #1509 : fix computeInverseWithCheck for complexes	2018-04-04 15:47:46 +02:00
luz.paz	e3912f5e63	MIsc. source and comment typos Found using `codespell` and `grep` from downstream FreeCAD	2018-03-11 10:01:44 -04:00
Benoit Steiner	09ae0e6586	Adjusted the EIGEN_DEVICE_FUNC qualifiers to make sure that: * they're used consistently between the declaration and the definition of a function * we avoid calling host only methods from host device methods.	2017-03-01 11:47:47 -08:00
Gael Guennebaud	3ecb343dc3	Fix regression in X = (X*X.transpose())/s with X rectangular by deferring resizing of the destination after the creation of the evaluator of the source expression.	2016-10-26 22:50:41 +02:00
Benoit Steiner	59e9edfbf1	Removed EIGEN_DEVICE_FUNC qualifers for the lu(), fullPivLu(), partialPivLu(), and inverse() functions since they aren't ready to run on GPU	2016-09-19 14:13:20 -07:00
Benoit Steiner	c0d56a543e	Added several missing EIGEN_DEVICE_FUNC qualifiers	2016-09-14 14:06:21 -07:00
Gael Guennebaud	73c8f2f697	bug #1285 : fix regression introduced in changeset 00c29c2caef8fb0c6b1d2ba5ecdf6780c0c766d4	2016-09-13 07:58:39 +02:00
Gael Guennebaud	3cb914f332	bug #1266 : remove CUDA guards on MatrixBase::<decomposition> definitions. (those used to break old nvcc versions that we propably don't care anymore)	2016-09-06 09:55:50 +02:00
Gael Guennebaud	8c48d42530	Fix 4x4 inverse with non-linear destination	2016-08-30 23:16:38 +02:00
Gael Guennebaud	35a8e94577	bug #1167 : simplify installation of header files using cmake's install(DIRECTORY ...) command.	2016-08-29 10:59:37 +02:00
Gael Guennebaud	9c663e4ee8	Clean references to MKL in LAPACKe support.	2016-07-25 18:20:08 +02:00
Gael Guennebaud	0c06077efa	Rename MKL files	2016-07-25 18:00:47 +02:00
Gael Guennebaud	4d54e3dd33	bug #173 : remove dependency to MKL for LAPACKe backend.	2016-07-25 17:55:07 +02:00
Gael Guennebaud	7f7839c12f	Add documentation and exemples for inplace decomposition.	2016-07-04 17:18:26 +02:00
Gael Guennebaud	32a41ee659	bug #707 : add inplace decomposition through Ref<> for Cholesky, LU and QR decompositions.	2016-07-04 15:13:35 +02:00
Gael Guennebaud	66e99ab6a1	Relax mixing-type constraints for binary coefficient-wise operators: - Replace internal::scalar_product_traits<A,B> by Eigen::ScalarBinaryOpTraits<A,B,OP> - Remove the "functor_is_product_like" helper (was pretty ugly) - Currently, OP is not used, but it is available to the user for fine grained tuning - Currently, only the following operators have been generalized: ,/,+,-,=,=,/=,+=,-= - TODO: generalize all other binray operators (comparisons,pow,etc.) - TODO: handle "scalar op array" operators (currently only * is handled) - TODO: move the handling of the "void" scalar type to ScalarBinaryOpTraits	2016-06-06 15:11:41 +02:00

1 2 3 4 5 ...

296 Commits