eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-20 12:01:07 +08:00

Author	SHA1	Message	Date
Steve Bronder	e7b8643d70	Revert "Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()"" This reverts commit 5f0b4a4010af4cbf6161a0d1a03a747addc44a5d.	2021-03-24 18:14:56 +00:00
David Tellenbach	5f0b4a4010	Revert "Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()" This reverts commit 6cbb3038ac48cb5fe17eba4dfbf26e3e798041f1 because it breaks clang-10 builds on x86 and aarch64 when C++11 is enabled.	2021-03-05 13:16:43 +01:00
Steve Bronder	6cbb3038ac	Adds EIGEN_CONSTEXPR and EIGEN_NOEXCEPT to rows(), cols(), innerStride(), outerStride(), and size()	2021-03-04 18:58:08 +00:00
Rasmus Munk Larsen	923ee9aba3	Fix the embarrassingly incomplete fix to the embarrassing bug in blocked transpose.	2020-04-29 17:27:36 +00:00
Rasmus Munk Larsen	a32923a439	Fix (embarrassing) bug in blocked transpose.	2020-04-29 17:02:27 +00:00
Rasmus Munk Larsen	1e41406c36	Add missing transpose in cleanup loop. Without it, we trip an assertion in debug mode.	2020-04-29 01:30:51 +00:00
Rasmus Munk Larsen	b47c777993	Block transposeInPlace() when the matrix is real and square. This yields a large speedup because we transpose in registers (or L1 if we spill), instead of one packet at a time, which in the worst case makes the code write to the same cache line PacketSize times instead of once. rmlarsen@rmlarsen4:.../eigen_bench/google3$ benchy --benchmarks=.TransposeInPlace.float.* --reference=srcfs experimental/users/rmlarsen/bench:matmul_bench 10 / 10 [====================================================================================================================================================================================================================] 100.00% 2m50s (Generated by http://go/benchy. Settings: --runs 5 --benchtime 1s --reference "srcfs" --benchmarks ".TransposeInPlace.float.*" experimental/users/rmlarsen/bench:matmul_bench) name old time/op new time/op delta BM_TransposeInPlace<float>/4 9.84ns ± 0% 6.51ns ± 0% -33.80% (p=0.008 n=5+5) BM_TransposeInPlace<float>/8 23.6ns ± 1% 17.6ns ± 0% -25.26% (p=0.016 n=5+4) BM_TransposeInPlace<float>/16 78.8ns ± 0% 60.3ns ± 0% -23.50% (p=0.029 n=4+4) BM_TransposeInPlace<float>/32 302ns ± 0% 229ns ± 0% -24.40% (p=0.008 n=5+5) BM_TransposeInPlace<float>/59 1.03µs ± 0% 0.84µs ± 1% -17.87% (p=0.016 n=5+4) BM_TransposeInPlace<float>/64 1.20µs ± 0% 0.89µs ± 1% -25.81% (p=0.008 n=5+5) BM_TransposeInPlace<float>/128 8.96µs ± 0% 3.82µs ± 2% -57.33% (p=0.008 n=5+5) BM_TransposeInPlace<float>/256 152µs ± 3% 17µs ± 2% -89.06% (p=0.008 n=5+5) BM_TransposeInPlace<float>/512 837µs ± 1% 208µs ± 0% -75.15% (p=0.008 n=5+5) BM_TransposeInPlace<float>/1k 4.28ms ± 2% 1.08ms ± 2% -74.72% (p=0.008 n=5+5)	2020-04-28 16:08:16 +00:00
Christoph Hertzberg	870e53c0f2	Bug #1788 : Fix rule-of-three violations inside the stable modules. This fixes deprecated-copy warnings when compiling with GCC>=9 Also protect some additional Base-constructors from getting called by user code code (#1587)	2019-12-19 17:30:11 +01:00
Christoph Hertzberg	4ccd1ece92	bug #1707 : Fix deprecation warnings, or disable warnings when testing deprecated functions	2019-05-10 14:57:05 +02:00
Gael Guennebaud	83309068b4	bug #1680 : improve MSVC inlining by declaring many triavial constructors and accessors as STRONG_INLINE.	2019-02-15 16:35:35 +01:00
Gael Guennebaud	562985bac4	bug #1646 : fix false aliasing detection for A.row(0) = A.col(0); This changeset completely disable the detection for vectors for which are current mechanism cannot detect any positive aliasing anyway.	2019-01-17 00:14:27 +01:00
Gael Guennebaud	502f717980	bug #1646 : disable aliasing detection for empty and 1x1 expression	2019-01-16 14:33:45 +01:00
Andrea Bocci	f7124b3e46	Extend CUDA support to matrix inversion and selfadjointeigensolver	2018-06-11 18:33:24 +02:00
Benoit Steiner	f3e9c42876	Added missing EIGEN_DEVICE_FUNC qualifiers	2017-02-28 09:46:30 -08:00
Gael Guennebaud	3ecb343dc3	Fix regression in X = (X*X.transpose())/s with X rectangular by deferring resizing of the destination after the creation of the evaluator of the source expression.	2016-10-26 22:50:41 +02:00
Gael Guennebaud	c1d900af61	bug #178 : remove additional const on nested expression, and remove several const_cast.	2016-01-28 21:43:20 +01:00
Gael Guennebaud	8b0d1eb0f7	Fix numerous doxygen shortcomings, and workaround some clang -Wdocumentation warnings	2016-01-01 21:45:06 +01:00
Gael Guennebaud	0bb12fa614	Add LU::transpose().solve() and LU::adjoint().solve() API.	2015-12-01 14:38:47 +01:00
Gael Guennebaud	1f5024332e	First part of a big refactoring of alignment control to enable the handling of arbitrarily aligned buffers. It includes: - AlignedBit flag is deprecated. Alignment is now specified by the evaluator through the 'Alignment' enum, e.g., evaluator<Xpr>::Alignment. Its value is in Bytes. - Add several enums to specify alignment: Aligned8, Aligned16, Aligned32, Aligned64, Aligned128. AlignedMax corresponds to EIGEN_MAX_ALIGN_BYTES. Such enums are used to define the above Alignment value, and as the 'Options' template parameter of Map<> and Ref<>. - The Aligned enum is now deprecated. It is now an alias for Aligned16. - Currently, traits<Matrix<>>, traits<Array<>>, traits<Ref<>>, traits<Map<>>, and traits<Block<>> also expose the Alignment enum.	2015-08-06 15:31:07 +02:00
Gael Guennebaud	846b227bb7	Get rid of class internal::nested<> (still have to updated Tensor module)	2015-06-19 17:56:39 +02:00
Gael Guennebaud	6fc5438205	Remove a few deprecated internal expressions	2015-06-19 17:06:12 +02:00
Gael Guennebaud	4aba24a1b2	Clean argument names of some functions	2015-06-09 13:32:12 +02:00
Gael Guennebaud	cc641aabb7	Remove deprecated usage of expr::Index.	2015-02-16 14:46:51 +01:00
Gael Guennebaud	fc202bab39	Index refactoring: StorageIndex must be used for storage only (and locally when it make sense). In all other cases use the global Index type.	2015-02-13 18:57:41 +01:00
Gael Guennebaud	fe51319980	Merge Index-refactoring branch with default, fix PastixSupport, remove some useless typedefs	2015-02-13 10:03:53 +01:00
Gael Guennebaud	c6eb84aabc	Enable vectorization of transposeInPlace for PacketSize x PacketSize matrices	2015-01-26 17:09:01 +01:00
Christoph Hertzberg	e8cdbedefb	bug #877 , bug #572 : Introduce a global Index typedef. Rename Sparse::Index to StorageIndex, make Dense::StorageIndex an alias to DenseIndex. Overall this commit gets rid of all Index conversion warnings.	2014-12-04 22:48:53 +01:00
Christoph Hertzberg	d3f52debc6	Make cuda_basic test compile again by adding lots of EIGEN_DEVICE_FUNC. Although the test passes now, there might still be some missing.	2014-10-13 17:18:26 +02:00
Christoph Hertzberg	36448c9e28	Make constructors explicit if they could lead to unintended implicit conversion	2014-09-23 14:28:23 +02:00
Gael Guennebaud	0ca43f7e9a	Remove deprecated code not used by evaluators	2014-09-18 15:15:27 +02:00
Gael Guennebaud	0bf5894861	workaround one more shadowing issue with MSVC	2014-09-16 18:21:39 -07:00
Gael Guennebaud	26db954776	Re-enable aliasing checks when using evaluators	2014-09-14 19:06:08 +02:00
Gael Guennebaud	3849cc65ee	Implement binaryop and transpose evaluators for sparse matrices	2014-06-23 10:40:03 +02:00
Gael Guennebaud	78bb808337	1- Introduce sub-evaluator types for unary, binary, product, and map expressions to ease specializing them. 2- Remove a lot of code which should not be there with evaluators, in particular coeff/packet methods implemented in the expressions.	2014-06-20 15:39:38 +02:00
Gael Guennebaud	74b1d79d77	merge default and evaluator branches	2014-03-12 16:24:25 +01:00
Gael Guennebaud	8dd3b716e3	Move evaluation related flags from traits to evaluator and fix evaluators of MapBase and Replicate	2014-03-12 13:34:11 +01:00
Gael Guennebaud	da6ec81282	Move CoeffReadCost mechanism to evaluators	2014-03-10 23:24:40 +01:00
Christoph Hertzberg	3e439889e0	Specify what non-resizeable objects are in transposeInPlace and adjointInPlace (cf bug #749 )	2014-02-24 13:12:10 +01:00
Gael Guennebaud	2f593ee67c	merge with main branch	2013-07-17 13:21:35 +02:00
Gael Guennebaud	84f52ad317	Remove double const qualifier	2013-07-10 23:54:53 +02:00
Jitse Niesen	419b5cff44	doc: Mention vec=vec.head(n) in aliasing page.	2013-07-02 13:35:36 +01:00
Gael Guennebaud	c21a04bcf9	fix compilation of ArrayBase::transposeInPlace	2013-06-24 13:35:13 +02:00
Gael Guennebaud	33788b97dd	Fix compilation issue with some compilers (when doing using Base::foo;, foo must be visible in the direct base class)	2013-06-18 00:48:47 +02:00
Gael Guennebaud	9cd2d14005	merge with default branch	2013-04-19 11:21:39 +02:00
Jitse Niesen	986f60127d	Guard against transposeInPlace on non-square non-resizable matrix. Inspired by question by Martin Drozdik at stackoverflow.com/q/14954983	2013-02-20 14:03:14 +00:00
Gael Guennebaud	5adcc6c7b4	Add support for NVCC5: most of the Core and part of LU are callable from CUDA code. Still a lot to do.	2013-02-07 19:06:14 +01:00
Gael Guennebaud	a1b405c92e	Add missing const in some casts	2012-08-05 10:40:46 +02:00
Benoit Jacob	69124cfca2	Automatic relicensing to MPL2 using Keirs script. Manual fixup follows.	2012-07-13 14:42:47 -04:00
Gael Guennebaud	62c504e7bf	fix most of the shadow warnings in Core/*.h	2012-06-22 16:32:45 +02:00
Jitse Niesen	3c412183b2	Get rid of include directives inside namespace blocks (bug #339 ).	2012-04-15 11:06:28 +01:00

1 2 3

147 Commits