eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-20 20:11:07 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	e5d301dc96	various work on the Sparse module: * added some glue to Eigen/Core (SparseBit, ei_eval, Matrix) * add two new sparse matrix types: HashMatrix: based on std::map (for random writes) LinkedVectorMatrix: array of linked vectors (for outer coherent writes, e.g. to transpose a matrix) * add a SparseSetter class to easily set/update any kind of matrices, e.g.: { SparseSetter<MatrixType,RandomAccessPattern> wrapper(mymatrix); for (...) wrapper->coeffRef(rand(),rand()) = rand(); } * automatic shallow copy for RValue * and a lot of mess ! plus: * remove the remaining ArrayBit related stuff * don't use alloca in product for very large memory allocation	2008-06-26 23:22:26 +00:00
Benoit Jacob	25ba9f377c	* add bench/benchVecAdd.cpp by Gael, fix crash (ei_pload on non-aligned) * introduce packet(int), make use of it in linear vectorized paths --> completely fixes the slowdown noticed in benchVecAdd. * generalize coeff(int) to linear-access xprs * clarify the access flag bits * rework api dox in Coeffs.h and util/Constants.h * improve certain expressions's flags, allowing more vectorization * fix bug in Block: start(int) and end(int) returned dyndyn size fix bug in Block: just because the Eval type has packet access doesn't imply the block xpr should have it too.	2008-06-26 16:06:41 +00:00
Benoit Jacob	3b94436d2f	* vectorize dot product, copying code from sum. * make the conj functor vectorizable: it is just identity in real case, and complex doesn't use the vectorized path anyway. * fix bug in Block: a 3x1 block in a 4x4 matrix (all fixed-size) should not be vectorizable, since in fixed-size we are assuming the size to be a multiple of packet size. (Or would you prefer Vector3d to be flagged "packetaccess" even though no packet access is possible on vectors of that type?) * rename: isOrtho for vectors ---> isOrthogonal isOrtho for matrices ---> isUnitary * add normalize() * reimplement normalized with quotient1 functor	2008-06-24 15:13:00 +00:00
Benoit Jacob	dc9206cec5	split sum away from redux and vectorize it. (could come back to redux after it has been vectorized, and could serve as a starting point for that) also make the abs2 functor vectorizable (for real types).	2008-06-23 10:32:48 +00:00
Gael Guennebaud	32c5ea388e	work on rotations in the Geometry module: - convertions are done trough constructors and operator= - added a EulerAngles class	2008-06-21 15:01:49 +00:00
Gael Guennebaud	54238961d6	* added a pseudo expression Array giving access to: - matrix-scalar addition/subtraction operators, e.g.: m.array() += 0.5; - matrix/matrix comparison operators, e.g.: if (m1.array() < m2.array()) {} * fix compilation issues with Transform and gcc < 4.1	2008-06-20 12:38:03 +00:00
Gael Guennebaud	fb4a151982	* more cleaning in Product * make Matrix2f (and similar) vectorized using linear path * fix a couple of warnings and compilation issues with ICC and gcc 3.3/3.4 (cannot get Transform compiles with gcc 3.3/3.4, see the FIXME)	2008-06-19 23:00:51 +00:00
Gael Guennebaud	82c3cea1d5	* refactoring of Product: * use ProductReturnType<>::Type to get the correct Product xpr type * Product is no longer instanciated for xpr types which are evaluated * vectorization of "a.transpose() * b" for the normal product (small and fixed-size matrix) * some cleanning * removed ArrayBase	2008-06-19 17:33:57 +00:00
Gael Guennebaud	5dbfed1902	fix two bugs dicovered by the previous commit.	2008-06-16 16:39:58 +00:00
Benoit Jacob	bb1f4e44f1	* Block: row and column expressions in the inner direction now have the Like1D flag. * Big renaming: packetCoeff ---> packet VectorizableBit ---> PacketAccessBit Like1DArrayBit ---> LinearAccessBit	2008-06-16 14:54:31 +00:00
Benoit Jacob	c905b31b42	* Big rework of Assign.h: Much better organization Fix a few bugs Add the ability to unroll only the inner loop Add an unrolled path to the Like1D vectorization. Not well tested. ** Add placeholder for sliced vectorization. Unimplemented. * Rework of corrected_flags: improve rules determining vectorizability for vectors, the storage-order is indifferent, so we tweak it to allow vectorization of row-vectors. * fix compilation in benchmark, and a warning in Transpose.	2008-06-16 10:49:44 +00:00
Gael Guennebaud	bc0c7c57ed	Added an extensible mechanism to support any kind of rotation representation in Transform via the template static class ToRotationMatrix. Added a lightweight AngleAxis class (similar to Rotation2D).	2008-06-15 17:22:41 +00:00
Gael Guennebaud	0ee6b08128	* split Product to a DiagonalProduct template specialization to optimize matrix-diag and diag-matrix products without making Product over complicated. * compilation fixes in Tridiagonalization and HessenbergDecomposition in the case of 2x2 matrices. * added an Orientation2D small class with similar interface than Quaternion (used by Transform to handle 2D and 3D orientations seamlessly) * added a couple of features in Transform.	2008-06-15 11:54:18 +00:00
Gael Guennebaud	fbbd8afe30	Started a Transform class in the Geometry module to represent homography. Fix indentation in Quaternion.h	2008-06-15 08:33:44 +00:00
Gael Guennebaud	f07f907810	Add QR and Cholesky module instantiations in the lib. To try it with the unit tests set the cmake variable TEST_LIB to ON.	2008-06-14 13:02:41 +00:00
Benoit Jacob	c90c77051f	* make the _Flags template parameter of Matrix default to the corrected flags. This ensures that unless explicitly messed up otherwise, a Matrix type is equal to its own Eval type. This seriously reduces the number of types instantiated. Measured +13% compile speed, -7% binary size. * Improve doc of Matrix template parameters.	2008-06-13 07:53:45 +00:00
Gael Guennebaud	6998037930	* move some compile time "if" to their respective unroller (assign and dot) * fix a couple of compilation issues when unrolling is disabled * reduce default unrolling limit to a more reasonable value	2008-06-07 01:07:48 +00:00
Gael Guennebaud	48262b9734	added a static assertion mechanism (see notes in Core/util/StaticAssert.h for details)	2008-06-04 11:16:11 +00:00
Gael Guennebaud	a9cf229e15	add a geometry unit test and fix a couple of typo in Quaternion.h	2008-06-03 07:32:12 +00:00
Benoit Jacob	8de4d92b70	- get the doc of the enums in MatrixBase right - get the doc of the flags in Constants right - finally give up with SEPARATE_MEMBER_PAGES: it triggers too big Doxygen bugs, and produces too many small pages. So we have one huge page for MatrixBase at currently 300kb and going up, so the solution especially for users with low bandwidth will be to provide an archive of the html documentation.	2008-06-03 02:06:18 +00:00
Gael Guennebaud	06752b2b77	* added a Tridiagonalization class for selfadjoint matrices * added MatrixBase::real() * added the ability to extract a selfadjoint matrix from the lower or upper part of a matrix, e.g.: m.extract<Upper\|SelfAdjoint>() will ignore the strict lower part and return a selfadjoint. This is compatible with ZeroDiag and UnitDiag.	2008-06-01 17:20:18 +00:00
Gael Guennebaud	64169389ed	added an optional Eigen2 dynamic library. it allows the possiblity to save some compilation time by linking to it and defining the token EIGEN_EXTERN_INSTANCIATIONS	2008-05-31 23:21:49 +00:00
Gael Guennebaud	fcf4457b78	added optimized matrix times diagonal matrix product via Diagonal flag shortcut.	2008-05-31 21:35:11 +00:00
Gael Guennebaud	310f7aa096	moved purely "array" related stuff to a new module Array. This include: - cwise Pow,Sin,Cos,Exp... - cwise Greater and other comparison operators - .any(), .all() and partial reduction - random	2008-05-31 18:11:48 +00:00
Gael Guennebaud	e2ac5d244e	Added ArrayBit to get the ability to manipulate a Matrix like a simple scalar. In particular this flag changes the behavior of operator* to a coeff wise product.	2008-05-29 22:33:07 +00:00
Benoit Jacob	486fdb26a1	many small fixes and documentation improvements, this should be alpha5.	2008-05-29 03:12:30 +00:00
Gael Guennebaud	c1559d3079	* updated the assignement operator macro so that overloads in MatrixBase work * removed product_selector and cleaned Product.h a bit * cleaned Assign.h a bit	2008-05-28 22:56:19 +00:00
Benoit Jacob	f54760c889	hehe, the complicated nesting scheme in Flagged in the previous commit was a sign that we were doing something wrong. In fact, having NestByValue as a special case of Flagged was wrong, and the previous commit, while not buggy, was inefficient because then when the resulting NestByValue xpr was nested -- hence copied -- the original xpr which was already nested by value was copied again; hence instead of 1 copy we got 3 copies. The solution was to ressuscitate the old Temporary.h (renamed NestByValue.h) as it was the right approach.	2008-05-28 05:14:16 +00:00
Benoit Jacob	aebecae510	* find the proper way of nesting the expression in Flagged: finally that's more subtle than just using ei_nested, because when flagging with NestByValueBit we want to store the expression by value already, regardless of whether it already had the NestByValueBit set. * rename temporary() ----> nestByValue() * move the old Product.h to disabled/, replace by what was ProductWIP.h * tweak -O and -g flags for tests and examples * reorder the tests -- basic things go first * simplifications, e.g. in many methoeds return derived() and count on implicit casting to the actual return type. * strip some not-really-useful stuff from the heaviest tests	2008-05-28 04:38:16 +00:00
Benoit Jacob	953efdbfe7	- introduce Part and Extract classes, splitting and extending the former Triangular class - full meta-unrolling in Part - move inverseProduct() to MatrixBase - compilation fix in ProductWIP: introduce a meta-selector to only do direct access on types that support it. - phase out the old Product, remove the WIP_DIRTY stuff. - misc renaming and fixes	2008-05-27 05:47:30 +00:00
Gael Guennebaud	94e1629a1b	* improved product performance: - fallback to normal product for small dynamic matrices - overloaded "c += (a * b).lazy()" to avoid the expensive and useless temporary and setZero() in such very common cases. * fix a couple of issues with the flags	2008-05-22 14:51:25 +00:00
Gael Guennebaud	c6789a279c	Fix compilation issues with MSVC and NVCC. Added a few typedef of complex return types in MatrixBase (Needed by MSVC)	2008-05-15 09:40:11 +00:00
Benoit Jacob	5da60897ab	Introduce generic Flagged xpr, remove already Lazy.h and Temporary.h Rename DefaultLostFlagMask --> HerediraryBits	2008-05-14 08:20:15 +00:00
Gael Guennebaud	4317fad869	* Added several cast to int of the enums (needed for some compilers) * Fix a mistake in CwiseNullary. * Added a CoreDeclarions header that declares only the forward declarations and related basic stuffs.	2008-05-12 18:09:30 +00:00
Benoit Jacob	678f18fce4	put inline keywords everywhere appropriate. So we don't need anymore to pass -finline-limit=1000 to gcc to get good performance. By the way some cleanup.	2008-05-12 17:34:46 +00:00
Gael Guennebaud	45cda6704a	* Draft of a eigenvalues solver (does not support complex and does not re-use the QR decomposition) * Rewrite the cache friendly product to have only one instance per scalar type ! This significantly speeds up compilation time and reduces executable size. The current drawback is that some trivial expressions might be evaluated like conjugate or negate. * Renamed "cache optimal" to "cache friendly" * Added the ability to directly access matrix data of some expressions via: - the stride()/_stride() methods - DirectAccessBit flag (replace ReferencableBit)	2008-05-12 10:23:09 +00:00
Benoit Jacob	3562b01105	* Give Konstantinos a copyright line * Fix compilation of Inverse.h with vectorisation * Introduce EIGEN_GNUC_AT_LEAST(x,y) macro doing future-proof (e.g. gcc v5.0) check * Only use ProductWIP if vectorisation is enabled * rename EIGEN_ALWAYS_INLINE -> EIGEN_INLINE with fall-back to inline keyword * some cleanup/indentation	2008-05-12 08:12:40 +00:00
Gael Guennebaud	bf5326c3ca	* Added ReferencableBit flag to known if coeffRef is available. (needed by the new product implementation) * Make the packet* members template to support aligned and unaligned access. This makes Block vectorizable. Combined with ReferencableBit, we should be able to determine at runtime (in some specific cases) if an aligned vectorization is possible or not. * Improved the new product implementation to robustly handle all cases, it now passes all the tests. * Renamed the packet version ei_predux to ei_preduxp to avoid name collision.	2008-05-08 08:12:52 +00:00
Gael Guennebaud	64c49de7ba	* split PacketMath.h to SSE and Altivec specific files * improved the flexibility of the new product implementation, now all sizes seems to be properly handled.	2008-05-05 17:19:47 +00:00
Gael Guennebaud	46fa4c713f	* Started support for unaligned vectorization. * Introduce a new highly optimized matrix-matrix product for large matrices. The code is still highly experimental and it is activated only if you define EIGEN_WIP_PRODUCT at compile time. Currently the third dimension of the product must be a factor of the packet size (x4 for floats) and the right handed side matrix must be column major. Moreover, currently c = ab; actually computes c += ab !! Therefore, the code is provided for experimentation purpose only ! These limitations will be fixed soon or later to become the default product implementation.	2008-05-05 10:23:29 +00:00
Gael Guennebaud	4c92150676	Added Triangular expression to extract upper or lower (strictly or not) part of a matrix. Triangular also provide an optimised method for forward and backward substitution. Further optimizations regarding assignments and products might come later. Updated determinant() to take into account triangular matrices. Started the QR module with a QR decompostion algorithm. Help needed to build a QR algorithm (eigen solver) based on it.	2008-04-26 18:26:05 +00:00
Gael Guennebaud	a451835bce	Make the explicit vectorization much more flexible: - support dynamic sizes - support arbitrary matrix size when the matrix can be seen as a 1D array (except for fixed size matrices where the size in Bytes must be a factor of 16, this is to allow compact storage of a vector of matrices) Note that the explict vectorization is still experimental and far to be completely tested.	2008-04-25 15:46:18 +00:00
Gael Guennebaud	9385793f71	Fix a couple of issue with the vectorization. In particular, default ei_p* functions are provided to handle not suported types seemlessly. Added a generic null-ary expression with null-ary functors. They replace Zero, Ones, Identity and Random.	2008-04-24 18:35:39 +00:00
Benoit Jacob	6ae037dfb5	give up on OpenMP... for now	2008-04-18 07:57:46 +00:00
Benoit Jacob	acfd6f3bda	- add _packetCoeff() to Inverse, allowing vectorization. - let Inverse take template parameter MatrixType instead of ExpressionType, in order to reduce executable code size when taking inverses of xpr's. - introduce ei_corrected_matrix_flags : the flags template parameter to the Matrix class is only a suggestion. This is also useful in ei_eval.	2008-04-16 07:18:27 +00:00
Benoit Jacob	9789c04467	when evaluating an xpr, the result can now be vectorizable even if the xpr itself wasn't vectorizable.	2008-04-14 08:55:12 +00:00
Benoit Jacob	ea3ccb1e8c	* Start of the LU module, with matrix inversion already there and fully optimized. * Even if LargeBit is set, only parallelize for large enough objects (controlled by EIGEN_PARALLELIZATION_TRESHOLD).	2008-04-14 08:20:24 +00:00
Benoit Jacob	ab4046970b	* Add fixed-size template versions of corner(), start(), end(). * Use them to write an unrolled path in echelon.cpp, as an experiment before I do this LU module. * For floating-point types, make ei_random() use an amplitude of 1.	2008-04-12 17:37:27 +00:00
Benoit Jacob	dcebc46cdc	- cleaner use of OpenMP (no code duplication anymore) using a macro and _Pragma. - use OpenMP also in cacheOptimalProduct and in the vectorized paths as well - kill the vector assignment unroller. implement in operator= the logic for assigning a row-vector in a col-vector. - CMakeLists support for building tests/examples with -fopenmp and/or -msse2 - updates in bench/, especially replace identity() by ones() which prevents underflows from perturbing bench results.	2008-04-11 14:28:42 +00:00
Benoit Jacob	613c49b475	* add typedefs for matrices/vectors with LargeBit * add -pedantic to CXXFLAGS * cleanup intricated expressions with && and \|\| which gave warnings because of "missing" parentheses * fix compile error in NumTraits, apparently discovered by -pedantic	2008-04-10 10:33:50 +00:00

1 2 3

102 Commits