eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-09-18 12:23:13 +08:00

Author	SHA1	Message	Date
Benoit Jacob	11fa2ae2c6	temporarily disable linear traversal. Actually I don't think it's buggy. But it probably triggers existing bugs, I suspect that some xprs have LinearAccessBit and shouldn't have it. Also this fixes the "bugs" with JacobiSVD ---> now it works again	2009-11-18 16:31:14 -05:00
Benoit Jacob	94c706d04f	Assign.h: add LinearTraversal (non-vectorized index-based traversal) Rename some constants to make names match more closely what they mean.	2009-11-18 11:57:07 -05:00
Gael Guennebaud	aa0974286f	fix compilation adding a makeconst helper struct	2009-11-07 09:07:23 +01:00
Gael Guennebaud	65fe5f76fd	rename back MayAliasBit to EvalBeforeAssigningBit	2009-08-16 00:14:05 +02:00
Gael Guennebaud	50c703f0c7	As proposed on the list: - rename EvalBeforeAssignBit to MayAliasBit - make .lazy() remove the MayAliasBit only, and mark it as deprecated - add a NoAlias pseudo expression, and MatrixBase::noalias() function Todo: - we have to decide whether += and -= assume no aliasing by default ? - once we agree on the API: update the Sparse module and the unit tests respectively.	2009-08-15 18:35:51 +02:00
Benoit Jacob	ce033ebdfe	add EIGEN_DEBUG_VAR	2009-08-11 16:12:34 -04:00
Gael Guennebaud	ea884e6f48	remove #include Bidiagonalization, and add missing ";"	2009-08-11 15:08:03 +02:00
Benoit Jacob	216ee335ac	LinearVectorization: If the destination isn't aligned, we have to do runtime checks and we don't unroll, so it's only good for large enough sizes	2009-08-09 22:19:12 +02:00
Benoit Jacob	1f1705868b	now you can #define EIGEN_DEBUG_ASSIGN, and all the values in ei_assign_traits are printed	2009-08-09 21:35:13 +02:00
Benoit Jacob	3cde9c0e35	apply Gael's idea for auto transpose in mixed fixed/dynamic case	2009-08-03 16:04:15 +02:00
Benoit Jacob	6809f7b1cd	new implementation of diagonal matrices and diagonal matrix expressions	2009-06-28 21:27:37 +02:00
Benoit Jacob	6347b1db5b	remove sentence "Eigen itself is part of the KDE project." it never made very precise sense. but now does it still make any?	2009-05-22 20:25:33 +02:00
Gael Guennebaud	67b4fab4e3	fix assertion issue in slice vectorization	2009-02-16 10:17:21 +00:00
Benoit Jacob	f6aa60bcf3	centralize those static asserts more upstream, reduces duplication and ensures they can't be bypassed (e.g. until now it was possible to bypass the static assert on sizes)	2009-01-27 15:40:05 +00:00
Benoit Jacob	4336cf3833	* add unit-tests to check allowed and forbiddent mixing of different scalar types * fix issues in Product revealed by this test * in Dot.h forbid mixing of different types (at least for now, might allow real.dot(complex) in the future).	2008-12-22 19:17:44 +00:00
Gael Guennebaud	5f6fbaa0e7	* fix a vectorization issue in Product * use _mm_malloc/_mm_free on other platforms than linux of MSVC (eg., cygwin, OSX) * replace a lot of inline keywords by EIGEN_STRONG_INLINE to compensate for poor MSVC inlining	2008-12-19 15:38:39 +00:00
Benoit Jacob	89f468671d	* replace postfix ++ by prefix ++ wherever that makes sense in Eigen/ * fix some "unused variable" warnings in the tests; there remains a libstdc++ "deprecated" warning which I haven't looked much into	2008-12-17 14:30:01 +00:00
Gael Guennebaud	a164646c77	more warning fixes by Armin Berres	2008-12-15 12:30:04 +00:00
Benoit Jacob	c1e2156d8a	* Much better, consistent error msgs when mixing different scalar types: - in matrix-matrix product, static assert on the two scalar types to be the same. - Similarly in CwiseBinaryOp. POTENTIALLY CONTROVERSIAL: we don't allow anymore binary ops to take two different scalar types. The functors that we defined take two args of the same type anyway; also we still allow the return type to be different. Again the reason is that different scalar types are incompatible with vectorization. Better have the user realize explicitly what mixing different numeric types costs him in terms of performance. See comment in CwiseBinaryOp constructor. - This allowed to fix a little mistake in test/regression.cpp, mixing float and double - Remove redundant semicolon (;) after static asserts	2008-12-03 21:01:55 +00:00
Benoit Jacob	00f89a8f37	Update e-mail address	2008-11-24 13:40:43 +00:00
Gael Guennebaud	3bbd1b3114	Bugfix regarding alignent in Assign.h (updated map unit test to detect this bug) Anyway: LinearVectorization+CompleteUnrolling actually uses the InnerVectorization unrollers, so these two cases could be merged to a single one...	2008-09-03 14:42:36 +00:00
Gael Guennebaud	63d3ef8204	* remove debug code commited by mistake in Assign * keep going on the doc: added a short geometry tutorial	2008-08-26 23:07:33 +00:00
Gael Guennebaud	440664cd5d	temporary fix of the pèrevious commit	2008-08-24 15:27:05 +00:00
Gael Guennebaud	7aba51ce53	* Added .all() and .any() members to PartialRedux * Bug fixes in euler angle snippet, Assign and MapBase * Started a "quick start guide" (draft state)	2008-08-20 00:58:25 +00:00
Gael Guennebaud	8a3e6b1ee2	change solveTriangularInPlace() to take a pointer as input (as discussed on IRC). extended the documentation of the triangular solver.	2008-08-12 07:49:59 +00:00
Gael Guennebaud	4fa40367e9	* Big change in Block and Map: - added a MapBase base xpr on top of which Map and the specialization of Block are implemented - MapBase forces both aligned loads (and aligned stores, see below) in expressions such as "x.block(...) += other_expr" * Significant vectorization improvement: - added a AlignedBit flag meaning the first coeff/packet is aligned, this allows to not generate extra code to deal with the first unaligned part - removed all unaligned stores when no unrolling - removed unaligned loads in Sum when the input as the DirectAccessBit flag * Some code simplification in CacheFriendly product * Some minor documentation improvements	2008-08-09 18:41:24 +00:00
Benoit Jacob	c94be35bc8	introduce copyCoeff and copyPacket methods in MatrixBase, used by Assign, in preparation for new Swap impl reusing Assign code. remove last remnant of old Inverse class in Transform.	2008-08-05 18:00:23 +00:00
Gael Guennebaud	842c4f8bfa	Several compilation fixes for MSVC and NVCC, basically: - added explicit enum to int conversion where needed - if a function is not defined as declared and the return type is "tricky" then the type must be typedefined somewhere. A "tricky return type" can be: * a template class with a default parameter which depends on another template parameter * a nested template class, or type of a nested template class	2008-07-29 16:33:07 +00:00
Gael Guennebaud	b7bd1b3446	Add a very efficient evaluation path for both col-major matrix * vector and vector * row-major products. Currently, it is enabled only is the matrix has DirectAccessBit flag and the product is "large enough". Added the respective unit tests in test/product/cpp.	2008-07-12 12:12:02 +00:00
Benoit Jacob	2b53fd4d53	some performance fixes in Assign.h reported by Gael. Some doc update in Cwise.	2008-07-10 16:15:55 +00:00
Benoit Jacob	a9d319d44f	* do the ActualPacketAccesBit change as discussed on list * add comment in Product.h about CanVectorizeInner * fix typo in test/product.cpp	2008-07-04 12:43:55 +00:00
Gael Guennebaud	027818d739	* added innerSize / outerSize functions to MatrixBase * added complete implementation of sparse matrix product (with a little glue in Eigen/Core) * added an exhaustive bench of sparse products including GMM++ and MTL4 => Eigen outperforms in all transposed/density configurations !	2008-06-28 23:07:14 +00:00
Benoit Jacob	e27b2b95cf	* rework Map, allow vectorization * rework PacketMath and DummyPacketMath, make these actual template specializations instead of just overriding by non-template inline functions * introduce ei_ploadt and ei_pstoret, make use of them in Map and Matrix * remove Matrix::map() methods, use Map constructors instead.	2008-06-27 01:22:35 +00:00
Benoit Jacob	25ba9f377c	* add bench/benchVecAdd.cpp by Gael, fix crash (ei_pload on non-aligned) * introduce packet(int), make use of it in linear vectorized paths --> completely fixes the slowdown noticed in benchVecAdd. * generalize coeff(int) to linear-access xprs * clarify the access flag bits * rework api dox in Coeffs.h and util/Constants.h * improve certain expressions's flags, allowing more vectorization * fix bug in Block: start(int) and end(int) returned dyndyn size fix bug in Block: just because the Eval type has packet access doesn't imply the block xpr should have it too.	2008-06-26 16:06:41 +00:00
Gael Guennebaud	ac9aa47bbc	optimize linear vectorization both in Assign and Sum (optimal amortized perf)	2008-06-23 15:50:28 +00:00
Benoit Jacob	dc9206cec5	split sum away from redux and vectorize it. (could come back to redux after it has been vectorized, and could serve as a starting point for that) also make the abs2 functor vectorizable (for real types).	2008-06-23 10:32:48 +00:00
Benoit Jacob	8a967fb17c	* implement slice vectorization. Because it uses unaligned packet access, it is not certain that it will bring a performance improvement: benchmarking needed. * improve logic choosing slice vectorization. * fix typo in SSE packet math, causing crash in unaligned case. * fix bug in Product, causing crash in unaligned case. * add TEST_SSE3 CMake option.	2008-06-22 15:02:05 +00:00
Gael Guennebaud	e735692e37	move "enum" back to "const int" int ei_assign_impl: in fact, casting enums to int is enough to get compile time constants with ICC.	2008-06-20 07:10:50 +00:00
Gael Guennebaud	fb4a151982	* more cleaning in Product * make Matrix2f (and similar) vectorized using linear path * fix a couple of warnings and compilation issues with ICC and gcc 3.3/3.4 (cannot get Transform compiles with gcc 3.3/3.4, see the FIXME)	2008-06-19 23:00:51 +00:00
Benoit Jacob	bb1f4e44f1	* Block: row and column expressions in the inner direction now have the Like1D flag. * Big renaming: packetCoeff ---> packet VectorizableBit ---> PacketAccessBit Like1DArrayBit ---> LinearAccessBit	2008-06-16 14:54:31 +00:00
Benoit Jacob	9857764ae7	aaargh.	2008-06-16 11:20:29 +00:00
Benoit Jacob	478bfaf228	fix bug in computation of unrolling limit: div instead of mul	2008-06-16 11:18:59 +00:00
Benoit Jacob	c905b31b42	* Big rework of Assign.h: Much better organization Fix a few bugs Add the ability to unroll only the inner loop Add an unrolled path to the Like1D vectorization. Not well tested. ** Add placeholder for sliced vectorization. Unimplemented. * Rework of corrected_flags: improve rules determining vectorizability for vectors, the storage-order is indifferent, so we tweak it to allow vectorization of row-vectors. * fix compilation in benchmark, and a warning in Transpose.	2008-06-16 10:49:44 +00:00
Gael Guennebaud	6998037930	* move some compile time "if" to their respective unroller (assign and dot) * fix a couple of compilation issues when unrolling is disabled * reduce default unrolling limit to a more reasonable value	2008-06-07 01:07:48 +00:00
Gael Guennebaud	48262b9734	added a static assertion mechanism (see notes in Core/util/StaticAssert.h for details)	2008-06-04 11:16:11 +00:00
Gael Guennebaud	f5e599e489	* replace compile-time-if by meta-selector in Assign.h as it speed up compilation. * fix minor typo introduced in the previous commit	2008-05-31 14:42:07 +00:00
Gael Guennebaud	c1559d3079	* updated the assignement operator macro so that overloads in MatrixBase work * removed product_selector and cleaned Product.h a bit * cleaned Assign.h a bit	2008-05-28 22:56:19 +00:00
Gael Guennebaud	8711e26c8a	* change Flagged to take into account NestByValue only * bugfix in Assign and cache friendly product (weird that worked before) * improved argument evaluation in Product	2008-05-28 22:11:47 +00:00
Benoit Jacob	953efdbfe7	- introduce Part and Extract classes, splitting and extending the former Triangular class - full meta-unrolling in Part - move inverseProduct() to MatrixBase - compilation fix in ProductWIP: introduce a meta-selector to only do direct access on types that support it. - phase out the old Product, remove the WIP_DIRTY stuff. - misc renaming and fixes	2008-05-27 05:47:30 +00:00
Gael Guennebaud	4317fad869	* Added several cast to int of the enums (needed for some compilers) * Fix a mistake in CwiseNullary. * Added a CoreDeclarions header that declares only the forward declarations and related basic stuffs.	2008-05-12 18:09:30 +00:00

1 2

64 Commits