eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-05-05 10:14:05 +08:00

Author	SHA1	Message	Date
Benoit Jacob	09ef7db9d9	Add partial pivoting runtime option to LU. Note: in fact, inverse() always uses partial pivoting because the algo currently used doesn't make sense with complete pivoting. No num stability issue so far even with size 200x200. If there is any problem we can of course reimplement inverse on top of LU.	2008-08-05 15:43:11 +00:00
Benoit Jacob	e741b7beca	further big perf improvement in Inverse	2008-08-04 23:47:09 +00:00
Benoit Jacob	79a0feee68	big performance improvement in inverse and LU	2008-08-04 23:34:21 +00:00
Benoit Jacob	c2f8ecf466	* LU decomposition, supporting all rectangular matrices, with full pivoting for better numerical stability. For now the only application is determinant. * New determinant unit-test. * Disable most of Swap.h for now as it makes LU fail (mysterious). Anyway Swap needs a big overhaul as proposed on IRC. * Remnants of old class Inverse removed. * Some warnings fixed.	2008-08-04 04:45:59 +00:00
Gael Guennebaud	f81dfcf00b	fix two perf issues in product. fix positive definite test in Cholesky. remove #include <cstring> in CoreDeclaration.	2008-08-03 20:23:06 +00:00
Benoit Jacob	49ae3fca89	fix compile errors with gcc 4.3: unresolved func call to ei_cache_friendly_product, and undeclared memcpy	2008-08-03 15:44:06 +00:00
Gael Guennebaud	6d11a07e5e	Added a ei_palign function align a packet from two others. This allows much faster code dealing with unligned as in the updated matrix-vector product functions.	2008-08-03 15:15:46 +00:00
Gael Guennebaud	55aeb1f83a	Optimizations: * faster matrix-matrix and matrix-vector products (especially for not aligned cases) * faster tridiagonalization (make it using our matrix-vector impl.) Others: * fix Flags of Map * split the test_product to two smaller ones	2008-08-01 23:44:59 +00:00
Gael Guennebaud	b32b186c14	removed the packet specializations of some functors (GCC generates better code without those "optimizations")	2008-07-31 21:03:11 +00:00
Gael Guennebaud	842c4f8bfa	Several compilation fixes for MSVC and NVCC, basically: - added explicit enum to int conversion where needed - if a function is not defined as declared and the return type is "tricky" then the type must be typedefined somewhere. A "tricky return type" can be: * a template class with a default parameter which depends on another template parameter * a nested template class, or type of a nested template class	2008-07-29 16:33:07 +00:00
Gael Guennebaud	44d95e0540	fix some internal asserts in CacheFrinedlyProduct	2008-07-27 22:14:08 +00:00
Gael Guennebaud	e9e5261664	Fix a couple issues introduced in the previous commit: * removed DirectAccessBit from Part * use a template specialization in inverseProduct() to transform a Part xpr to a Flagged xpr	2008-07-26 23:05:44 +00:00
Gael Guennebaud	e77ccf2928	* Rewrite the triangular solver so that we can take advantage of our efficient matrix-vector products: => up to 6 times faster ! * Added DirectAccessBit to Part * Added an exemple of a cwise operator * Renamed perpendicular() => someOrthogonal() (geometry module) * Fix a weired bug in ei_constant_functor: the default copy constructor did not copy the imaginary part when the single member of the class is a complex...	2008-07-26 20:40:29 +00:00
Gael Guennebaud	2940617e6f	bugfix in some internal asserts of CacheFriendlyProduct	2008-07-26 12:26:27 +00:00
Benoit Jacob	f997a3e902	update the inverse test a little make use of static asserts in Map fix 2 warnings in CacheFriendlyProduct: unused var 'Vectorized'	2008-07-26 12:08:28 +00:00
Gael Guennebaud	b466c266a0	* Fix some complex alignment issues in the cache friendly matrix-vector products. * Minor update of the cores of the Cholesky algorithms to make them more friendly wrt to matrix-vector products => speedup x5 !	2008-07-23 17:30:00 +00:00
Gael Guennebaud	172000aaeb	Add .perpendicular() function in Geometry module (adapted from Eigen1) Documentation: * add an overview for each module. * add an example for .all() and Cwise::operator<	2008-07-22 10:54:42 +00:00
Gael Guennebaud	516db2c3b9	Fix compilation issues with icc and g++ < 4.1. Those include: - conflicts with operator * overloads - discard the use of ei_pdiv for interger (g++ handles operators on __m128* types, this is why it worked) - weird behavior of icc in fixed size Block() constructor complaining the initializer of m_blockRows and m_blockCols were missing while we are in fixed size (maybe this hide deeper problem since this is a recent one, but icc gives only little feedback)	2008-07-21 12:40:56 +00:00
Gael Guennebaud	c10f069b6b	* Merge Extract and Part to the Part expression. Renamed "MatrixBase::extract() const" to "MatrixBase::part() const" * Renamed static functions identity, zero, ones, random with an upper case first letter: Identity, Zero, Ones and Random.	2008-07-21 00:34:46 +00:00
Gael Guennebaud	ce425d92f1	Various documentation improvements, in particualr in Cholesky and Geometry module. Added doxygen groups for Matrix typedefs and the Geometry module	2008-07-20 15:18:54 +00:00
Gael Guennebaud	269f683902	Add cholesky's members to MatrixBase Various documentation improvements including new snippets (AngleAxis and Cholesky)	2008-07-19 22:59:05 +00:00
Gael Guennebaud	6e2c53e056	Added an automatically generated list of selected examples in the documentation. Added the custom gemetry_module tag, and use it.	2008-07-19 20:36:41 +00:00
Gael Guennebaud	05ad083467	Added MatrixBase::Unit() static function to easily create unit/basis vectors. Removed EulerAngles, addes typdefs for Quaternion and AngleAxis, and added automatic conversions from Quaternion/AngleAxis to Matrix3 such that: Matrix3f m = AngleAxisf(0.2,Vector3f::UnitX) AngleAxisf(0.2,Vector3f::UnitY); just works.	2008-07-19 13:03:23 +00:00
Gael Guennebaud	7245c63067	Complete rewrite of partial reduction according to mailing list discussions.	2008-07-19 11:36:32 +00:00
Benoit Jacob	8b4945a5a2	add some static asserts, use them, fix gcc 4.3 warning in Product.h.	2008-07-19 00:25:41 +00:00
Gael Guennebaud	22a816ade8	* Fix a couple of issues related to the recent cache friendly products * Improve the efficiency of matrixvector in unaligned cases Trivial fixes in the destructors of MatrixStorage * Removed the matrixNorm in test/product.cpp (twice faster and that assumed the matrix product was ok while checking that !!)	2008-07-19 00:09:01 +00:00
Benoit Jacob	62ec1dd616	* big rework of Inverse.h: - remove all invertibility checking, will be redundant with LU - general case: adapt to matrix storage order for better perf - size 4 case: handle corner cases without falling back to gen case. - rationalize with selectors instead of compile time if - add C-style computeInverse() * update inverse test. * in snippets, default cout precision to 3 decimal places * add some cmake module from kdelibs to support btl with cmake 2.4	2008-07-15 23:56:17 +00:00
Gael Guennebaud	b970a9c8aa	trivial fix in EulerAngles constructor	2008-07-15 22:42:55 +00:00
Gael Guennebaud	99a625243f	Optimization: added super efficient rowmajor * vector product (and vector * colmajor). It basically performs 4 dot products at once reducing loads of the vector and improving instructions scheduling. With 3 cache friendly algorithms, we now handle all product configurations with outstanding perf for large matrices.	2008-07-13 01:22:54 +00:00
Gael Guennebaud	861d18d553	* Optimization: added a specialization of Block for xpr with DirectAccessBit * some simplifications and fixes in cache friendly products	2008-07-12 22:59:34 +00:00
Gael Guennebaud	b7bd1b3446	Add a very efficient evaluation path for both col-major matrix * vector and vector * row-major products. Currently, it is enabled only is the matrix has DirectAccessBit flag and the product is "large enough". Added the respective unit tests in test/product/cpp.	2008-07-12 12:12:02 +00:00
Benoit Jacob	2b53fd4d53	some performance fixes in Assign.h reported by Gael. Some doc update in Cwise.	2008-07-10 16:15:55 +00:00
Gael Guennebaud	c9b046d5d5	* added optimized paths for matrix-vector and vector-matrix products (using either a cache friendly strategy or re-using dot-product vectorized implementation) * add LinearAccessBit to Transpose	2008-07-09 22:30:18 +00:00
Gael Guennebaud	5f55ab524c	* added a lazyAssign overload skipping .lazy() such that c = (<xpr>).lazy() such that lazyAssign overloads of <xpr> are automatically called (this also reduces assign instansiations)	2008-07-09 13:54:21 +00:00
Gael Guennebaud	783eb6da9b	I forgot that the previous commit needed minor changes outside the bench folder	2008-07-08 17:25:58 +00:00
Benoit Jacob	6f09d3a67d	- many updates after Cwise change - fix compilation in product.cpp with std::complex - fix bug in MatrixBase::operator!=	2008-07-08 07:56:01 +00:00
Benoit Jacob	f5791eeb70	the big Array/Cwise rework as discussed on the mailing list. The new API can be seen in Eigen/src/Core/Cwise.h.	2008-07-08 00:49:10 +00:00
Benoit Jacob	a9d319d44f	* do the ActualPacketAccesBit change as discussed on list * add comment in Product.h about CanVectorizeInner * fix typo in test/product.cpp	2008-07-04 12:43:55 +00:00
Gael Guennebaud	8463b7d3f4	* fix compilation issue in Product * added some tests for product and swap * overload .swap() for dynamic-sized matrix of same size	2008-07-02 16:05:33 +00:00
Gael Guennebaud	9433df83a7	* resurected Flagged::_expression used to optimize m+=(ab).lazy() (equivalent to the GEMM blas routine) added a GEMM benchmark	2008-07-01 16:20:06 +00:00
Gael Guennebaud	37a50fa526	* added an in-place version of inverseProduct which might be twice faster fot small fixed size matrix * added a sparse triangular solver (sparse version of inverseProduct) * various other improvements in the Sparse module	2008-06-29 21:29:12 +00:00
Gael Guennebaud	027818d739	* added innerSize / outerSize functions to MatrixBase * added complete implementation of sparse matrix product (with a little glue in Eigen/Core) * added an exhaustive bench of sparse products including GMM++ and MTL4 => Eigen outperforms in all transposed/density configurations !	2008-06-28 23:07:14 +00:00
Benoit Jacob	55e08f7102	fix breakage from my last commit	2008-06-28 17:15:16 +00:00
Benoit Jacob	844f69e4a9	* update CMakeLists, only build instantiations if TEST_LIB is defined * allow default Matrix constructor in dynamic size, defaulting to (1, 1), this is convenient in mandelbrot example.	2008-06-27 10:53:30 +00:00
Benoit Jacob	6de4871c8c	fix a couple of issues in the new Map.h	2008-06-27 01:42:44 +00:00
Benoit Jacob	e27b2b95cf	* rework Map, allow vectorization * rework PacketMath and DummyPacketMath, make these actual template specializations instead of just overriding by non-template inline functions * introduce ei_ploadt and ei_pstoret, make use of them in Map and Matrix * remove Matrix::map() methods, use Map constructors instead.	2008-06-27 01:22:35 +00:00
Gael Guennebaud	e5d301dc96	various work on the Sparse module: * added some glue to Eigen/Core (SparseBit, ei_eval, Matrix) * add two new sparse matrix types: HashMatrix: based on std::map (for random writes) LinkedVectorMatrix: array of linked vectors (for outer coherent writes, e.g. to transpose a matrix) * add a SparseSetter class to easily set/update any kind of matrices, e.g.: { SparseSetter<MatrixType,RandomAccessPattern> wrapper(mymatrix); for (...) wrapper->coeffRef(rand(),rand()) = rand(); } * automatic shallow copy for RValue * and a lot of mess ! plus: * remove the remaining ArrayBit related stuff * don't use alloca in product for very large memory allocation	2008-06-26 23:22:26 +00:00
Benoit Jacob	c5bd1703cb	change derived classes methods from "private:_method()" to "public:method()" i.e. reimplementing the generic method() from MatrixBase. improves compilation speed by 7%, reduces almost by half the call depth of trivial functions, making gcc errors and application backtraces nicer...	2008-06-26 20:08:16 +00:00
Benoit Jacob	25ba9f377c	* add bench/benchVecAdd.cpp by Gael, fix crash (ei_pload on non-aligned) * introduce packet(int), make use of it in linear vectorized paths --> completely fixes the slowdown noticed in benchVecAdd. * generalize coeff(int) to linear-access xprs * clarify the access flag bits * rework api dox in Coeffs.h and util/Constants.h * improve certain expressions's flags, allowing more vectorization * fix bug in Block: start(int) and end(int) returned dyndyn size fix bug in Block: just because the Eval type has packet access doesn't imply the block xpr should have it too.	2008-06-26 16:06:41 +00:00
Benoit Jacob	5b0da4b778	make use of ei_pmadd in dot-product: will further improve performance on architectures having a packed-mul-add assembly instruction.	2008-06-24 18:08:35 +00:00

... 97 98 99 100 101 ...

5136 Commits