eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-04-30 15:54:13 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	83309068b4	bug #1680 : improve MSVC inlining by declaring many triavial constructors and accessors as STRONG_INLINE.	2019-02-15 16:35:35 +01:00
Gael Guennebaud	6ec6bf0b0d	Enable visitor on empty matrices (the visitor is left unchanged), and protect min/maxCoeff(Index,Index) on empty matrices by an assertion (+ doc & unit tests)	2019-01-15 15:21:14 +01:00
Gael Guennebaud	bfa2a81a50	Make redux_vec_unroller more flexible regarding packet-type	2018-10-09 23:30:41 +02:00
Gael Guennebaud	bac36d0996	Demangle Travseral and Unrolling in Redux	2018-09-21 23:03:45 +02:00
Gael Guennebaud	b00e48a867	Improve slice-vectorization logic for redux (significant speed-up for reduxion of blocks)	2018-09-21 13:45:56 +02:00
Gael Guennebaud	eb3d8f68bb	fix unused warning	2018-07-12 16:59:47 +02:00
Gael Guennebaud	0537123953	bug #1565 : help MSVC to generatenot too bad ASM in reductions.	2018-07-05 09:21:26 +02:00
Gael Guennebaud	d625564936	Simplify redux_evaluator using inheritance, and properly rename parameters in reducers.	2018-07-02 11:50:41 +02:00
Basil Fierz	624df50945	Adds missing EIGEN_STRONG_INLINE to support MSVC properly inlining small vector calculations When working with MSVC often small vector operations are not properly inlined. This behaviour is observed even on the most recent compiler versions.	2017-10-26 22:44:28 +02:00
Benoit Steiner	33443ec2b0	Added missing EIGEN_DEVICE_FUNC qualifiers	2017-02-28 09:50:10 -08:00
Gael Guennebaud	2e238bafb6	Big 279: enable mixing types for comparisons, min, and max.	2016-06-10 15:05:43 +02:00
Gael Guennebaud	66e99ab6a1	Relax mixing-type constraints for binary coefficient-wise operators: - Replace internal::scalar_product_traits<A,B> by Eigen::ScalarBinaryOpTraits<A,B,OP> - Remove the "functor_is_product_like" helper (was pretty ugly) - Currently, OP is not used, but it is available to the user for fine grained tuning - Currently, only the following operators have been generalized: ,/,+,-,=,=,/=,+=,-= - TODO: generalize all other binray operators (comparisons,pow,etc.) - TODO: handle "scalar op array" operators (currently only * is handled) - TODO: move the handling of the "void" scalar type to ScalarBinaryOpTraits	2016-06-06 15:11:41 +02:00
Gael Guennebaud	8b6f53222b	bug #1193 : fix lpNorm<Infinity> for empty input.	2016-06-02 15:29:59 +02:00
Gael Guennebaud	f253e19296	Disable some long to float conversion warnings	2016-05-26 17:27:14 +02:00
Christoph Hertzberg	33ca7e3c8d	bug #1207 : Add and fix logical-op warnings	2016-05-11 19:36:34 +02:00
Gael Guennebaud	bbb8854bf7	Enable half-packet in reduxions.	2016-04-13 13:02:34 +02:00
Gael Guennebaud	8531304858	Simplify cost computations based on HugeCost being smaller that unrolling limit	2015-10-28 13:39:02 +01:00
Gael Guennebaud	77ff3386b7	Refactoring of the cost model: - Dynamic is now an invalid value - introduce a HugeCost constant to be used for runtime-cost values or arbitrarily huge cost - add sanity checks for cost values: must be >=0 and not too large This change provides several benefits: - it fixes shortcoming is some cost computation where the Dynamic case was not properly handled. - it simplifies cost computation logic, and should avoid future similar shortcomings. - it allows to distinguish between different level of dynamic/huge/infinite cost - it should enable further simplifications in the computation of costs (save compilation time)	2015-10-28 11:42:14 +01:00
Gael Guennebaud	e78bc111f1	bug #1090 : fix a shortcoming in redux logic for which slice-vectorization plus unrolling might happen.	2015-10-21 20:58:33 +02:00
Gael Guennebaud	72bd05b6d8	Cleaning in Redux.h	2015-10-09 12:07:42 +02:00
Gael Guennebaud	aa768add0b	Since there is no reason for evaluators to be nested by reference, let's remove the evaluator<>::nestedType indirection.	2015-09-02 22:10:39 +02:00
Gael Guennebaud	65bfa5fce7	Allow to use arbitrary packet-types during evaluation. This is implemented by adding a PacketType template parameter to packet and writePacket members of evaluator<>.	2015-08-07 12:01:39 +02:00
Gael Guennebaud	ce57dbd937	Let unpacket_traits<> exposes the required alignment and make use of it everywhere	2015-08-07 10:44:01 +02:00
Gael Guennebaud	2afdef6a54	Generalize first_aligned to take the requested alignment as a template parameter, and add a first_default_aligned variante calling first_aligned with the requirement of the largest packet for the given scalar type.	2015-08-06 17:52:01 +02:00
Gael Guennebaud	1f5024332e	First part of a big refactoring of alignment control to enable the handling of arbitrarily aligned buffers. It includes: - AlignedBit flag is deprecated. Alignment is now specified by the evaluator through the 'Alignment' enum, e.g., evaluator<Xpr>::Alignment. Its value is in Bytes. - Add several enums to specify alignment: Aligned8, Aligned16, Aligned32, Aligned64, Aligned128. AlignedMax corresponds to EIGEN_MAX_ALIGN_BYTES. Such enums are used to define the above Alignment value, and as the 'Options' template parameter of Map<> and Ref<>. - The Aligned enum is now deprecated. It is now an alias for Aligned16. - Currently, traits<Matrix<>>, traits<Array<>>, traits<Ref<>>, traits<Map<>>, and traits<Block<>> also expose the Alignment enum.	2015-08-06 15:31:07 +02:00
Gael Guennebaud	7baa1ba03e	Remove the usage of result_of for DenseBase::redux as discussed in bug #1006	2015-06-15 22:40:18 +02:00
Gael Guennebaud	1b7e12847d	Fix some calls to result_of on binary functors as unary ones.	2015-02-19 23:30:41 +01:00
Gael Guennebaud	cc641aabb7	Remove deprecated usage of expr::Index.	2015-02-16 14:46:51 +01:00
Christoph Hertzberg	d3f52debc6	Make cuda_basic test compile again by adding lots of EIGEN_DEVICE_FUNC. Although the test passes now, there might still be some missing.	2014-10-13 17:18:26 +02:00
Christoph Hertzberg	36448c9e28	Make constructors explicit if they could lead to unintended implicit conversion	2014-09-23 14:28:23 +02:00
Gael Guennebaud	0ca43f7e9a	Remove deprecated code not used by evaluators	2014-09-18 15:15:27 +02:00
Gael Guennebaud	749b56f6af	merge with default branch	2014-09-14 17:34:54 +02:00
Gael Guennebaud	6162672dc5	Runtime alignement is not possible if AlignedOnScalar is not true (e.g., for complex<double>)	2014-09-08 10:04:26 +02:00
Gael Guennebaud	4dd55a2958	Optimize reduxions for Homogeneous	2014-08-01 17:00:20 +02:00
Gael Guennebaud	b29b81a1f4	merge with default branch	2014-06-20 15:55:44 +02:00
Gael Guennebaud	f74ed34539	Fix regressions in redux_evaluator flags and evaluator<Block> flags	2014-03-12 18:14:08 +01:00
Gael Guennebaud	5e26b7cf9d	Extend evaluation traits debuging info	2014-03-12 18:13:18 +01:00
Gael Guennebaud	8dd3b716e3	Move evaluation related flags from traits to evaluator and fix evaluators of MapBase and Replicate	2014-03-12 13:34:11 +01:00
Gael Guennebaud	da6ec81282	Move CoeffReadCost mechanism to evaluators	2014-03-10 23:24:40 +01:00
Benoit Steiner	64a85800bd	Added support for AVX to Eigen.	2014-01-29 11:43:05 -08:00
Gael Guennebaud	f0b82c3ab9	Make reductions compatible with evaluators	2013-12-02 17:54:38 +01:00
Gael Guennebaud	9cd2d14005	merge with default branch	2013-04-19 11:21:39 +02:00
Gael Guennebaud	d7f3cfb56e	bug #564 : document the fact that minCoeff/maxCoeff members have undefined behavior if the matrix contains NaN.	2013-04-09 11:27:54 +02:00
Gael Guennebaud	5adcc6c7b4	Add support for NVCC5: most of the Core and part of LU are callable from CUDA code. Still a lot to do.	2013-02-07 19:06:14 +01:00
Benoit Jacob	69124cfca2	Automatic relicensing to MPL2 using Keirs script. Manual fixup follows.	2012-07-13 14:42:47 -04:00
Jitse Niesen	3c412183b2	Get rid of include directives inside namespace blocks (bug #339 ).	2012-04-15 11:06:28 +01:00
Gael Guennebaud	9c86ee2695	fix static inline versus inline static issues (the former is the correct order)	2012-01-31 12:58:52 +01:00
Gael Guennebaud	87f2af5930	workaround ICC compilation error with -strict-ansi	2012-01-25 15:45:01 +01:00
Gael Guennebaud	3e4a68cc60	optimize vectorized reductions by peeling the loop: - x2 for squaredNorm() on double - peeling the loop with a peeling factor of 4 leads to even better perf for large vectors (e.g., >64) but it makes more difficult to keep good performance on smaller ones.	2011-11-12 09:19:48 +01:00
Benoit Jacob	25579df2d4	'fix' a couple of clang -Wconstant-logical-operand warnings (still not convinced about the pertinence of that warning)	2011-02-22 08:54:55 -05:00

1 2 3

101 Commits