eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-13 08:31:28 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	0fa9e4a15c	Fix performance regression in dgemm introduced by changeset 5d51a7f12c69138ed2a43df240bdf27a5313f7ce	2016-07-02 17:35:08 +02:00
Gael Guennebaud	672076db5d	Fix performance regression introduced in changeset e56aabf205a1e8f581dd8a46d7d46ce79c45e158 . Register blocking sizes are better handled by the cache size heuristics. The current code introduced very small blocks, for instance for 9x9 matrix, thus killing performance.	2016-07-02 15:40:56 +02:00
Justin Carpentier	6126886a67	Use complete nested namespace Eigen::internal	2016-06-28 20:09:25 +02:00
Benoit Jacob	328c5d876a	Undo changes in AltiVec --- I don't have any way to test there.	2016-06-28 11:15:25 -04:00
Benoit Jacob	38fb606052	Avoid global variables with static constructors in NEON/Complex.h	2016-06-28 11:12:49 -04:00
Gael Guennebaud	d476cadbb8	bug #1247 : fix regression in compilation of pow(integer,integer), and add respective unit tests.	2016-06-25 10:12:06 +02:00
Gael Guennebaud	c50c73cae2	Fix missing specialization.	2016-06-24 23:10:39 +02:00
Gael Guennebaud	cd577a275c	Relax promote_scalar_arg logic to enable promotion to Expr::Scalar if conversion to Expr::Literal fails. This is useful to cancel expression template at the scalar level, e.g. with AutoDiff<AutoDiff<>>. This patch also defers calls to NumTraits in cases for which types are not directly compatible.	2016-06-24 11:28:54 +02:00
Gael Guennebaud	deb45ad4bc	bug #1245 : fix compilation with msvc	2016-06-24 09:52:25 +02:00
Gael Guennebaud	55fc04e8b5	Fix operator priority	2016-06-23 15:36:42 +02:00
Gael Guennebaud	7c6561485a	merge PR 194	2016-06-23 15:29:57 +02:00
Gael Guennebaud	76faf4a965	Introduce a NumTraits<T>::Literal type to be used for literals, and improve mixing type support in operations between arrays and scalars: - 2 * ArrayXcf is now optimized in the sense that the integer 2 is properly promoted to a float instead of a complex<float> (fix a regression) - 2.1 * ArrayXi is now forbiden (previously, 2.1 was converted to 2) - This mechanism should be applicable to any custom scalar type, assuming NumTraits<T>::Literal is properly defined (it defaults to T)	2016-06-23 14:27:20 +02:00
Gael Guennebaud	a3f7edf7e7	Biug 1242: fix comma init with empty matrices.	2016-06-23 10:25:04 +02:00
Benoit Steiner	b055590e91	Made log1p_impl usable inside a GPU kernel	2016-06-16 11:37:40 -07:00
Gael Guennebaud	eb91345d64	Move scalar/expr to ArrayBase and fix documentation	2016-06-15 15:22:03 +02:00
Gael Guennebaud	4794834397	Propagate functor to ScalarBinaryOpTraits	2016-06-15 09:58:49 +02:00
Gael Guennebaud	c55035b9c0	Include the cost of stores in unrolling of triangular expressions.	2016-06-15 09:57:33 +02:00
Gael Guennebaud	4e7c3af874	Cleanup useless helper: internal::product_result_scalar	2016-06-15 00:04:10 +02:00
Gael Guennebaud	101ea26f5e	Include the cost of stores in unrolling (also fix infinite unrolling with expression costing 0 like Constant)	2016-06-15 00:01:16 +02:00
Gael Guennebaud	76236cdea4	merge	2016-06-14 15:33:47 +02:00
Gael Guennebaud	1004c4df99	Cleanup unused functors.	2016-06-14 15:27:28 +02:00
Gael Guennebaud	70dad84b73	Generalize expr/expr and scalar/expr wrt scalar types.	2016-06-14 15:26:37 +02:00
Gael Guennebaud	62134082aa	Update AutoDiffScalar wrt to scalar-multiple.	2016-06-14 15:06:35 +02:00
Gael Guennebaud	396d9cfb6e	Generalize expr.pow(scalar), pow(expr,scalar) and pow(scalar,expr). Internal: scalar_pow_op (unary) is removed, and scalar_binary_pow_op is renamed scalar_pow_op.	2016-06-14 14:10:07 +02:00
Gael Guennebaud	a8c08e8b8e	Implement expr+scalar, scalar+expr, expr-scalar, and scalar-expr as binary expressions, and generalize supported scalar types. The following functors are now deprecated: scalar_add_op, scalar_sub_op, and scalar_rsub_op.	2016-06-14 12:06:10 +02:00
Gael Guennebaud	bcc0f38f98	Add unittesting plugins to scalar_product_op and scalar_quotient_op to help chaking that types are properly propagated.	2016-06-14 11:31:27 +02:00
Gael Guennebaud	f5b1c73945	Set cost of constant expression to 0 (the cost should be amortized through the expression)	2016-06-14 11:29:06 +02:00
Gael Guennebaud	deb8306e60	Move MatrixBase::operaotr*(UniformScaling) as a free function in Scaling.h, and fix return type.	2016-06-14 11:28:03 +02:00
Gael Guennebaud	64fcfd314f	Implement scalar multiples and division by a scalar as a binary-expression with a constant expression. This slightly complexifies the type of the expressions and implies that we now have to distinguish between scalarexpr and exprscalar to catch scalar-multiple expression (e.g., see BlasUtil.h), but this brings several advantages: - it makes it clear on each side the scalar is applied, - it clearly reflects that we are dealing with a binary-expression, - the complexity of the type is hidden through macros defined at the end of Macros.h, - distinguishing between "scalar op expr" and "expr op scalar" is important to support non commutative fields (like quaternions) - "scalar op expr" is now fully equivalent to "ConstantExpr(scalar) op expr" - scalar_multiple_op, scalar_quotient1_op and scalar_quotient2_op are not used anymore in officially supported modules (still used in Tensor)	2016-06-14 11:26:57 +02:00
Gael Guennebaud	3c12e24164	Add bind1st_op and bind2nd_op helpers to turn binary functors into unary ones, and implement scalar_multiple2 and scalar_quotient2 on top of them.	2016-06-13 16:18:59 +02:00
Gael Guennebaud	7a9ef7bbb4	Add default template parameters for the second scalar type of binary functors. This enhences backward compatibility.	2016-06-13 16:17:23 +02:00
Gael Guennebaud	5fdd703629	Enable mixing types in numext::pow	2016-06-10 15:58:04 +02:00
Gael Guennebaud	2e238bafb6	Big 279: enable mixing types for comparisons, min, and max.	2016-06-10 15:05:43 +02:00
Gael Guennebaud	0028049380	bug #1240 : Remove any assumption on NEON vector types.	2016-06-09 23:08:11 +02:00
Gael Guennebaud	2c462f4201	Clean handling for void type in EIGEN_CHECK_BINARY_COMPATIBILIY	2016-06-06 23:11:38 +02:00
Gael Guennebaud	3d71d3918e	Disable shortcuts for res ?= prod when the scalar types do not match exactly.	2016-06-06 23:10:55 +02:00
Gael Guennebaud	66e99ab6a1	Relax mixing-type constraints for binary coefficient-wise operators: - Replace internal::scalar_product_traits<A,B> by Eigen::ScalarBinaryOpTraits<A,B,OP> - Remove the "functor_is_product_like" helper (was pretty ugly) - Currently, OP is not used, but it is available to the user for fine grained tuning - Currently, only the following operators have been generalized: ,/,+,-,=,=,/=,+=,-= - TODO: generalize all other binray operators (comparisons,pow,etc.) - TODO: handle "scalar op array" operators (currently only * is handled) - TODO: move the handling of the "void" scalar type to ScalarBinaryOpTraits	2016-06-06 15:11:41 +02:00
Benoit Steiner	1f1e0b9e30	Silenced compilation warning	2016-06-05 12:59:11 -07:00
Benoit Steiner	5b95b4daf9	Moved static assertions into the class constructor to make the code more portable	2016-06-05 12:57:48 -07:00
Sean Templeton	bd21243821	Fix compile errors initializing packets on ARM DS-5 5.20 The ARM DS-5 5.20 compiler fails compiling with the following errors: "src/Core/arch/NEON/PacketMath.h", line 113: Error: #146: too many initializer values Packet4f countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/PacketMath.h", line 118: Error: #146: too many initializer values Packet4i countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/Complex.h", line 30: Error: #146: too many initializer values static uint32x4_t p4ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET4(0x00000000, 0x80000000, 0x00000000, 0x80000000); ^ "src/Core/arch/NEON/Complex.h", line 31: Error: #146: too many initializer values static uint32x2_t p2ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET2(0x00000000, 0x80000000); ^ The vectors are implemented as two doubles, hence the too many initializer values error. Changed the code to use intrinsic load functions which all compilers implementing NEON should have.	2016-06-03 10:51:35 -05:00
Gael Guennebaud	1fc2746417	Make Arrays's ctor/assignment noexcept	2016-06-09 22:52:37 +02:00
Benoit Steiner	8fd57a97f2	Enable the vectorization of adds and mults of fp16	2016-06-07 18:22:18 -07:00
Benoit Steiner	33f0340188	Implement result_of for the new ternary functors	2016-06-06 12:06:42 -07:00
Eugene Brevdo	39baff850c	Add TernaryFunctors and the betainc SpecialFunction. TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.	2016-06-02 17:04:19 -07:00
Gael Guennebaud	8d97ba6b22	bug #725 : make move ctor/assignment noexcept.	2016-06-03 14:28:25 +02:00
Gael Guennebaud	f2c2465acc	Fix function dependencies	2016-06-03 07:44:18 +02:00
Gael Guennebaud	53feb73b45	Remove dead code.	2016-06-02 22:19:55 +02:00
Gael Guennebaud	2c00ac0b53	Implement generic scalarexpr and exprscalar operator based on scalar_product_traits. This is especially useful for custom scalar types, e.g., to enable float*expr<multi_prec> without conversion.	2016-06-02 22:16:37 +02:00
Gael Guennebaud	8b6f53222b	bug #1193 : fix lpNorm<Infinity> for empty input.	2016-06-02 15:29:59 +02:00
Gael Guennebaud	360e311b66	Doc: add some cross references (also fix empty macro argument warning)	2016-06-01 23:34:09 +02:00

1 2 3 4 5 ...

3102 Commits