eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-08-05 03:30:37 +08:00

Author	SHA1	Message	Date
Rasmus Munk Larsen	765615609d	Update comment for fast sqrt.	2016-10-04 15:08:41 -07:00
Rasmus Munk Larsen	3ed67cb0bb	Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (enabled by EIGEN_FAST_MATH), which causes the vectorized parts of the computation to return -0.0 instead of NaN for negative arguments. Benchmark speed in Giga-sqrts/s Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz ----------------------------------------- SSE AVX Fast=1 2.529G 4.380G Fast=0 1.944G 1.898G Fast=1 fixed 2.214G 3.739G This table illustrates the worst case in terms speed impact: It was measured by repeatedly computing the sqrt of an n=4096 float vector that fits in L1 cache. For large vectors the operation becomes memory bound and the differences between the different versions almost negligible.	2016-10-04 14:22:56 -07:00
Benoit Steiner	881b90e984	Use explicit type casting to generate packets of zeros.	2016-10-04 08:23:38 -07:00
Benoit Steiner	409e887d78	Added support for constand std::complex numbers on GPU	2016-10-03 11:06:24 -07:00
Gael Guennebaud	9d6d0dff8f	bug #1317 : fix performance regression with some Block expressions and clang by helping it to remove dead code. The trick is to get rid of the nested expression in the evaluator by copying only the required information (here, the strides).	2016-10-01 15:37:00 +02:00
Gael Guennebaud	33500050c3	bug #1308 : fix compilation of some small products involving nullary-expressions.	2016-09-29 09:40:44 +02:00
Benoit Steiner	27d7628f16	Updated the list of warnings to reflect the new message ids introduced in cuda 8.0	2016-09-28 17:42:59 -07:00
Gael Guennebaud	f3a00dd2b5	Merged in sergiu/eigen (pull request PR-229) Disabled MSVC level 4 warning C4714	2016-09-27 09:28:08 +02:00
Gael Guennebaud	892afb9416	Add debug info.	2016-09-26 23:53:57 +02:00
Gael Guennebaud	779774f98c	bug #1311 : fix alignment logic in some cases of (scalar*small).lazyProduct(small)	2016-09-26 23:53:40 +02:00
Gael Guennebaud	48dfe98abd	bug #1308 : fix compilation of vector * rowvector::nullary.	2016-09-25 14:54:35 +02:00
Sergiu Deitsch	fe29157d02	disabled MSVC level 4 warning C4714 The level 4 warning (/W4) warns about functions marked as __forceinline not inlined, and generates a lot of noise.	2016-09-25 14:25:47 +02:00
Benoit Steiner	2a69290ddb	Added a specialization of Eigen::numext::real and Eigen::numext::imag for std::complex<T> to be used when compiling a cuda kernel. This is unfortunately necessary to be able to process complex numbers from a CUDA kernel on MacOS.	2016-09-22 15:52:23 -07:00
Gael Guennebaud	77e27fbeee	bump to 3.3-rc1	2016-09-22 22:37:39 +02:00
Benoit Steiner	50e3bbfc90	Calls x.imag() instead of imag(x) when x is a complex number since the former is a constexpr while the later isn't. This fixes compilation errors triggered by nvcc on Mac.	2016-09-22 13:17:25 -07:00
Felix Gruber	8bde7da086	fix documentation of LinSpaced The index of the highest value in a LinSpace is size-1.	2016-09-22 14:50:07 +02:00
Gael Guennebaud	66cbabafed	Add a note regarding gcc bug #72867	2016-09-22 11:18:52 +02:00
Gael Guennebaud	9fa2c8650e	Fix alignement of statically allocated temporaries in symv, and trmv.	2016-09-21 17:34:24 +02:00
Gael Guennebaud	ac5377e161	Improve cost estimation of complex division	2016-09-21 17:26:04 +02:00
Benoit Steiner	26f9907542	Added missing typedefs	2016-09-20 12:58:03 -07:00
RJ Ryan	b2c6dc48d9	Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.	2016-09-20 07:18:20 -07:00
Benoit Steiner	59e9edfbf1	Removed EIGEN_DEVICE_FUNC qualifers for the lu(), fullPivLu(), partialPivLu(), and inverse() functions since they aren't ready to run on GPU	2016-09-19 14:13:20 -07:00
Gael Guennebaud	4cc2c73e6a	Fix alignement of statically allocated temporaries in gemv.	2016-09-17 12:52:27 +02:00
Gael Guennebaud	ca7f061a5f	bug #828 : clarify documentation of SparseMatrixBase's methods returning a sub-matrix.	2016-09-16 11:23:19 +02:00
Gael Guennebaud	50e203c717	bug #828 : clarify documentation of SparseMatrixBase's unary methods.	2016-09-16 10:40:50 +02:00
Gael Guennebaud	b33144e4df	merge	2016-09-15 11:22:16 +02:00
Benoit Steiner	c0d56a543e	Added several missing EIGEN_DEVICE_FUNC qualifiers	2016-09-14 14:06:21 -07:00
Benoit Steiner	779faaaeba	Fixed compilation warnings generated by nvcc 6.5 (and below) when compiling the EIGEN_THROW macro	2016-09-14 09:56:11 -07:00
Gael Guennebaud	1c8347e554	Fix product for custom complex type. (conjugation was ignored)	2016-09-14 18:28:49 +02:00
Benoit Steiner	ff47717f25	Suppress warning 2527 and 2529, which correspond to the "calling a __host__ function from a __host__ __device__ function is not allowed" message in nvcc 6.5.	2016-09-13 12:49:40 -07:00
Benoit Steiner	309190cf02	Suppress message 1222 when compiling with nvcc: this ensures that we don't warnings about unknown warning messages when compiling with older versions of nvcc	2016-09-13 12:42:13 -07:00
Benoit Steiner	5f50f12d2c	Added the ability to compute the absolute value of a complex number on GPU, as well as a test to catch the problem.	2016-09-12 13:46:13 -07:00
Gael Guennebaud	228ae29591	Fix compilation on 32 bits systems.	2016-09-09 22:34:38 +02:00
Gael Guennebaud	471eac5399	bug #1195 : move NumTraits::Div<>::Cost to internal::scalar_div_cost (with some specializations in arch/SSE and arch/AVX)	2016-09-08 08:36:27 +02:00
Gael Guennebaud	d780983f59	Doc: explain minimal requirements on nullary functors	2016-09-06 23:14:52 +02:00
Gael Guennebaud	85fb517eaf	Generalize ScalarBinaryOpTraits to any complex-real combination as defined by NumTraits (instead of supporting std::complex only).	2016-09-06 17:23:15 +02:00
Gael Guennebaud	447f269561	Disable previous workaround.	2016-09-06 15:49:02 +02:00
Gael Guennebaud	b046a3f87d	Workaround MSVC instantiation faillure of has_ary_operator at the level of triats<Ref>::match so that the has_ary_operator are really properly instantiated throughout the compilation unit.	2016-09-06 15:47:04 +02:00
Gael Guennebaud	19a95b3309	Fix shadowing wrt Eigen::Index	2016-09-05 17:19:47 +02:00
Gael Guennebaud	e13071dd13	Workaround a weird msvc 2012 compilation error.	2016-09-05 15:50:41 +02:00
Gael Guennebaud	d123717e21	Fix for msvc 2012 and older	2016-09-05 15:26:56 +02:00
Benoit Steiner	373c340b71	Fixed a typo	2016-09-02 15:41:17 -07:00
Benoit Steiner	5a6be66cef	Turned the Index type used by the nullary wrapper into a template parameter.	2016-09-02 14:10:29 -07:00
Gael Guennebaud	d6c8366d84	Fix compilation with MSVC 2012	2016-09-02 15:23:32 +02:00
Gael Guennebaud	ef54723dbe	One more msvc fix iteration, the previous one was over-simplified for visual	2016-09-01 15:04:53 +02:00
Gael Guennebaud	f9f32e9e2d	Fix compilation with nvcc	2016-09-01 13:06:14 +02:00
Gael Guennebaud	3d946e42b3	Fix compilation with visual studio	2016-09-01 12:59:32 +02:00
Gael Guennebaud	836fa25a82	Make sure sizeof is truelly needed, thus improving SFINAE portability.	2016-08-31 23:40:18 +02:00
Gael Guennebaud	84cf6e42ca	minor tweaks in has_* helpers	2016-08-31 23:04:14 +02:00
Gael Guennebaud	218c37beb4	bug #1286 : automatically detect the available prototypes of functors passed to CwiseNullaryExpr such that functors have only to implement the operators that matters among: operator()() operator()(i) operator()(i,j) Linear access is also automatically detected based on the availability of operator()(i,j).	2016-08-31 15:45:25 +02:00

1 2 3 4 5 ...

3235 Commits