eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-05-18 23:57:39 +08:00

Author	SHA1	Message	Date
Christoph Hertzberg	449ff74672	Fix most Doxygen warnings. Also add links to stable documentation from unsupported modules (by using the corresponding Doxytags file). Manually grafted from d107a371c61b764c73fd1570b1f3ed1c6400dd7e	2018-10-19 21:10:28 +02:00
Rasmus Munk Larsen	d8f285852b	Only set EIGEN_CONSTEXPR_ARE_DEVICE_FUNC for clang++ if cxx_relaxed_constexpr is available.	2018-10-18 16:55:02 -07:00
Gael Guennebaud	0f780bb0b4	Fix float-to-double warning	2018-10-16 09:19:45 +02:00
Gael Guennebaud	a39e0f7438	bug #1612 : fix regression in "outer-vectorization" of partial reductions for PacketSize==1 (aka complex<double>)	2018-10-16 01:04:25 +02:00
Gael Guennebaud	d2d570c116	Remove useless (and broken) resize	2018-10-16 00:42:48 +02:00
Gael Guennebaud	f0fb95135d	Iterative solvers: unify and fix handling of multiple rhs. m_info was not properly computed and the logic was repeated in several places.	2018-10-15 23:47:46 +02:00
Gael Guennebaud	3a33db4de5	merge	2018-10-15 09:22:27 +02:00
Mark D Ryan	aa110e681b	PR 526: Speed up multiplication of small, dynamically sized matrices The Packet16f, Packet8f and Packet8d types are too large to use with dynamically sized matrices typically processed by the SliceVectorizedTraversal specialization of the dense_assignment_loop. Using these types is likely to lead to little or no vectorization. Significant slowdown in the multiplication of these small matrices can be observed when building with AVX and AVX512 enabled. This patch introduces a new dense_assignment_kernel that is used when computing small products whose operands have dynamic dimensions. It ensures that the PacketSize used is no larger than 4, thereby increasing the chance that vectorized instructions will be used when computing the product. I tested all 969 possible combinations of M, K, and N that are handled by the dense_assignment_loop on x86 builds. Although a few combinations are slowed down by this patch they are far outnumbered by the cases that are sped up, as the following results demonstrate. Disabling Packed8d on AVX512 builds: Total Cases: 969 Better: 511 Worse: 85 Same: 373 Max Improvement: 169.00% (4 8 6) Max Degradation: 36.50% (8 5 3) Median Improvement: 35.46% Median Degradation: 17.41% Total FLOPs Improvement: 19.42% Disabling Packet16f and Packed8f on AVX512 builds: Total Cases: 969 Better: 658 Worse: 5 Same: 306 Max Improvement: 214.05% (8 6 5) Max Degradation: 22.26% (16 2 1) Median Improvement: 60.05% Median Degradation: 13.32% Total FLOPs Improvement: 59.58% Disabling Packed8f on AVX builds: Total Cases: 969 Better: 663 Worse: 96 Same: 210 Max Improvement: 155.29% (4 10 5) Max Degradation: 35.12% (8 3 2) Median Improvement: 34.28% Median Degradation: 15.05% Total FLOPs Improvement: 26.02%	2018-10-12 15:20:21 +02:00
Eugene Zhulenev	d9392f9e55	Fix code format	2018-11-02 14:51:35 -07:00
Eugene Zhulenev	118520f04a	Workaround nbcc+msvc compiler bug	2018-11-02 14:48:28 -07:00
Christoph Hertzberg	24dc076519	Explicitly convert 0 to Scalar for custom types	2018-10-12 10:22:19 +02:00
Gael Guennebaud	43633fbaba	Fix warning with AVX512f	2018-10-11 10:13:48 +02:00
Gael Guennebaud	97e2c808e9	Fix avx512 plog(NaN) to return NaN instead of +inf	2018-10-11 10:13:13 +02:00
Gael Guennebaud	b3f66d29a5	Enable avx512 plog with clang	2018-10-11 10:12:21 +02:00
Gael Guennebaud	f0aa7e40fc	Fix regression in changeset 5335659c47d69d3ee1b6f9792fea5998731f9a53	2018-10-10 23:47:30 +02:00
Gael Guennebaud	ce243ee45b	bug #520 : add diagmat +/- diagmat operators.	2018-10-10 23:38:22 +02:00
Gael Guennebaud	5335659c47	Merged in ezhulenev/eigen-02 (pull request PR-525) Fix bug in partial reduction of expressions requiring evaluation	2018-10-10 20:59:00 +00:00
Gael Guennebaud	eec0dfd688	bug #632 : add specializations for res ?= dense +/- sparse and res ?= sparse +/- dense. They are rewritten as two compound assignment to by-pass hybrid dense-sparse iterator.	2018-10-10 22:50:15 +02:00
Eugene Zhulenev	8e6dc2c81d	Fix bug in partial reduction of expressions requiring evaluation	2018-10-10 13:23:52 -07:00
Eugene Zhulenev	2bf1a31d81	Use void type if stl-style iterators are not supported	2018-10-10 10:31:40 -07:00
Rasmus Munk Larsen	e8918743c1	Merged in ezhulenev/eigen-01 (pull request PR-523) Compile time detection for unimplemented stl-style iterators	2018-10-09 23:42:01 +00:00
Eugene Zhulenev	c0ca8a9fa3	Compile time detection for unimplemented stl-style iterators	2018-10-09 15:28:23 -07:00
Gael Guennebaud	1dd1f8e454	bug #65 : add vectorization of partial reductions along the outer-dimension, for instance: colmajor_mat.rowwise().mean()	2018-10-09 23:36:50 +02:00
Gael Guennebaud	bfa2a81a50	Make redux_vec_unroller more flexible regarding packet-type	2018-10-09 23:30:41 +02:00
Christoph Hertzberg	f6359ad795	Small Doxygen fixes	2018-10-09 19:33:35 +02:00
Gael Guennebaud	7a882c05ab	Fix compilation on CUDA	2018-10-09 17:02:16 +02:00
Gael Guennebaud	e00487f7d2	bug #1603 : add parenthesis around ternary operator in function body as well as a harmless attempt to make MSVC happy.	2018-10-08 22:27:04 +02:00
Gael Guennebaud	649d4758a6	merge	2018-10-08 17:35:18 +02:00
Gael Guennebaud	774bb9d6f7	fix a doxygen issue	2018-10-08 09:30:15 +02:00
Gael Guennebaud	bcb7c66b53	Workaround gcc's alloc-size-larger-than= warning	2018-10-07 21:55:59 +02:00
Gael Guennebaud	6512c5e136	Implement a better workaround for GCC's bug #87544	2018-10-07 15:00:05 +02:00
Gael Guennebaud	409132bb81	Workaround gcc bug making it trigger an invalid warning	2018-10-07 09:23:15 +02:00
Gael Guennebaud	c6a1ab4036	Workaround MSVC compilation issue	2018-10-06 13:49:17 +02:00
Gael Guennebaud	e21766c6f5	Clarify doc of rowwise/colwise/vectorwise.	2018-10-05 23:12:09 +02:00
Gael Guennebaud	d92f004ab7	Simplify API by removing allCols/allRows and reusing rowwise/colwise to define iterators over rows/columns	2018-10-05 23:11:21 +02:00
Gael Guennebaud	3e64b1fc86	Move iterators to internal, improve doc, make unit test c++03 friendly	2018-10-03 15:13:15 +02:00
Gael Guennebaud	2b2b4d0580	fix unused warning	2018-10-03 14:16:21 +02:00
Gael Guennebaud	5f26f57598	Change the logic of A.reshaped<Order>() to be a simple alias to A.reshaped<Order>(AutoSize,fix<1>). This means that now AutoOrder is allowed, and it always return a column-vector.	2018-10-03 11:41:47 +02:00
Gael Guennebaud	0481900e25	Add pointer-based iterator for direct-access expressions	2018-10-02 23:44:36 +02:00
Gael Guennebaud	8c38528168	Factorize RowsProxy/ColsProxy and related iterators using subVector<>(Index)	2018-10-02 14:03:26 +02:00
Gael Guennebaud	12487531ce	Add templated subVector<Vertical/Horizonal>(Index) aliases to col/row(Index) methods (plus subVectors<>() to retrieve the number of rows/columns)	2018-10-02 14:02:34 +02:00
Gael Guennebaud	37e29fc893	Use Index instead of ptrdiff_t or int, fix random-accessors.	2018-10-02 13:29:32 +02:00
Gael Guennebaud	de2efbc43c	bug #1605 : workaround ABI issue with vector types (aka __m128) versus scalar types (aka float)	2018-10-01 23:45:55 +02:00
Gael Guennebaud	b0c66adfb1	bug #231 : initial implementation of STL iterators for dense expressions	2018-10-01 23:21:37 +02:00
Christoph Hertzberg	564ca71e39	Merged in deven-amd/eigen/HIP_fixes (pull request PR-518) PR with HIP specific fixes (for the eigen nightly regression failures in HIP mode)	2018-10-01 16:51:04 +00:00
Deven Desai	94898488a6	This commit contains the following (HIP specific) updates: - unsupported/Eigen/CXX11/src/Tensor/TensorReductionGpu.h Changing "pass-by-reference" argument to be "pass-by-value" instead (in a __global__ function decl). "pass-by-reference" arguments to __global__ functions are unwise, and will be explicitly flagged as errors by the newer versions of HIP. - Eigen/src/Core/util/Memory.h - unsupported/Eigen/CXX11/src/Tensor/TensorContraction.h Changes introduced in recent commits breaks the HIP compile. Adding EIGEN_DEVICE_FUNC attribute to some functions and calling ::malloc/free instead of the corresponding std:: versions to get the HIP compile working again - unsupported/Eigen/CXX11/src/Tensor/TensorReduction.h Change introduced a recent commit breaks the HIP compile (link stage errors out due to failure to inline a function). Disabling the recently introduced code (only for HIP compile), to get the eigen nightly testing going again. Will submit another PR once we have te proper fix. - Eigen/src/Core/util/ConfigureVectorization.h Enabling GPU VECTOR support when HIP compiler is in use (for both the host and device compile phases)	2018-10-01 14:28:37 +00:00
Gael Guennebaud	af3ad4b513	oops, I've been too fast in previous copy/paste	2018-09-27 09:28:57 +02:00
Gael Guennebaud	24b163a877	#pragma GCC diagnostic push/pop is not supported prioro to gcc 4.6	2018-09-27 09:23:54 +02:00
Gael Guennebaud	41c3a2ffc1	Fix documentation of reshape to vectors.	2018-09-25 16:35:44 +02:00
Christoph Hertzberg	2c083ace3e	Provide EIGEN_OVERRIDE and EIGEN_FINAL macros to mark virtual function overrides	2018-09-24 18:01:17 +02:00

... 2 3 4 5 6 ...

5889 Commits