eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-09-29 09:43:13 +08:00

Author	SHA1	Message	Date
Benoit Steiner	23f69ab936	Added implementations of pexp, plog, psqrt, and prsqrt optimized for AVX512	2016-02-04 10:36:36 -08:00
Benoit Steiner	6c9cf117c1	Fixed indentation	2016-02-04 10:34:10 -08:00
Benoit Steiner	d93b71a301	Updated the packetmath test to call predux_half instead of predux4	2016-02-01 15:18:33 -08:00
Benoit Steiner	ef66f2887b	Updated the matrix multiplication code to make it compile with AVX512 enabled.	2016-02-01 14:38:05 -08:00
Benoit Steiner	85b6d82b49	Generalized predux4 to support AVX512 packets, and renamed it predux_half. Disabled the implementation of pabs for avx512 since the corresponding intrinsics are not shipped with gcc	2016-02-01 14:35:51 -08:00
Benoit Steiner	c1a42c2d0d	Don't disable the AVX implementations of plset when compiling with AVX512 enabled	2016-01-14 17:21:39 -08:00
Benoit Steiner	0366478df8	Added alignment requirement to the AVX512 packet traits.	2016-01-14 17:02:39 -08:00
Benoit Steiner	3cfd16f3af	Fixed the signature of the plset primitives for AVX512	2016-01-14 16:58:01 -08:00
Benoit Steiner	67f44365ea	Fixed the AVX512 signature of the ptranspose primitives	2016-01-14 16:51:11 -08:00
Benoit Steiner	a282eb1363	pscatter/pgather use Index instead of int to specify the stride	2016-01-14 16:39:39 -08:00
Benoit Steiner	7832485575	Deleted unnecessary commas and semicolons	2016-01-14 16:36:29 -08:00
Benoit Steiner	99093c0fe0	Added support for AVX512 to the build files	2016-01-05 10:02:49 -08:00
Benoit Steiner	9f9d8d2f62	Disabled part of the matrix matrix peeling code that's incompatible with 512 bit registers	2015-12-21 13:04:52 -08:00
Benoit Steiner	b74887d5f2	Implemented most of the packet primitives for AVX512	2015-12-21 11:46:36 -08:00
Benoit Steiner	6ffb208c77	Make sure EIGEN_HAS_MM_MALLOC is set to 1 when using the avx512 instruction set.	2015-12-21 11:23:15 -08:00
Benoit Steiner	994d1c60b9	Free memory allocated using posix_memalign() with free() instead of std::free()	2015-12-21 11:21:39 -08:00
Benoit Steiner	b8861b0c25	Make sure the data is aligned on a 64 byte boundary when using avx512 instructions.	2015-12-11 09:19:57 -08:00
Benoit Steiner	9a415fb1e2	Preliminary support for AVX512	2015-12-10 15:34:57 -08:00
Mark Borgerding	22dd368ea0	sign(complex) compiles for GPU	2015-12-10 16:14:29 -05:00
Gael Guennebaud	46d2f6cd78	Workaround gcc issue with -O3 and the i387 FPU.	2015-12-10 21:33:43 +01:00
Gael Guennebaud	7ad1aaec1d	bug #1103 : fix neon vectorization of pmul(Packet1cd,Packet1cd)	2015-12-10 16:06:33 +01:00
Gael Guennebaud	b0a1d6f2e5	Improve handling of deprecated EIGEN_INCLUDE_INSTALL_DIR variable	2015-12-10 15:47:06 +01:00
Taylor Braun-Jones	b836acb799	Further fixes for CMAKE_INSTALL_PREFIX correctness And other related cmake cleanup, including: - Use CMAKE_CURRENT_LIST_DIR to find UseEigen3.cmake - Use INSTALL_DIR term consistently for variable names - Drop unnecessary extra EIGEN_INCLUDE_INSTALL_DIR - Fix some paths in generated eigen3.pc and Eigen3Config.cmake files missing CMAKE_INSTALL_PREFIX - Fix pkgconfig directory choice ignored if it doesn't exist at configure time (bug #711)	2015-11-07 21:29:24 -05:00
Gael Guennebaud	e73ef4f25e	bug #1109 : use noexcept instead of throw for C++11 compilers	2015-12-10 14:21:23 +01:00
Gael Guennebaud	145ad5d800	Use more explicit names.	2015-12-10 12:03:38 +01:00
Gael Guennebaud	75f0fe3795	Fix usage of "Index" as a compile time integral.	2015-12-10 12:01:06 +01:00
Gael Guennebaud	f248249c1f	bug #1113 : fix name conflict with C99's "I".	2015-12-10 11:57:57 +01:00
Gael Guennebaud	21ed29e2c9	Disable complex scalar types because the compiler might aggressively vectorize the initialization of complex coeffs to 0 before we can check for alignedness	2015-12-09 20:46:09 +01:00
Gael Guennebaud	fbe18d5507	Forbid the creation of SparseCompressedBase object	2015-12-09 15:47:32 +01:00
Gael Guennebaud	dc73430d4b	bug #1074 : forbid the creation of PlainObjectBase object by making its ctor protected	2015-12-09 15:47:08 +01:00
Gael Guennebaud	1257fbd2f9	Fix sign-unsigned issue in enum	2015-12-09 10:06:42 +01:00
Gael Guennebaud	4549549992	Fix and clarify documentation of Transform wrt operator*(MatrixBase)	2015-12-08 16:21:49 +01:00
Gael Guennebaud	543bd28a24	Fix Alignment in coeff-based product, and enable unaligned vectorization	2015-12-08 11:28:05 +01:00
Gael Guennebaud	03ad4fc504	Extend unit test of coeff-based product to check many more combinations	2015-12-08 11:27:43 +01:00
Benoit Steiner	20e2ab1121	Fixed another compilation warning	2015-12-07 16:17:57 -08:00
Benoit Steiner	7dfe75f445	Fixed compilation warnings	2015-12-07 08:12:30 -08:00
Gael Guennebaud	ad3d68400e	Add matrix-free solver example	2015-12-07 12:33:38 +01:00
Gael Guennebaud	b37036afce	Implement wrapper for matrix-free iterative solvers	2015-12-07 12:23:22 +01:00
Benoit Steiner	f4ca8ad917	Use signed integers instead of unsigned ones more consistently in the codebase.	2015-12-04 18:14:16 -08:00
Benoit Steiner	490d26e4c1	Use integers instead of std::size_t to encode the number of dimensions in the Tensor class since most of the code currently already use integers.	2015-12-04 10:15:11 -08:00
Benoit Steiner	d20efc974d	Made it possible to use the sigmoid functor within a CUDA kernel.	2015-12-04 09:38:15 -08:00
Benoit Steiner	e25e3a041b	Added rsqrt() method to the Array class: this method computes the coefficient-wise inverse square root much more efficiently than calling sqrt().inverse().	2015-12-03 18:16:35 -08:00
Benoit Steiner	029052d276	Deleted redundant code	2015-12-03 17:08:47 -08:00
Benoit Steiner	c41e9e4bd0	Merged in Unril/eigen-1/Unril/fixes-internal-compiler-error-while-comp-1449156092576 (pull request PR-147) Fixes internal compiler error while compiling with VC2015 Update1 x64.	2015-12-03 14:26:14 -08:00
Gael Guennebaud	1562e13aba	Add missing Rotation2D::operator=(Matrix2x2)	2015-12-03 22:25:26 +01:00
Nikolay Fedorov	944647c0aa	Fixes internal compiler error while compiling with VC2015 Update1 x64.	2015-12-03 15:21:43 +00:00
Benoit Steiner	d2d4c45d55	Made it possible to leverage several binary functor in a CUDA kernel Explicitely specified the return type of the various scalar_cmp_op functors.	2015-12-02 17:21:33 -08:00
Gael Guennebaud	c5b86893e7	bug #1123 : add missing documentation of angle() and axis()	2015-12-01 14:45:08 +01:00
Gael Guennebaud	0bb12fa614	Add LU::transpose().solve() and LU::adjoint().solve() API.	2015-12-01 14:38:47 +01:00
Rasmus Munk Larsen	1663d15da7	Add internal method _solve_impl_transposed() to LU decomposition classes that solves A^T x = b or A^* x = b.	2015-11-30 13:39:24 -08:00

1 2 3 4 5 ...

7113 Commits