Benoit Steiner
|
23f69ab936
|
Added implementations of pexp, plog, psqrt, and prsqrt optimized for AVX512
|
2016-02-04 10:36:36 -08:00 |
|
Benoit Steiner
|
6c9cf117c1
|
Fixed indentation
|
2016-02-04 10:34:10 -08:00 |
|
Benoit Steiner
|
ef66f2887b
|
Updated the matrix multiplication code to make it compile with AVX512 enabled.
|
2016-02-01 14:38:05 -08:00 |
|
Benoit Steiner
|
85b6d82b49
|
Generalized predux4 to support AVX512 packets, and renamed it predux_half.
Disabled the implementation of pabs for avx512 since the corresponding intrinsics are not shipped with gcc
|
2016-02-01 14:35:51 -08:00 |
|
Benoit Steiner
|
c1a42c2d0d
|
Don't disable the AVX implementations of plset when compiling with AVX512 enabled
|
2016-01-14 17:21:39 -08:00 |
|
Benoit Steiner
|
0366478df8
|
Added alignment requirement to the AVX512 packet traits.
|
2016-01-14 17:02:39 -08:00 |
|
Benoit Steiner
|
3cfd16f3af
|
Fixed the signature of the plset primitives for AVX512
|
2016-01-14 16:58:01 -08:00 |
|
Benoit Steiner
|
67f44365ea
|
Fixed the AVX512 signature of the ptranspose primitives
|
2016-01-14 16:51:11 -08:00 |
|
Benoit Steiner
|
a282eb1363
|
pscatter/pgather use Index instead of int to specify the stride
|
2016-01-14 16:39:39 -08:00 |
|
Benoit Steiner
|
7832485575
|
Deleted unnecessary commas and semicolons
|
2016-01-14 16:36:29 -08:00 |
|
Benoit Steiner
|
9f9d8d2f62
|
Disabled part of the matrix matrix peeling code that's incompatible with 512 bit registers
|
2015-12-21 13:04:52 -08:00 |
|
Benoit Steiner
|
b74887d5f2
|
Implemented most of the packet primitives for AVX512
|
2015-12-21 11:46:36 -08:00 |
|
Benoit Steiner
|
6ffb208c77
|
Make sure EIGEN_HAS_MM_MALLOC is set to 1 when using the avx512 instruction set.
|
2015-12-21 11:23:15 -08:00 |
|
Benoit Steiner
|
994d1c60b9
|
Free memory allocated using posix_memalign() with free() instead of std::free()
|
2015-12-21 11:21:39 -08:00 |
|
Benoit Steiner
|
b8861b0c25
|
Make sure the data is aligned on a 64 byte boundary when using avx512 instructions.
|
2015-12-11 09:19:57 -08:00 |
|
Benoit Steiner
|
9a415fb1e2
|
Preliminary support for AVX512
|
2015-12-10 15:34:57 -08:00 |
|
Mark Borgerding
|
22dd368ea0
|
sign(complex) compiles for GPU
|
2015-12-10 16:14:29 -05:00 |
|
Gael Guennebaud
|
7ad1aaec1d
|
bug #1103: fix neon vectorization of pmul(Packet1cd,Packet1cd)
|
2015-12-10 16:06:33 +01:00 |
|
Gael Guennebaud
|
e73ef4f25e
|
bug #1109: use noexcept instead of throw for C++11 compilers
|
2015-12-10 14:21:23 +01:00 |
|
Gael Guennebaud
|
145ad5d800
|
Use more explicit names.
|
2015-12-10 12:03:38 +01:00 |
|
Gael Guennebaud
|
75f0fe3795
|
Fix usage of "Index" as a compile time integral.
|
2015-12-10 12:01:06 +01:00 |
|
Gael Guennebaud
|
f248249c1f
|
bug #1113: fix name conflict with C99's "I".
|
2015-12-10 11:57:57 +01:00 |
|
Gael Guennebaud
|
fbe18d5507
|
Forbid the creation of SparseCompressedBase object
|
2015-12-09 15:47:32 +01:00 |
|
Gael Guennebaud
|
dc73430d4b
|
bug #1074: forbid the creation of PlainObjectBase object by making its ctor protected
|
2015-12-09 15:47:08 +01:00 |
|
Gael Guennebaud
|
1257fbd2f9
|
Fix sign-unsigned issue in enum
|
2015-12-09 10:06:42 +01:00 |
|
Gael Guennebaud
|
4549549992
|
Fix and clarify documentation of Transform wrt operator*(MatrixBase)
|
2015-12-08 16:21:49 +01:00 |
|
Gael Guennebaud
|
543bd28a24
|
Fix Alignment in coeff-based product, and enable unaligned vectorization
|
2015-12-08 11:28:05 +01:00 |
|
Gael Guennebaud
|
ad3d68400e
|
Add matrix-free solver example
|
2015-12-07 12:33:38 +01:00 |
|
Gael Guennebaud
|
b37036afce
|
Implement wrapper for matrix-free iterative solvers
|
2015-12-07 12:23:22 +01:00 |
|
Benoit Steiner
|
e25e3a041b
|
Added rsqrt() method to the Array class: this method computes the coefficient-wise inverse square root much more efficiently than calling sqrt().inverse().
|
2015-12-03 18:16:35 -08:00 |
|
Benoit Steiner
|
c41e9e4bd0
|
Merged in Unril/eigen-1/Unril/fixes-internal-compiler-error-while-comp-1449156092576 (pull request PR-147)
Fixes internal compiler error while compiling with VC2015 Update1 x64.
|
2015-12-03 14:26:14 -08:00 |
|
Gael Guennebaud
|
1562e13aba
|
Add missing Rotation2D::operator=(Matrix2x2)
|
2015-12-03 22:25:26 +01:00 |
|
Nikolay Fedorov
|
944647c0aa
|
Fixes internal compiler error while compiling with VC2015 Update1 x64.
|
2015-12-03 15:21:43 +00:00 |
|
Benoit Steiner
|
d2d4c45d55
|
Made it possible to leverage several binary functor in a CUDA kernel
Explicitely specified the return type of the various scalar_cmp_op functors.
|
2015-12-02 17:21:33 -08:00 |
|
Gael Guennebaud
|
c5b86893e7
|
bug #1123: add missing documentation of angle() and axis()
|
2015-12-01 14:45:08 +01:00 |
|
Gael Guennebaud
|
0bb12fa614
|
Add LU::transpose().solve() and LU::adjoint().solve() API.
|
2015-12-01 14:38:47 +01:00 |
|
Rasmus Munk Larsen
|
1663d15da7
|
Add internal method _solve_impl_transposed() to LU decomposition classes that solves A^T x = b or A^* x = b.
|
2015-11-30 13:39:24 -08:00 |
|
Gael Guennebaud
|
6c02cbbb0f
|
Fix matrix to quaternion (and angleaxis) conversion for matrix expression.
|
2015-12-01 09:45:56 +01:00 |
|
Gael Guennebaud
|
1d906d883d
|
Fix degenerate cases in syrk and trsm
|
2015-11-30 22:20:31 +01:00 |
|
Gael Guennebaud
|
afa11d646d
|
Fix UmfPackLU ctor for exppressions
|
2015-11-27 22:04:22 +01:00 |
|
Gael Guennebaud
|
6bdeb8cfbe
|
bug #918, umfpack: add access to umfpack return code and parameters
|
2015-11-27 21:58:36 +01:00 |
|
Gael Guennebaud
|
3f32f5ec22
|
ArrayBase::sign: add unit test and fix doc
|
2015-11-27 16:27:53 +01:00 |
|
Gael Guennebaud
|
1261d020c3
|
bug #1120, superlu: mem_usage_t is now uniquely defined, so let's use it.
|
2015-11-27 10:39:09 +01:00 |
|
Gael Guennebaud
|
ca001d7c2a
|
Big 1009, part 2/2: add static assertion on LinearAccessBit in coeff(index)-like methods.
|
2015-11-27 10:06:47 +01:00 |
|
Gael Guennebaud
|
91a7059459
|
bug #1009, part 1/2: make sure vector expressions expose LinearAccessBit flag.
|
2015-11-27 10:06:07 +01:00 |
|
Mark Borgerding
|
7ddcf97da7
|
added scalar_sign_op (both real,complex)
|
2015-11-24 17:15:07 -05:00 |
|
Gael Guennebaud
|
f9fff67a56
|
Disable "decorated name length exceeded, name was truncated" MSVC warning.
|
2015-11-23 15:03:24 +01:00 |
|
Gael Guennebaud
|
f3dca16a1d
|
bug #1117: workaround unused-local-typedefs warning when EIGEN_NO_STATIC_ASSERT and NDEBUG are both defined.
|
2015-11-23 14:07:52 +01:00 |
|
Gael Guennebaud
|
82bd4e546a
|
Merged in dr15jones/eigen (pull request PR-146)
Use a class constructor to initialize CPU cache sizes
|
2015-11-22 22:50:31 +01:00 |
|
Gael Guennebaud
|
35c17a3fc8
|
Use overload instead of template full specialization to please old MSVC
|
2015-11-22 22:09:57 +01:00 |
|