eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-22 21:11:07 +08:00

Author	SHA1	Message	Date
Benoit Steiner	e29c9676b1	Don't mark the cast operator as explicit, since this is a c++11 feature that's not supported by older compilers.	2016-03-12 00:15:58 -08:00
Benoit Steiner	eecd914864	Also replaced uint32_t with unsigned int to make the code more portable	2016-03-11 19:34:21 -08:00
Benoit Steiner	1ca8c1ec97	Replaced a couple more uint16_t with unsigned short	2016-03-11 19:28:28 -08:00
Benoit Steiner	0423b66187	Use unsigned short instead of uint16_t since they're more portable	2016-03-11 17:53:41 -08:00
Benoit Steiner	048c4d6efd	Made half floats usable on hardware that doesn't support them natively.	2016-03-11 17:21:42 -08:00
Benoit Steiner	456e038a4e	Fixed the +=, -=, *= and /= operators to return a reference	2016-03-10 15:17:44 -08:00
Eugene Brevdo	5e7de771e3	Properly fix merge issues.	2016-03-08 17:35:05 -08:00
Eugene Brevdo	73220d2bb0	Resolve bad merge.	2016-03-08 17:28:21 -08:00
Eugene Brevdo	5707004d6b	Fix Eigen's building of sharded tests that use CUDA & more igamma/igammac bugfixes. 0. Prior to this PR, not a single sharded CUDA test was actually being run. Fixed that. GPU tests are still failing for igamma/igammac. 1. Add calls for igamma/igammac to TensorBase 2. Fix up CUDA-specific calls of igamma/igammac 3. Add unit tests for digamma, igamma, igammac in CUDA.	2016-03-07 14:08:56 -08:00
Eugene Brevdo	7ea35bfa1c	Initial implementation of igamma and igammac.	2016-03-03 19:39:41 -08:00
Benoit Steiner	1032441c6f	Enable partial support for half floats on Kepler GPUs.	2016-03-03 10:34:20 -08:00
Benoit Steiner	1da10a7358	Enable the conversion between floats and half floats on older GPUs that support it.	2016-03-03 10:33:20 -08:00
Benoit Steiner	6270d851e3	Declare the half float type as arithmetic.	2016-02-22 13:59:33 -08:00
Benoit Steiner	584832cb3c	Implemented the ptranspose function on half floats	2016-02-21 12:44:53 -08:00
Benoit Steiner	95fceb6452	Added the ability to compute the absolute value of a half float	2016-02-21 20:24:11 +00:00
Benoit Steiner	9ff269a1d3	Moved some of the fp16 operators outside the Eigen namespace to workaround some nvcc limitations.	2016-02-20 07:47:23 +00:00
Benoit Steiner	180156ba1a	Added support for tensor reductions on half floats	2016-02-19 10:05:59 -08:00
Benoit Steiner	5c4901b83a	Implemented the scalar division of 2 half floats	2016-02-19 10:03:19 -08:00
Benoit Steiner	f7cb755299	Added support for operators +=, -=, *= and /= on CUDA half floats	2016-02-19 15:57:26 +00:00
Benoit Steiner	dc26459b99	Implemented protate() for CUDA	2016-02-19 15:16:54 +00:00
Benoit Steiner	ac5d706a94	Added support for simple coefficient wise tensor expression using half floats on CUDA devices	2016-02-19 08:19:12 +00:00
Benoit Steiner	0606a0a39b	FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA	2016-02-18 23:15:23 -08:00
Benoit Steiner	17b9fbed34	Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa	2016-02-19 06:16:07 +00:00
Benoit Steiner	8ce46f9d89	Improved implementation of ptanh for SSE and AVX	2016-02-18 13:24:34 -08:00
Benoit Steiner	6d8b1dce06	Avoid implicit cast from double to float.	2016-02-10 18:07:11 -08:00
Benoit Steiner	bfb3fcd94f	Optimized implementation of the tanh function for SSE	2016-02-10 08:52:30 -08:00
Benoit Steiner	2d523332b3	Optimized implementation of the hyperbolic tangent function for AVX	2016-02-10 08:48:05 -08:00
Benoit Jacob	e6ee18d6b4	Make the GCC workaround for sqrt GCC-only; detect Emscripten as non-GCC	2016-02-10 11:11:49 -05:00
Benoit Jacob	964a95bf5e	Work around Emscripten bug - https://github.com/kripken/emscripten/issues/4088	2016-02-10 10:37:22 -05:00
Gael Guennebaud	c2bf2f56ef	Remove custom unaligned loads for SSE. They were only useful for core2 CPU.	2016-02-08 14:29:12 +01:00
Benoit Steiner	3ca1ae2bb7	Commented out the version of pexp<Packet8d> since it fails to compile with gcc 5.3	2016-02-04 13:49:06 -08:00
Benoit Steiner	23f69ab936	Added implementations of pexp, plog, psqrt, and prsqrt optimized for AVX512	2016-02-04 10:36:36 -08:00
Benoit Steiner	6c9cf117c1	Fixed indentation	2016-02-04 10:34:10 -08:00
Benoit Steiner	85b6d82b49	Generalized predux4 to support AVX512 packets, and renamed it predux_half. Disabled the implementation of pabs for avx512 since the corresponding intrinsics are not shipped with gcc	2016-02-01 14:35:51 -08:00
Gael Guennebaud	ddf64babde	merge	2016-01-28 13:21:48 +01:00
Gael Guennebaud	7cae8918c0	Fix compilation on old gcc+AVX	2016-01-21 20:30:32 +01:00
Gael Guennebaud	8dca9f97e3	Add numext::sqrt function to enable custom optimized implementation. This changeset add two specializations for float/double on SSE. Those are mostly usefull with GCC for which std::sqrt add an extra and costly check on the result of _mm_sqrt_*. Clang does not add this burden. In this changeset, only DenseBase::norm() makes use of it.	2016-01-21 20:18:51 +01:00
Benoit Steiner	c1a42c2d0d	Don't disable the AVX implementations of plset when compiling with AVX512 enabled	2016-01-14 17:21:39 -08:00
Benoit Steiner	0366478df8	Added alignment requirement to the AVX512 packet traits.	2016-01-14 17:02:39 -08:00
Benoit Steiner	3cfd16f3af	Fixed the signature of the plset primitives for AVX512	2016-01-14 16:58:01 -08:00
Benoit Steiner	67f44365ea	Fixed the AVX512 signature of the ptranspose primitives	2016-01-14 16:51:11 -08:00
Benoit Steiner	a282eb1363	pscatter/pgather use Index instead of int to specify the stride	2016-01-14 16:39:39 -08:00
Benoit Steiner	7832485575	Deleted unnecessary commas and semicolons	2016-01-14 16:36:29 -08:00
Gael Guennebaud	70404e07c2	Workaround clang -Wdocumentation warning about "/*<"	2015-12-30 16:46:45 +01:00
Eugene Brevdo	cef81c9084	Merged eigen/eigen into default	2015-12-24 21:17:33 -08:00
Eugene Brevdo	f7362772e3	Add digamma for CPU + CUDA. Includes tests.	2015-12-24 21:15:38 -08:00
Gael Guennebaud	d2e288ae50	Workaround compilers that do not even define _mm256_set_m128.	2015-12-24 16:53:43 +01:00
Benoit Steiner	b74887d5f2	Implemented most of the packet primitives for AVX512	2015-12-21 11:46:36 -08:00
Benoit Steiner	994d1c60b9	Free memory allocated using posix_memalign() with free() instead of std::free()	2015-12-21 11:21:39 -08:00
Benoit Steiner	a6c243617b	Fixed a typo in previous change.	2015-12-21 09:05:45 -08:00

... 13 14 15 16 17 ...

1082 Commits