eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-19 19:41:07 +08:00

Author	SHA1	Message	Date
Benoit Steiner	d485d12c51	Added missing AVX intrinsics for fp16: in particular, implemented predux which is required by the matrix-vector code.	2016-10-06 10:41:03 -07:00
Benoit Steiner	698ff69450	Properly characterize the CUDA packet primitives for fp16 as device only	2016-10-04 16:53:30 -07:00
Benoit Steiner	409e887d78	Added support for constand std::complex numbers on GPU	2016-10-03 11:06:24 -07:00
Benoit Steiner	26f9907542	Added missing typedefs	2016-09-20 12:58:03 -07:00
RJ Ryan	b2c6dc48d9	Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.	2016-09-20 07:18:20 -07:00
Gael Guennebaud	8f4b4ad5fb	use ::hlog if available.	2016-08-29 11:05:32 +02:00
Gael Guennebaud	35a8e94577	bug #1167 : simplify installation of header files using cmake's install(DIRECTORY ...) command.	2016-08-29 10:59:37 +02:00
Gael Guennebaud	d937a420a2	Fix compilation with MSVC by using our portable numext::log1p implementation.	2016-08-22 15:44:21 +02:00
Igor Babuschkin	59bacfe520	Fix compilation on CUDA 8 by removing call to h2log1p	2016-08-15 23:38:05 +01:00
Igor Babuschkin	aee693ac52	Add log1p support for CUDA and half floats	2016-08-08 20:24:59 +01:00
Benoit Steiner	fe778427f2	Fixed the constructors of the new half_base class.	2016-08-04 18:32:26 -07:00
Benoit Steiner	9506343349	Fixed the isnan, isfinite and isinf operations on GPU	2016-08-04 17:25:53 -07:00
Gael Guennebaud	17b9a55d98	Move Eigen::half_impl::half to Eigen::half while preserving the free functions to the Eigen::half_impl namespace together with ADL	2016-08-04 00:00:43 +02:00
Benoit Steiner	02fe89f5ef	half implementation has been moved to half_impl namespace	2016-07-29 15:09:34 -07:00
Christoph Hertzberg	c5b893f434	bug #1266 : half implementation has been moved to half_impl namespace	2016-07-29 18:36:08 +02:00
Gael Guennebaud	395c835f4b	Fix CUDA compilation	2016-07-22 15:30:24 +02:00
Gael Guennebaud	47afc9a365	More cleaning in half: - put its definition and functions in its own half_impl namespace such that the free function does not polute the Eigen namespace while still making them visible for half through ADL. - expose Eigen::half throguh a using statement - move operator<< from std to half_float namespace	2016-07-22 14:33:28 +02:00
Gael Guennebaud	0f350a8b7e	Fix CUDA compilation	2016-07-21 18:47:07 +02:00
Gael Guennebaud	87fbda812f	Add missing log10 and random generator for half.	2016-07-21 15:46:45 +02:00
Gael Guennebaud	01d12d3e82	Some cleanup in Halh: standard functions should be defined in the namespace of the class half to make ADL work, and thus the global is* functions can be removed.	2016-07-21 15:10:48 +02:00
Gael Guennebaud	a96a7ce3f7	Move CUDA's special functions to SpecialFunctions module.	2016-07-11 18:39:11 +02:00
Gael Guennebaud	2f7e2614e7	bug #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.	2016-07-08 11:13:55 +02:00
Benoit Steiner	8fd57a97f2	Enable the vectorization of adds and mults of fp16	2016-06-07 18:22:18 -07:00
Eugene Brevdo	39baff850c	Add TernaryFunctors and the betainc SpecialFunction. TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.	2016-06-02 17:04:19 -07:00
Benoit Steiner	b6e306f189	Improved support for CUDA 8.0	2016-05-31 09:47:59 -07:00
Benoit Steiner	3a5d6a3c38	Disable the use of MMX instructions since the code is broken on many platforms	2016-05-27 09:13:26 -07:00
Benoit Steiner	094f4a56c8	Deleted extra namespace	2016-05-26 14:49:51 -07:00
Gael Guennebaud	7ff5fadcc0	Disable usage of MMX with msvc.	2016-05-26 17:58:46 +02:00
Gael Guennebaud	cc1ab64f29	Add missing inclusion of mmintrin.h	2016-05-26 09:51:50 +02:00
Benoit Steiner	3585ff585e	Silenced a compilation warning	2016-05-25 22:09:19 -07:00
Benoit Steiner	efeb89dcdb	Specify the rounding mode in the correct location	2016-05-25 17:53:24 -07:00
Benoit Steiner	0322c66a3f	Explicitly specify the rounding mode when converting floats to fp16	2016-05-25 15:56:15 -07:00
Benoit Steiner	ed783872ab	Disable the use of MMX instructions on x86_64 since too many compilers only support them in 32bit mode	2016-05-25 08:27:26 -07:00
Gael Guennebaud	bbf9109e25	Fix compilation with ICC.	2016-05-25 10:00:55 +02:00
Benoit Steiner	d041a528da	Cleaned up the fp16 code a little more	2016-05-24 22:43:26 -07:00
Benoit Steiner	ff4a289572	Cleaned up the fp16 code	2016-05-24 18:50:09 -07:00
Benoit Jacob	40a16282c7	Remove now-unused protate PacketMath func	2016-05-24 11:01:18 -04:00
Benoit Steiner	e617711306	Don't attempt to use MMX instructions with visualstudio since they're only partially supported.	2016-05-24 06:43:58 -07:00
Benoit Steiner	334e76537f	Worked around missing clang intrinsic	2016-05-24 00:29:28 -07:00
Benoit Steiner	b517ab349b	Use the generic ploadquad intrinsics since it does the job	2016-05-24 00:11:17 -07:00
Benoit Steiner	646872cb3b	Worked around missing clang intrinsics	2016-05-24 00:07:08 -07:00
Benoit Steiner	3dfc391a61	Added missing EIGEN_DEVICE_FUNC qualifier	2016-05-23 20:56:59 -07:00
Benoit Steiner	33a94f5dc7	Use the Index type instead of integers to specify the strides in pgather/pscatter	2016-05-23 20:37:30 -07:00
Benoit Steiner	6bc684ab6a	Added missing alignment in the fp16 packet traits	2016-05-23 20:32:30 -07:00
Benoit Steiner	283e33dea4	ptranspose is not a template.	2016-05-23 19:55:55 -07:00
Benoit Steiner	5ba0ebe7c9	Avoid unnecessary float to double conversion.	2016-05-23 17:14:31 -07:00
Benoit Steiner	7d980d74e5	Started to vectorize the processing of 16bit floats on CPU.	2016-05-23 15:21:40 -07:00
Christoph Hertzberg	88654762da	Replace multiple constructors of half-type by a generic/templated constructor. This fixes an incompatibility with long double, exposed by the previous commit.	2016-05-23 10:03:03 +02:00
Gael Guennebaud	1395056fc0	Make EIGEN_HAS_C99_MATH user configurable	2016-05-20 14:58:19 +02:00
Benoit Steiner	fae0493f98	Fixed a couple of bugs related to the Pascalfamily of GPUs H: Enter commit message. Lines beginning with 'HG:' are removed.	2016-05-11 23:02:26 -07:00

1 2 3

144 Commits