eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-15 17:41:30 +08:00

Author	SHA1	Message	Date
Benoit Steiner	a452dedb4f	Merged in ibab/eigen/double-tensor-reduction (pull request PR-216) Enable efficient Tensor reduction for doubles on the GPU (continued)	2016-08-18 12:29:54 -07:00
Igor Babuschkin	18c67df31c	Fix remaining CUDA >= 300 checks	2016-08-18 17:18:30 +01:00
Igor Babuschkin	1569a7d7ab	Add the necessary CUDA >= 300 checks back	2016-08-18 17:15:12 +01:00
Benoit Steiner	2b17f34574	Properly detect the type of the result of a contraction.	2016-08-16 16:00:30 -07:00
Igor Babuschkin	59bacfe520	Fix compilation on CUDA 8 by removing call to h2log1p	2016-08-15 23:38:05 +01:00
Benoit Steiner	34ae80179a	Use array_prod instead of calling TotalSize since TotalSize is only available on DSize.	2016-08-15 10:29:14 -07:00
Benoit Steiner	2556565b4b	Merged in ibab/eigen/extend-log1p (pull request PR-218) Fix compilation on CUDA 8 due to missing h2log1p function	2016-08-15 08:31:03 -07:00
Benoit Steiner	30dd6f5e34	Close branch extend-log1p	2016-08-15 08:31:03 -07:00
Benoit Steiner	fe73648c98	Fixed a bug in the documentation.	2016-08-12 10:00:43 -07:00
Christoph Hertzberg	9636a8ed43	bug #1273 : Add parentheses when redefining eigen_assert	2016-08-12 15:34:21 +02:00
Christoph Hertzberg	c83b754ee0	bug #1272 : Disable assertion when total number of columns is zero. Also moved assertion to finished() method and adapted unit-test	2016-08-12 15:15:34 +02:00
Benoit Steiner	e3a8dfb02f	std::erfcf doesn't exist: use numext::erfc instead	2016-08-11 15:24:06 -07:00
Benoit Steiner	64e68cbe87	Don't attempt to optimize partial reductions when the optimized implementation doesn't buy anything.	2016-08-08 19:29:59 -07:00
Benoit Steiner	5157ce8cbf	Merged in ibab/eigen/extend-log1p (pull request PR-217) Add log1p support for CUDA and half floats	2016-08-08 14:50:00 -07:00
Igor Babuschkin	aee693ac52	Add log1p support for CUDA and half floats	2016-08-08 20:24:59 +01:00
Benoit Steiner	72096f3bd4	Merged in suiyuan2009/eigen/fix_tanh_inconsistent_for_tensorflow (pull request PR-215) Fix_tanh_inconsistent_for_tensorflow	2016-08-08 09:06:45 -07:00
Christoph Hertzberg	3e4a33d4ba	bug #1272 : Let CommaInitializer work for more border cases (enhances fix of bug #1242 ). The unit test tests all combinations of 2x2 block-sizes from 0 to 3.	2016-08-08 17:26:48 +02:00
Igor Babuschkin	841e075154	Remove CUDA >= 300 checks and enable outer reductin for doubles	2016-08-06 18:07:50 +01:00
Ziming Dong	1031223c09	fix tanh inconsistent	2016-08-06 19:48:50 +08:00
Ziming Dong	5cf1e4c79b	create fix_tanh_inconsistent branch	2016-08-06 15:54:33 +08:00
Igor Babuschkin	0425118e2a	Merge upstream changes	2016-08-05 14:34:57 +01:00
Igor Babuschkin	9537e8b118	Make use of atomicExch for atomicExchCustom	2016-08-05 14:29:58 +01:00
Christoph Hertzberg	fe4b927e9c	Add aliases Eigen__DIR to Eigen3__DIR This is to make configuring work again after project was renamed from Eigen to Eigen3	2016-08-05 15:21:14 +02:00
Benoit Steiner	fe778427f2	Fixed the constructors of the new half_base class.	2016-08-04 18:32:26 -07:00
Benoit Steiner	5eea1c7f97	Fixed cut and paste bug in debud message	2016-08-04 17:34:13 -07:00
Benoit Steiner	9506343349	Fixed the isnan, isfinite and isinf operations on GPU	2016-08-04 17:25:53 -07:00
Benoit Steiner	b50d8f8c4a	Extended a regression test to validate that we basic fp16 support works with cuda 7.0	2016-08-03 16:50:13 -07:00
Benoit Steiner	fad9828769	Deleted redundant regression test.	2016-08-03 16:08:37 -07:00
Benoit Steiner	373bb12dc6	Check that it's possible to forward declare the hlaf type.	2016-08-03 16:07:31 -07:00
Gael Guennebaud	17b9a55d98	Move Eigen::half_impl::half to Eigen::half while preserving the free functions to the Eigen::half_impl namespace together with ADL	2016-08-04 00:00:43 +02:00
Benoit Steiner	ca2cee2739	Merged in ibab/eigen (pull request PR-206) Expose real and imag methods on Tensors	2016-08-03 11:53:04 -07:00
Benoit Steiner	d92df04ce8	Cleaned up the new float16 test a bit	2016-08-03 11:50:07 -07:00
Benoit Steiner	81099ef482	Added a test for fp16	2016-08-03 11:41:17 -07:00
Benoit Steiner	a20b58845f	CUDA_ARCH isn't always defined, so avoid relying on it too much when figuring out which implementation to use for reductions. Instead rely on the device to tell us on which hardware version we're running.	2016-08-03 10:00:43 -07:00
Gael Guennebaud	819d0cea1b	List PARDISO solver.	2016-08-02 23:32:41 +02:00
Christoph Hertzberg	f4404777ff	Change project name to Eigen3, to be compatible with FindEigen3.cmake and Eigen3Config.cmake. This is related to pull-requests 214.	2016-08-02 17:08:57 +00:00
Benoit Steiner	fd220dd8b0	Use numext::conj instead of std::conj	2016-08-01 18:16:16 -07:00
Benoit Steiner	e256acec7c	Avoid unecessary object copies	2016-08-01 17:03:39 -07:00
Gael Guennebaud	7995cec90c	Fix vectorization logic for coeff-based product for some corner cases.	2016-07-31 15:20:22 +02:00
Benoit Steiner	02fe89f5ef	half implementation has been moved to half_impl namespace	2016-07-29 15:09:34 -07:00
Benoit Steiner	2693fd54bf	bug #1266 : half implementation has been moved to half_impl namespace	2016-07-29 13:45:56 -07:00
Christoph Hertzberg	c5b893f434	bug #1266 : half implementation has been moved to half_impl namespace	2016-07-29 18:36:08 +02:00
Gael Guennebaud	4057f9b1fc	Enable slice-vectorization+inner-unrolling when unaligned vectorization is allowed. For instance, this permits to vectorize 5x5 matrices (including product)	2016-07-28 13:47:33 +02:00
Gael Guennebaud	5fbe7aa604	Update and fix Cholesky mini benchmark	2016-07-28 11:26:30 +02:00
Gael Guennebaud	a72752caac	Vectorize more small product expressions by letting the general assignement logic decides on the sizes that are OK for vectorization.	2016-07-28 11:21:07 +02:00
Gael Guennebaud	cc2f6d68b1	bug #1264 : fix compilation	2016-07-27 23:30:47 +02:00
Gael Guennebaud	188590db82	Add instructions for LAPACKE+Accelerate	2016-07-27 15:07:35 +02:00
Gael Guennebaud	8972323c08	Big 1261: add missing max(ADS,ADS) overload (same for min)	2016-07-27 14:52:48 +02:00
Gael Guennebaud	5d94dc85e5	bug #1260 : add regression test	2016-07-27 14:38:30 +02:00
Gael Guennebaud	0d7039319c	bug #1260 : remove doubtful specializations of ScalarBinaryOpTraits	2016-07-27 14:35:52 +02:00

1 2 3 4 5 ...

8453 Commits