eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-08-14 20:56:00 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	038b55464b	Merged in deven-amd/eigen (pull request PR-425) applying EIGEN_DECLARE_TEST to *gpu unit tests	2018-07-17 21:14:40 +00:00
Deven Desai	f124f07965	applying EIGEN_DECLARE_TEST to gpu tests Also, a few minor fixes for GPU tests running in HIP mode. 1. Adding an include for hip/hip_runtime.h in the Macros.h file For HIP __host__ and __device__ are macros which are defined in hip headers. Their definitions need to be included before their use in the file. 2. Fixing the compile failure in TensorContractionGpu introduced by the commit to "Fuse computations into the Tensor contractions using output kernel" 3. Fixing a HIP/clang specific compile error by making the struct-member assignment explicit	2018-07-17 14:16:48 -04:00
Gael Guennebaud	dff3a92d52	Remove usage of #if EIGEN_TEST_PART_XX in unit tests that does not require them (splitting can thus be avoided for them)	2018-07-17 15:52:58 +02:00
Gael Guennebaud	82f0ce2726	Get rid of EIGEN_TEST_FUNC, unit tests must now be declared with EIGEN_DECLARE_TEST(mytest) { /* code */ }. This provide several advantages: - more flexibility in designing unit tests - unit tests can be glued to speed up compilation - unit tests are compiled with same predefined macros, which is a requirement for zapcc	2018-07-17 14:46:15 +02:00
Gael Guennebaud	37f4bdd97d	Fix VERIFY_EVALUATION_COUNT(EXPR,N) with a complex expression as N	2018-07-17 13:20:49 +02:00
Gael Guennebaud	2b2cd85694	bug #1573 : add noexcept move constructor and move assignment operator to Quaternion	2018-07-17 11:11:33 +02:00
Eugene Zhulenev	43206ac4de	Call OutputKernel in evalGemv	2018-07-12 14:52:23 -07:00
Eugene Zhulenev	e204ecdaaf	Remove SimpleThreadPool and always use {NonBlocking}ThreadPool	2018-07-16 15:06:57 -07:00
Eugene Zhulenev	b324ed55d9	Call OutputKernel in evalGemv	2018-07-12 14:52:23 -07:00
Eugene Zhulenev	01fd4096d3	Fuse computations into the Tensor contractions using output kernel	2018-07-10 13:16:38 -07:00
Gael Guennebaud	5539587b1f	Some warning fixes	2018-07-17 10:29:12 +02:00
Benoit Steiner	8f55956a57	Update the padding computation for PADDING_SAME to be consistent with TensorFlow.	2018-01-30 20:22:12 +00:00
Gael Guennebaud	09a16ba42f	bug #1412 : fix compilation with nvcc+MSVC	2018-01-17 23:13:16 +01:00
Lee.Deokjae	5b3c367926	Fix typos in the contraction example of tensor README	2018-01-06 14:36:19 +09:00
Eugene Chereshnev	f558ad2955	Fix incorrect ldvt in LAPACKE call from JacobiSVD	2018-01-03 12:55:52 -08:00
Benoit Steiner	22de74aa76	Disable use of recurrence for computing twiddle factors.	2018-01-09 18:32:52 +00:00
Gael Guennebaud	73629f8b68	Fix gcc7 warning	2018-01-09 08:59:27 +01:00
RJ Ryan	59985cfd26	Disable use of recurrence for computing twiddle factors. Fixes FFT precision issues for large FFTs. https://github.com/tensorflow/tensorflow/issues/10749#issuecomment-354557689	2017-12-31 10:44:56 -05:00
nluehr	f9bdcea022	For cuda 9.1 replace math_functions.hpp with cuda_runtime.h	2017-12-18 16:51:15 -08:00
Gael Guennebaud	06bf1047f9	Fix compilation of stableNorm with some expressions as input	2017-12-15 15:15:37 +01:00
Gael Guennebaud	73214c4bd0	Workaround nvcc 9.0 issue. See PR 351. https://bitbucket.org/eigen/eigen/pull-requests/351	2017-12-15 14:10:59 +01:00
Gael Guennebaud	31e0bda2e3	Fix cmake warning	2017-12-14 15:48:27 +01:00
Gael Guennebaud	26a2c6fc16	fix unit test	2017-12-14 15:11:04 +01:00
Gael Guennebaud	546ab97d76	Add possibility to overwrite EIGEN_STRONG_INLINE.	2017-12-14 14:47:38 +01:00
Gael Guennebaud	9c3aed9d48	Fix packet and alignment propagation logic of Block<Xpr> expressions. In particular, (A+B).col(j) lost vectorisation.	2017-12-14 14:24:33 +01:00
Gael Guennebaud	76c7dae600	ignore all build sub directories	2017-12-14 14:22:14 +01:00
Gael Guennebaud	b2cacd189e	fix header inclusion	2017-12-14 10:01:02 +01:00
Yangzihao Wang	3122477c86	Update the padding computation for PADDING_SAME to be consistent with TensorFlow.	2017-12-12 11:15:24 -08:00
Benoit Steiner	393b7c4959	Merged in ncluehr/eigen/float2half-fix (pull request PR-349) Replace __float2half_rn with __float2half	2017-12-01 00:29:51 +00:00
nluehr	aefd5fd5c4	Replace __float2half_rn with __float2half The latter provides a consistent definition for CUDA 8.0 and 9.0.	2017-11-28 10:15:46 -08:00
Gael Guennebaud	d0b028e173	clarify Pastix requirements	2017-11-27 22:11:57 +01:00
Gael Guennebaud	3587e481fb	silent MSVC warning	2017-11-27 21:53:02 +01:00
Benoit Steiner	3a327cd3c7	Merged in ncluehr/eigen/predux_fp16_fix (pull request PR-348) Fix incorrect integer cast in half2 predux.	2017-11-21 21:11:45 +00:00
nluehr	dd6de618c3	Fix incorrect integer cast in predux<half2>(). Bug corrupts results on Maxwell and earlier GPU architectures.	2017-11-21 10:47:00 -08:00
Gael Guennebaud	3dc6ff73ca	Handle PGI compiler	2017-11-17 22:54:39 +01:00
Zvi Rackover	599a88da27	Disable gcc-specific workaround for Clang to allow build with AVX512 There is currently a workaround for an issue in gcc that requires invoking gcc with the -fabi-version flag. This workaround is not needed for Clang and moreover is not supported.	2017-11-16 19:53:38 +00:00
Gael Guennebaud	672bdc126b	bug #1479 : fix failure detection in LDLT	2017-11-16 17:55:24 +01:00
Basil Fierz	624df50945	Adds missing EIGEN_STRONG_INLINE to support MSVC properly inlining small vector calculations When working with MSVC often small vector operations are not properly inlined. This behaviour is observed even on the most recent compiler versions.	2017-10-26 22:44:28 +02:00
Benoit Steiner	746a6b7b81	Merged in zzp11/eigen/zzp11/a-small-mistake-quickreferencedox-edited-1510217281963 (pull request PR-346) a small mistake QuickReference.dox edited online with Bitbucket	2018-03-23 01:02:34 +00:00
Benoit Steiner	d2631ef61d	Merged in facaiy/eigen/ENH/exp_support_complex_for_gpu (pull request PR-359) ENH: exp supports complex type for cuda	2018-03-23 00:59:15 +00:00
Benoit Steiner	8fcbd6d4c9	Merged in dtrebbien/eigen (pull request PR-369) Move up the specialization of std::numeric_limits	2018-03-23 00:54:58 +00:00
Rasmus Munk Larsen	e900b010c8	Improve robustness of igamma and igammac to bad inputs. Check for nan inputs and propagate them immediately. Limit the number of internal iterations to 2000 (same number as used by scipy.special.gammainc). This prevents an infinite loop when the function is called with nan or very large arguments. Original change by mfirgunov@google.com	2018-03-19 09:04:54 -07:00
Gael Guennebaud	f7d17689a5	Add static assertion for fixed sizes Ref<>	2018-03-09 10:11:13 +01:00
Gael Guennebaud	f6be7289d7	Implement better static assertion checking to make sure that the first assertion is a static one and not a runtime one.	2018-03-09 10:00:51 +01:00
Gael Guennebaud	d820ab9edc	Add static assertion on selfadjoint-view's UpLo parameter.	2018-03-09 09:33:43 +01:00
Daniel Trebbien	0c57be407d	Move up the specialization of std::numeric_limits This fixes a compilation error seen when building TensorFlow on macOS: https://github.com/tensorflow/tensorflow/issues/17067	2018-02-18 15:35:45 -08:00
Yan Facai (颜发才)	42a8334668	ENH: exp supports complex type for cuda	2018-01-04 16:01:01 +08:00
zhouzhaoping	912e9965ef	a small mistake QuickReference.dox edited online with Bitbucket	2017-11-09 08:49:01 +00:00
Gael Guennebaud	4c03b3511e	Fix issue with boost::multiprec in previous commit	2017-11-08 23:28:01 +01:00
Gael Guennebaud	e9d2888e74	Improve debugging tests and output in BDCSVD	2017-11-08 10:26:03 +01:00

... 6 7 8 9 10 ...

9864 Commits