eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2025-10-15 01:21:29 +08:00

Author	SHA1	Message	Date
Charles Schlosser	99c18bce6e	Msvc muluh	2024-05-07 16:30:58 +00:00
Tobias Wood	f38e16c193	Apply clang-format	2023-11-29 11:12:48 +00:00
Antonio Sánchez	6e4d5d4832	Add IWYU private pragmas to internal headers.	2023-08-21 16:25:22 +00:00
Erik Schultheis	64909b82bd	static const class members turned into constexpr	2022-04-04 17:33:33 +00:00
Erik Schultheis	421cbf0866	Replace Eigen type metaprogramming with corresponding std types and make use of alias templates	2022-03-16 16:43:40 +00:00
Antonio Sánchez	cafeadffef	Fix ODR violations.	2022-02-04 19:01:07 +00:00
Rasmus Munk Larsen	d7d0bf832d	Issue an error in case of direct inclusion of internal headers.	2021-09-10 19:12:26 +00:00
Rasmus Munk Larsen	cc3d0e6a40	Add EIGEN_HAS_INTRINSIC_INT128 macro Add a new EIGEN_HAS_INTRINSIC_INT128 macro, and use this instead of __SIZEOF_INT128__. This fixes related issues with TensorIntDiv.h when building with Clang for Windows, where support for 128-bit integer arithmetic is advertised but broken in practice.	2019-11-06 14:24:33 -08:00
Mehdi Goli	7d08fa805a	[SYCL] This PR adds the minimum modifications to the Eigen unsupported module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes.	2019-06-28 10:08:23 +01:00
Deven Desai	b6cc0961b1	updates based on PR feedback There are two major changes (and a few minor ones which are not listed here...see PR discussion for details) 1. Eigen::half implementations for HIP and CUDA have been merged. This means that - `CUDA/Half.h` and `HIP/hcc/Half.h` got merged to a new file `GPU/Half.h` - `CUDA/PacketMathHalf.h` and `HIP/hcc/PacketMathHalf.h` got merged to a new file `GPU/PacketMathHalf.h` - `CUDA/TypeCasting.h` and `HIP/hcc/TypeCasting.h` got merged to a new file `GPU/TypeCasting.h` After this change the `HIP/hcc` directory only contains one file `math_constants.h`. That will go away too once that file becomes a part of the HIP install. 2. new macros EIGEN_GPUCC, EIGEN_GPU_COMPILE_PHASE and EIGEN_HAS_GPU_FP16 have been added and the code has been updated to use them where appropriate. - `EIGEN_GPUCC` is the same as `(EIGEN_CUDACC \|\| EIGEN_HIPCC)` - `EIGEN_GPU_DEVICE_COMPILE` is the same as `(EIGEN_CUDA_ARCH \|\| EIGEN_HIP_DEVICE_COMPILE)` - `EIGEN_HAS_GPU_FP16` is the same as `(EIGEN_HAS_CUDA_FP16 or EIGEN_HAS_HIP_FP16)`	2018-06-14 10:21:54 -04:00
Gael Guennebaud	b3fd93207b	Fix typos found using codespell	2018-06-07 14:43:02 +02:00
Katrin Leinweber	ea94543190	Hyperlink DOIs against preferred resolver	2018-05-24 18:55:40 +02:00
Gael Guennebaud	bbd97b4095	Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH aliases	2017-07-17 01:02:51 +02:00
Mehdi Goli	e46e722381	Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying Tensor Contractsycl to be located in any place in the expression tree.	2017-01-16 13:58:49 +00:00
Mehdi Goli	79aa2b784e	Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.	2016-12-01 13:02:27 +00:00
Mehdi Goli	7318daf887	Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.	2016-11-25 16:19:07 +00:00
Gael Guennebaud	18c35747ce	Emulate _BitScanReverse64 for 32 bits builds	2016-07-11 11:38:04 +02:00
Gael Guennebaud	599f8ba617	Change runtime to compile-time conditional.	2016-07-08 11:39:43 +02:00
Benoit Steiner	b084133dbf	Fixed the integer division code on windows	2016-03-09 07:06:36 -08:00
Benoit Steiner	60d9df11c1	Fixed the computation of leading zeros when compiling with msvc.	2016-03-04 16:27:02 -08:00
Benoit Steiner	2c50fc878e	Fixed a typo	2016-03-04 14:09:38 -08:00
Benoit Steiner	d69946183d	Updated the TensorIntDivisor code to work properly on LLP64 systems	2016-02-08 21:03:59 -08:00
Benoit Steiner	547a8608e5	Fixed the implementation of Eigen::internal::count_leading_zeros for MSVC. Also updated the code to silence bogux warnings generated by nvcc when compilining this function.	2015-11-23 12:17:45 -08:00
Benoit Steiner	383d1cc2ed	Added proper support for fast 64bit integer division on CUDA	2015-11-20 11:09:46 -08:00
Benoit Steiner	1dd444ea71	Avoid using the version of TensorIntDiv optimized for 32-bit integers when the divisor can be equal to one since it isn't supported.	2015-11-18 11:37:58 -08:00
Christoph Hertzberg	1bdd06a199	Fix some trivial warnings	2015-08-19 21:38:18 +02:00
Benoit Steiner	a5dc49e7e8	Fixed 2 compilation warnings generated by llvm	2015-07-29 15:06:08 -07:00
Benoit Steiner	0570594f2c	Fixed a few compilation warnings triggered by clang	2015-07-29 11:48:38 -07:00
Benoit Steiner	099597406f	Simplified and generalized the DividerTraits code	2015-07-29 10:02:42 -07:00
Gael Guennebaud	6db3a557f4	Add missing specialization of struct DividerTraits<long>	2015-07-29 11:38:53 +02:00
Benoit Steiner	4200bdec24	Extended the range of value inputs for TensorIntDiv to support tensors with more than 4 billion elements.	2015-07-22 17:02:30 -07:00
Benoit Steiner	3912ca0d53	Fixed a bug in the integer division code that caused some large numerators to be incorrectly handled	2015-07-13 11:14:59 -07:00
Benoit Steiner	a93af65938	Improved and cleaned up the 2d patch extraction code	2015-07-07 08:52:14 -07:00
vanhoucke	4cc0c961f3	Fix undefined behavior.	2015-06-19 15:46:46 +00:00
Benoit Steiner	a81d17b73a	Added new version of the TensorIntDiv class optimized for 32 bit signed integers. It saves 1 register on CPU and 2 on GPU.	2015-05-19 13:59:52 -07:00
Benoit Steiner	ae73859a0a	Fixed incorrect assertion	2015-02-28 08:02:02 -08:00
Benoit Steiner	bb483313f6	Fixed another batch of compilation warnings	2015-02-28 02:32:46 -08:00
Benoit Steiner	f074bb4b5f	Fixed another compilation problem with TensorIntDiv.h	2015-02-26 11:14:23 -08:00
Benoit Steiner	bffb6bdf45	Made TensorIntDiv.h compile with MSVC	2015-02-25 23:54:43 -08:00
Benoit Steiner	27f3fb2bcc	Fixed another clang warning	2015-02-25 22:54:20 -08:00
Benoit Steiner	99d75235a9	Misc improvements and cleanups	2014-10-13 17:02:09 -07:00
Benoit Steiner	33c702c79f	Added support for fast integer divisions by a constant Sped up tensor slicing by a factor of 3 by using these fast integer divisions.	2014-08-14 22:13:21 -07:00

42 Commits