Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							f9eff17e91 
							
						 
					 
					
						
						
							
							Leverage libxsmm kernels within signle threaded contractions  
						
						
						
						
					 
					
						2016-12-21 12:32:06 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							4deafd35b7 
							
						 
					 
					
						
						
							
							Introduce a portable EIGEN_SLEEP macro.  
						
						
						
						
					 
					
						2016-12-09 14:52:15 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							e633a8371f 
							
						 
					 
					
						
						
							
							Simplified includes  
						
						
						
						
					 
					
						2016-11-30 20:21:18 -08:00 
						 
				 
			
				
					
						
							
							
								Mehdi Goli 
							
						 
					 
					
						
						
						
						
							
						
						
							7318daf887 
							
						 
					 
					
						
						
							
							Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.  
						
						
						
						
					 
					
						2016-11-25 16:19:07 +00:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							2d1aec15a7 
							
						 
					 
					
						
						
							
							Added missing include  
						
						
						
						
					 
					
						2016-11-19 08:09:54 -08:00 
						 
				 
			
				
					
						
							
							
								Luke Iwanski 
							
						 
					 
					
						
						
						
						
							
						
						
							c5130dedbe 
							
						 
					 
					
						
						
							
							Specialised basic math functions for SYCL device.  
						
						
						
						
					 
					
						2016-11-17 11:47:13 +00:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							dad177be01 
							
						 
					 
					
						
						
							
							Added missing includes  
						
						
						
						
					 
					
						2016-11-05 10:04:42 -07:00 
						 
				 
			
				
					
						
							
							
								Mehdi Goli 
							
						 
					 
					
						
						
						
						
							
						
						
							0ebe3808ca 
							
						 
					 
					
						
						
							
							Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;  
						
						
						
						
					 
					
						2016-11-04 18:18:19 +00:00 
						 
				 
			
				
					
						
							
							
								Luke Iwanski 
							
						 
					 
					
						
						
						
						
							
						
						
							03b63e182c 
							
						 
					 
					
						
						
							
							Added SYCL include in Tensor.  
						
						
						
						
					 
					
						2016-10-20 15:32:44 +01:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							7e4a6754b2 
							
						 
					 
					
						
						
							
							Merged eigen/eigen into default  
						
						
						
						
					 
					
						2016-10-12 22:42:33 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							7f0599b6eb 
							
						 
					 
					
						
						
							
							Manually define int16_t and uint16_t when compiling with Visual Studio  
						
						
						
						
					 
					
						2016-10-08 22:56:32 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							ae1385c7e4 
							
						 
					 
					
						
						
							
							Pull the latest updates from trunk  
						
						
						
						
					 
					
						2016-10-05 14:54:36 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							2f6d1607c8 
							
						 
					 
					
						
						
							
							Cleaned up the random number generation code.  
						
						
						
						
					 
					
						2016-10-04 08:38:23 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							6565f8d60f 
							
						 
					 
					
						
						
							
							Made the initialization of a CUDA device thread safe.  
						
						
						
						
					 
					
						2016-09-26 11:00:32 -07:00 
						 
				 
			
				
					
						
							
							
								Luke Iwanski 
							
						 
					 
					
						
						
						
						
							
						
						
							cb81975714 
							
						 
					 
					
						
						
							
							Partial OpenCL support via SYCL compatible with ComputeCpp CE.  
						
						
						
						
					 
					
						2016-09-19 12:44:13 +01:00 
						 
				 
			
				
					
						
							
							
								Gael Guennebaud 
							
						 
					 
					
						
						
						
						
							
						
						
							2f7e2614e7 
							
						 
					 
					
						
						
							
							bug  #1232 : refactor special functions as a new SpecialFunctions module, currently in unsupported/.  
						
						
						
						
					 
					
						2016-07-08 11:13:55 +02:00 
						 
				 
			
				
					
						
							
							
								Eugene Brevdo 
							
						 
					 
					
						
						
						
						
							
						
						
							39baff850c 
							
						 
					 
					
						
						
							
							Add TernaryFunctors and the betainc SpecialFunction.  
						
						... 
						
						
						
						TernaryFunctors and their executors allow operations on 3-tuples of inputs.
API fully implemented for Arrays and Tensors based on binary functors.
Ported the cephes betainc function (regularized incomplete beta
integral) to Eigen, with support for CPU and GPU, floats, doubles, and
half types.
Added unit tests in array.cpp and cxx11_tensor_cuda.cu
Collapsed revision
* Merged helper methods for betainc across floats and doubles.
* Added TensorGlobalFunctions with betainc().  Removed betainc() from TensorBase.
* Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper.
* betainc: merge incbcf and incbd into incbeta_cfe.  and more cleanup.
* Update TernaryOp and SpecialFunctions (betainc) based on review comments. 
						
						
					 
					
						2016-06-02 17:04:19 -07:00 
						 
				 
			
				
					
						
							
							
								Igor Babuschkin 
							
						 
					 
					
						
						
						
						
							
						
						
							fbd7ed6ff7 
							
						 
					 
					
						
						
							
							Add tensor scan op  
						
						... 
						
						
						
						This is the initial implementation a generic scan operation.
Based on this, cumsum and cumprod method have been added to TensorBase. 
						
						
					 
					
						2016-06-02 13:35:47 +01:00 
						 
				 
			
				
					
						
							
							
								Rasmus Munk Larsen 
							
						 
					 
					
						
						
						
						
							
						
						
							0dbd68145f 
							
						 
					 
					
						
						
							
							Roll back changes to core. Move include of TensorFunctors.h up to satisfy dependence in TensorCostModel.h.  
						
						
						
						
					 
					
						2016-05-17 10:25:19 -07:00 
						 
				 
			
				
					
						
							
							
								Rasmus Munk Larsen 
							
						 
					 
					
						
						
						
						
							
						
						
							e55deb21c5 
							
						 
					 
					
						
						
							
							Improvements to parallelFor.  
						
						... 
						
						
						
						Move some scalar functors from TensorFunctors. to Eigen core. 
						
						
					 
					
						2016-05-12 14:07:22 -07:00 
						 
				 
			
				
					
						
							
							
								Gael Guennebaud 
							
						 
					 
					
						
						
						
						
							
						
						
							318e65e0ae 
							
						 
					 
					
						
						
							
							Fix missing inclusion of Eigen/Core  
						
						
						
						
					 
					
						2016-04-27 23:05:40 +02:00 
						 
				 
			
				
					
						
							
							
								Gael Guennebaud 
							
						 
					 
					
						
						
						
						
							
						
						
							3dddd34133 
							
						 
					 
					
						
						
							
							Refactor the unsupported CXX11/Core module to internal headers only.  
						
						
						
						
					 
					
						2016-04-26 11:20:25 +02:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							6fbedf5a4e 
							
						 
					 
					
						
						
							
							Merged in rmlarsen/eigen (pull request PR-177)  
						
						... 
						
						
						
						Eigen Tensor cost model part 1. 
						
						
					 
					
						2016-04-14 17:13:19 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							1372156c41 
							
						 
					 
					
						
						
							
							Prepared the migration to the new non blocking thread pool  
						
						
						
						
					 
					
						2016-04-14 16:16:42 -07:00 
						 
				 
			
				
					
						
							
							
								Rasmus Munk Larsen 
							
						 
					 
					
						
						
						
						
							
						
						
							235e83aba6 
							
						 
					 
					
						
						
							
							Eigen cost model part 1. This implements a basic recursive framework to estimate the cost of evaluating tensor expressions.  
						
						
						
						
					 
					
						2016-04-14 13:57:35 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							002cf0d1c9 
							
						 
					 
					
						
						
							
							Use a single Barrier instead of a collection of Notifications to reduce the thread synchronization overhead  
						
						
						
						
					 
					
						2016-03-22 15:24:23 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							b2100b83ad 
							
						 
					 
					
						
						
							
							Made sure to include the <random> header file when compiling with visual studio  
						
						
						
						
					 
					
						2016-03-09 16:03:16 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							5cd00068c0 
							
						 
					 
					
						
						
							
							include <iostream> in the tensor header since we now use it to better report cuda initialization errors  
						
						
						
						
					 
					
						2016-02-22 13:59:03 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							4beb447e27 
							
						 
					 
					
						
						
							
							Created a mechanism to enable contraction mappers to determine the best blocking strategy.  
						
						
						
						
					 
					
						2016-01-22 14:37:26 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							6d472d8375 
							
						 
					 
					
						
						
							
							Moved the contraction mapping code to its own file to make the code more manageable.  
						
						
						
						
					 
					
						2016-01-19 17:22:05 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							2ca55a3ae4 
							
						 
					 
					
						
						
							
							Fixed some compilation error triggered by the tensor code with msvc 2008  
						
						
						
						
					 
					
						2015-12-16 20:45:58 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							df31ca3b9e 
							
						 
					 
					
						
						
							
							Made it possible to refer t oa GPUDevice from code compile with a regular C++ compiler  
						
						
						
						
					 
					
						2015-11-23 10:03:53 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							9fa65d3838 
							
						 
					 
					
						
						
							
							Split TensorDeviceType.h in 3 files to make it more manageable  
						
						
						
						
					 
					
						2015-11-20 17:42:50 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							f8df393165 
							
						 
					 
					
						
						
							
							Added support for 128bit integers on CUDA devices.  
						
						
						
						
					 
					
						2015-11-19 13:57:27 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							d573efe303 
							
						 
					 
					
						
						
							
							Code cleanup  
						
						
						
						
					 
					
						2015-11-06 14:54:28 -08:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							ac99b49249 
							
						 
					 
					
						
						
							
							Added missing glue logic  
						
						
						
						
					 
					
						2015-10-22 16:54:21 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							6585efc553 
							
						 
					 
					
						
						
							
							Tightened the definition of isOfNormalIndex to take into account integer types in addition to arrays of indices  
						
						... 
						
						
						
						Only compile the custom index code  when EIGEN_HAS_SFINAE is defined. For the time beeing, EIGEN_HAS_SFINAE is a synonym for EIGEN_HAS_VARIADIC_TEMPLATES, but this might evolve in the future.
Moved some code around. 
						
						
					 
					
						2015-10-14 09:31:37 -07:00 
						 
				 
			
				
					
						
							
							
								Gabriel Nützi 
							
						 
					 
					
						
						
						
						
							
						
						
							6edae2d30d 
							
						 
					 
					
						
						
							
							added CustomIndex capability only to Tensor and not yet to TensorBase.  
						
						... 
						
						
						
						using Sfinae and is_base_of to select correct template which converts to array<Index,NumIndices>
 user: Gabriel Nützi <gnuetzi@gmx.ch>
 branch 'default'
 added unsupported/Eigen/CXX11/src/Tensor/TensorMetaMacros.h
 added unsupported/test/cxx11_tensor_customIndex.cpp
 changed unsupported/Eigen/CXX11/Tensor
 changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h
 changed unsupported/Eigen/CXX11/src/Tensor/TensorMeta.h
 changed unsupported/test/CMakeLists.txt 
						
						
					 
					
						2015-10-09 18:52:48 +02:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							f41831e445 
							
						 
					 
					
						
						
							
							Added support for argmax/argmin  
						
						
						
						
					 
					
						2015-08-31 08:18:53 -07:00 
						 
				 
			
				
					
						
							
							
								Christoph Hertzberg 
							
						 
					 
					
						
						
						
						
							
						
						
							0721690dbb 
							
						 
					 
					
						
						
							
							Use standard include syntax in Tensor module (<> for include-path and "" for relative path)  
						
						
						
						
					 
					
						2015-08-18 14:34:00 +02:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							05787f8367 
							
						 
					 
					
						
						
							
							Added support for tensor inflation.  
						
						
						
						
					 
					
						2015-07-16 09:04:05 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							f1f480b116 
							
						 
					 
					
						
						
							
							Added support for user defined custom tensor op.  
						
						
						
						
					 
					
						2015-06-30 15:36:29 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							dc31fcb9ba 
							
						 
					 
					
						
						
							
							Added support for 3D patch extraction  
						
						
						
						
					 
					
						2015-06-30 14:48:26 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							3625734bc8 
							
						 
					 
					
						
						
							
							Moved some utilities to TensorMeta.h to make it easier to reuse them accross several tensor operations.  
						
						... 
						
						
						
						Created the TensorDimensionList class to encode the list of all the dimensions of a tensor of rank n. This could be done using TensorIndexList, however TensorIndexList require cxx11 which isn't yet supported as widely as we'd like. 
						
						
					 
					
						2015-06-29 10:49:55 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							f13b3d4433 
							
						 
					 
					
						
						
							
							Added missing include files  
						
						
						
						
					 
					
						2015-05-28 07:57:28 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							6b800744ce 
							
						 
					 
					
						
						
							
							Moved away from std::async and std::future as the underlying mechnism for the thread pool device. On several platforms, the functions passed to std::async are not scheduled in the order in which they are given to std::async, which leads to massive performance issues in the contraction code.  
						
						... 
						
						
						
						Instead we now have a custom thread pool that ensures that the functions are picked up by the threads in the pool in the order in which they are enqueued in the pool. 
						
						
					 
					
						2015-05-20 13:52:07 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							91359e1d0a 
							
						 
					 
					
						
						
							
							Added the ability to generate a tensor from a custom user defined 'generator'. This simplifies the creation of constant tensors initialized using specific regular patterns.  
						
						... 
						
						
						
						Created a gaussian window generator as a first use case. 
						
						
					 
					
						2015-04-22 11:14:58 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							0f82399fe9 
							
						 
					 
					
						
						
							
							Pulled latest changes from trunk  
						
						
						
						
					 
					
						2015-04-14 19:13:34 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							a1f1e1e51d 
							
						 
					 
					
						
						
							
							Fixed the order of 2 #includes  
						
						
						
						
					 
					
						2015-04-06 10:41:39 -07:00 
						 
				 
			
				
					
						
							
							
								Benoit Steiner 
							
						 
					 
					
						
						
						
						
							
						
						
							74e558cfa8 
							
						 
					 
					
						
						
							
							Pulled latest updates from trunk  
						
						
						
						
					 
					
						2015-04-01 23:24:11 -07:00