Mehdi Goli
|
2062ac9958
|
Changes required for new ComputeCpp CE version.
|
2017-09-18 18:17:39 +01:00 |
|
Benoit Steiner
|
1e736b9ead
|
Merged in mehdi_goli/opencl/SYCLAlignAllocator (pull request PR-7)
Fixing SYCL alignment issue required by TensorFlow.
|
2017-05-26 17:23:00 +00:00 |
|
Mehdi Goli
|
0370d3576e
|
Applying Ronnan's comments.
|
2017-05-26 16:01:48 +01:00 |
|
Mehdi Goli
|
e3f964ed55
|
Applying Benoit's comment;removing dead code.
|
2017-05-25 11:17:26 +01:00 |
|
Mehdi Goli
|
76c0fc1f95
|
Fixing SYCL alignment issue required by TensorFlow.
|
2017-05-22 16:49:32 +01:00 |
|
Mehdi Goli
|
2d17128d6f
|
Fixing suported device list.
|
2017-05-22 16:40:33 +01:00 |
|
Luke Iwanski
|
a91417a7a5
|
Introduces align allocator for SYCL buffer
|
2017-03-20 14:48:54 +00:00 |
|
Luke Iwanski
|
c06861d15e
|
Fixes bug in get_sycl_supported_devices() that was reporting unsupported Intel CPU on AMD platform - causing timeouts in that configuration
|
2017-03-15 19:26:08 +00:00 |
|
Luke Iwanski
|
1b32a10053
|
Use name to distinguish name instead of the vendor
|
2017-03-08 18:26:34 +00:00 |
|
Mehdi Goli
|
5e9a1e7a7a
|
Adding sycl Benchmarks.
|
2017-03-08 14:17:48 +00:00 |
|
Mehdi Goli
|
e2e3f78533
|
Fixing potential race condition on sycl device.
|
2017-03-07 17:48:15 +00:00 |
|
Mehdi Goli
|
f84963ed95
|
Adding TensorIndexTuple and TensorTupleReduceOP backend (ArgMax/Min) for sycl; fixing the address space issue for const TensorMap; converting all discard_write to write due to data missmatch.
|
2017-03-07 14:27:10 +00:00 |
|
Mehdi Goli
|
8296b87d7b
|
Adding sycl backend for TensorCustomOp; fixing the partial lhs modification issue on sycl when the rhs is TensorContraction, reduction or convolution; Fixing the partial modification for memset when sycl backend is used.
|
2017-02-28 17:16:14 +00:00 |
|
Mehdi Goli
|
bab29936a1
|
Reducing warnings in Sycl backend.
|
2017-02-01 15:29:53 +00:00 |
|
Mehdi Goli
|
82ce92419e
|
Fixing the buffer type in memcpy.
|
2017-01-30 11:38:20 +00:00 |
|
Luke Iwanski
|
bf44fed9b7
|
Allows AMD APU
|
2017-01-23 15:56:45 +00:00 |
|
Mehdi Goli
|
602f8c27f5
|
Reverting back to the previous TensorDeviceSycl.h as the total number of buffer is not enough for tensorflow.
|
2017-01-20 18:23:20 +00:00 |
|
Mehdi Goli
|
77cc4d06c7
|
Removing unused variables
|
2017-01-19 17:06:21 +00:00 |
|
Mehdi Goli
|
6bdd15f572
|
Adding non-deferrenciable pointer track for ComputeCpp backend; Adding TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class.
|
2017-01-19 11:30:59 +00:00 |
|
Mehdi Goli
|
8b1c2108ba
|
Reverting asynchronous exec to Synchronous exec regarding random race condition.
|
2016-12-22 16:45:38 +00:00 |
|
Benoit Steiner
|
70d0172f0c
|
Merged eigen/eigen into default
|
2016-12-16 17:37:04 -08:00 |
|
Benoit Steiner
|
8910442e19
|
Fixed memcpy, memcpyHostToDevice and memcpyDeviceToHost for Sycl.
|
2016-12-16 15:45:04 -08:00 |
|
Mehdi Goli
|
35bae513a0
|
Converting all parallel for lambda to functor in order to prevent kernel duplication name error; adding tensorConcatinationOp backend for sycl.
|
2016-12-16 19:46:45 +00:00 |
|
Mehdi Goli
|
c5e8546306
|
Adding asynchandler to sycl queue as lack of it can cause undefined behaviour.
|
2016-12-15 16:59:57 +00:00 |
|
Mehdi Goli
|
730eb9fe1c
|
Adding asynchronous execution as it improves the performance.
|
2016-12-14 17:38:53 +00:00 |
|
Mehdi Goli
|
2d4a091beb
|
Adding tensor contraction operation backend for Sycl; adding test for contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h
|
2016-12-14 15:30:37 +00:00 |
|
Mehdi Goli
|
79aa2b784e
|
Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
|
2016-12-01 13:02:27 +00:00 |
|
Benoit Steiner
|
e073de96dc
|
Moved the MemCopyFunctor back to TensorSyclDevice since it's the only caller and it makes TensorFlow compile again
|
2016-11-30 21:36:52 -08:00 |
|
Benoit Steiner
|
fca27350eb
|
Added the deallocate_all() method back
|
2016-11-30 20:45:20 -08:00 |
|
Luke Iwanski
|
26fff1c5b1
|
Added EIGEN_STRONG_INLINE to get_sycl_supported_device().
|
2016-11-30 16:55:22 +00:00 |
|
Mehdi Goli
|
577ce78085
|
Adding TensorShuffling backend for sycl; adding TensorReshaping backend for sycl; cleaning up the sycl backend.
|
2016-11-29 15:30:42 +00:00 |
|
Mehdi Goli
|
7318daf887
|
Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.
|
2016-11-25 16:19:07 +00:00 |
|
Mehdi Goli
|
b8cc5635d5
|
Removing unsupported device from test case; cleaning the tensor device sycl.
|
2016-11-23 16:30:41 +00:00 |
|
Benoit Steiner
|
f11da1d83b
|
Made the QueueInterface thread safe
|
2016-11-20 13:17:08 -08:00 |
|
Benoit Steiner
|
6d781e3e52
|
Merged eigen/eigen into default
|
2016-11-20 10:12:54 -08:00 |
|
Benoit Steiner
|
79a07b891b
|
Fixed a typo
|
2016-11-20 07:07:41 -08:00 |
|
Benoit Steiner
|
81151bd474
|
Fixed merge conflicts
|
2016-11-19 19:12:59 -08:00 |
|
Benoit Steiner
|
9265ca707e
|
Made it possible to check the state of a sycl device without synchronization
|
2016-11-19 10:56:24 -08:00 |
|
Benoit Steiner
|
dc601d79d1
|
Added the ability to run test exclusively OpenCL devices that are listed by sycl::device::get_devices().
|
2016-11-18 16:26:50 -08:00 |
|
Benoit Steiner
|
37c2c516a6
|
Cleaned up the sycl device code
|
2016-11-18 12:38:06 -08:00 |
|
Mehdi Goli
|
15e226d7d3
|
adding Benoit changes on the TensorDeviceSycl.h
|
2016-11-18 16:34:54 +00:00 |
|
Mehdi Goli
|
622805a0c5
|
Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}.
|
2016-11-18 16:20:42 +00:00 |
|
Benoit Steiner
|
553f50b246
|
Added a way to detect errors generated by the opencl device from the host
|
2016-11-17 21:51:48 -08:00 |
|
Benoit Steiner
|
72a45d32e9
|
Cleanup
|
2016-11-17 21:29:15 -08:00 |
|
Benoit Steiner
|
4349fc640e
|
Created a test to check that the sycl runtime can successfully report errors (like ivision by 0).
Small cleanup
|
2016-11-17 20:27:54 -08:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Mehdi Goli
|
a5c3f15682
|
Adding comment to TensorDeviceSycl.h and cleaning the code.
|
2016-11-11 19:06:34 +00:00 |
|
Mehdi Goli
|
3be3963021
|
Adding EIGEN_STRONG_INLINE back; using size() instead of dimensions.TotalSize() on Tensor.
|
2016-11-10 19:16:31 +00:00 |
|
Mehdi Goli
|
12387abad5
|
adding the missing in eigen_assert!
|
2016-11-10 18:58:08 +00:00 |
|
Mehdi Goli
|
2e704d4257
|
Adding Memset; optimising MecopyDeviceToHost by removing double copying;
|
2016-11-10 18:45:12 +00:00 |
|