Benoit Steiner
|
b5e3285e16
|
Test broadcasting on OpenCL devices with 64 bit indexing
|
2016-11-18 13:44:20 -08:00 |
|
Benoit Steiner
|
7335c49204
|
Fixed the cxx11_tensor_device_sycl test
|
2016-11-18 12:37:13 -08:00 |
|
Mehdi Goli
|
15e226d7d3
|
adding Benoit changes on the TensorDeviceSycl.h
|
2016-11-18 16:34:54 +00:00 |
|
Mehdi Goli
|
622805a0c5
|
Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}.
|
2016-11-18 16:20:42 +00:00 |
|
Luke Iwanski
|
5159675c33
|
Added isnan, isfinite and isinf for SYCL device. Plus test for that.
|
2016-11-18 16:01:48 +00:00 |
|
Luke Iwanski
|
927bd62d2a
|
Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC()
|
2016-11-18 11:16:42 +00:00 |
|
Benoit Steiner
|
553f50b246
|
Added a way to detect errors generated by the opencl device from the host
|
2016-11-17 21:51:48 -08:00 |
|
Benoit Steiner
|
4349fc640e
|
Created a test to check that the sycl runtime can successfully report errors (like ivision by 0).
Small cleanup
|
2016-11-17 20:27:54 -08:00 |
|
Benoit Steiner
|
004344cf54
|
Avoid calling log(0) or 1/0
|
2016-11-17 11:56:44 -08:00 |
|
Luke Iwanski
|
7878756dea
|
Fixed existing test.
|
2016-11-17 17:46:55 +00:00 |
|
Luke Iwanski
|
c5130dedbe
|
Specialised basic math functions for SYCL device.
|
2016-11-17 11:47:13 +00:00 |
|
Mehdi Goli
|
05e8c2a1d9
|
Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl.
|
2016-11-14 18:13:53 +00:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Mehdi Goli
|
3be3963021
|
Adding EIGEN_STRONG_INLINE back; using size() instead of dimensions.TotalSize() on Tensor.
|
2016-11-10 19:16:31 +00:00 |
|
Mehdi Goli
|
2e704d4257
|
Adding Memset; optimising MecopyDeviceToHost by removing double copying;
|
2016-11-10 18:45:12 +00:00 |
|
Benoit Steiner
|
75c080b176
|
Added a test to validate memory transfers between host and sycl device
|
2016-11-09 06:23:42 -08:00 |
|
Benoit Steiner
|
db3903498d
|
Merged in benoitsteiner/opencl (pull request PR-246)
Improved support for OpenCL
|
2016-11-08 22:28:44 +00:00 |
|
Mehdi Goli
|
d57430dd73
|
Converting all sycl buffers to uninitialised device only buffers; adding memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion.
|
2016-11-08 17:08:02 +00:00 |
|
Benoit Steiner
|
ad086b03e4
|
Removed unnecessary statement
|
2016-11-05 12:43:27 -07:00 |
|
Gael Guennebaud
|
55b4fd1d40
|
Extend mpreal unit test to check LLT with complexes.
|
2016-11-05 11:28:53 +01:00 |
|
Mehdi Goli
|
0ebe3808ca
|
Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;
|
2016-11-04 18:18:19 +00:00 |
|
Benoit Steiner
|
d5f88e2357
|
Sharded the tensor_image_patch test to help it run on low power devices
|
2016-10-27 21:48:21 -07:00 |
|
Benoit Steiner
|
0b4b0f11e8
|
Fixed a few more compilation warnings
|
2016-10-28 04:01:01 +00:00 |
|
Benoit Steiner
|
306daa24a3
|
Fixed a compilation warning
|
2016-10-28 03:50:31 +00:00 |
|
Benoit Steiner
|
8471cf1996
|
Fixed compilation warning
|
2016-10-28 03:46:08 +00:00 |
|
Benoit Steiner
|
cf20b30d65
|
Merge latest updates from trunk
|
2016-10-20 09:42:05 -07:00 |
|
Tal Hadad
|
15eca2432a
|
Euler tests: Tighter precision when no roll exists and clean code.
|
2016-10-18 23:24:57 +03:00 |
|
Tal Hadad
|
6f4f12d1ed
|
Add isApprox() and cast() functions.
test cases included
|
2016-10-17 22:23:47 +03:00 |
|
Tal Hadad
|
7402cfd4cc
|
Add safty for near pole cases and test them better.
|
2016-10-17 20:42:08 +03:00 |
|
Tal Hadad
|
58f5d7d058
|
Fix calc bug, docs and better testing.
Test code changes:
* better coded
* rand and manual numbers
* singularity checking
|
2016-10-16 14:39:26 +03:00 |
|
Tal Hadad
|
078a202621
|
Merge Hongkai Dai correct range calculation, and remove ranges from API.
Docs updated.
|
2016-10-14 16:03:28 +03:00 |
|
Luke Iwanski
|
e742da8b28
|
Merged ComputeCpp into default.
|
2016-10-14 13:36:51 +01:00 |
|
Mehdi Goli
|
524fa4c46f
|
Reducing the code by generalising sycl backend functions/structs.
|
2016-10-14 12:09:55 +01:00 |
|
Hongkai Dai
|
014d9f1d9b
|
implement euler angles with the right ranges
|
2016-10-13 14:45:51 -07:00 |
|
Benoit Steiner
|
d0ee2267d6
|
Relaxed the resizing checks so that they don't fail with gcc >= 5.3
|
2016-10-13 10:59:46 -07:00 |
|
Benoit Steiner
|
7e4a6754b2
|
Merged eigen/eigen into default
|
2016-10-12 22:42:33 -07:00 |
|
Benoit Steiner
|
5266ff8966
|
Cleaned up a regression test
|
2016-10-08 19:12:44 +00:00 |
|
Benoit Steiner
|
5c68051cd7
|
Merge the content of the ComputeCpp branch into the default branch
|
2016-10-07 11:04:16 -07:00 |
|
RJ Ryan
|
bfc264abe8
|
Add a test that GPU complex product reductions match CPU reductions.
|
2016-10-06 11:10:14 -07:00 |
|
Benoit Steiner
|
d7f9679a34
|
Fixed a couple of compilation warnings
|
2016-10-05 15:00:32 -07:00 |
|
Benoit Steiner
|
ae1385c7e4
|
Pull the latest updates from trunk
|
2016-10-05 14:54:36 -07:00 |
|
Benoit Steiner
|
73b0012945
|
Fixed compilation warnings
|
2016-10-05 14:24:24 -07:00 |
|
Benoit Steiner
|
4387433acf
|
Increased the robustness of the reduction tests on fp16
|
2016-10-05 10:42:41 -07:00 |
|
Benoit Steiner
|
aad20d700d
|
Increase the tolerance to numerical noise.
|
2016-10-05 10:39:24 -07:00 |
|
Benoit Steiner
|
616a7a1912
|
Improved support for compiling CUDA code with clang as the host compiler
|
2016-10-03 17:09:33 -07:00 |
|
Benoit Steiner
|
422530946f
|
Renamed the SYCL tests to follow the standard naming convention.
|
2016-09-30 08:22:10 -07:00 |
|
Benoit Steiner
|
2bda1b0d93
|
Updated the tensor sum and mean reducer to enable them to process complex numbers on cuda gpus.
|
2016-09-28 17:08:41 -07:00 |
|
RJ Ryan
|
608b1acd6d
|
Don't use c++11 features and fix include.
|
2016-09-20 07:49:05 -07:00 |
|
RJ Ryan
|
b2c6dc48d9
|
Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_difference_op, scalar_product_op, and scalar_quotient_op.
|
2016-09-20 07:18:20 -07:00 |
|
Luke Iwanski
|
c771df6bc3
|
Updated the owners of the file.
|
2016-09-19 14:09:25 +01:00 |
|