Antonio Sánchez
e5794873cb
Replace assert with eigen_assert.
2022-10-04 17:11:23 +00:00
Chao Chen
5ffe7b92e0
[ROCm] fixed gpuGetDevice unused message
2022-09-20 21:38:20 +00:00
chuckyschluz
8acbf5c11c
re-enable pow for complex types
2022-08-26 17:29:02 -04:00
Charles Schlosser
76a669fb45
add fixed power unary operation
2022-08-16 21:32:36 +00:00
Antonio Sánchez
b8e93bf589
Eliminate bool bitwise warnings.
2022-08-09 22:42:30 +00:00
Antonio Sánchez
bb51d9f4fa
Fix ODR violations.
2022-07-09 04:56:36 +00:00
Antonio Sánchez
8c2e0e3cb8
Fix ambiguous comparisons for c++20 (again again)
2022-06-07 17:06:17 +00:00
Antonio Sánchez
76cf6204f3
Revert "Fix c++20 ambiguity of comparisons."
...
This reverts commit 4f6354128ff1dd17bc40cfa64b99ae613d05785a
2022-06-04 02:32:10 +00:00
Antonio Sánchez
4f6354128f
Fix c++20 ambiguity of comparisons.
2022-06-03 05:11:07 +00:00
Antonio Sánchez
477eb7f630
Revert "Avoid ambiguous Tensor comparison operators for C++20 compatibility"
...
This reverts commit 5c2179b6c30921d1c5ed5d6762be1f0e0e14493d
2022-05-24 16:09:59 +00:00
Mehdi Goli
c5a5ac680c
[SYCL] SYCL-2020 range does not have default constructor.
2022-05-24 03:11:46 +00:00
Benjamin Kramer
5c2179b6c3
Avoid ambiguous Tensor comparison operators for C++20 compatibility
2022-05-23 17:36:03 +00:00
Chip Kerchner
aa8b7e2c37
Add subMappers to Power GEMM packing - simplifies the address calculations (10% faster)
2022-05-23 15:18:29 +00:00
Mehdi Goli
cbe03f3531
[SYCL] Extending SYCL queue interface extension.
2022-05-23 14:45:27 +00:00
Eisuke Kawashima
ac5c83a3f5
unset executable flag
2022-05-22 22:47:43 +09:00
Tobias Wood
a9868bd5be
Add arg() to tensor
2022-05-20 03:33:01 +00:00
Rasmus Munk Larsen
73d65dbc43
Update README.md. Remove obsolete comment about RowMajor not being fully supported.
2022-05-06 18:19:35 +00:00
Antonio Sánchez
f7b31f864c
Revert "Replace call to FixedDimensions() with a singleton instance of"
...
This reverts commit 19e6496ce0c52fef33265bca54285ba77b2155be
2022-04-10 15:30:33 +00:00
Tobias Schlüter
f3ba220c5d
Remove EIGEN_EMPTY_STRUCT_CTOR
2022-04-08 18:27:26 +00:00
Antonio Sánchez
5ed7a86ae9
Fix MSVC+CUDA issues.
2022-04-08 18:05:32 +00:00
Erik Schultheis
64909b82bd
static const class members turned into constexpr
2022-04-04 17:33:33 +00:00
Antonio Sanchez
9bc9992dd3
Eliminate trace unused warning.
2022-03-29 22:04:50 +00:00
Romain Biessy
f2a3e03e9b
Fix usages of wrong namespace
2022-03-21 15:07:53 +00:00
Erik Schultheis
421cbf0866
Replace Eigen type metaprogramming with corresponding std types and make use of alias templates
2022-03-16 16:43:40 +00:00
Antonio Sánchez
008ff3483a
Fix broken tensor executor test, allow tensor packets of size 1.
2022-03-07 20:30:37 +00:00
Antonio Sanchez
bded5028a5
Fix ODR failures in TensorRandom.
2022-02-11 23:28:33 -08:00
Rasmus Munk Larsen
18eab8f997
Add convenience method constexpr std::size_t size() const
to Eigen::IndexList
2022-02-12 04:23:03 +00:00
Antonio Sánchez
9441d94dcc
Revert "Make fixed-size Matrix and Array trivially copyable after C++20"
...
This reverts commit 47eac21072ec8c225d03c139dd9632bc5c19fd83
2022-02-05 04:40:29 +00:00
Antonio Sánchez
cafeadffef
Fix ODR violations.
2022-02-04 19:01:07 +00:00
Erik Schultheis
970640519b
Cleanup
2022-01-21 01:48:59 +00:00
Kolja Brix
8d81a2339c
Reduce usage of reserved names
2022-01-10 20:53:29 +00:00
Matthias Möller
c9df98b071
Fix Gcc8.5 warning about missing base class initialisation ( #2404 )
2022-01-07 19:16:53 +00:00
Lingzhu Xiang
47eac21072
Make fixed-size Matrix and Array trivially copyable after C++20
...
Making them trivially copyable allows using std::memcpy() without undefined
behaviors.
Only Matrix and Array with trivially copyable DenseStorage are marked as
trivially copyable with an additional type trait.
As described in http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0848r3.html
it requires extremely verbose SFINAE to make the special member functions of
fixed-size Matrix and Array trivial, unless C++20 concepts are available to
simplify the selection of trivial special member functions given template
parameters. Therefore only make this feature available to compilers that support
C++20 P0848R3.
Fix #1855 .
2022-01-07 19:04:35 +00:00
Erik Schultheis
c20e908ebc
turn some macros intro constexpr functions
2021-12-10 19:27:01 +00:00
Erik Schultheis
c35679af27
fixed customIndices2Array forgetting first index
2021-12-10 16:41:59 +00:00
Erik Schultheis
e4c40b092a
disambiguate overloads for empty index list
2021-12-07 19:40:09 +00:00
Erik Schultheis
cc11e240ac
Some further cleanup
2021-12-06 18:01:15 +00:00
Erik Schultheis
2f65ec5302
fixed leftover else branch
2021-12-02 18:13:19 +00:00
Erik Schultheis
ec2fd0f7ed
Require recent GCC and MSCV and removed EIGEN_HAS_CXX14
and some other feature test macros
2021-12-01 00:48:34 +00:00
Erik Schultheis
f33a31b823
removed EIGEN_HAS_CXX11_* and redundant EIGEN_COMP_CXXVER checks
2021-11-29 19:18:57 +00:00
Erik Schultheis
ec4efbd696
remove EIGEN_HAS_CXX11
2021-11-24 20:08:49 +00:00
Rasmus Munk Larsen
cfdb3ce3f0
Fix warnings about shadowing definitions.
2021-11-23 14:34:47 -08:00
Rasmus Munk Larsen
5e89573e2a
Implement Eigen::array<...>::reverse_iterator if std::reverse_iterator exists.
2021-11-20 00:22:46 +00:00
Rasmus Munk Larsen
11cb7b8372
Add basic iterator support for Eigen::array to ease transition to std::array in third-party libraries.
2021-11-19 05:14:30 +00:00
Antonio Sanchez
c107bd6102
Fix errors for windows build.
2021-11-19 04:23:25 +00:00
Rasmus Munk Larsen
96aeffb013
Make the new TensorIO implementation work with TensorMap with const elements.
2021-11-17 18:16:04 -08:00
Rasmus Munk Larsen
824d06eb36
Include <numeric> to get std::iota.
2021-11-18 00:47:18 +00:00
Antonio Sanchez
ffb78e23a1
Fix tensor broadcast off-by-one error.
...
Caught by JAX unit tests. Triggered if broadcast is smaller than packet
size.
2021-11-16 17:37:38 +00:00
cpp977
f73c95c032
Reimplemented the Tensor stream output.
2021-11-16 17:36:58 +00:00
Ben Barsdell
50df8d3d6d
Avoid integer overflow in EigenMetaKernel indexing
...
- The current implementation computes `size + total_threads`, which can
overflow and cause CUDA_ERROR_ILLEGAL_ADDRESS when size is close to
the maximum representable value.
- The num_blocks calculation can also overflow due to the implementation
of divup().
- This patch prevents these overflows and allows the kernel to work
correctly for the full representable range of tensor sizes.
- Also adds relevant tests.
2021-11-05 16:39:37 +11:00