Frédéric Chapoton
|
6331da95eb
|
fixing a lot of typos
|
2024-07-30 22:15:49 +00:00 |
|
Charles Schlosser
|
fb95e90f7f
|
Add truncation op
|
2024-04-29 23:45:49 +00:00 |
|
Antonio Sánchez
|
dcceb9afec
|
Unbork avx512 preduce_mul on MSVC.
|
2024-04-26 15:28:03 +00:00 |
|
Damiano Franzò
|
888fca0e2b
|
Simd sincos double
|
2024-04-15 21:12:32 +00:00 |
|
Charles Schlosser
|
6ad2ccea4e
|
Eigen pblend
|
2024-04-15 16:19:53 +00:00 |
|
Charles Schlosser
|
9099c5eac7
|
Handle missing AVX512 intrinsic
|
2024-04-14 16:41:23 +00:00 |
|
Charles Schlosser
|
122befe54c
|
Fix "unary minus operator applied to unsigned type, result still unsigned" on MSVC and other stupid warnings
|
2024-04-12 19:35:04 +00:00 |
|
Rasmus Munk Larsen
|
3c9109238f
|
Add support for Packet8l to AVX512.
|
2024-04-09 22:58:44 +00:00 |
|
Rasmus Munk Larsen
|
b86641a4c2
|
Add support for casting between double and int64_t for SSE and AVX2.
|
2024-03-22 22:32:29 +00:00 |
|
Tobias Wood
|
f38e16c193
|
Apply clang-format
|
2023-11-29 11:12:48 +00:00 |
|
Antonio Sánchez
|
6e4d5d4832
|
Add IWYU private pragmas to internal headers.
|
2023-08-21 16:25:22 +00:00 |
|
Rasmus Munk Larsen
|
df1049ddf4
|
Small packet math cleanup.
|
2023-04-04 16:14:32 +00:00 |
|
Rasmus Munk Larsen
|
1a5dfd7c0f
|
Fix incorrect casting in AVX512DQ path.
|
2023-03-27 09:28:06 -07:00 |
|
Rasmus Munk Larsen
|
b8b8a26145
|
Add more missing vectorized casts for int on x86, and remove redundant unit tests
|
2023-03-24 16:02:00 +00:00 |
|
Rasmus Munk Larsen
|
0488b708b4
|
Vectorize tensor.isnan() by using typed predicates.
|
2023-03-16 04:04:22 +00:00 |
|
Antonio Sánchez
|
394aabb0a3
|
Fix failing MSVC tests due to compiler bugs.
|
2023-03-10 22:36:57 +00:00 |
|
Rasmus Munk Larsen
|
1c0a6cf228
|
Get rid of EIGEN_HAS_AVX512_MATH workaround.
|
2023-02-23 23:16:41 +00:00 |
|
Rasmus Munk Larsen
|
ce62177b5b
|
Vectorize atanh & add a missing definition and unit test for atan.
|
2023-02-21 03:14:05 +00:00 |
|
Sean McBride
|
d70b4864d9
|
issue #2581: review and cleanup of compiler version checks
|
2023-01-17 18:58:34 +00:00 |
|
Antonio Sánchez
|
8588d8c74b
|
Correct pnegate for floating-point zero.
|
2022-11-15 18:07:23 +00:00 |
|
Charles Schlosser
|
82b152dbe7
|
Add signbit function
|
2022-11-04 00:31:20 +00:00 |
|
Antonio Sánchez
|
e5794873cb
|
Replace assert with eigen_assert.
|
2022-10-04 17:11:23 +00:00 |
|
Rasmus Munk Larsen
|
c475228b28
|
Vectorize atan() for double.
|
2022-10-01 01:49:30 +00:00 |
|
Rasmus Munk Larsen
|
7b2901e2aa
|
Add vectorized integer division for int32 with AVX512, AVX or SSE.
|
2022-09-21 00:27:23 +00:00 |
|
Rasmus Munk Larsen
|
bd393e15c3
|
Vectorize acos, asin, and atan for float.
|
2022-08-29 19:49:33 +00:00 |
|
Charles Schlosser
|
e5af9f87f2
|
Vectorize pow for integer base / exponent types
|
2022-08-29 19:23:54 +00:00 |
|
Matthew Sterrett
|
7a3b667c43
|
Add support for AVX512-FP16 for vectorizing half precision math
|
2022-08-17 18:15:21 +00:00 |
|
Matthew Sterrett
|
39fcc89798
|
Removed unnecessary checks for FP16C
|
2022-08-16 18:14:41 +00:00 |
|
aaraujom
|
d49ede4dc4
|
Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
|
2022-05-28 02:00:21 +00:00 |
|
Antonio Sánchez
|
9b9496ad98
|
Revert "Add AVX512 optimizations for matrix multiply"
This reverts commit 25db0b4a824ba9a092bbb514fbada51bf9d37a18
|
2022-05-13 18:50:33 +00:00 |
|
aaraujom
|
25db0b4a82
|
Add AVX512 optimizations for matrix multiply
|
2022-05-12 23:41:19 +00:00 |
|
Antonio Sánchez
|
07db964bde
|
Restrict new AVX512 trsm to AVX512VL, rename files for consistency.
|
2022-04-14 16:58:32 +00:00 |
|
Antonio Sánchez
|
9a14d91a99
|
Fix AVX512 builds with MSVC.
|
2022-03-18 16:04:53 +00:00 |
|
b-shi
|
518fc321cb
|
AVX512 Optimizations for Triangular Solve
|
2022-03-16 18:04:50 +00:00 |
|
Antonio Sánchez
|
e7f4a901ee
|
Define EIGEN_HAS_AVX512_MATH in PacketMath.
|
2022-02-04 22:25:52 +00:00 |
|
Antonio Sánchez
|
96da541cba
|
Fix AVX512 math function consistency, enable for ICC.
|
2022-02-04 19:35:18 +00:00 |
|
Rasmus Munk Larsen
|
51311ec651
|
Remove inline assembly for FMA (AVX) and add remaining extensions as packet ops: pmsub, pnmadd, and pnmsub.
|
2022-01-26 04:25:41 +00:00 |
|
Rasmus Munk Larsen
|
ea2c02060c
|
Add reciprocal packet op and fast specializations for float with SSE, AVX, and AVX512.
|
2022-01-21 23:49:18 +00:00 |
|
Ilya Tokar
|
a0fc640c18
|
Add support for packets of int64 on x86
|
2022-01-21 19:55:23 +00:00 |
|
Kolja Brix
|
afa616bc9e
|
Fix some typos found
|
2021-09-23 15:22:00 +00:00 |
|
Rasmus Munk Larsen
|
7b975acb1f
|
Remove unused variable.
|
2021-09-16 20:27:13 +00:00 |
|
Rasmus Munk Larsen
|
92849d814b
|
Remove unused variable.
|
2021-09-16 20:21:31 +00:00 |
|
Rasmus Munk Larsen
|
d7d0bf832d
|
Issue an error in case of direct inclusion of internal headers.
|
2021-09-10 19:12:26 +00:00 |
|
Antonio Sanchez
|
3d4ba855e0
|
Fix AVX integer packet issues.
Most are instances of AVX2 functions not protected by
`EIGEN_VECTORIZE_AVX2`. There was also a missing semi-colon
for AVX512.
|
2021-09-01 14:14:43 -07:00 |
|
Jakub Lichman
|
dc5b1f7d75
|
AVX512 and AVX2 support for Packet16i and Packet8i added
|
2021-08-25 19:38:23 +00:00 |
|
Gauri Deshpande
|
e6a5a594a7
|
remove denormal flushing in fp32tobf16 for avx & avx512
|
2021-08-09 22:15:21 +00:00 |
|
Jakub Lichman
|
d87648a6be
|
Tests added and AVX512 bug fixed for pcmp_lt_or_nan
|
2021-04-25 20:58:56 +00:00 |
|
Jakub Lichman
|
2b1dfd1ba0
|
HasExp added for AVX512 Packet8d
|
2021-04-20 19:07:58 +00:00 |
|
Antonio Sanchez
|
1d79c68ba0
|
Fix ldexp for AVX512 (#2215)
Wrong shuffle was used. Need to interleave low/high halves with a
`permute` instruction.
Fixes #2215.
|
2021-04-20 16:25:22 +00:00 |
|
Christoph Hertzberg
|
69a4f70956
|
Revert "Uses _mm512_abs_pd for Packet8d pabs"
This reverts commit f019b97aca82071f35726b1aaebf1c598770f0f5
|
2021-03-23 18:52:19 +00:00 |
|