Mark D Ryan 3c9add6598 Remove reinterpret_cast from AVX512 complex implementation
The reinterpret_casts used in ptranspose(PacketBlock<Packet8cf,4>&)
ptranspose(PacketBlock<Packet8cf,8>&) don't appear to be working
correctly.  They're used to convert the kernel parameters to
PacketBlock<Packet8d,T>& so that the complex number versions of
ptranspose can be written using the existing double implementations.
Unfortunately, they don't seem to work and are responsible for 9 unit
test failures in the AVX512 build of tensorflow master.  This commit
fixes the issue by manually initialising PacketBlock<Packet8d,T>
variables with the contents of the kernel parameter before calling
the double version of ptranspose, and then copying the resulting
values back into the kernel parameter before returning.
2019-01-11 14:02:09 +01:00
..
2018-08-01 21:34:47 -04:00
2018-07-17 10:29:12 +02:00
2018-09-18 04:15:01 -04:00
2018-03-11 10:01:44 -04:00
2018-12-11 22:07:20 +01:00
2017-03-07 11:25:58 +01:00
2018-11-14 14:43:18 +01:00
2018-06-29 11:28:36 +02:00
2018-09-18 04:15:01 -04:00