Antonio Sanchez 1fd5ce1002 For GpuDevice::fill, use a single memset if all bytes are equal.
The original `fill` implementation introduced a 5x regression on my
nvidia Quadro K1200.  @rohitsan reported up to 100x regression for
HIP.  This restores performance.
2021-07-10 13:37:16 +00:00
..
2021-02-19 22:26:56 +00:00
2016-08-30 10:01:53 +02:00
FFT
2021-02-27 18:44:26 +01:00
2018-09-20 18:30:10 +02:00