Benoit Steiner
|
36f5a10198
|
Properly gate the definition of the error and gamma functions for fp16
|
2016-04-13 18:44:48 -07:00 |
|
Benoit Steiner
|
d6105b53b8
|
Added basic implementation of the lgamma, digamma, igamma, igammac, polygamma, and zeta function for fp16
|
2016-04-13 15:26:02 -07:00 |
|
Benoit Steiner
|
87ca15c4e8
|
Added support for sin, cos, tan, and tanh on fp16
|
2016-04-13 14:12:38 -07:00 |
|
Benoit Steiner
|
473c8380ea
|
Added constructors to convert unsigned integers into fp16
|
2016-04-13 11:03:37 -07:00 |
|
Benoit Steiner
|
8bfe739cd2
|
Updated the AVX512 PacketMath to properly leverage the AVX512DQ instructions
|
2016-04-11 18:40:16 -07:00 |
|
Benoit Steiner
|
d6e596174d
|
Pull latest updates from upstream
|
2016-04-11 17:20:17 -07:00 |
|
Benoit Steiner
|
833efb39bf
|
Added epsilon, dummy_precision, infinity and quiet_NaN NumTraits for fp16
|
2016-04-11 11:03:56 -07:00 |
|
Benoit Steiner
|
995f202cea
|
Disabled the use of half2 on cuda devices of compute capability < 5.3
|
2016-04-08 14:43:36 -07:00 |
|
Benoit Steiner
|
8d22967bd9
|
Initial support for taking the power of fp16
|
2016-04-08 14:22:39 -07:00 |
|
Benoit Steiner
|
3394379319
|
Fixed the packet_traits for half floats.
|
2016-04-08 13:33:59 -07:00 |
|
Benoit Jacob
|
cd2b667ac8
|
Add references to filed LLVM bugs
|
2016-04-08 08:12:47 -04:00 |
|
Benoit Steiner
|
737644366f
|
Move the functions operating on fp16 out of the std namespace and into the Eigen::numext namespace
|
2016-04-07 11:40:15 -07:00 |
|
Benoit Steiner
|
df838736e2
|
Fixed compilation warning triggered by msvc
|
2016-04-06 20:48:55 -07:00 |
|
Benoit Steiner
|
14ea7c7ec7
|
Fixed packet_traits<half>
|
2016-04-06 19:30:21 -07:00 |
|
Benoit Steiner
|
532fdf24cb
|
Added support for hardware conversion between fp16 and full floats whenever
possible.
|
2016-04-06 17:11:31 -07:00 |
|
Benoit Steiner
|
58c1dbff19
|
Made the fp16 code more portable.
|
2016-04-06 13:44:08 -07:00 |
|
Benoit Steiner
|
cf7e73addd
|
Added some missing conversions to the Half class, and fixed the implementation of the < operator on cuda devices.
|
2016-04-06 09:59:51 -07:00 |
|
Benoit Steiner
|
10bdd8e378
|
Merged in tillahoffmann/eigen (pull request PR-173)
Added zeta function of two arguments and polygamma function
|
2016-04-06 09:40:17 -07:00 |
|
Benoit Steiner
|
72abfa11dd
|
Added support for isfinite on fp16
|
2016-04-06 09:07:30 -07:00 |
|
Konstantinos Margaritis
|
2bba4ee2cf
|
Merged kmargar/eigen/tip into default
|
2016-04-05 22:22:08 +03:00 |
|
Konstantinos Margaritis
|
317384b397
|
complete the port, remove float support
|
2016-04-05 14:56:45 -04:00 |
|
tillahoffmann
|
726bd5f077
|
Merged eigen/eigen into default
|
2016-04-05 18:21:05 +01:00 |
|
Konstantinos Margaritis
|
bc0ad363c6
|
add remaining includes
|
2016-04-05 06:01:17 -04:00 |
|
Konstantinos Margaritis
|
2d41dc9622
|
complete int/double specialized traits for ZVector
|
2016-04-05 06:00:51 -04:00 |
|
Benoit Jacob
|
158fea0f5e
|
bug #1190 - Don't trust __ARM_FEATURE_FMA on Clang/ARM
|
2016-04-04 16:42:40 -04:00 |
|
Benoit Jacob
|
03f2997a11
|
bug #1191 - Prevent Clang/ARM from rewriting VMLA into VMUL+VADD
|
2016-04-04 16:41:47 -04:00 |
|
tillahoffmann
|
49960adbdd
|
Merged eigen/eigen into default
|
2016-04-01 14:36:15 +01:00 |
|
Till Hoffmann
|
57239f4a81
|
Added polygamma function.
|
2016-04-01 14:35:21 +01:00 |
|
Till Hoffmann
|
dd5d390daf
|
Added zeta function.
|
2016-04-01 13:32:29 +01:00 |
|
Benoit Steiner
|
0ea7ab4f62
|
Hashing was only officially introduced in c++11. Therefore only define an implementation of the hash function for float16 if c++11 is enabled.
|
2016-03-31 14:44:55 -07:00 |
|
Benoit Steiner
|
92b7f7b650
|
Improved code formating
|
2016-03-31 13:09:58 -07:00 |
|
Benoit Steiner
|
f197813f37
|
Added the ability to hash a fp16
|
2016-03-31 13:09:23 -07:00 |
|
Benoit Steiner
|
c36ab19902
|
Added __ldg primitive for fp16.
|
2016-03-31 10:55:03 -07:00 |
|
Benoit Steiner
|
b575fb1d02
|
Added NumTraits for half floats
|
2016-03-31 10:43:59 -07:00 |
|
Benoit Steiner
|
8c8a79cec1
|
Fixed a typo
|
2016-03-31 10:33:32 -07:00 |
|
Benoit Jacob
|
01b5333e44
|
bug #1186 - vreinterpretq_u64_f64 fails to build on Android/Aarch64/Clang toolchain
|
2016-03-30 11:02:33 -04:00 |
|
Benoit Steiner
|
e02b784ec3
|
Added support for standard mathematical functions and trancendentals(such as exp, log, abs, ...) on fp16
|
2016-03-29 09:20:36 -07:00 |
|
Konstantinos Margaritis
|
01e7298fe6
|
actually include ZVector files, passes most basic tests (float still fails)
|
2016-03-28 10:58:02 -04:00 |
|
Benoit Steiner
|
7a570e50ef
|
Fixed contractions of fp16
|
2016-03-23 16:00:06 -07:00 |
|
Benoit Steiner
|
fc3660285f
|
Made type conversion explicit
|
2016-03-23 09:56:50 -07:00 |
|
Benoit Steiner
|
0e68882604
|
Added the ability to divide a half float by an index
|
2016-03-23 09:46:42 -07:00 |
|
Benoit Steiner
|
6971146ca9
|
Added more conversion operators for half floats
|
2016-03-23 09:44:52 -07:00 |
|
Benoit Steiner
|
f9ad25e4d8
|
Fixed contractions of 16 bit floats
|
2016-03-22 09:30:23 -07:00 |
|
Benoit Steiner
|
134d750eab
|
Completed the implementation of vectorized type casting of half floats.
|
2016-03-18 13:36:28 -07:00 |
|
Benoit Steiner
|
7bd551b3a9
|
Make all the conversions explicit
|
2016-03-18 12:20:08 -07:00 |
|
Benoit Steiner
|
7b98de1f15
|
Implemented some of the missing type casting for half floats
|
2016-03-17 21:45:45 -07:00 |
|
Christoph Hertzberg
|
46aa9772fc
|
Merged in ebrevdo/eigen (pull request PR-169)
Bugfixes to cuda tests, igamma & igammac implemented, & tests for digamma, igamma, igammac on CPU & GPU.
|
2016-03-16 21:59:08 +01:00 |
|
Benoit Steiner
|
5a51366ea5
|
Fixed a typo.
|
2016-03-14 09:25:16 -07:00 |
|
Benoit Steiner
|
fcf59e1c37
|
Properly gate the use of cuda intrinsics in the code
|
2016-03-14 09:13:44 -07:00 |
|
Benoit Steiner
|
97a1f1c273
|
Make sure we only use the half float intrinsic when compiling with a version of CUDA that is recent enough to provide them
|
2016-03-14 08:37:58 -07:00 |
|