Benoit Steiner
|
cf7e73addd
|
Added some missing conversions to the Half class, and fixed the implementation of the < operator on cuda devices.
|
2016-04-06 09:59:51 -07:00 |
|
Benoit Steiner
|
10bdd8e378
|
Merged in tillahoffmann/eigen (pull request PR-173)
Added zeta function of two arguments and polygamma function
|
2016-04-06 09:40:17 -07:00 |
|
Benoit Steiner
|
72abfa11dd
|
Added support for isfinite on fp16
|
2016-04-06 09:07:30 -07:00 |
|
Konstantinos Margaritis
|
2bba4ee2cf
|
Merged kmargar/eigen/tip into default
|
2016-04-05 22:22:08 +03:00 |
|
Konstantinos Margaritis
|
317384b397
|
complete the port, remove float support
|
2016-04-05 14:56:45 -04:00 |
|
tillahoffmann
|
726bd5f077
|
Merged eigen/eigen into default
|
2016-04-05 18:21:05 +01:00 |
|
Till Hoffmann
|
a350c25a39
|
Added accuracy comments.
|
2016-04-05 18:20:40 +01:00 |
|
Konstantinos Margaritis
|
bc0ad363c6
|
add remaining includes
|
2016-04-05 06:01:17 -04:00 |
|
Konstantinos Margaritis
|
2d41dc9622
|
complete int/double specialized traits for ZVector
|
2016-04-05 06:00:51 -04:00 |
|
Konstantinos Margaritis
|
988344daf1
|
enable the other includes as well
|
2016-04-05 05:59:30 -04:00 |
|
Benoit Jacob
|
158fea0f5e
|
bug #1190 - Don't trust __ARM_FEATURE_FMA on Clang/ARM
|
2016-04-04 16:42:40 -04:00 |
|
Benoit Jacob
|
03f2997a11
|
bug #1191 - Prevent Clang/ARM from rewriting VMLA into VMUL+VADD
|
2016-04-04 16:41:47 -04:00 |
|
Till Hoffmann
|
b97911dd18
|
Refactored code into type-specific helper functions.
|
2016-04-04 19:16:03 +01:00 |
|
Benoit Steiner
|
c4179dd470
|
Updated the scalar_abs_op struct to make it compatible with cuda devices.
|
2016-04-04 11:11:51 -07:00 |
|
Benoit Steiner
|
1108b4f218
|
Fixed the signature of numext::abs to make it compatible with complex numbers
|
2016-04-04 11:09:25 -07:00 |
|
Till Hoffmann
|
80eba21ad0
|
Merge upstream.
|
2016-04-01 18:18:49 +01:00 |
|
Till Hoffmann
|
3cb0a237c1
|
Fixed suggestions by Eugene Brevdo.
|
2016-04-01 17:51:39 +01:00 |
|
tillahoffmann
|
49960adbdd
|
Merged eigen/eigen into default
|
2016-04-01 14:36:15 +01:00 |
|
Till Hoffmann
|
57239f4a81
|
Added polygamma function.
|
2016-04-01 14:35:21 +01:00 |
|
Till Hoffmann
|
dd5d390daf
|
Added zeta function.
|
2016-04-01 13:32:29 +01:00 |
|
Benoit Steiner
|
0ea7ab4f62
|
Hashing was only officially introduced in c++11. Therefore only define an implementation of the hash function for float16 if c++11 is enabled.
|
2016-03-31 14:44:55 -07:00 |
|
Benoit Steiner
|
92b7f7b650
|
Improved code formating
|
2016-03-31 13:09:58 -07:00 |
|
Benoit Steiner
|
f197813f37
|
Added the ability to hash a fp16
|
2016-03-31 13:09:23 -07:00 |
|
Benoit Steiner
|
4c859181da
|
Made it possible to use the NumTraits for complex and Array in a cuda kernel.
|
2016-03-31 12:48:38 -07:00 |
|
Benoit Steiner
|
c36ab19902
|
Added __ldg primitive for fp16.
|
2016-03-31 10:55:03 -07:00 |
|
Benoit Steiner
|
b575fb1d02
|
Added NumTraits for half floats
|
2016-03-31 10:43:59 -07:00 |
|
Benoit Steiner
|
8c8a79cec1
|
Fixed a typo
|
2016-03-31 10:33:32 -07:00 |
|
Benoit Steiner
|
4f1a7e51c1
|
Pull math functions from the global namespace only when compiling cuda code with nvcc. When compiling with clang, we want to use the std namespace.
|
2016-03-30 17:59:49 -07:00 |
|
Benoit Steiner
|
bc68fc2fe7
|
Enable constant expressions when compiling cuda code with clang.
|
2016-03-30 17:58:32 -07:00 |
|
Benoit Jacob
|
01b5333e44
|
bug #1186 - vreinterpretq_u64_f64 fails to build on Android/Aarch64/Clang toolchain
|
2016-03-30 11:02:33 -04:00 |
|
Benoit Steiner
|
1841d6d4c3
|
Added missing cuda template specializations for numext::ceil
|
2016-03-29 13:29:34 -07:00 |
|
Benoit Steiner
|
e02b784ec3
|
Added support for standard mathematical functions and trancendentals(such as exp, log, abs, ...) on fp16
|
2016-03-29 09:20:36 -07:00 |
|
Benoit Steiner
|
c38295f0a0
|
Added support for fmod
|
2016-03-28 15:53:02 -07:00 |
|
Konstantinos Margaritis
|
01e7298fe6
|
actually include ZVector files, passes most basic tests (float still fails)
|
2016-03-28 10:58:02 -04:00 |
|
Konstantinos Margaritis
|
f48011119e
|
Merged eigen/eigen into default
|
2016-03-28 01:48:45 +03:00 |
|
Konstantinos Margaritis
|
ed6b9d08f1
|
some primitives ported, but missing intrinsics and crash with asm() are a problem
|
2016-03-27 18:47:49 -04:00 |
|
Benoit Steiner
|
65716e99a5
|
Improved the cost estimate of the quotient op
|
2016-03-25 11:13:53 -07:00 |
|
Benoit Steiner
|
d94f6ba965
|
Started to model the cost of divisions more accurately.
|
2016-03-25 11:02:56 -07:00 |
|
Benoit Steiner
|
2e4e4cb74d
|
Use numext::abs instead of abs to avoid incorrect conversion to integer of the argument
|
2016-03-23 16:57:12 -07:00 |
|
Benoit Steiner
|
81d340984a
|
Removed executable bit from header files
|
2016-03-23 16:15:02 -07:00 |
|
Benoit Steiner
|
bff8cbad06
|
Removed executable bit from header files
|
2016-03-23 16:14:23 -07:00 |
|
Benoit Steiner
|
7a570e50ef
|
Fixed contractions of fp16
|
2016-03-23 16:00:06 -07:00 |
|
Benoit Steiner
|
fc3660285f
|
Made type conversion explicit
|
2016-03-23 09:56:50 -07:00 |
|
Benoit Steiner
|
0e68882604
|
Added the ability to divide a half float by an index
|
2016-03-23 09:46:42 -07:00 |
|
Benoit Steiner
|
6971146ca9
|
Added more conversion operators for half floats
|
2016-03-23 09:44:52 -07:00 |
|
Benoit Steiner
|
f9ad25e4d8
|
Fixed contractions of 16 bit floats
|
2016-03-22 09:30:23 -07:00 |
|
Benoit Steiner
|
134d750eab
|
Completed the implementation of vectorized type casting of half floats.
|
2016-03-18 13:36:28 -07:00 |
|
Benoit Steiner
|
7bd551b3a9
|
Make all the conversions explicit
|
2016-03-18 12:20:08 -07:00 |
|
Benoit Steiner
|
7b98de1f15
|
Implemented some of the missing type casting for half floats
|
2016-03-17 21:45:45 -07:00 |
|
Christoph Hertzberg
|
46aa9772fc
|
Merged in ebrevdo/eigen (pull request PR-169)
Bugfixes to cuda tests, igamma & igammac implemented, & tests for digamma, igamma, igammac on CPU & GPU.
|
2016-03-16 21:59:08 +01:00 |
|