Benoit Steiner
|
60d9df11c1
|
Fixed the computation of leading zeros when compiling with msvc.
|
2016-03-04 16:27:02 -08:00 |
|
Benoit Steiner
|
2c50fc878e
|
Fixed a typo
|
2016-03-04 14:09:38 -08:00 |
|
Benoit Steiner
|
d69946183d
|
Updated the TensorIntDivisor code to work properly on LLP64 systems
|
2016-02-08 21:03:59 -08:00 |
|
Benoit Steiner
|
547a8608e5
|
Fixed the implementation of Eigen::internal::count_leading_zeros for MSVC.
Also updated the code to silence bogux warnings generated by nvcc when compilining this function.
|
2015-11-23 12:17:45 -08:00 |
|
Benoit Steiner
|
383d1cc2ed
|
Added proper support for fast 64bit integer division on CUDA
|
2015-11-20 11:09:46 -08:00 |
|
Benoit Steiner
|
1dd444ea71
|
Avoid using the version of TensorIntDiv optimized for 32-bit integers when the divisor can be equal to one since it isn't supported.
|
2015-11-18 11:37:58 -08:00 |
|
Christoph Hertzberg
|
1bdd06a199
|
Fix some trivial warnings
|
2015-08-19 21:38:18 +02:00 |
|
Benoit Steiner
|
a5dc49e7e8
|
Fixed 2 compilation warnings generated by llvm
|
2015-07-29 15:06:08 -07:00 |
|
Benoit Steiner
|
0570594f2c
|
Fixed a few compilation warnings triggered by clang
|
2015-07-29 11:48:38 -07:00 |
|
Benoit Steiner
|
099597406f
|
Simplified and generalized the DividerTraits code
|
2015-07-29 10:02:42 -07:00 |
|
Gael Guennebaud
|
6db3a557f4
|
Add missing specialization of struct DividerTraits<long>
|
2015-07-29 11:38:53 +02:00 |
|
Benoit Steiner
|
4200bdec24
|
Extended the range of value inputs for TensorIntDiv to support tensors with more than 4 billion elements.
|
2015-07-22 17:02:30 -07:00 |
|
Benoit Steiner
|
3912ca0d53
|
Fixed a bug in the integer division code that caused some large numerators to be incorrectly handled
|
2015-07-13 11:14:59 -07:00 |
|
Benoit Steiner
|
a93af65938
|
Improved and cleaned up the 2d patch extraction code
|
2015-07-07 08:52:14 -07:00 |
|
vanhoucke
|
4cc0c961f3
|
Fix undefined behavior.
|
2015-06-19 15:46:46 +00:00 |
|
Benoit Steiner
|
a81d17b73a
|
Added new version of the TensorIntDiv class optimized for 32 bit signed integers. It saves 1 register on CPU and 2 on GPU.
|
2015-05-19 13:59:52 -07:00 |
|
Benoit Steiner
|
ae73859a0a
|
Fixed incorrect assertion
|
2015-02-28 08:02:02 -08:00 |
|
Benoit Steiner
|
bb483313f6
|
Fixed another batch of compilation warnings
|
2015-02-28 02:32:46 -08:00 |
|
Benoit Steiner
|
f074bb4b5f
|
Fixed another compilation problem with TensorIntDiv.h
|
2015-02-26 11:14:23 -08:00 |
|
Benoit Steiner
|
bffb6bdf45
|
Made TensorIntDiv.h compile with MSVC
|
2015-02-25 23:54:43 -08:00 |
|
Benoit Steiner
|
27f3fb2bcc
|
Fixed another clang warning
|
2015-02-25 22:54:20 -08:00 |
|
Benoit Steiner
|
99d75235a9
|
Misc improvements and cleanups
|
2014-10-13 17:02:09 -07:00 |
|
Benoit Steiner
|
33c702c79f
|
Added support for fast integer divisions by a constant
Sped up tensor slicing by a factor of 3 by using these fast integer divisions.
|
2014-08-14 22:13:21 -07:00 |
|