aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported
Commit message (Collapse)AuthorAge
* Enable the random number generators when compiling with visual studioGravatar Benoit Steiner2016-03-09
|
* Fixed the integer division code on windowsGravatar Benoit Steiner2016-03-09
|
* Fixed static assertionGravatar Benoit Steiner2016-03-08
|
* Replace std::vector with our own implementation, as using the stl when ↵Gravatar Benoit Steiner2016-03-08
| | | | compiling with nvcc and avx enabled leads to many issues.
* Simplified the full reduction codeGravatar Benoit Steiner2016-03-08
|
* Fixed the tensor generator codeGravatar Benoit Steiner2016-03-08
|
* Fixed the tensor concatenation codeGravatar Benoit Steiner2016-03-08
|
* Fixed the tensor layout swapping codeGravatar Benoit Steiner2016-03-08
|
* Fixed the tensor chipping code.Gravatar Benoit Steiner2016-03-08
|
* Decoupled the packet type definition from the definition of the tensor ops. ↵Gravatar Benoit Steiner2016-03-08
| | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit.
* Use NumTraits::highest() and NumTraits::lowest() instead of the ↵Gravatar Benoit Steiner2016-03-07
| | | | std::numeric_limits to make the tensor min and max functors more CUDA friendly.
* Added the ability to pad a tensor using a non-zero valueGravatar Benoit Steiner2016-03-07
|
* Fix a couple of typos in the code.Gravatar Benoit Steiner2016-03-07
|
* Added a test to validate the behavior of some of the tensor syntactic sugar.Gravatar Benoit Steiner2016-03-07
|
* Added missing includeGravatar Benoit Steiner2016-03-06
|
* Don't try to compile the uint128 test with compilers that don't support uint127Gravatar Benoit Steiner2016-03-06
|
* Don't warn that msvc 2015 isn't c++11 compliant just because it doesn't ↵Gravatar Benoit Steiner2016-03-06
| | | | claim to be.
* Turn on some of the cxx11 features when compiling with visual studio 2015Gravatar Benoit Steiner2016-03-05
|
* Don't test our 128bit emulation code when compiling with msvcGravatar Benoit Steiner2016-03-05
|
* Avoid using initializer lists in test since not all version of msvc support themGravatar Benoit Steiner2016-03-05
|
* Use EIGEN_PI instead of redefining our own constant PIGravatar Benoit Steiner2016-03-05
|
* Use the CMAKE_CXX_STANDARD variable to turn on cxx11Gravatar Benoit Steiner2016-03-04
|
* Don't rely on the M_PI constant since not all compilers provide it.Gravatar Benoit Steiner2016-03-04
|
* Fixed the computation of leading zeros when compiling with msvc.Gravatar Benoit Steiner2016-03-04
|
* MSVC uses __uint128 while other compilers use __uint128_t to encode 128bit ↵Gravatar Benoit Steiner2016-03-04
| | | | unsigned integers. Make the cxx11_tensor_uint128.cpp test work in both cases.
* Fixed syntax errorGravatar Benoit Steiner2016-03-04
|
* Added missing includeGravatar Benoit Steiner2016-03-04
|
* Don't use implicit type conversions in initializer lists since not all ↵Gravatar Benoit Steiner2016-03-04
| | | | compilers support them.
* Made the contraction test more portableGravatar Benoit Steiner2016-03-04
|
* Fixed a typoGravatar Benoit Steiner2016-03-04
|
* Added tests to cover the new rounding, flooring and ceiling tensor operations.Gravatar Benoit Steiner2016-03-03
|
* Added support for rounding, flooring, and ceiling to the tensor apiGravatar Benoit Steiner2016-03-03
|
* Added a test to validate the conversion of half floats into floats on Kepler ↵Gravatar Benoit Steiner2016-03-03
| | | | | | GPUs. Restricted the testing of the random number generation code to GPU architecture greater than or equal to 3.5.
* Improved the performance of large outer reductions on cudaGravatar Benoit Steiner2016-02-29
|
* Made the signature of the inner and outer reducers consistentGravatar Benoit Steiner2016-02-29
|
* Optimized the performance of narrow reductions on CUDA devicesGravatar Benoit Steiner2016-02-29
|
* Print some information to stderr when a CUDA kernel failsGravatar Benoit Steiner2016-02-27
|
* Properly vectorized the random number generatorsGravatar Benoit Steiner2016-02-26
|
* Made the TensorIndexList usable on GPU without having to use the ↵Gravatar Benoit Steiner2016-02-26
| | | | -relaxed-constexpr compilation flag
* Reverted previous commit since it caused more problems than it solvedGravatar Benoit Steiner2016-02-26
|
* Fixed handling of long doubles on aarch64Gravatar Benoit Steiner2016-02-26
|
* Made the CUDA architecture level a build setting.Gravatar Benoit Steiner2016-02-25
|
* Fixed a typo in the reduction code that could prevent large full reductionsx ↵Gravatar Benoit Steiner2016-02-24
| | | | from running properly on old cuda devices.
* Marked the And and Or reducers as stateless.Gravatar Benoit Steiner2016-02-24
|
* Updated the padding code to work with half floatsGravatar Benoit Steiner2016-02-23
|
* Deleted the coordinate based evaluation of tensor expressions, since it's ↵Gravatar Benoit Steiner2016-02-22
| | | | hardly ever used and started to cause some issues with some versions of xcode.
* include <iostream> in the tensor header since we now use it to better report ↵Gravatar Benoit Steiner2016-02-22
| | | | cuda initialization errors
* Fixed compilation warning generated by clangGravatar Benoit Steiner2016-02-21
|
* Pulled latest updates from trunkGravatar Benoit Steiner2016-02-21
|\
* | Added the ability to compute the absolute value of a half floatGravatar Benoit Steiner2016-02-21
| |