aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
...
* | | MSVC uses __uint128 while other compilers use __uint128_t to encode 128bit ↵Gravatar Benoit Steiner2016-03-04
| | | | | | | | | | | | unsigned integers. Make the cxx11_tensor_uint128.cpp test work in both cases.
* | | Fixed syntax errorGravatar Benoit Steiner2016-03-04
| | |
* | | Added missing includeGravatar Benoit Steiner2016-03-04
| | |
* | | Don't use implicit type conversions in initializer lists since not all ↵Gravatar Benoit Steiner2016-03-04
| | | | | | | | | | | | compilers support them.
* | | Made the contraction test more portableGravatar Benoit Steiner2016-03-04
| | |
* | | Fixed a typoGravatar Benoit Steiner2016-03-04
| | |
| | * Initial implementation of igamma and igammac.Gravatar Eugene Brevdo2016-03-03
| |/
* | Added tests to cover the new rounding, flooring and ceiling tensor operations.Gravatar Benoit Steiner2016-03-03
| |
* | Added support for rounding, flooring, and ceiling to the tensor apiGravatar Benoit Steiner2016-03-03
| |
* | Added a test to validate the conversion of half floats into floats on Kepler ↵Gravatar Benoit Steiner2016-03-03
| | | | | | | | | | | | GPUs. Restricted the testing of the random number generation code to GPU architecture greater than or equal to 3.5.
* | Enable partial support for half floats on Kepler GPUs.Gravatar Benoit Steiner2016-03-03
| |
* | Enable the conversion between floats and half floats on older GPUs that ↵Gravatar Benoit Steiner2016-03-03
| | | | | | | | support it.
* | Merged in ebrevdo/eigen (pull request PR-167)Gravatar Benoit Steiner2016-03-03
|\| | | | | | | | | | | Add infinity() support to numext::numeric_limits, use it in lgamma. I tested the code on my gtx-titan-black gpu, and it appears to work as expected.
| * Small bugfix to numeric_limits for CUDA.Gravatar Eugene Brevdo2016-03-02
| |
| * Add infinity() support to numext::numeric_limits, use it in lgamma.Gravatar Eugene Brevdo2016-03-02
| | | | | | | | | | This makes the infinity access a __device__ function, removing nvcc warnings.
* | bug #537: fix compilation with Apples's compilerGravatar Gael Guennebaud2016-03-02
| |
* | Pulled latest updates from trunkGravatar Benoit Steiner2016-03-01
|\ \
| * | Compilation fixGravatar Gael Guennebaud2016-03-01
| | |
| * | Compilation fixGravatar Gael Guennebaud2016-03-01
| | |
* | | Improved the performance of large outer reductions on cudaGravatar Benoit Steiner2016-02-29
|/ /
* | Added benchmarks for full reductionGravatar Benoit Steiner2016-02-29
| |
* | Made the signature of the inner and outer reducers consistentGravatar Benoit Steiner2016-02-29
| |
* | Optimized the performance of narrow reductions on CUDA devicesGravatar Benoit Steiner2016-02-29
| |
* | Fix shortcoming in fixed-value deduction of startRow/startColGravatar Gael Guennebaud2016-02-29
| |
* | Print some information to stderr when a CUDA kernel failsGravatar Benoit Steiner2016-02-27
| |
* | Improved the READMEGravatar Benoit Steiner2016-02-27
| |
* | bug #1172: make valuePtr and innderIndexPtr properly return null for empty ↵Gravatar Gael Guennebaud2016-02-27
| | | | | | | | matrices.
* | Properly vectorized the random number generatorsGravatar Benoit Steiner2016-02-26
| |
* | Made the TensorIndexList usable on GPU without having to use the ↵Gravatar Benoit Steiner2016-02-26
| | | | | | | | -relaxed-constexpr compilation flag
* | Added benchmarks for type casting of float16Gravatar Benoit Steiner2016-02-26
| |
* | Added benchmarks for fp16Gravatar Benoit Steiner2016-02-26
| |
* | Reverted previous commit since it caused more problems than it solvedGravatar Benoit Steiner2016-02-26
| |
* | Fixed handling of long doubles on aarch64Gravatar Benoit Steiner2016-02-26
| |
* | Made the CUDA architecture level a build setting.Gravatar Benoit Steiner2016-02-25
| |
* | Fixed a typo in the reduction code that could prevent large full reductionsx ↵Gravatar Benoit Steiner2016-02-24
| | | | | | | | from running properly on old cuda devices.
* | Marked the And and Or reducers as stateless.Gravatar Benoit Steiner2016-02-24
| |
* | mergeGravatar Gael Guennebaud2016-02-23
|\ \
* | | Fix startRow()/startCol() for dense Block with direct access:Gravatar Gael Guennebaud2016-02-23
| | | | | | | | | | | | the initial implementation failed for empty rows/columns for which are ambiguous.
| * | Updated the padding code to work with half floatsGravatar Benoit Steiner2016-02-23
| | |
| * | Extended the tensor benchmark suite to support types other than floatsGravatar Benoit Steiner2016-02-23
| | |
| * | Updated the tensor benchmarking code to work with compilers that don't ↵Gravatar Benoit Steiner2016-02-23
| | | | | | | | | | | | support cxx11.
| * | Deleted the coordinate based evaluation of tensor expressions, since it's ↵Gravatar Benoit Steiner2016-02-22
| | | | | | | | | | | | hardly ever used and started to cause some issues with some versions of xcode.
| * | Declare the half float type as arithmetic.Gravatar Benoit Steiner2016-02-22
| | |
| * | include <iostream> in the tensor header since we now use it to better report ↵Gravatar Benoit Steiner2016-02-22
| | | | | | | | | | | | cuda initialization errors
| * | Fixed compilation warning generated by clangGravatar Benoit Steiner2016-02-21
| | |
| * | Implemented the ptranspose function on half floatsGravatar Benoit Steiner2016-02-21
| | |
| * | Pulled latest updates from trunkGravatar Benoit Steiner2016-02-21
| |\ \
| * | | Added the ability to compute the absolute value of a half floatGravatar Benoit Steiner2016-02-21
| | | |
| | * | Added some debugging information to the test to figure out why it fails ↵Gravatar Benoit Steiner2016-02-21
| | | | | | | | | | | | | | | | sometimes
| | * | Optimized casting of tensors in the case where the casting happens to be a no-opGravatar Benoit Steiner2016-02-21
| |/ /