aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Expand)AuthorAge
...
* | Added benchmarks for full reductionGravatar Benoit Steiner2016-02-29
* | Made the signature of the inner and outer reducers consistentGravatar Benoit Steiner2016-02-29
* | Optimized the performance of narrow reductions on CUDA devicesGravatar Benoit Steiner2016-02-29
* | Fix shortcoming in fixed-value deduction of startRow/startColGravatar Gael Guennebaud2016-02-29
* | Print some information to stderr when a CUDA kernel failsGravatar Benoit Steiner2016-02-27
* | Improved the READMEGravatar Benoit Steiner2016-02-27
* | bug #1172: make valuePtr and innderIndexPtr properly return null for empty ma...Gravatar Gael Guennebaud2016-02-27
* | Properly vectorized the random number generatorsGravatar Benoit Steiner2016-02-26
* | Made the TensorIndexList usable on GPU without having to use the -relaxed-con...Gravatar Benoit Steiner2016-02-26
* | Added benchmarks for type casting of float16Gravatar Benoit Steiner2016-02-26
* | Added benchmarks for fp16Gravatar Benoit Steiner2016-02-26
* | Reverted previous commit since it caused more problems than it solvedGravatar Benoit Steiner2016-02-26
* | Fixed handling of long doubles on aarch64Gravatar Benoit Steiner2016-02-26
* | Made the CUDA architecture level a build setting.Gravatar Benoit Steiner2016-02-25
* | Fixed a typo in the reduction code that could prevent large full reductionsx ...Gravatar Benoit Steiner2016-02-24
* | Marked the And and Or reducers as stateless.Gravatar Benoit Steiner2016-02-24
* | mergeGravatar Gael Guennebaud2016-02-23
|\ \
* | | Fix startRow()/startCol() for dense Block with direct access:Gravatar Gael Guennebaud2016-02-23
| * | Updated the padding code to work with half floatsGravatar Benoit Steiner2016-02-23
| * | Extended the tensor benchmark suite to support types other than floatsGravatar Benoit Steiner2016-02-23
| * | Updated the tensor benchmarking code to work with compilers that don't suppor...Gravatar Benoit Steiner2016-02-23
| * | Deleted the coordinate based evaluation of tensor expressions, since it's har...Gravatar Benoit Steiner2016-02-22
| * | Declare the half float type as arithmetic.Gravatar Benoit Steiner2016-02-22
| * | include <iostream> in the tensor header since we now use it to better report ...Gravatar Benoit Steiner2016-02-22
| * | Fixed compilation warning generated by clangGravatar Benoit Steiner2016-02-21
| * | Implemented the ptranspose function on half floatsGravatar Benoit Steiner2016-02-21
| * | Pulled latest updates from trunkGravatar Benoit Steiner2016-02-21
| |\ \
| * | | Added the ability to compute the absolute value of a half floatGravatar Benoit Steiner2016-02-21
| | * | Added some debugging information to the test to figure out why it fails somet...Gravatar Benoit Steiner2016-02-21
| | * | Optimized casting of tensors in the case where the casting happens to be a no-opGravatar Benoit Steiner2016-02-21
| |/ /
| * / Prevent unecessary Index to int conversionsGravatar Benoit Steiner2016-02-21
|/ /
* | Moved some of the fp16 operators outside the Eigen namespace to workaround so...Gravatar Benoit Steiner2016-02-20
* | Fixed the float16 tensor test.Gravatar Benoit Steiner2016-02-20
* | Get rid of duplicate code.Gravatar Rasmus Munk Larsen2016-02-19
* | Speed up tensor FFT by up ~25-50%.Gravatar Rasmus Munk Larsen2016-02-19
* | mergeGravatar Gael Guennebaud2016-02-19
|\ \
* | | Add COD and BDCSVD in list of benched solvers.Gravatar Gael Guennebaud2016-02-19
* | | Extend unit test to stress smart_copy with empty input/output.Gravatar Gael Guennebaud2016-02-19
* | | bug #1170: skip calls to memcpy/memmove for empty imput.Gravatar Gael Guennebaud2016-02-19
| * | Print an error message to stderr when the initialization of the CUDA runtime ...Gravatar Benoit Steiner2016-02-19
* | | Fix nesting type and complete reflection methods of Block expressions.Gravatar Gael Guennebaud2016-02-19
* | | Add typedefs for the return type of all block methods.Gravatar Gael Guennebaud2016-02-19
| * | Updated the contraction code to make it compatible with half floats.Gravatar Benoit Steiner2016-02-19
| * | Added support for tensor reductions on half floatsGravatar Benoit Steiner2016-02-19
| * | Implemented the scalar division of 2 half floatsGravatar Benoit Steiner2016-02-19
| * | Added the ability to query the minor version of a cuda deviceGravatar Benoit Steiner2016-02-19
| * | Started to work on contractions and reductions using half floatsGravatar Benoit Steiner2016-02-19
| * | Don't make the array constructors explicitGravatar Benoit Steiner2016-02-19
| * | Added support for operators +=, -=, *= and /= on CUDA half floatsGravatar Benoit Steiner2016-02-19
| * | Implemented protate() for CUDAGravatar Benoit Steiner2016-02-19