Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Increased the robustness of the reduction tests on fp16 | Benoit Steiner | 2016-10-05 |
| | |||
* | Increase the tolerance to numerical noise. | Benoit Steiner | 2016-10-05 |
| | |||
* | ::rand() returns a signed integer on win32 | Benoit Steiner | 2016-10-05 |
| | |||
* | Fixed a typo that impacts windows builds | Benoit Steiner | 2016-10-05 |
| | |||
* | Silenced compilation warning | Benoit Steiner | 2016-10-04 |
| | |||
* | Cleanup the cuda executor code. | Benoit Steiner | 2016-10-04 |
| | |||
* | Cleaned up the random number generation code. | Benoit Steiner | 2016-10-04 |
| | |||
* | Improved support for compiling CUDA code with clang as the host compiler | Benoit Steiner | 2016-10-03 |
| | |||
* | Updated the tensor sum and mean reducer to enable them to process complex ↵ | Benoit Steiner | 2016-09-28 |
| | | | | numbers on cuda gpus. | ||
* | Made the initialization of a CUDA device thread safe. | Benoit Steiner | 2016-09-26 |
| | |||
* | Made the gaussian generator usable on GPU | Benoit Steiner | 2016-09-22 |
| | |||
* | Don't use c++11 features and fix include. | RJ Ryan | 2016-09-20 |
| | |||
* | Add CUDA-specific std::complex<T> specializations for scalar_sum_op, ↵ | RJ Ryan | 2016-09-20 |
| | | | | scalar_difference_op, scalar_product_op, and scalar_quotient_op. | ||
* | Merged hongkai-dai/eigen/tip into default (bug #1298) | Gael Guennebaud | 2016-09-19 |
|\ | |||
* | | Deleted some unecessary and confusing EIGEN_DEVICE_FUNC | Benoit Steiner | 2016-09-19 |
| | | |||
| * | remove ternary operator in euler angles | Hongkai Dai | 2016-09-19 |
|/ | |||
* | Made AutoDiffJacobian more intuitive to use and updated for C++11 | Emil Fresk | 2016-09-16 |
| | | | | | | | | | | | | Changes: * Removed unnecessary types from the Functor by inferring from its types * Removed inputs() function reference, replaced with .rows() * Updated the forward constructor to use variadic templates * Added optional parameters to the Fuctor for passing parameters, control signals, etc * Has been tested with fixed size and dynamic matricies Ammendment by chtz: overload operator() for compatibility with not fully conforming compilers | ||
* | Fix order of "static inline". | Gael Guennebaud | 2016-09-16 |
| | |||
* | Added missing EIGEN_DEVICE_FUNC qualifiers | Benoit Steiner | 2016-09-14 |
| | |||
* | Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= ↵ | Benoit Steiner | 2016-09-12 |
| | | | | 3.0) since the test instantiate contractions that require a modern gpu. | ||
* | CUDA contractions require arch >= 3.0: don't compile the cuda contraction ↵ | Benoit Steiner | 2016-09-12 |
| | | | | tests on older architectures. | ||
* | Fixed a bug impacting some outer reductions on GPU | Benoit Steiner | 2016-09-12 |
| | |||
* | Added the ability to compute the absolute value of a complex number on GPU, ↵ | Benoit Steiner | 2016-09-12 |
| | | | | as well as a test to catch the problem. | ||
* | Merged latest updates from trunk | Benoit Steiner | 2016-09-12 |
|\ | |||
* | | Properly size the list of waiters | Benoit Steiner | 2016-09-12 |
| | | |||
* | | Added a resize method to MaxSizeVector | Benoit Steiner | 2016-09-12 |
| | | |||
| * | bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with ↵ | Gael Guennebaud | 2016-09-08 |
| | | | | | | | | some specializations in arch/SSE and arch/AVX) | ||
| * | bug #1288: fix memory leak in arpack wrapper. | Gael Guennebaud | 2016-09-05 |
| | | |||
| * | Fix compilation when cuda_fp16.h does not exist. | Gael Guennebaud | 2016-09-05 |
| | | |||
* | | Fixed a regression test | Benoit Steiner | 2016-09-02 |
| | | |||
* | | Use MaxSizeVector instead of std::vector: xcode sometimes assumes that ↵ | Benoit Steiner | 2016-09-02 |
|/ | | | | std::vector allocates aligned memory and therefore issues aligned instruction to initialize it. This can result in random crashes when compiling with AVX instructions enabled. | ||
* | Pulled latest update from trunk | Benoit Steiner | 2016-09-02 |
|\ | |||
* | | Made the index type an explicit template parameter to help some compilers ↵ | Benoit Steiner | 2016-09-02 |
| | | | | | | | | compile the code. | ||
| * | Merged in rmlarsen/eigen (pull request PR-222) | Benoit Steiner | 2016-09-02 |
|/| | | | | | | | Fix CUDA build broken by changes to min and max reduction. | ||
| * | Fix CUDA build broken by changes to min and max reduction. | Rasmus Munk Larsen | 2016-09-02 |
| | | |||
* | | Fix the cxx11_tensor_cuda.cu test on 32bit platforms. | Benoit Steiner | 2016-09-02 |
| | | |||
* | | Added a test for nullary expressions on CUDA | Benoit Steiner | 2016-09-01 |
| | | | | | | | | Also check that we can mix 64 and 32 bit indices in the same compilation unit | ||
* | | Updated the contraction code to support constant inputs. | Benoit Steiner | 2016-09-01 |
|/ | |||
* | Adjust Tensor module wrt recent change in nullary functor | Gael Guennebaud | 2016-09-01 |
| | |||
* | Fix compilation with CUDA 8 | Gael Guennebaud | 2016-09-01 |
| | |||
* | Fix bugs to make min- and max reducers with correctly with IEEE infinities. | Rasmus Munk Larsen | 2016-08-31 |
| | |||
* | merge EulerAngles module | Gael Guennebaud | 2016-08-30 |
|\ | |||
* | | Include missing forward declaration of SparseMatrix | Gael Guennebaud | 2016-08-29 |
| | | |||
* | | Fix compilation with cuda 8 | Gael Guennebaud | 2016-08-29 |
| | | |||
* | | bug #1167: simplify installation of header files using cmake's ↵ | Gael Guennebaud | 2016-08-29 |
| | | | | | | | | install(DIRECTORY ...) command. | ||
* | | enable vectorization path when testing half on cuda, and add test for log1p | Gael Guennebaud | 2016-08-26 |
| | | |||
* | | Add missing log1p method | Gael Guennebaud | 2016-08-26 |
| | | |||
* | | Made the cost model cwiseMax and cwiseMin methods consts to help the PowerPC ↵ | Benoit Steiner | 2016-08-18 |
| | | | | | | | | cuda compiler compile this code. | ||
* | | Force the inlining of a simple accessor. | Benoit Steiner | 2016-08-18 |
| | | |||
* | | Merged in ibab/eigen/double-tensor-reduction (pull request PR-216) | Benoit Steiner | 2016-08-18 |
|\ \ | | | | | | | | | | Enable efficient Tensor reduction for doubles on the GPU (continued) |