Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fully support complex types in SumReducer and MeanReducer when building for ↵ | 2016-10-06 | |
| | | | | CUDA by using scalar_sum_op and scalar_product_op instead of operator+ and operator*. | ||
* | Increased the robustness of the reduction tests on fp16 | 2016-10-05 | |
| | |||
* | Increase the tolerance to numerical noise. | 2016-10-05 | |
| | |||
* | ::rand() returns a signed integer on win32 | 2016-10-05 | |
| | |||
* | Fixed a typo that impacts windows builds | 2016-10-05 | |
| | |||
* | Silenced compilation warning | 2016-10-04 | |
| | |||
* | Cleanup the cuda executor code. | 2016-10-04 | |
| | |||
* | Cleaned up the random number generation code. | 2016-10-04 | |
| | |||
* | Improved support for compiling CUDA code with clang as the host compiler | 2016-10-03 | |
| | |||
* | Updated the tensor sum and mean reducer to enable them to process complex ↵ | 2016-09-28 | |
| | | | | numbers on cuda gpus. | ||
* | Made the initialization of a CUDA device thread safe. | 2016-09-26 | |
| | |||
* | Made the gaussian generator usable on GPU | 2016-09-22 | |
| | |||
* | Don't use c++11 features and fix include. | 2016-09-20 | |
| | |||
* | Add CUDA-specific std::complex<T> specializations for scalar_sum_op, ↵ | 2016-09-20 | |
| | | | | scalar_difference_op, scalar_product_op, and scalar_quotient_op. | ||
* | Merged hongkai-dai/eigen/tip into default (bug #1298) | 2016-09-19 | |
|\ | |||
* | | Deleted some unecessary and confusing EIGEN_DEVICE_FUNC | 2016-09-19 | |
| | | |||
| * | remove ternary operator in euler angles | 2016-09-19 | |
|/ | |||
* | Made AutoDiffJacobian more intuitive to use and updated for C++11 | 2016-09-16 | |
| | | | | | | | | | | | | Changes: * Removed unnecessary types from the Functor by inferring from its types * Removed inputs() function reference, replaced with .rows() * Updated the forward constructor to use variadic templates * Added optional parameters to the Fuctor for passing parameters, control signals, etc * Has been tested with fixed size and dynamic matricies Ammendment by chtz: overload operator() for compatibility with not fully conforming compilers | ||
* | Fix order of "static inline". | 2016-09-16 | |
| | |||
* | Added missing EIGEN_DEVICE_FUNC qualifiers | 2016-09-14 | |
| | |||
* | Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= ↵ | 2016-09-12 | |
| | | | | 3.0) since the test instantiate contractions that require a modern gpu. | ||
* | CUDA contractions require arch >= 3.0: don't compile the cuda contraction ↵ | 2016-09-12 | |
| | | | | tests on older architectures. | ||
* | Fixed a bug impacting some outer reductions on GPU | 2016-09-12 | |
| | |||
* | Added the ability to compute the absolute value of a complex number on GPU, ↵ | 2016-09-12 | |
| | | | | as well as a test to catch the problem. | ||
* | Merged latest updates from trunk | 2016-09-12 | |
|\ | |||
* | | Properly size the list of waiters | 2016-09-12 | |
| | | |||
* | | Added a resize method to MaxSizeVector | 2016-09-12 | |
| | | |||
| * | bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with ↵ | 2016-09-08 | |
| | | | | | | | | some specializations in arch/SSE and arch/AVX) | ||
| * | bug #1288: fix memory leak in arpack wrapper. | 2016-09-05 | |
| | | |||
| * | Fix compilation when cuda_fp16.h does not exist. | 2016-09-05 | |
| | | |||
* | | Fixed a regression test | 2016-09-02 | |
| | | |||
* | | Use MaxSizeVector instead of std::vector: xcode sometimes assumes that ↵ | 2016-09-02 | |
|/ | | | | std::vector allocates aligned memory and therefore issues aligned instruction to initialize it. This can result in random crashes when compiling with AVX instructions enabled. | ||
* | Pulled latest update from trunk | 2016-09-02 | |
|\ | |||
* | | Made the index type an explicit template parameter to help some compilers ↵ | 2016-09-02 | |
| | | | | | | | | compile the code. | ||
| * | Merged in rmlarsen/eigen (pull request PR-222) | 2016-09-02 | |
|/| | | | | | | | Fix CUDA build broken by changes to min and max reduction. | ||
| * | Fix CUDA build broken by changes to min and max reduction. | 2016-09-02 | |
| | | |||
* | | Fix the cxx11_tensor_cuda.cu test on 32bit platforms. | 2016-09-02 | |
| | | |||
* | | Added a test for nullary expressions on CUDA | 2016-09-01 | |
| | | | | | | | | Also check that we can mix 64 and 32 bit indices in the same compilation unit | ||
* | | Updated the contraction code to support constant inputs. | 2016-09-01 | |
|/ | |||
* | Adjust Tensor module wrt recent change in nullary functor | 2016-09-01 | |
| | |||
* | Fix compilation with CUDA 8 | 2016-09-01 | |
| | |||
* | Fix bugs to make min- and max reducers with correctly with IEEE infinities. | 2016-08-31 | |
| | |||
* | merge EulerAngles module | 2016-08-30 | |
|\ | |||
* | | Include missing forward declaration of SparseMatrix | 2016-08-29 | |
| | | |||
* | | Fix compilation with cuda 8 | 2016-08-29 | |
| | | |||
* | | bug #1167: simplify installation of header files using cmake's ↵ | 2016-08-29 | |
| | | | | | | | | install(DIRECTORY ...) command. | ||
* | | enable vectorization path when testing half on cuda, and add test for log1p | 2016-08-26 | |
| | | |||
* | | Add missing log1p method | 2016-08-26 | |
| | | |||
* | | Made the cost model cwiseMax and cwiseMin methods consts to help the PowerPC ↵ | 2016-08-18 | |
| | | | | | | | | cuda compiler compile this code. | ||
* | | Force the inlining of a simple accessor. | 2016-08-18 | |
| | |