Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
| * | | Manually define int16_t and uint16_t when compiling with Visual Studio | 2016-10-08 | ||
| | | | ||||
* | | | Merge the content of the ComputeCpp branch into the default branch | 2016-10-07 | ||
|\ \ \ | | |/ | |/| | ||||
| | * | Fully support complex types in SumReducer and MeanReducer when building for ↵ | 2016-10-06 | ||
| |/ |/| | | | | | CUDA by using scalar_sum_op and scalar_product_op instead of operator+ and operator*. | |||
| * | Pull the latest updates from trunk | 2016-10-05 | ||
| |\ | ||||
| * | | Fixed compilation warning | 2016-10-05 | ||
| | | | ||||
* | | | ::rand() returns a signed integer on win32 | 2016-10-05 | ||
| | | | ||||
* | | | Fixed a typo that impacts windows builds | 2016-10-05 | ||
| |/ |/| | ||||
* | | Silenced compilation warning | 2016-10-04 | ||
| | | ||||
* | | Cleanup the cuda executor code. | 2016-10-04 | ||
| | | ||||
* | | Cleaned up the random number generation code. | 2016-10-04 | ||
| | | ||||
* | | Updated the tensor sum and mean reducer to enable them to process complex ↵ | 2016-09-28 | ||
| | | | | | | | | numbers on cuda gpus. | |||
| * | Converting alias template to nested struct in order to be compatible with CXX-03 | 2016-09-27 | ||
| | | ||||
* | | Made the initialization of a CUDA device thread safe. | 2016-09-26 | ||
| | | ||||
| * | Made TensorEvalTo compatible with c++0x again. | 2016-09-23 | ||
| | | ||||
| * | Deleted unused TensorMap data member | 2016-09-23 | ||
| | | ||||
* | | Made the gaussian generator usable on GPU | 2016-09-22 | ||
| | | ||||
* | | Deleted some unecessary and confusing EIGEN_DEVICE_FUNC | 2016-09-19 | ||
| | | ||||
| * | Merged with default. | 2016-09-19 | ||
| |\ | ||||
| * | | Partial OpenCL support via SYCL compatible with ComputeCpp CE. | 2016-09-19 | ||
| | | | ||||
* | | | Fix order of "static inline". | 2016-09-16 | ||
| | | | ||||
* | | | Added missing EIGEN_DEVICE_FUNC qualifiers | 2016-09-14 | ||
| |/ |/| | ||||
* | | Fixed a bug impacting some outer reductions on GPU | 2016-09-12 | ||
| | | ||||
* | | Merged latest updates from trunk | 2016-09-12 | ||
|\ \ | ||||
* | | | Properly size the list of waiters | 2016-09-12 | ||
| | | | ||||
* | | | Added a resize method to MaxSizeVector | 2016-09-12 | ||
| | | | ||||
| * | | bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with ↵ | 2016-09-08 | ||
| | | | | | | | | | | | | some specializations in arch/SSE and arch/AVX) | |||
* | | | Use MaxSizeVector instead of std::vector: xcode sometimes assumes that ↵ | 2016-09-02 | ||
|/ / | | | | | | | std::vector allocates aligned memory and therefore issues aligned instruction to initialize it. This can result in random crashes when compiling with AVX instructions enabled. | |||
* | | Pulled latest update from trunk | 2016-09-02 | ||
|\ \ | ||||
* | | | Made the index type an explicit template parameter to help some compilers ↵ | 2016-09-02 | ||
| | | | | | | | | | | | | compile the code. | |||
| * | | Merged in rmlarsen/eigen (pull request PR-222) | 2016-09-02 | ||
|/| | | | | | | | | | | | Fix CUDA build broken by changes to min and max reduction. | |||
| * | | Fix CUDA build broken by changes to min and max reduction. | 2016-09-02 | ||
| | | | ||||
* | | | Updated the contraction code to support constant inputs. | 2016-09-01 | ||
|/ / | ||||
* | | Adjust Tensor module wrt recent change in nullary functor | 2016-09-01 | ||
| | | ||||
* | | Fix bugs to make min- and max reducers with correctly with IEEE infinities. | 2016-08-31 | ||
| | | ||||
* | | bug #1167: simplify installation of header files using cmake's ↵ | 2016-08-29 | ||
| | | | | | | | | install(DIRECTORY ...) command. | |||
* | | Add missing log1p method | 2016-08-26 | ||
| | | ||||
* | | Made the cost model cwiseMax and cwiseMin methods consts to help the PowerPC ↵ | 2016-08-18 | ||
| | | | | | | | | cuda compiler compile this code. | |||
* | | Force the inlining of a simple accessor. | 2016-08-18 | ||
| | | ||||
* | | Merged in ibab/eigen/double-tensor-reduction (pull request PR-216) | 2016-08-18 | ||
|\ \ | | | | | | | | | | Enable efficient Tensor reduction for doubles on the GPU (continued) | |||
| * | | Fix remaining CUDA >= 300 checks | 2016-08-18 | ||
| | | | ||||
| * | | Add the necessary CUDA >= 300 checks back | 2016-08-18 | ||
| | | | ||||
* | | | Properly detect the type of the result of a contraction. | 2016-08-16 | ||
| |/ |/| | ||||
* | | Use array_prod instead of calling TotalSize since TotalSize is only ↵ | 2016-08-15 | ||
| | | | | | | | | available on DSize. | |||
* | | Fixed a bug in the documentation. | 2016-08-12 | ||
| | | ||||
* | | Don't attempt to optimize partial reductions when the optimized ↵ | 2016-08-08 | ||
| | | | | | | | | implementation doesn't buy anything. | |||
| * | Remove CUDA >= 300 checks and enable outer reductin for doubles | 2016-08-06 | ||
| | | ||||
| * | Merge upstream changes | 2016-08-05 | ||
| |\ | |/ |/| | ||||
| * | Make use of atomicExch for atomicExchCustom | 2016-08-05 | ||
| | | ||||
* | | Merged in ibab/eigen (pull request PR-206) | 2016-08-03 | ||
|\ \ | | | | | | | | | | Expose real and imag methods on Tensors | |||
* | | | CUDA_ARCH isn't always defined, so avoid relying on it too much when ↵ | 2016-08-03 | ||
| | | | | | | | | | | | | figuring out which implementation to use for reductions. Instead rely on the device to tell us on which hardware version we're running. |