Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Enabled the use of fixed dimensions from within a cuda kernel. | Benoit Steiner | 2016-01-11 |
| | |||
* | Deleted unused variable. | Benoit Steiner | 2016-01-11 |
| | |||
* | Silenced a nvcc compilation warning | Benoit Steiner | 2016-01-11 |
| | |||
* | Silenced several compilation warnings triggered by nvcc. | Benoit Steiner | 2016-01-11 |
| | |||
* | Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152) | Benoit Steiner | 2016-01-11 |
|\ | | | | | | | Alternative way of forcing instantiation of device kernels without causing warnings or requiring device to device kernel invocations. | ||
* | | Fixed a bug in the dispatch of optimized reduction kernels. | Benoit Steiner | 2016-01-11 |
| | | |||
* | | Re-enabled the optimized reduction CUDA code. | Benoit Steiner | 2016-01-11 |
| | | |||
| * | Cleaned up double-defined macro from last commit | Jeremy Barnes | 2016-01-10 |
| | | |||
| * | Alternative way of forcing instantiation of device kernels without | Jeremy Barnes | 2016-01-10 |
|/ | | | | | | causing warnings or requiring device to device kernel invocations. This allows Tensorflow to work on SM 3.0 (ie, Amazon EC2) machines. | ||
* | merge | Gael Guennebaud | 2016-01-09 |
|\ | |||
* | | bug #1144: fix regression in x=y+A*x (aliasing), and move ↵ | Gael Guennebaud | 2016-01-09 |
| | | | | | | | | evaluator_traits::AssumeAliasing to evaluator_assume_aliasing. | ||
| * | Simplified the dispatch code. | Benoit Steiner | 2016-01-08 |
| | | |||
| * | Made it possible to use array of size 0 on CUDA devices | Benoit Steiner | 2016-01-08 |
| | | |||
| * | Reworked the dispatch of optimized cuda reduction kernels to workaround a ↵ | Benoit Steiner | 2016-01-08 |
| | | | | | | | | nvcc bug that prevented the code from compiling in optimized mode in some cases | ||
| * | Prevent nvcc from miscompiling the cuda metakernel. Unfortunately this ↵ | Benoit Steiner | 2016-01-08 |
| | | | | | | | | reintroduces some compulation warnings but it's much better than having to deal with random assertion failures. | ||
* | | extend matlab conversion table | Gael Guennebaud | 2016-01-08 |
|/ | |||
* | Removed a couple of partial specialization that confuse nvcc and result in ↵ | Benoit Steiner | 2016-01-07 |
| | | | | | | | | errors such as this: error: more than one partial specialization matches the template argument list of class "Eigen::internal::get<3, Eigen::internal::numeric_list<std::size_t, 1UL, 1UL, 1UL, 1UL>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, a, as...>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, as...>>" | ||
* | Fixed a typo. | Benoit Steiner | 2016-01-06 |
| | |||
* | Optimized the performance of broadcasting of scalars. | Benoit Steiner | 2016-01-06 |
| | |||
* | rm remaining debug code | Gael Guennebaud | 2016-01-06 |
| | |||
* | bug #1143: Work-around gcc bug | Christoph Hertzberg | 2016-01-06 |
| | |||
* | Improved the performance of reductions on CUDA devices | Benoit Steiner | 2016-01-04 |
| | |||
* | Added a 'divup' util to compute the floor of the quotient of two integers | Benoit Steiner | 2016-01-04 |
| | |||
* | Improve inline documentation of SparseCompressedBase and its derived classes | Gael Guennebaud | 2016-01-03 |
| | |||
* | Fix numerous doxygen shortcomings, and workaround some clang -Wdocumentation ↵ | Gael Guennebaud | 2016-01-01 |
| | | | | warnings | ||
* | Mark AlignedBit and EvalBeforeNestingBit with deprecated attribute, and ↵ | Gael Guennebaud | 2015-12-30 |
| | | | | remove the remaining usages of EvalBeforeNestingBit. | ||
* | Workaround clang -Wdocumentation warning about "/*<" | Gael Guennebaud | 2015-12-30 |
| | |||
* | Workaround "empty paragraph" warning with clang -Wdocumentation | Gael Guennebaud | 2015-12-30 |
| | |||
* | Add missing doc of Derived template parameter | Gael Guennebaud | 2015-12-30 |
| | |||
* | Fix numerous doxygen issues in auto-link generation | Gael Guennebaud | 2015-12-30 |
| | |||
* | Fix links to Eigen2-to-Eigen3 porting helpers | Gael Guennebaud | 2015-12-30 |
| | |||
* | Recent versions of doxygen miss-parsed Eigen/* headers | Gael Guennebaud | 2015-12-30 |
| | |||
* | Add missing snippets for erf/erfc/lgamma functions. | Gael Guennebaud | 2015-12-30 |
| | |||
* | Add missing snippet for sign/cwiseSign functions. | Gael Guennebaud | 2015-12-30 |
| | |||
* | Add missing ctor from uint | Gael Guennebaud | 2015-12-30 |
| | |||
* | bug #1141: add missing initialization of CholmodBase::m_*IsOk | Gael Guennebaud | 2015-12-29 |
| | |||
* | Workaround compilers that do not even define _mm256_set_m128. | Gael Guennebaud | 2015-12-24 |
| | |||
* | Don't attempt to vectorize mean reductions of integers since we can't use | Benoit Steiner | 2015-12-22 |
| | | | | SSE or AVX instructions to divide 2 integers. | ||
* | Optimized the configuration of the outer reduction cuda kernel | Benoit Steiner | 2015-12-22 |
| | |||
* | Added missing define | Benoit Steiner | 2015-12-22 |
| | |||
* | Made sure the optimized gpu reduction code is actually compiled. | Benoit Steiner | 2015-12-22 |
| | |||
* | Optimized outer reduction on GPUs. | Benoit Steiner | 2015-12-22 |
| | |||
* | Made it possible to run the lgamma, erf, and erfc functors on a CUDA gpu. | Benoit Steiner | 2015-12-21 |
| | |||
* | Added missing const | Benoit Steiner | 2015-12-21 |
| | |||
* | Merged in connor-k/eigen (pull request PR-149) | Benoit Steiner | 2015-12-21 |
|\ | | | | | | | [doc] Remove extra ';' in Advanced Initialization sample | ||
* | | Fixed a typo in previous change. | Benoit Steiner | 2015-12-21 |
| | | |||
* | | Added support for CUDA architectures that don's support for 3.5 capabilities | Benoit Steiner | 2015-12-21 |
| | | |||
| * | [doc] Remove extra ';' in Tutorial_AdvancedInitialization_Join.cpp | connor-k | 2015-12-21 |
|/ | |||
* | Fixed a typo. | Benoit Steiner | 2015-12-18 |
| | |||
* | Add alignment requirement for local buffer used by the slicing op. | Benoit Steiner | 2015-12-18 |
| |