aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Enabled the use of fixed dimensions from within a cuda kernel.Gravatar Benoit Steiner2016-01-11
|
* Deleted unused variable.Gravatar Benoit Steiner2016-01-11
|
* Silenced a nvcc compilation warningGravatar Benoit Steiner2016-01-11
|
* Silenced several compilation warnings triggered by nvcc.Gravatar Benoit Steiner2016-01-11
|
* Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152)Gravatar Benoit Steiner2016-01-11
|\ | | | | | | Alternative way of forcing instantiation of device kernels without causing warnings or requiring device to device kernel invocations.
* | Fixed a bug in the dispatch of optimized reduction kernels.Gravatar Benoit Steiner2016-01-11
| |
* | Re-enabled the optimized reduction CUDA code.Gravatar Benoit Steiner2016-01-11
| |
| * Cleaned up double-defined macro from last commitGravatar Jeremy Barnes2016-01-10
| |
| * Alternative way of forcing instantiation of device kernels withoutGravatar Jeremy Barnes2016-01-10
|/ | | | | | causing warnings or requiring device to device kernel invocations. This allows Tensorflow to work on SM 3.0 (ie, Amazon EC2) machines.
* mergeGravatar Gael Guennebaud2016-01-09
|\
* | bug #1144: fix regression in x=y+A*x (aliasing), and move ↵Gravatar Gael Guennebaud2016-01-09
| | | | | | | | evaluator_traits::AssumeAliasing to evaluator_assume_aliasing.
| * Simplified the dispatch code.Gravatar Benoit Steiner2016-01-08
| |
| * Made it possible to use array of size 0 on CUDA devicesGravatar Benoit Steiner2016-01-08
| |
| * Reworked the dispatch of optimized cuda reduction kernels to workaround a ↵Gravatar Benoit Steiner2016-01-08
| | | | | | | | nvcc bug that prevented the code from compiling in optimized mode in some cases
| * Prevent nvcc from miscompiling the cuda metakernel. Unfortunately this ↵Gravatar Benoit Steiner2016-01-08
| | | | | | | | reintroduces some compulation warnings but it's much better than having to deal with random assertion failures.
* | extend matlab conversion tableGravatar Gael Guennebaud2016-01-08
|/
* Removed a couple of partial specialization that confuse nvcc and result in ↵Gravatar Benoit Steiner2016-01-07
| | | | | | | | errors such as this: error: more than one partial specialization matches the template argument list of class "Eigen::internal::get<3, Eigen::internal::numeric_list<std::size_t, 1UL, 1UL, 1UL, 1UL>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, a, as...>>" "Eigen::internal::get<n, Eigen::internal::numeric_list<T, as...>>"
* Fixed a typo.Gravatar Benoit Steiner2016-01-06
|
* Optimized the performance of broadcasting of scalars.Gravatar Benoit Steiner2016-01-06
|
* rm remaining debug codeGravatar Gael Guennebaud2016-01-06
|
* bug #1143: Work-around gcc bugGravatar Christoph Hertzberg2016-01-06
|
* Improved the performance of reductions on CUDA devicesGravatar Benoit Steiner2016-01-04
|
* Added a 'divup' util to compute the floor of the quotient of two integersGravatar Benoit Steiner2016-01-04
|
* Improve inline documentation of SparseCompressedBase and its derived classesGravatar Gael Guennebaud2016-01-03
|
* Fix numerous doxygen shortcomings, and workaround some clang -Wdocumentation ↵Gravatar Gael Guennebaud2016-01-01
| | | | warnings
* Mark AlignedBit and EvalBeforeNestingBit with deprecated attribute, and ↵Gravatar Gael Guennebaud2015-12-30
| | | | remove the remaining usages of EvalBeforeNestingBit.
* Workaround clang -Wdocumentation warning about "/*<"Gravatar Gael Guennebaud2015-12-30
|
* Workaround "empty paragraph" warning with clang -WdocumentationGravatar Gael Guennebaud2015-12-30
|
* Add missing doc of Derived template parameterGravatar Gael Guennebaud2015-12-30
|
* Fix numerous doxygen issues in auto-link generationGravatar Gael Guennebaud2015-12-30
|
* Fix links to Eigen2-to-Eigen3 porting helpersGravatar Gael Guennebaud2015-12-30
|
* Recent versions of doxygen miss-parsed Eigen/* headersGravatar Gael Guennebaud2015-12-30
|
* Add missing snippets for erf/erfc/lgamma functions.Gravatar Gael Guennebaud2015-12-30
|
* Add missing snippet for sign/cwiseSign functions.Gravatar Gael Guennebaud2015-12-30
|
* Add missing ctor from uintGravatar Gael Guennebaud2015-12-30
|
* bug #1141: add missing initialization of CholmodBase::m_*IsOkGravatar Gael Guennebaud2015-12-29
|
* Workaround compilers that do not even define _mm256_set_m128.Gravatar Gael Guennebaud2015-12-24
|
* Don't attempt to vectorize mean reductions of integers since we can't useGravatar Benoit Steiner2015-12-22
| | | | SSE or AVX instructions to divide 2 integers.
* Optimized the configuration of the outer reduction cuda kernelGravatar Benoit Steiner2015-12-22
|
* Added missing defineGravatar Benoit Steiner2015-12-22
|
* Made sure the optimized gpu reduction code is actually compiled.Gravatar Benoit Steiner2015-12-22
|
* Optimized outer reduction on GPUs.Gravatar Benoit Steiner2015-12-22
|
* Made it possible to run the lgamma, erf, and erfc functors on a CUDA gpu.Gravatar Benoit Steiner2015-12-21
|
* Added missing constGravatar Benoit Steiner2015-12-21
|
* Merged in connor-k/eigen (pull request PR-149)Gravatar Benoit Steiner2015-12-21
|\ | | | | | | [doc] Remove extra ';' in Advanced Initialization sample
* | Fixed a typo in previous change.Gravatar Benoit Steiner2015-12-21
| |
* | Added support for CUDA architectures that don's support for 3.5 capabilitiesGravatar Benoit Steiner2015-12-21
| |
| * [doc] Remove extra ';' in Tutorial_AdvancedInitialization_Join.cppGravatar connor-k2015-12-21
|/
* Fixed a typo.Gravatar Benoit Steiner2015-12-18
|
* Add alignment requirement for local buffer used by the slicing op.Gravatar Benoit Steiner2015-12-18
|