| Commit message (Collapse) | Author | Age |
|
|
|
| |
manageable.
|
| |
|
|
|
|
| |
the evaluation of an expression.
|
| |
|
| |
|
|
|
|
| |
reduction kernel.
|
| |
|
| |
|
| |
|
|
|
|
| |
by a factor of 3 or more. This helps speedup LSTM neural networks.
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
|\
| |
| |
| | |
Alternative way of forcing instantiation of device kernels without causing warnings or requiring device to device kernel invocations.
|
| | |
|
| | |
|
| | |
|
|/
|
|
|
|
| |
causing warnings or requiring device to device kernel invocations.
This allows Tensorflow to work on SM 3.0 (ie, Amazon EC2) machines.
|
| |
|
| |
|
|
|
|
| |
nvcc bug that prevented the code from compiling in optimized mode in some cases
|
|
|
|
| |
reintroduces some compulation warnings but it's much better than having to deal with random assertion failures.
|
|
|
|
|
|
|
|
| |
errors such as this:
error: more than one partial specialization matches the template argument list of class "Eigen::internal::get<3, Eigen::internal::numeric_list<std::size_t, 1UL, 1UL, 1UL, 1UL>>"
"Eigen::internal::get<n, Eigen::internal::numeric_list<T, a, as...>>"
"Eigen::internal::get<n, Eigen::internal::numeric_list<T, as...>>"
|
| |
|
| |
|
| |
|
| |
|
|
|
|
| |
warnings
|
| |
|
|
|
|
| |
SSE or AVX instructions to divide 2 integers.
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
|
|
|
| |
(fix ambiguous template instantiation)
|
| |
|
|
|
|
| |
to call them from a CUDA kernel.
|
|\
| |
| |
| | |
Add special functions to eigen: lgamma, erf, erfc.
|
| | |
|