Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Added an option to enable the use of the F16C instruction set | 2016-04-21 | |
| | |||
* | Use EIGEN_THREAD_YIELD instead of std::this_thread::yield to make the code ↵ | 2016-04-21 | |
| | | | | more portable. | ||
* | Don't crash when attempting to reduce empty tensors. | 2016-04-20 | |
| | |||
* | Added more tests | 2016-04-20 | |
| | |||
* | Don't attempt to leverage the _cvtss_sh and _cvtsh_ss instructions when ↵ | 2016-04-20 | |
| | | | | compiling with clang since it's unclear which versions of clang actually support these instruction. | ||
* | Started to implement a portable way to yield. | 2016-04-19 | |
| | |||
* | Made sure all the required header files are included when trying to use fp16 | 2016-04-19 | |
| | |||
* | Implemented a more portable version of thread local variables | 2016-04-19 | |
| | |||
* | Fixed a few typos | 2016-04-19 | |
| | |||
* | Fixed a compilation error with nvcc 7. | 2016-04-19 | |
| | |||
* | Simplified the code that launches cuda kernels. | 2016-04-19 | |
| | |||
* | Don't take the address of a kernel on CUDA devices that don't support this ↵ | 2016-04-19 | |
| | | | | feature. | ||
* | Use numext::ceil instead of std::ceil | 2016-04-19 | |
| | |||
* | Avoid an unnecessary copy of the evaluator. | 2016-04-19 | |
| | |||
* | Fixed 2 recent regression tests | 2016-04-19 | |
| | |||
* | Use DenseIndex in the MeanReducer to avoid overflows when processing very ↵ | 2016-04-19 | |
| | | | | large tensors. | ||
* | Worked around the lack of a rand_r function on windows systems | 2016-04-17 | |
| | |||
* | Worked around the lack of a rand_r function on windows systems | 2016-04-17 | |
| | |||
* | Enable lazy-coeff-based-product for vector*(1x1) products | 2016-04-16 | |
| | |||
* | Move the evalGemm method into the TensorContractionEvaluatorBase class to ↵ | 2016-04-15 | |
| | | | | make it accessible from both the single and multithreaded contraction evaluators. | ||
* | Deleted extraneous comma. | 2016-04-15 | |
| | |||
* | Deleted unnecessary variable | 2016-04-15 | |
| | |||
* | Fixed a few compilation warnings | 2016-04-15 | |
| | |||
* | Merged in rmlarsen/eigen (pull request PR-178) | 2016-04-15 | |
|\ | | | | | | | Eigen Tensor cost model part 2: Thread scheduling for standard evaluators and reductions. | ||
* | | bug #1203: by-pass large stack-allocation in stableNorm if ↵ | 2016-04-15 | |
| | | | | | | | | EIGEN_STACK_ALLOCATION_LIMIT is too small | ||
| * | Get rid of void* casting when calling EvalRange::run. | 2016-04-15 | |
| | | |||
* | | Fixed compilation errors with msvc | 2016-04-15 | |
| | | |||
* | | Improved the matrix multiplication blocking in the case where mr is not a ↵ | 2016-04-15 | |
| | | | | | | | | power of 2 (e.g on Haswell CPUs). | ||
* | | Fix trmv for mixing types. | 2016-04-15 | |
| | | |||
* | | Added ability to access the cache sizes from the tensor devices | 2016-04-14 | |
| | | |||
* | | Added support for exclusive or | 2016-04-14 | |
| | | |||
| * | Eigen Tensor cost model part 2: Thread scheduling for standard evaluators ↵ | 2016-04-14 | |
| | | | | | | | | and reductions. The cost model is turned off by default. | ||
* | | Added missing definition of PacketSize in the gpu evaluator of convolution | 2016-04-14 | |
| | | |||
* | | Merged in rmlarsen/eigen (pull request PR-177) | 2016-04-14 | |
|\| | | | | | | | Eigen Tensor cost model part 1. | ||
* | | Enabled the new threadpool tests | 2016-04-14 | |
| | | |||
* | | Cleanup | 2016-04-14 | |
| | | |||
* | | Prepared the migration to the new non blocking thread pool | 2016-04-14 | |
| | | |||
| * | Improvements to cost model. | 2016-04-14 | |
| | | |||
* | | Merged latest updates from trunk | 2016-04-14 | |
|\ \ | |||
* | | | Added tests for the non blocking thread pool | 2016-04-14 | |
| | | | |||
* | | | Added a more scalable non blocking thread pool | 2016-04-14 | |
| | | | |||
| | * | Merge upstream updates. | 2016-04-14 | |
| | |\ | | |/ | |/| | |||
| | * | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | | | | | | | | | estimate the cost of evaluating tensor expressions. | ||
| * | | Add extreme values to the imaginary part for SVD unit tests. | 2016-04-14 | |
| | | | |||
| * | | Improve numerical robustness of JacoviSVD: | 2016-04-14 | |
|/ / | | | | | | | | | - avoid noise amplification in complex to real conversion - compare off-diagonal entries to the current biggest diagonal entry: no need to bother about a 2x2 block containing ridiculously small entries compared to the rest of the matrix. | ||
* | | Force the inlining of the << operator on half floats | 2016-04-14 | |
| | | |||
* | | Inline the << operator on half floats | 2016-04-14 | |
| | | |||
* | | Silenced a compilation warning | 2016-04-14 | |
| | | |||
* | | Added tests to validate flooring and ceiling of fp16 | 2016-04-14 | |
| | | |||
* | | Added simple test for numext::sqrt and numext::pow on fp16 | 2016-04-14 | |
| | |