Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Manually define int16_t and uint16_t when compiling with Visual Studio | 2016-10-08 | |
| | |||
* | Reenabled the use of variadic templates on tegra x1 provides that the latest ↵ | 2016-10-08 | |
| | | | | version (i.e. JetPack 2.3) is used. | ||
* | Cleaned up a regression test | 2016-10-08 | |
| | |||
* | Remove static qualifier of free-functions (inline is enough and this helps ↵ | 2016-10-07 | |
| | | | | ICC to find the right overload) | ||
* | Merged in rryan/eigen/tensorfunctors (pull request PR-233) | 2016-10-06 | |
|\ | | | | | | | Fully support complex types in SumReducer and MeanReducer when building for CUDA by using scalar_sum_op and scalar_product_op instead of operator+ and operator*. | ||
| * | Add a test that GPU complex product reductions match CPU reductions. | 2016-10-06 | |
| | | |||
| * | Fully support complex types in SumReducer and MeanReducer when building for ↵ | 2016-10-06 | |
| | | | | | | | | CUDA by using scalar_sum_op and scalar_product_op instead of operator+ and operator*. | ||
* | | Added missing AVX intrinsics for fp16: in particular, implemented predux ↵ | 2016-10-06 | |
|/ | | | | which is required by the matrix-vector code. | ||
* | Fix compilation of qr.inverse() for column and full pivoting variants. | 2016-10-06 | |
| | |||
* | Increased the robustness of the reduction tests on fp16 | 2016-10-05 | |
| | |||
* | Increase the tolerance to numerical noise. | 2016-10-05 | |
| | |||
* | ::rand() returns a signed integer on win32 | 2016-10-05 | |
| | |||
* | Fixed a typo that impacts windows builds | 2016-10-05 | |
| | |||
* | Silenced compilation warning | 2016-10-04 | |
| | |||
* | Properly characterize the CUDA packet primitives for fp16 as device only | 2016-10-04 | |
| | |||
* | Cleanup the cuda executor code. | 2016-10-04 | |
| | |||
* | Cleaned up the random number generation code. | 2016-10-04 | |
| | |||
* | Use explicit type casting to generate packets of zeros. | 2016-10-04 | |
| | |||
* | Improved support for compiling CUDA code with clang as the host compiler | 2016-10-03 | |
| | |||
* | Added support for constand std::complex numbers on GPU | 2016-10-03 | |
| | |||
* | bug #1317: fix performance regression with some Block expressions and clang ↵ | 2016-10-01 | |
| | | | | | | by helping it to remove dead code. The trick is to get rid of the nested expression in the evaluator by copying only the required information (here, the strides). | ||
* | bug #1310: workaround a compilation regression from 3.2 regarding triangular ↵ | 2016-09-30 | |
| | | | | * homogeneous | ||
* | Fix angle range | 2016-09-30 | |
| | |||
* | Remove std:: prefix | 2016-09-30 | |
| | |||
* | bug #1312: Quaternion to AxisAngle conversion now ensures the angle will be ↵ | 2016-09-29 | |
| | | | | in the range [-pi,pi]. This also increases accuracy when q.w is negative. | ||
* | bug #1308: fix compilation of some small products involving nullary-expressions. | 2016-09-29 | |
| | |||
* | Updated the list of warnings to reflect the new message ids introduced in ↵ | 2016-09-28 | |
| | | | | cuda 8.0 | ||
* | Updated the tensor sum and mean reducer to enable them to process complex ↵ | 2016-09-28 | |
| | | | | numbers on cuda gpus. | ||
* | Merged in sergiu/eigen (pull request PR-229) | 2016-09-27 | |
|\ | | | | | | | Disabled MSVC level 4 warning C4714 | ||
* | | Add debug info. | 2016-09-26 | |
| | | |||
* | | bug #1311: fix alignment logic in some cases of ↵ | 2016-09-26 | |
| | | | | | | | | (scalar*small).lazyProduct(small) | ||
* | | Made the initialization of a CUDA device thread safe. | 2016-09-26 | |
| | | |||
* | | bug #1308: fix compilation of vector * rowvector::nullary. | 2016-09-25 | |
| | | |||
| * | disabled MSVC level 4 warning C4714 | 2016-09-25 | |
| | | | | | | | | | | The level 4 warning (/W4) warns about functions marked as __forceinline not inlined, and generates a lot of noise. | ||
* | | bug #1304: fix Projective * scaling and Projective *= scaling | 2016-09-23 | |
| | | |||
* | | Add missing file. | 2016-09-23 | |
| | | |||
* | | Made the gaussian generator usable on GPU | 2016-09-22 | |
| | | |||
* | | Added a specialization of Eigen::numext::real and Eigen::numext::imag for ↵ | 2016-09-22 | |
| | | | | | | | | std::complex<T> to be used when compiling a cuda kernel. This is unfortunately necessary to be able to process complex numbers from a CUDA kernel on MacOS. | ||
* | | Added tag 3.3-rc1 for changeset 77e27fbeee7acb289d7df809fc09a8cc8ee94eb7 | 2016-09-22 | |
|/ | |||
* | bump to 3.3-rc1 | 2016-09-22 | |
| | |||
* | merge | 2016-09-22 | |
|\ | |||
* \ | merge | 2016-09-22 | |
|\ \ | |||
* | | | Backout changeset ce3557ca69742af477546d031d644a6dab1ff614 | 2016-09-22 | |
| | | | |||
| * | | Add a nullary-functor example performing index-based sub-matrices. | 2016-09-22 | |
| | | | |||
| | * | Calls x.imag() instead of imag(x) when x is a complex number since the former | 2016-09-22 | |
| | | | | | | | | | | | | is a constexpr while the later isn't. This fixes compilation errors triggered by nvcc on Mac. | ||
| * | | Bypass identity reflectors. | 2016-09-22 | |
| |/ | |||
| * | fix documentation of LinSpaced | 2016-09-22 | |
| | | | | | | | | The index of the highest value in a LinSpace is size-1. | ||
| * | Add a note regarding gcc bug #72867 | 2016-09-22 | |
| | | |||
| * | Do not manually add absolute path to boost-library. | 2016-09-22 | |
| | | | | | | | | Also set C++ standard for blaze to C++14 | ||
| * | fix typo | 2016-09-21 | |
| | |