aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported
Commit message (Collapse)AuthorAge
* Fix shorten-64-to-32 warning in TensorContractionThreadPoolGravatar Eugene Zhulenev2019-01-11
|
* Fix shorten-64-to-32 warning in TensorContractionThreadPoolGravatar Eugene Zhulenev2019-01-10
|
* bug #1654: fix compilation with cuda and no c++11Gravatar Gael Guennebaud2019-01-09
|
* Optimize evalShardedByInnerDimGravatar Eugene Zhulenev2019-01-08
|
* Fix shorten-64-to-32 warning. Use regular memcpy if num_threads==0.Gravatar Rasmus Munk Larsen2018-12-12
|
* Remove debug code.Gravatar Gael Guennebaud2018-12-09
|
* Various fixes in polynomial solver and its unit tests:Gravatar Gael Guennebaud2018-12-09
| | | | | | - cleanup noise in imaginary part of real roots - take into account the magnitude of the derivative to check roots. - use <= instead of < at appropriate places
* Merged in markdryan/eigen/avx512-contraction-2 (pull request PR-554)Gravatar Rasmus Munk Larsen2018-12-05
|\ | | | | | | | | | | Fix tensor contraction on AVX512 builds Approved-by: Rasmus Munk Larsen <rmlarsen@google.com>
| * Fix evalShardedByInnerDim for AVX512 buildsGravatar Mark D Ryan2018-12-05
| | | | | | | | | | | | | | | | | | | | | | | | evalShardedByInnerDim ensures that the values it passes for start_k and end_k to evalGemmPartialWithoutOutputKernel are multiples of 8 as the kernel does not work correctly when the values of k are not multiples of the packet_size. While this precaution works for AVX builds, it is insufficient for AVX512 builds where the maximum packet size is 16. The result is slightly incorrect float32 contractions on AVX512 builds. This commit fixes the problem by ensuring that k is always a multiple of the packet_size if the packet_size is > 8.
* | Fixed most conversion warnings in MatrixFunctions moduleGravatar Christoph Hertzberg2018-11-20
| |
* | Merged in rmlarsen/eigen2 (pull request PR-543)Gravatar Rasmus Munk Larsen2018-11-13
|\ \ | | | | | | | | | | | | | | | Add parallel memcpy to TensorThreadPoolDevice in Eigen, but limit the number of threads to 4, beyond which we just seem to be wasting CPU cycles as the threads contend for memory bandwidth. Approved-by: Eugene Zhulenev <ezhulenev@google.com>
| * | Remove accidental changes.Gravatar Rasmus Munk Larsen2018-11-12
| | |
| * | Add parallel memcpy to TensorThreadPoolDevice in Eigen, but limit the number ↵Gravatar Rasmus Munk Larsen2018-11-12
| | | | | | | | | | | | of threads to 4, beyond which we just seem to be wasting CPU cycles as the threads contend for memory bandwidth.
* | | [PATCH 1/2] Misc. typosGravatar luz.paz"2018-09-18
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | From 68d431b4c14ad60a778ee93c1f59ecc4b931950e Mon Sep 17 00:00:00 2001 Found via `codespell -q 3 -I ../eigen-word-whitelist.txt` where the whitelists consists of: ``` als ans cas dum lastr lowd nd overfl pres preverse substraction te uint whch ``` --- CMakeLists.txt | 26 +++++++++---------- Eigen/src/Core/GenericPacketMath.h | 2 +- Eigen/src/SparseLU/SparseLU.h | 2 +- bench/bench_norm.cpp | 2 +- doc/HiPerformance.dox | 2 +- doc/QuickStartGuide.dox | 2 +- .../Eigen/CXX11/src/Tensor/TensorChipping.h | 6 ++--- .../Eigen/CXX11/src/Tensor/TensorDeviceGpu.h | 2 +- .../src/Tensor/TensorForwardDeclarations.h | 4 +-- .../src/Tensor/TensorGpuHipCudaDefines.h | 2 +- .../Eigen/CXX11/src/Tensor/TensorReduction.h | 2 +- .../CXX11/src/Tensor/TensorReductionGpu.h | 2 +- .../test/cxx11_tensor_concatenation.cpp | 2 +- unsupported/test/cxx11_tensor_executor.cpp | 2 +- 14 files changed, 29 insertions(+), 29 deletions(-)
| * Fix tensor contraction for AVX512 machinesGravatar Mark D Ryan2018-07-31
|/ | | | | | | | | This patch modifies the TensorContraction class to ensure that the kc_ field is always a multiple of the packet_size, if the packet_size is > 8. Without this change spatial convolutions in Tensorflow do not work properly as the code that re-arranges the input matrices can assert if kc_ is not a multiple of the packet_size. This leads to a unit test failure, //tensorflow/python/kernel_tests:conv_ops_test, on AVX512 builds of tensorflow.
* A few small fixes to a) prevent throwing in ctors and dtors of the threading ↵Gravatar Rasmus Munk Larsen2018-11-09
| | | | code, and b) supporting matrix exponential on platforms with 113 bits of mantissa for long doubles.
* Merged in ezhulenev/eigen-02 (pull request PR-534)Gravatar Rasmus Munk Larsen2018-10-25
|\ | | | | | | Fix cxx11_tensor_{block_access, reduction} tests
| * Fix cxx11_tensor_{block_access, reduction} testsGravatar Eugene Zhulenev2018-10-25
| |
* | Fix most Doxygen warnings. Also add links to stable documentation from ↵Gravatar Christoph Hertzberg2018-10-19
| | | | | | | | | | | | | | unsupported modules (by using the corresponding Doxytags file). Manually grafted from d107a371c61b764c73fd1570b1f3ed1c6400dd7e
* | bug #1606: Explicitly set the standard before ↵Gravatar Christoph Hertzberg2018-10-19
| | | | | | | | | | | | | | find_package(StandardMathLibrary). Also replace EIGEN_COMPILER_SUPPORT_CXX11 in favor of EIGEN_COMPILER_SUPPORT_CPP11. Grafted manually from a4afa90d161faab385a77f0e2764fb13ff3b9484
* | Fix GPU build due to gpu_assert not always being defined.Gravatar Rasmus Munk Larsen2018-10-18
|/
* Move from rvalue arguments in ThreadPool enqueue* methodsGravatar Eugene Zhulenev2018-10-16
|
* Reduce thread scheduling overhead in parallelForGravatar Eugene Zhulenev2018-10-16
|
* Merged in ezhulenev/eigen-02 (pull request PR-528)Gravatar Rasmus Munk Larsen2018-10-16
|\ | | | | | | | | | | [TensorBlockIO] Check if it's allowed to squeeze inner dimensions Approved-by: Rasmus Munk Larsen <rmlarsen@google.com>
| * Check if it's allowed to squueze inner dimensions in TensorBlockIOGravatar Eugene Zhulenev2018-10-15
| |
* | Iterative solvers: unify and fix handling of multiple rhs.Gravatar Gael Guennebaud2018-10-15
| | | | | | | | m_info was not properly computed and the logic was repeated in several places.
* | DGMRES: fix null rhs, fix restart, fix m_isDeflInitialized for multiple solveGravatar Gael Guennebaud2018-10-15
|/
* relax number of iterations checks to avoid false negativesGravatar Gael Guennebaud2018-10-15
|
* Make sparse_basic includable from sparse_extra, but disable it since ↵Gravatar Gael Guennebaud2018-10-11
| | | | sparse_basic(DynamicSparseMatrix) does not compile at all anyways
* Fix a lot of Doxygen warnings in Tensor moduleGravatar Christoph Hertzberg2018-10-09
|
* fix mpreal for mpfr<4.0.0Gravatar Gael Guennebaud2018-10-09
|
* Fix out-of bounds access in TensorArgMax.h.Gravatar Rasmus Munk Larsen2018-10-08
|
* Fix contraction test.Gravatar Rasmus Munk Larsen2018-10-08
|
* typoGravatar Gael Guennebaud2018-10-08
|
* fix warning in mpreal.hGravatar Gael Guennebaud2018-10-08
|
* Update included mpreal header to 3.6.5 and fix deprecated warnings.Gravatar Gael Guennebaud2018-10-08
|
* Workaround stupid warningGravatar Gael Guennebaud2018-10-08
|
* Fix shadow warningGravatar Christoph Hertzberg2018-10-02
|
* Move struct outside of method for C++03 compatibility.Gravatar Christoph Hertzberg2018-10-02
|
* Make code compile in C++03 mode againGravatar Christoph Hertzberg2018-10-02
|
* Fix conversion warning ... againGravatar Christoph Hertzberg2018-10-02
|
* Merged in deven-amd/eigen/HIP_fixes (pull request PR-518)Gravatar Christoph Hertzberg2018-10-01
|\ | | | | | | PR with HIP specific fixes (for the eigen nightly regression failures in HIP mode)
| * This commit contains the following (HIP specific) updates:Gravatar Deven Desai2018-10-01
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - unsupported/Eigen/CXX11/src/Tensor/TensorReductionGpu.h Changing "pass-by-reference" argument to be "pass-by-value" instead (in a __global__ function decl). "pass-by-reference" arguments to __global__ functions are unwise, and will be explicitly flagged as errors by the newer versions of HIP. - Eigen/src/Core/util/Memory.h - unsupported/Eigen/CXX11/src/Tensor/TensorContraction.h Changes introduced in recent commits breaks the HIP compile. Adding EIGEN_DEVICE_FUNC attribute to some functions and calling ::malloc/free instead of the corresponding std:: versions to get the HIP compile working again - unsupported/Eigen/CXX11/src/Tensor/TensorReduction.h Change introduced a recent commit breaks the HIP compile (link stage errors out due to failure to inline a function). Disabling the recently introduced code (only for HIP compile), to get the eigen nightly testing going again. Will submit another PR once we have te proper fix. - Eigen/src/Core/util/ConfigureVectorization.h Enabling GPU VECTOR support when HIP compiler is in use (for both the host and device compile phases)
* | Merged eigen/eigen into defaultGravatar Rasmus Munk Larsen2018-09-28
|\ \
* | | Get rid of unused variable warning.Gravatar Rasmus Munk Larsen2018-09-28
| | |
| * | Fix bug in copy optimization in Tensor slicing.Gravatar Eugene Zhulenev2018-09-28
|/ /
* | Fix a few warnings and rename a variable to not shadow "last".Gravatar Rasmus Munk Larsen2018-09-28
| |
* | Merged in ezhulenev/eigen-01 (pull request PR-514)Gravatar Rasmus Munk Larsen2018-09-28
|\ \ | | | | | | | | | Add tests for evalShardedByInnerDim contraction + fix bugs
| * | Add tests for evalShardedByInnerDim contraction + fix bugsGravatar Eugene Zhulenev2018-09-28
| |/
* | Fix integer conversion warningsGravatar Christoph Hertzberg2018-09-28
| |