aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorContractionThreadPool.h
diff options
context:
space:
mode:
authorGravatar Mark D Ryan <mark.d.ryan@intel.com>2018-07-31 09:33:37 +0100
committerGravatar Mark D Ryan <mark.d.ryan@intel.com>2018-07-31 09:33:37 +0100
commit6f5b126e6d23f1339d15b26fe87916132397d619 (patch)
tree0dc235c360eb5e5c7576db2f5de37a8201a9dcd6 /unsupported/Eigen/CXX11/src/Tensor/TensorContractionThreadPool.h
parent77b447c24e3344e43ff64eb932d4bb35a2db01ce (diff)
Fix tensor contraction for AVX512 machines
This patch modifies the TensorContraction class to ensure that the kc_ field is always a multiple of the packet_size, if the packet_size is > 8. Without this change spatial convolutions in Tensorflow do not work properly as the code that re-arranges the input matrices can assert if kc_ is not a multiple of the packet_size. This leads to a unit test failure, //tensorflow/python/kernel_tests:conv_ops_test, on AVX512 builds of tensorflow.
Diffstat (limited to 'unsupported/Eigen/CXX11/src/Tensor/TensorContractionThreadPool.h')
0 files changed, 0 insertions, 0 deletions