Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Roll back changes to core. Move include of TensorFunctors.h up to satisfy ↵ | 2016-05-17 | |
| | | | | dependence in TensorCostModel.h. | ||
* | Improvements to parallelFor. | 2016-05-12 | |
| | | | | Move some scalar functors from TensorFunctors. to Eigen core. | ||
* | Fix missing inclusion of Eigen/Core | 2016-04-27 | |
| | |||
* | Refactor the unsupported CXX11/Core module to internal headers only. | 2016-04-26 | |
| | |||
* | Merged in rmlarsen/eigen (pull request PR-177) | 2016-04-14 | |
|\ | | | | | | | Eigen Tensor cost model part 1. | ||
* | | Prepared the migration to the new non blocking thread pool | 2016-04-14 | |
| | | |||
| * | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
|/ | | | | estimate the cost of evaluating tensor expressions. | ||
* | Use a single Barrier instead of a collection of Notifications to reduce the ↵ | 2016-03-22 | |
| | | | | thread synchronization overhead | ||
* | Made sure to include the <random> header file when compiling with visual studio | 2016-03-09 | |
| | |||
* | include <iostream> in the tensor header since we now use it to better report ↵ | 2016-02-22 | |
| | | | | cuda initialization errors | ||
* | Created a mechanism to enable contraction mappers to determine the best ↵ | 2016-01-22 | |
| | | | | blocking strategy. | ||
* | Moved the contraction mapping code to its own file to make the code more ↵ | 2016-01-19 | |
| | | | | manageable. | ||
* | Fixed some compilation error triggered by the tensor code with msvc 2008 | 2015-12-16 | |
| | |||
* | Made it possible to refer t oa GPUDevice from code compile with a regular ↵ | 2015-11-23 | |
| | | | | C++ compiler | ||
* | Split TensorDeviceType.h in 3 files to make it more manageable | 2015-11-20 | |
| | |||
* | Added support for 128bit integers on CUDA devices. | 2015-11-19 | |
| | |||
* | Code cleanup | 2015-11-06 | |
| | |||
* | Added missing glue logic | 2015-10-22 | |
| | |||
* | Tightened the definition of isOfNormalIndex to take into account integer ↵ | 2015-10-14 | |
| | | | | | | | types in addition to arrays of indices Only compile the custom index code when EIGEN_HAS_SFINAE is defined. For the time beeing, EIGEN_HAS_SFINAE is a synonym for EIGEN_HAS_VARIADIC_TEMPLATES, but this might evolve in the future. Moved some code around. | ||
* | added CustomIndex capability only to Tensor and not yet to TensorBase. | 2015-10-09 | |
| | | | | | | | | | | | | | | using Sfinae and is_base_of to select correct template which converts to array<Index,NumIndices> user: Gabriel Nützi <gnuetzi@gmx.ch> branch 'default' added unsupported/Eigen/CXX11/src/Tensor/TensorMetaMacros.h added unsupported/test/cxx11_tensor_customIndex.cpp changed unsupported/Eigen/CXX11/Tensor changed unsupported/Eigen/CXX11/src/Tensor/Tensor.h changed unsupported/Eigen/CXX11/src/Tensor/TensorMeta.h changed unsupported/test/CMakeLists.txt | ||
* | Added support for argmax/argmin | 2015-08-31 | |
| | |||
* | Use standard include syntax in Tensor module (<> for include-path and "" for ↵ | 2015-08-18 | |
| | | | | relative path) | ||
* | Added support for tensor inflation. | 2015-07-16 | |
| | |||
* | Added support for user defined custom tensor op. | 2015-06-30 | |
| | |||
* | Added support for 3D patch extraction | 2015-06-30 | |
| | |||
* | Moved some utilities to TensorMeta.h to make it easier to reuse them accross ↵ | 2015-06-29 | |
| | | | | | | several tensor operations. Created the TensorDimensionList class to encode the list of all the dimensions of a tensor of rank n. This could be done using TensorIndexList, however TensorIndexList require cxx11 which isn't yet supported as widely as we'd like. | ||
* | Added missing include files | 2015-05-28 | |
| | |||
* | Moved away from std::async and std::future as the underlying mechnism for ↵ | 2015-05-20 | |
| | | | | | | the thread pool device. On several platforms, the functions passed to std::async are not scheduled in the order in which they are given to std::async, which leads to massive performance issues in the contraction code. Instead we now have a custom thread pool that ensures that the functions are picked up by the threads in the pool in the order in which they are enqueued in the pool. | ||
* | Added the ability to generate a tensor from a custom user defined ↵ | 2015-04-22 | |
| | | | | | | 'generator'. This simplifies the creation of constant tensors initialized using specific regular patterns. Created a gaussian window generator as a first use case. | ||
* | Pulled latest changes from trunk | 2015-04-14 | |
|\ | |||
| * | Fixed the order of 2 #includes | 2015-04-06 | |
| | | |||
* | | Pulled latest updates from trunk | 2015-04-01 | |
|\| | |||
| * | Cleaned up the TensorDevice code a little bit. | 2015-03-25 | |
| | | |||
* | | Added support for vectorized type casting of tensors | 2015-02-27 | |
|/ | |||
* | Updated the list of include files | 2015-01-14 | |
| | |||
* | Added support for extraction of patches from images | 2014-11-13 | |
| | |||
* | Added support for static list of indices | 2014-11-12 | |
| | |||
* | Added support for tensor references | 2014-10-28 | |
| | |||
* | Made the blocking computation aware of the l3 cache | 2014-10-15 | |
| | | | | Also optimized the blocking parameters to take into account the number of threads used for a computation | ||
* | Added support for promises | 2014-10-15 | |
| | | | | Started to improve multithreaded contractions | ||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Added support for patch extraction | 2014-10-13 | |
| | |||
* | Added ability to print a tensor using an iostream. | 2014-10-10 | |
| | |||
* | Added support for tensor chips | 2014-10-10 | |
| | |||
* | Improved support for CUDA devices. | 2014-10-03 | |
| | | | | Improved contractions on GPU | ||
* | Added support for tensor reductions and concatenations | 2014-10-01 | |
| | |||
* | Added support for broadcasting | 2014-08-20 | |
| | |||
* | Added support for fast integer divisions by a constant | 2014-08-14 | |
| | | | | Sped up tensor slicing by a factor of 3 by using these fast integer divisions. | ||
* | Added support for padding, stridding, and shuffling | 2014-08-14 | |
| | |||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | 2014-06-13 | |
| | | | | | | | | efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed |