Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Improved AVX512 configuration | Benoit Steiner | 2016-11-03 |
| | |||
* | Improved AVX512 support | Benoit Steiner | 2016-11-03 |
| | |||
* | Merged eigen/eigen into default | Benoit Steiner | 2016-11-03 |
|\ | |||
| * | bug #1337: improve doc of homogeneous() and hnormalized() | Gael Guennebaud | 2016-11-03 |
| | | |||
| * | bug #1330: Cholmod supports double precision only, so let's trigger a static ↵ | Gael Guennebaud | 2016-11-03 |
| | | | | | | | | assertion if the scalar type does not match this requirement. | ||
| * | Merged in benoitsteiner/opencl (pull request PR-244) | Benoit Steiner | 2016-11-02 |
| |\ | | | | | | | | | | Disable vectorization on device only when compiling for sycl | ||
| | * | Disable vectorization on device only when compiling for sycl | Benoit Steiner | 2016-11-02 |
| | | | |||
| * | | bug #1004: improve accuracy of LinSpaced for abs(low) >> abs(high). | Gael Guennebaud | 2016-11-02 |
| | | | |||
| * | | Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX. | Gael Guennebaud | 2016-11-02 |
| |/ | |||
| * | Gate the code that refers to cuda fp16 primitives more thoroughly | Benoit Steiner | 2016-11-01 |
| | | |||
| * | Fix regression in X = (X*X.transpose())/s with X rectangular by deferring ↵ | Gael Guennebaud | 2016-10-26 |
| | | | | | | | | resizing of the destination after the creation of the evaluator of the source expression. | ||
| * | add a generic EIGEN_HAS_CXX11 | Gael Guennebaud | 2016-10-26 |
| | | |||
| * | Fix warning with ICC | Gael Guennebaud | 2016-10-26 |
| | | |||
| * | Fix ICC warnings | Gael Guennebaud | 2016-10-25 |
| | | |||
| * | Add missing inline keywords | Gael Guennebaud | 2016-10-25 |
| | | |||
| * | Fixed a typo | Benoit Steiner | 2016-10-25 |
| | | |||
| * | bug #1004: one more rewrite of LinSpaced for floating point numbers to ↵ | Gael Guennebaud | 2016-10-25 |
| | | | | | | | | | | | | | | | | guarantee both interpolation and monotonicity. This version simply does low+i*step plus a branch to return high if i==size-1. Vectorization is accomplished with a branch and the help of pinsertlast. Some quick benchmark revealed that the overhead is really marginal, even when filling small vectors. | ||
| * | Add a pinsertlast function replacing the last entry of a packet by a scalar. | Gael Guennebaud | 2016-10-25 |
| | | | | | | | | (useful to vectorize LinSpaced) | ||
| * | bug #1333: fix bad usage of const_cast_derived. Better use .data() for that ↵ | Gael Guennebaud | 2016-10-24 |
| | | | | | | | | purpose. | ||
| * | Cleanup ArrayWrapper and MatrixWrapper by removing redundant accessors. | Gael Guennebaud | 2016-10-24 |
| | | |||
| * | bug #1004: remove the inaccurate "sequential" path for LinSpaced, mark ↵ | Gael Guennebaud | 2016-10-24 |
| | | | | | | | | | | | | respective function as deprecated, and enforce strict interpolation of the higher range using a correction term. Now, even with floating point precision, both the 'low' and 'high' bounds are exactly reproduced at i=0 and i=size-1 respectively. | ||
| * | Merged in benoitsteiner/opencl (pull request PR-238) | Benoit Steiner | 2016-10-24 |
| |\ | | | | | | | | | | Added support for OpenCL to the Tensor Module | ||
| * | | bug #698: rewrite LinSpaced for integer scalar types to avoid overflow and ↵ | Gael Guennebaud | 2016-10-24 |
| | | | | | | | | | | | | | | | | | | | | | guarantee an even spacing when possible. Otherwise, the "high" bound is implicitly lowered to the largest value allowing for an even distribution. This changeset also disable vectorization for this integer path. | ||
| * | | bug #1328: workaround a compilation issue with gcc 4.2 | Gael Guennebaud | 2016-10-20 |
| | | | |||
| | * | Merge latest updates from trunk | Benoit Steiner | 2016-10-20 |
| | |\ | | |/ | |/| | |||
| * | | Fixed a few typos in the ternary tensor expressions types | Benoit Steiner | 2016-10-19 |
| | | | |||
| | * | Fixing the typo regarding missing #if needed for proper handling of ↵ | Mehdi Goli | 2016-10-16 |
| | | | | | | | | | | | | exceptions in Eigen/Core. | ||
| | * | Merged ComputeCpp to default. | Luke Iwanski | 2016-10-14 |
| | |\ | |||
| | | * | Applyiing Benoit's comment to return the missing line back in Eigen/Core | Mehdi Goli | 2016-10-14 |
| | | | | |||
| * | | | Fix previous merge. | Gael Guennebaud | 2016-10-14 |
| | | | | |||
| * | | | Merged in rmlarsen/eigen2 (pull request PR-232) | Gael Guennebaud | 2016-10-14 |
| |\ \ \ | | | | | | | | | | | | | | | | Improve performance of parallelized matrix multiply for rectangular matrices | ||
| | | * | | Merged ComputeCpp into default. | Luke Iwanski | 2016-10-14 |
| | | |\| | |||
| | | | * | Reducing the code by generalising sycl backend functions/structs. | Mehdi Goli | 2016-10-14 |
| | | | | | |||
| * | | | | Merged in lukier/eigen (pull request PR-234) | Benoit Steiner | 2016-10-13 |
| |\ \ \ \ | | | | | | | | | | | | | | | | | | | Enabling CUDA in Geometry | ||
| | * | | | | Fixes for min and abs after Benoit's comments, switched to numext. | Robert Lukierski | 2016-10-13 |
| | | | | | | |||
| * | | | | | Patch to allow VS2015 & CUDA 8.0 to compile with Eigen included. I'm not sure | Avi Ginsburg | 2016-10-13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | whether to limit the check to this compiler combination (` || (EIGEN_COMP_MSVC == 1900 && __CUDACC_VER__) `) or to leave it as it is. I also don't know if this will have any affect on including Eigen in device code (I'm not in my current project). | ||
| | | | * | | Merged eigen/eigen into default | Benoit Steiner | 2016-10-12 |
| | | | |\ \ | | |_|_|/ / | |/| | | | | |||
* | | | | | | Deleted redundant implementation of predux | Benoit Steiner | 2016-10-12 |
| | | | | | | |||
| * | | | | | Remove double ;; | Gael Guennebaud | 2016-10-12 |
| | | | | | | |||
* | | | | | | Merged eigen/eigen into default | Benoit Steiner | 2016-10-12 |
|\| | | | | | |||
* | | | | | | Take advantage of AVX512 instructions whenever possible to speedup the ↵ | Benoit Steiner | 2016-10-12 |
| | | | | | | | | | | | | | | | | | | | | | | | | processing of 16 bit floats. | ||
| * | | | | | Fix SPQR for rectangular matrices | Gael Guennebaud | 2016-10-12 |
| | | | | | | |||
| | * | | | | Fixes min() warnings. | Robert Lukierski | 2016-10-12 |
| | | | | | | |||
| * | | | | | Merged in rmlarsen/eigen (pull request PR-230) | Gael Guennebaud | 2016-10-12 |
| |\ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | Fix a bug in psqrt for SSE and AVX when EIGEN_FAST_MATH=1 | ||
| | | * | | | | Adding EIGEN_DEVICE_FUNC in the Geometry module. | Robert Lukierski | 2016-10-12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Additional CUDA necessary fixes in the Core (mostly usage of EIGEN_USING_STD_MATH). | ||
| | * | | | | | Fix copy-paste error: Must use _mm256_cmp_ps for AVX. | Rasmus Munk Larsen | 2016-10-12 |
| | | | | | | | |||
| * | | | | | | bug #1325: fix compilation on NEON with clang | Gael Guennebaud | 2016-10-12 |
| | |/ / / / | |/| | | | | |||
| * | | | | | Reenabled the use of variadic templates on tegra x1 provides that the latest ↵ | Benoit Steiner | 2016-10-08 |
| | | | | | | | | | | | | | | | | | | | | | | | | version (i.e. JetPack 2.3) is used. | ||
| | | | * | | Merge the content of the ComputeCpp branch into the default branch | Benoit Steiner | 2016-10-07 |
| | | | |\| | |||
| * | | | | | Remove static qualifier of free-functions (inline is enough and this helps ↵ | Gael Guennebaud | 2016-10-07 |
| | |_|/ / | |/| | | | | | | | | | | | | | ICC to find the right overload) |