aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch
Commit message (Collapse)AuthorAge
* use ::hlog if available.Gravatar Gael Guennebaud2016-08-29
|
* bug #1167: simplify installation of header files using cmake's ↵Gravatar Gael Guennebaud2016-08-29
| | | | install(DIRECTORY ...) command.
* Implement pmadd for float and double to make it consistent with the ↵Gravatar Gael Guennebaud2016-08-23
| | | | vectorized path when FMA is available.
* Factorize the 4 copies of tanh implementations, make numext::tanh consistent ↵Gravatar Gael Guennebaud2016-08-23
| | | | with array::tanh, enable fast tanh in fast-math mode only.
* Fix compilation with MSVC by using our portable numext::log1p implementation.Gravatar Gael Guennebaud2016-08-22
|
* bug #1270: bypass custom asm for pmadd and recent clang versionGravatar Gael Guennebaud2016-08-22
|
* Fix compilation on CUDA 8 by removing call to h2log1pGravatar Igor Babuschkin2016-08-15
|
* Add log1p support for CUDA and half floatsGravatar Igor Babuschkin2016-08-08
|
* Fixed the constructors of the new half_base class.Gravatar Benoit Steiner2016-08-04
|
* Fixed the isnan, isfinite and isinf operations on GPUGravatar Benoit Steiner2016-08-04
|
* Move Eigen::half_impl::half to Eigen::half while preserving the free ↵Gravatar Gael Guennebaud2016-08-04
| | | | functions to the Eigen::half_impl namespace together with ADL
* half implementation has been moved to half_impl namespaceGravatar Benoit Steiner2016-07-29
|
* bug #1266: half implementation has been moved to half_impl namespaceGravatar Christoph Hertzberg2016-07-29
|
* Fix CUDA compilationGravatar Gael Guennebaud2016-07-22
|
* More cleaning in half:Gravatar Gael Guennebaud2016-07-22
| | | | | | - put its definition and functions in its own half_impl namespace such that the free function does not polute the Eigen namespace while still making them visible for half through ADL. - expose Eigen::half throguh a using statement - move operator<< from std to half_float namespace
* Fix CUDA compilationGravatar Gael Guennebaud2016-07-21
|
* Add missing log10 and random generator for half.Gravatar Gael Guennebaud2016-07-21
|
* Some cleanup in Halh: standard functions should be defined in the namespace ↵Gravatar Gael Guennebaud2016-07-21
| | | | of the class half to make ADL work, and thus the global is* functions can be removed.
* Move CUDA's special functions to SpecialFunctions module.Gravatar Gael Guennebaud2016-07-11
|
* mergeGravatar Gael Guennebaud2016-07-11
|\
| * Merged kmargar/eigen/tip into default, Altivec/VSX port should be working ok ↵Gravatar Konstantinos Margaritis2016-07-10
| |\ | | | | | | | | | now.
| | * minor fixes for big endian altivec/vsxGravatar Konstantinos Margaritis2016-07-10
| | |
* | | bug #1232: refactor special functions as a new SpecialFunctions module, ↵Gravatar Gael Guennebaud2016-07-08
|/ / | | | | | | currently in unsupported/.
* | Undo changes in AltiVec --- I don't have any way to test there.Gravatar Benoit Jacob2016-06-28
| |
* | Avoid global variables with static constructors in NEON/Complex.hGravatar Benoit Jacob2016-06-28
| |
| * fix compilation with clang 3.9, fix performance with pset1, use vector ↵Gravatar Konstantinos Margaritis2016-06-23
| | | | | | | | operators instead of intrinsics in some cases
| * mostly cleanups and modernizing codeGravatar Konstantinos Margaritis2016-06-19
| |
| * mostly cleanups and modernizing codeGravatar Konstantinos Margaritis2016-06-19
| |
| * fixed pexp<Packet2d>, was failing testsGravatar Konstantinos Margaritis2016-06-19
| |
* | bug #1240: Remove any assumption on NEON vector types.Gravatar Gael Guennebaud2016-06-09
| |
* | Fix compile errors initializing packets on ARM DS-5 5.20Gravatar Sean Templeton2016-06-03
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The ARM DS-5 5.20 compiler fails compiling with the following errors: "src/Core/arch/NEON/PacketMath.h", line 113: Error: #146: too many initializer values Packet4f countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/PacketMath.h", line 118: Error: #146: too many initializer values Packet4i countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/Complex.h", line 30: Error: #146: too many initializer values static uint32x4_t p4ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET4(0x00000000, 0x80000000, 0x00000000, 0x80000000); ^ "src/Core/arch/NEON/Complex.h", line 31: Error: #146: too many initializer values static uint32x2_t p2ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET2(0x00000000, 0x80000000); ^ The vectors are implemented as two doubles, hence the too many initializer values error. Changed the code to use intrinsic load functions which all compilers implementing NEON should have.
* | Enable the vectorization of adds and mults of fp16Gravatar Benoit Steiner2016-06-07
| |
* | Add TernaryFunctors and the betainc SpecialFunction.Gravatar Eugene Brevdo2016-06-02
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.
* | Improved support for CUDA 8.0Gravatar Benoit Steiner2016-05-31
| |
* | Disable the use of MMX instructions since the code is broken on many platformsGravatar Benoit Steiner2016-05-27
| |
* | Deleted extra namespaceGravatar Benoit Steiner2016-05-26
| |
* | Disable usage of MMX with msvc.Gravatar Gael Guennebaud2016-05-26
| |
* | Add missing inclusion of mmintrin.hGravatar Gael Guennebaud2016-05-26
| |
* | Silenced a compilation warningGravatar Benoit Steiner2016-05-25
| |
* | Specify the rounding mode in the correct locationGravatar Benoit Steiner2016-05-25
| |
* | Explicitly specify the rounding mode when converting floats to fp16Gravatar Benoit Steiner2016-05-25
| |
* | Disable the use of MMX instructions on x86_64 since too many compilers only ↵Gravatar Benoit Steiner2016-05-25
| | | | | | | | support them in 32bit mode
* | Fix compilation with ICC.Gravatar Gael Guennebaud2016-05-25
| |
* | Cleaned up the fp16 code a little moreGravatar Benoit Steiner2016-05-24
| |
* | Cleaned up the fp16 codeGravatar Benoit Steiner2016-05-24
| |
* | Remove now-unused protate PacketMath funcGravatar Benoit Jacob2016-05-24
| |
* | Don't attempt to use MMX instructions with visualstudio since they're only ↵Gravatar Benoit Steiner2016-05-24
| | | | | | | | partially supported.
* | Worked around missing clang intrinsicGravatar Benoit Steiner2016-05-24
| |
* | Use the generic ploadquad intrinsics since it does the jobGravatar Benoit Steiner2016-05-24
| |
* | Worked around missing clang intrinsicsGravatar Benoit Steiner2016-05-24
| |