Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | use ::hlog if available. | 2016-08-29 | |
| | |||
* | bug #1167: simplify installation of header files using cmake's ↵ | 2016-08-29 | |
| | | | | install(DIRECTORY ...) command. | ||
* | Implement pmadd for float and double to make it consistent with the ↵ | 2016-08-23 | |
| | | | | vectorized path when FMA is available. | ||
* | Factorize the 4 copies of tanh implementations, make numext::tanh consistent ↵ | 2016-08-23 | |
| | | | | with array::tanh, enable fast tanh in fast-math mode only. | ||
* | Fix compilation with MSVC by using our portable numext::log1p implementation. | 2016-08-22 | |
| | |||
* | bug #1270: bypass custom asm for pmadd and recent clang version | 2016-08-22 | |
| | |||
* | Fix compilation on CUDA 8 by removing call to h2log1p | 2016-08-15 | |
| | |||
* | Add log1p support for CUDA and half floats | 2016-08-08 | |
| | |||
* | Fixed the constructors of the new half_base class. | 2016-08-04 | |
| | |||
* | Fixed the isnan, isfinite and isinf operations on GPU | 2016-08-04 | |
| | |||
* | Move Eigen::half_impl::half to Eigen::half while preserving the free ↵ | 2016-08-04 | |
| | | | | functions to the Eigen::half_impl namespace together with ADL | ||
* | half implementation has been moved to half_impl namespace | 2016-07-29 | |
| | |||
* | bug #1266: half implementation has been moved to half_impl namespace | 2016-07-29 | |
| | |||
* | Fix CUDA compilation | 2016-07-22 | |
| | |||
* | More cleaning in half: | 2016-07-22 | |
| | | | | | | - put its definition and functions in its own half_impl namespace such that the free function does not polute the Eigen namespace while still making them visible for half through ADL. - expose Eigen::half throguh a using statement - move operator<< from std to half_float namespace | ||
* | Fix CUDA compilation | 2016-07-21 | |
| | |||
* | Add missing log10 and random generator for half. | 2016-07-21 | |
| | |||
* | Some cleanup in Halh: standard functions should be defined in the namespace ↵ | 2016-07-21 | |
| | | | | of the class half to make ADL work, and thus the global is* functions can be removed. | ||
* | Move CUDA's special functions to SpecialFunctions module. | 2016-07-11 | |
| | |||
* | merge | 2016-07-11 | |
|\ | |||
| * | Merged kmargar/eigen/tip into default, Altivec/VSX port should be working ok ↵ | 2016-07-10 | |
| |\ | | | | | | | | | | now. | ||
| | * | minor fixes for big endian altivec/vsx | 2016-07-10 | |
| | | | |||
* | | | bug #1232: refactor special functions as a new SpecialFunctions module, ↵ | 2016-07-08 | |
|/ / | | | | | | | currently in unsupported/. | ||
* | | Undo changes in AltiVec --- I don't have any way to test there. | 2016-06-28 | |
| | | |||
* | | Avoid global variables with static constructors in NEON/Complex.h | 2016-06-28 | |
| | | |||
| * | fix compilation with clang 3.9, fix performance with pset1, use vector ↵ | 2016-06-23 | |
| | | | | | | | | operators instead of intrinsics in some cases | ||
| * | mostly cleanups and modernizing code | 2016-06-19 | |
| | | |||
| * | mostly cleanups and modernizing code | 2016-06-19 | |
| | | |||
| * | fixed pexp<Packet2d>, was failing tests | 2016-06-19 | |
| | | |||
* | | bug #1240: Remove any assumption on NEON vector types. | 2016-06-09 | |
| | | |||
* | | Fix compile errors initializing packets on ARM DS-5 5.20 | 2016-06-03 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The ARM DS-5 5.20 compiler fails compiling with the following errors: "src/Core/arch/NEON/PacketMath.h", line 113: Error: #146: too many initializer values Packet4f countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/PacketMath.h", line 118: Error: #146: too many initializer values Packet4i countdown = EIGEN_INIT_NEON_PACKET4(0, 1, 2, 3); ^ "src/Core/arch/NEON/Complex.h", line 30: Error: #146: too many initializer values static uint32x4_t p4ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET4(0x00000000, 0x80000000, 0x00000000, 0x80000000); ^ "src/Core/arch/NEON/Complex.h", line 31: Error: #146: too many initializer values static uint32x2_t p2ui_CONJ_XOR = EIGEN_INIT_NEON_PACKET2(0x00000000, 0x80000000); ^ The vectors are implemented as two doubles, hence the too many initializer values error. Changed the code to use intrinsic load functions which all compilers implementing NEON should have. | ||
* | | Enable the vectorization of adds and mults of fp16 | 2016-06-07 | |
| | | |||
* | | Add TernaryFunctors and the betainc SpecialFunction. | 2016-06-02 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments. | ||
* | | Improved support for CUDA 8.0 | 2016-05-31 | |
| | | |||
* | | Disable the use of MMX instructions since the code is broken on many platforms | 2016-05-27 | |
| | | |||
* | | Deleted extra namespace | 2016-05-26 | |
| | | |||
* | | Disable usage of MMX with msvc. | 2016-05-26 | |
| | | |||
* | | Add missing inclusion of mmintrin.h | 2016-05-26 | |
| | | |||
* | | Silenced a compilation warning | 2016-05-25 | |
| | | |||
* | | Specify the rounding mode in the correct location | 2016-05-25 | |
| | | |||
* | | Explicitly specify the rounding mode when converting floats to fp16 | 2016-05-25 | |
| | | |||
* | | Disable the use of MMX instructions on x86_64 since too many compilers only ↵ | 2016-05-25 | |
| | | | | | | | | support them in 32bit mode | ||
* | | Fix compilation with ICC. | 2016-05-25 | |
| | | |||
* | | Cleaned up the fp16 code a little more | 2016-05-24 | |
| | | |||
* | | Cleaned up the fp16 code | 2016-05-24 | |
| | | |||
* | | Remove now-unused protate PacketMath func | 2016-05-24 | |
| | | |||
* | | Don't attempt to use MMX instructions with visualstudio since they're only ↵ | 2016-05-24 | |
| | | | | | | | | partially supported. | ||
* | | Worked around missing clang intrinsic | 2016-05-24 | |
| | | |||
* | | Use the generic ploadquad intrinsics since it does the job | 2016-05-24 | |
| | | |||
* | | Worked around missing clang intrinsics | 2016-05-24 | |
| | |