| Commit message (Expand) | Author | Age |
* | Revert "Fix rint for SSE/NEON." | Antonio Sánchez | 2021-03-03 |
* | Fix rint for SSE/NEON. | Antonio Sanchez | 2021-03-03 |
* | Add print for SSE/NEON, use NEON rounding intrinsics if available. | Antonio Sanchez | 2021-02-27 |
* | Make half/bfloat16 constructor take inputs by value, fix powerpc test. | Antonio Sanchez | 2021-02-27 |
* | Fix double-promotion warnings | Christoph Hertzberg | 2021-02-27 |
* | Fix NEON sqrt for 32-bit, add prsqrt. | Antonio Sanchez | 2021-02-26 |
* | Fix floor/ceil for NEON fp16. | Antonio Sanchez | 2021-02-25 |
* | Fix SSE/NEON pfloor/pceil for saturated values. | Antonio Sanchez | 2021-02-25 |
* | Fix clang compile when no MMA flags are set. Simplify MMA compiler detection. | Chip-Kerchner | 2021-02-24 |
* | Having forward template function declarations in a P10 file causes bad code i... | Chip-Kerchner | 2021-02-24 |
* | Fixes to support old and new versions of the compilers for built-ins. Cast t... | Chip-Kerchner | 2021-02-24 |
* | Disable fast psqrt for NEON. | Antonio Sanchez | 2021-02-23 |
* | Fix some CUDA warnings. | Antonio Sanchez | 2021-02-24 |
* | Accurate pow, part 2. This change adds specializations of log2 and exp2 for d... | Rasmus Munk Larsen | 2021-02-23 |
* | Fix compilation errors with later versions of GCC and use of MMA. | Chip-Kerchner | 2021-02-22 |
* | Fixes Bug #1925. Packets should be passed by const reference, even to inline ... | Christoph Hertzberg | 2021-02-20 |
* | Use the Cephes double subtraction trick in pexp<float> even when FMA is avail... | Rasmus Munk Larsen | 2021-02-18 |
* | Fix uninitialized warning on AVX. | Antonio Sanchez | 2021-02-17 |
* | Fixed performance issues for VSX and P10 MMA in general_matrix_matrix_product | Chip Kerchner | 2021-02-17 |
* | New accurate algorithm for pow(x,y). This version is accurate to 1.4 ulps for... | Rasmus Munk Larsen | 2021-02-17 |
* | Updated pfrexp implementation. | Antonio Sanchez | 2021-02-17 |
* | missing method in packetmath.h void ptranspose(PacketBlock<Packet16uc, 4>& ke... | Ashutosh Sharma | 2021-02-16 |
* | Use vrsqrts for rsqrt Newton iterations. | Antonio Sanchez | 2021-02-11 |
* | Adjust bounds for pexp_float/double | Antonio Sanchez | 2021-02-10 |
* | Fix ldexp implementations. | Antonio Sanchez | 2021-02-10 |
* | loop less ptranspose | Ashutosh Sharma | 2021-02-10 |
* | Add more tests for pow and fix a corner case for huge exponent where the resu... | Rasmus Munk Larsen | 2021-02-05 |
* | Fix excessive GEBP register spilling for 32-bit NEON. | Antonio Sanchez | 2021-02-03 |
* | Eliminate implicit conversions from float to double. | Antonio Sanchez | 2021-02-01 |
* | Fix altivec packetmath. | Antonio Sanchez | 2021-01-28 |
* | Fix clang compilation for AltiVec from previous check-in | Chip Kerchner | 2021-01-28 |
* | Include `<cstdint>` in one place, remove custom typedefs | Antonio Sanchez | 2021-01-26 |
* | Fix sqrt, ldexp and frexp compilation errors. | Chip Kerchner | 2021-01-25 |
* | Fix pow and other cwise ops for half/bfloat16. | Antonio Sanchez | 2021-01-22 |
* | Specialize std::complex operators for use on GPU device. | Antonio Sanchez | 2021-01-22 |
* | Add support for Arm SVE | David Tellenbach | 2021-01-21 |
* | Fix pfrexp/pldexp for half. | Antonio Sanchez | 2021-01-21 |
* | Vectorize `pow(x, y)`. This closes https://gitlab.com/libeigen/eigen/-/issues... | Rasmus Munk Larsen | 2021-01-18 |
* | Improved std::complex sqrt and rsqrt. | Antonio Sanchez | 2021-01-17 |
* | 1)provide a better generic paddsub op implementation | Guoqiang QI | 2021-01-13 |
* | Only specialize complex `sqrt_impl` for CUDA if not MSVC. | Antonio Sanchez | 2021-01-11 |
* | Fix MSVC complex sqrt and packetmath test. | Antonio Sanchez | 2021-01-08 |
* | Add CUDA complex sqrt. | Antonio Sanchez | 2020-12-22 |
* | * Add iterative psqrt<double> for AVX and SSE when FMA is available. This pro... | Rasmus Munk Larsen | 2020-12-16 |
* | Add an additional step of Newton-Raphson for `psqrt<double>` on Arm, which ot... | Rasmus Munk Larsen | 2020-12-15 |
* | Remove comma at the end of enumeration list to silence C++03 warnings | David Tellenbach | 2020-12-13 |
* | Fix implicit cast to double. | Antonio Sanchez | 2020-12-12 |
* | Fix NEON pmax<PropagateNumbers,Packet4bf>. | Antonio Sanchez | 2020-12-11 |
* | Fix typo in AVX512 packet math. | Antonio Sanchez | 2020-12-11 |
* | Remove unused macro in Half.h | David Tellenbach | 2020-12-12 |