Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | fix plog(+inf) with AVX512 | 2019-01-09 | |
| | |||
* | Add psin/pcos on AVX512 -> almost for free, at last! | 2018-11-30 | |
| | |||
* | Fix float-to-double warning | 2018-10-16 | |
| | |||
* | Fix avx512 plog(NaN) to return NaN instead of +inf | 2018-10-11 | |
| | |||
* | Enable avx512 plog with clang | 2018-10-11 | |
| | |||
* | Re-enable FMA for fast sqrt functions | 2018-07-30 | |
| | |||
* | Fix AVX512 implementations of psqrt | 2018-06-25 | |
| | | | | | | | | | | | | | This commit fixes the AVX512 implementations of psqrt in the same way that 3ed67cb0bb4af65fbf243df598604a8c7630bf7d fixed the AVX2 version of this function. The AVX512 versions of psqrt incorrectly return -0.0 for negative values, instead of NaN. Fixing the issues requires adding some additional instructions that slow down the algorithms. A similar test to the one used in 3ed67cb0bb4af65fbf243df598604a8c7630bf7d shows that the corrected Packet16f code runs at 73% of the speed of the existing code, while the corrected Packed8d function runs at 68% of the original. | ||
* | fix AVX512 plog | 2018-04-20 | |
| | |||
* | AVX512: _mm512_rsqrt28_ps is available for AVX512ER only | 2018-04-03 | |
| | |||
* | AVX512: fix psqrt and prsqrt | 2018-04-03 | |
| | |||
* | Disabled some of the AVX512 primitives on compilers that don't support them | 2016-04-29 | |
| | |||
* | Commented out the version of pexp<Packet8d> since it fails to compile with ↵ | 2016-02-04 | |
| | | | | gcc 5.3 | ||
* | Added implementations of pexp, plog, psqrt, and prsqrt optimized for AVX512 | 2016-02-04 | |