Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fast atan and atan2 functions. #8388

Open
wants to merge 35 commits into
base: main
Choose a base branch
from
Open

Commits on Aug 13, 2024

  1. Configuration menu
    Copy the full SHA
    69052fa View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e82d9ff View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    11b442c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    dee28bc View commit details
    Browse the repository at this point in the history
  5. Correct attribution.

    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    362f0ea View commit details
    Browse the repository at this point in the history
  6. Clang-format

    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    1bd7f7a View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    4f1e851 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    f10396b View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    de9d3b7 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    d8e3225 View commit details
    Browse the repository at this point in the history
  11. Bugfix fast_atan approximation. Fix correctness test to exceed the ra…

    …nge (-1, 1) to test (-4, 4). Cleanup code/comments. Test performance for all approximations.
    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    3bcd1a7 View commit details
    Browse the repository at this point in the history
  12. Cleanup

    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    2aa0c7e View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    fd088f8 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    62534d7 View commit details
    Browse the repository at this point in the history
  15. Skip test for WebGPU.

    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    c76e719 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    fc25944 View commit details
    Browse the repository at this point in the history
  17. Feedback Steven.

    mcourteaux committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    b5d0cad View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2024

  1. Configuration menu
    Copy the full SHA
    4d61c6a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ff28b99 View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2024

  1. Partially apply clang-tidy fixes we don't enforce yet (halide#8376)

    * Partially apply clang-tidy fixes we don't use yet
    
    - Put a bunch of stuff into anonymous namespaces
    - Delete some redundant casts (e.g. casting an int to int)
    - Add some const refs to avoid copies
    - Remove meaningless inline qualifiers on in-class definitions and
    constexpr functions
    - Remove return-with-value from functions returning void
    - Delete a little dead code
    - Use std::min/max where appropriate
    - Don't use a variable after std::forwarding it. It may have been moved
    from.
    - Use std::string::empty instead of comparing length to zero
    
    * Undo unintentional formatting change
    
    * Restore some necessary casts
    
    * Add NOLINT to silence older clang-tidy
    abadams authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    5a435f0 View commit details
    Browse the repository at this point in the history
  2. Fix bundling error on buildbots (halide#8392)

    LLVM as it is built on the buildbots depends on `-lrt`, which is not a
    target. Filter out non-target dependencies from consideration.
    alexreinking authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    a4544be View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    624f737 View commit details
    Browse the repository at this point in the history
  4. Fix _Float16 detection on ARM64 GCC<13 (halide#8401)

    GCC 12 only supports _Float16 on x86. Support for ARM was added in
    GCC 13. This causes a build failure in the manylinux_2_28 images.
    alexreinking authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    5ca88b7 View commit details
    Browse the repository at this point in the history
  5. Update README.md (halide#8404)

    The instructions for which llvm to acquire were stale
    abadams authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    238f73c View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b09f611 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    0614530 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    ae6dac4 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    30b5938 View commit details
    Browse the repository at this point in the history
  10. Merge pull request halide#8412

    * Update pip package metadata
    
    * Link to the CMake package docs from Doxygen
    
    * Fix invalid Doxygen annotation in Serialization.h
    alexreinking authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    6f0da12 View commit details
    Browse the repository at this point in the history
  11. Fix classifier spelling (halide#8413)

    PyPI rejected this because of a spacing issue.
    alexreinking authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    44651f9 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    636ad8f View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    51824df View commit details
    Browse the repository at this point in the history
  14. Include our Markdown documentation in the Doxygen site. (halide#8417)

    A few quirks in the Markdown parser were worked around here.
    
    The most notable is that the sequence `]:` causes Doxygen to interpret a
    would-be link as a trailing reference even if it is not at the start of
    a line. Duplicating the single bracket reference is a portable
    workaround, i.e.
    
        [winget]  ~>  [winget][winget]
    
    It also doesn't stop interpreting `@` directives inside inline code, so
    it warns about our use of the `@` as a decorator symbol inside
    Python.md.
    alexreinking authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    c9b2a76 View commit details
    Browse the repository at this point in the history
  15. Add missing backslash (halide#8419)

    abadams authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    a8966e9 View commit details
    Browse the repository at this point in the history
  16. Reschedule the matrix multiply performance app (halide#8418)

    Someone was using this as a reference expert schedule, but it was
    stale and a bit simplistic for large matrices. I rescheduled it
    to get a better fraction of peak. This also now demonstrates how
    to use rfactor to block an sgemm over the k axis.
    abadams authored and mcourteaux committed Sep 15, 2024
    Configuration menu
    Copy the full SHA
    9bcb9b7 View commit details
    Browse the repository at this point in the history