Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CODE SHARING] Insertions of custom LLVM IR and AMDGCN codes to triton #610

Draft
wants to merge 10 commits into
base: sjw-pipeline-infra
Choose a base branch
from

Commits on Jun 25, 2024

  1. [AMD-Pipeliner] Transition stream-pipeline to new SW pipelining infra…

    …structure
    
        - Copied scheduler from MatmulLoopPipeline (much could be consolidated)
        - Enable register buffering (even though may increases register pressure)
        - Enable num_stages=2+, including multi-buffering, and make `2` the default
        - updated tutorial for new tuning default
        - added lit	tests
    sjw36 committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    69b1ba3 View commit details
    Browse the repository at this point in the history
  2. [AMD-Reorder] Move tt.loads as early as possible

    - Also move independent(from loop-carried buffer) `triton_gpu.local_store` as early as possible
    sjw36 committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    b7a0d69 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    299c6c4 View commit details
    Browse the repository at this point in the history
  4. * updated test

    sjw36 committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    3d7beed View commit details
    Browse the repository at this point in the history
  5. * Find insertion point for loads/local_stores as early as possible

    - check for last atomic (sync?)
    - also check for other accesses to the source
    sjw36 committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    82ad03d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    f586572 View commit details
    Browse the repository at this point in the history
  7. * fixed pruning

    sjw36 committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    ed7ef15 View commit details
    Browse the repository at this point in the history

Commits on Jun 26, 2024

  1. * updated test

    sjw36 committed Jun 26, 2024
    Configuration menu
    Copy the full SHA
    950bfa7 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2024

  1. Configuration menu
    Copy the full SHA
    13b20fe View commit details
    Browse the repository at this point in the history

Commits on Jul 9, 2024

  1. Configuration menu
    Copy the full SHA
    4916575 View commit details
    Browse the repository at this point in the history