how can the L2 arithmetic intensity be less than the HBM AI. · Issue #690 · ROCm/rocprofiler-compute · GitHub
More Web Proxy on the site http://driver.im/
You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Additionally, how would one interpret the AI L1/L2 dots. Would that mean the farther left we are, the better the caches are used (we do more loads in caches/LDS). If the L1 dot is farther than L2 this means we use the L1, if the L2 is farther than the HBM that means we use it ?
Describe your question
I have the following roofline:
This gives me a lower L2 ai than HBM ai. As all loads/stores that go through L2 should go through HBM we should have AI L1 >= AI L2 >= AI HBM right ?
Additional context
No response
The text was updated successfully, but these errors were encountered: