8000 [Bug]: KeyError · Issue #625 · ROCm/rocprofiler-compute · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Bug]: KeyError #625

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
dezhiAmd opened this issue Mar 22, 2025 · 1 comment
Open

[Bug]: KeyError #625

dezhiAmd opened this issue Mar 22, 2025 · 1 comment
Assignees
Labels
bug Something isn't working triage

Comments

@dezhiAmd
Copy link
dezhiAmd commented Mar 22, 2025

Describe the bug

Repo process:

  1. Go to the sample directory (rocprofiler-compute_source/sample)

  2. Run the command:
    rocprof-compute profile -n test1 -k run_forward$async_dispatch_48_attention_4x10x4096x64xf8E4M3FNUZ_generic --no-roof -- ./occupancy

  3. Bear in mind that there is no kernel named "run_forward$async_dispatch_48_attention_4x10x4096x64xf8E4M3FNUZ_generic", this test might just exit early instead of running all 15 tests using 15 input files

When I try to profile my application using kernel filter like this:

In the end, timestamps.csv is almost empty (with Dispatch_ID,Kernel_Name)
The error message on screen is:

Traceback (most recent call last):
  File "/rocprofiler-compute_source/install/bin/rocprof-compute", line 156, in <module>
    main()
    ~~~~^^
  File "/rocprofiler-compute_source/install/bin/rocprof-compute", line 144, in main
    rocprof_compute.run_profiler()
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/utils/utils.py", line 53, in wrap_function
    result = function(*args, **kwargs)
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/rocprof_compute_base.py", line 357, in run_profiler
    profiler.post_processing()
    ~~~~~~~~~~~~~~~~~~~~~~~~^^
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/utils/utils.py", line 53, in wrap_function
    result = function(*args, **kwargs)
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/rocprof_compute_profile/profiler_rocprof_v1.py", line 100, in post_processing
    self.join_prof()
    ~~~~~~~~~~~~~~^^
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/utils/utils.py", line 53, in wrap_function
    result = function(*args, **kwargs)
  File "/rocprofiler-compute_source/install/libexec/rocprofiler-compute/rocprof_compute_profile/profiler_base.py", line 121, in join_prof
    key = _df.groupby(["Kernel_Name", "Grid_Size"]).cumcount()
          ~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/root/.pyenv/versions/3.13t-dev/lib/python3.13t/site-packages/pandas/core/frame.py", line 9183, in groupby
    return DataFrameGroupBy(
        obj=self,
    ...<7 lines>...
        dropna=dropna,
    )
  File "/root/.pyenv/versions/3.13t-dev/lib/python3.13t/site-packages/pandas/core/groupby/groupby.py", line 1329, in __init__
    grouper, exclusions, obj = get_grouper(
                               ~~~~~~~~~~~^
        obj,
        ^^^^
    ...<5 lines>...
        dropna=self.dropna,
        ^^^^^^^^^^^^^^^^^^^
    )
    ^
  File "/root/.pyenv/versions/3.13t-dev/lib/python3.13t/site-packages/pandas/core/groupby/grouper.py", line 1043, in get_grouper
    raise KeyError(gpr)
KeyError: 'Grid_Size'

Linux Distribution

Ubuntu22.04

ROCm Compute Profiler Version

git hash 40ad99e

GPU

AMD MI300X

ROCm Version

ROCM6.3.4

Cluster name (if applicable)

No response

Reproducer

  1. Run command:
    rocprof-compute profile -n test1 -k run_forward$async_dispatch_48_attention_4x10x4096x64xf8E4M3FNUZ_generic --no-roof -- ./occupancy

Expected behavior

Report no kernel named "run_forward$async_dispatch_48_attention_4x10x4096x64xf8E4M3FNUZ_generic"

Relevant log output

Screenshots

No response

Additional Context

No response

@dezhiAmd dezhiAmd added bug Something isn't working triage labels Mar 22, 2025
@dezhiAmd
Copy link
Author

dezhliao@amd.com
SharkAI team for more details

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working triage
Projects
None yet
Development

No branches or pull requests

2 participants
0