8000 Kernel filtering with Omniperf profile · Issue #325 · ROCm/rocprofiler-compute · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Kernel filtering with Omniperf profile #325
Closed
@ausellis0

Description

@ausellis0

Describe the bug
On the 2.x branch, omniperf profile and the kernel filtering -k option is not limiting the kernels that are being profiled. After running omniperf analyze all kernels are still present. This is needed for applications with many, many kernels and dispatches. The kernel filtering -k option does work as expected for omniperf analyze.

Development Environment:

  • Linux Distribution: RHEL/8.9 (TOSS)
  • Omniperf Version: 2.0.0-RC1 (6222138)
  • GPU: MI300A
  • Cluster (if applicable): LLNL System

To Reproduce
Steps to reproduce the behavior:

git clone https://github.com/ROCm/HIP-Examples.git
cd HIP-Examples/add4

./buildit.sh

ROCPROF=${ROCM_PATH}/bin/rocprofv2 omniperf profile -n add4_test -k "add" -- ./gpu-stream-hip
omniperf analyze -p workloads/add4_test/MI300A_A1
# All kernels present in Top Stats and all metrics are essentially the same without any filtering for `omniperf profile`. 

Expected behavior
omniperf profile and -k is expected to not collect information on kernels not passed to the flag.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    0