Median Filter on HOST and HIP #559

r-abishek · 2025-06-04T04:34:17Z

8000

Adds RPP Median Filter for HOST and HIP with support for U8/F16/F32/I8 and NCHW-NHWC toggle support
Adds relevant QA/Unit/Performance tests

add doc images

templated median filter compute in HIP backend

…tations

into hk/median_filter

RPP Median Filter on HOST and HIP

kiritigowda · 2025-06-04T21:47:00Z

@r-abishek - can you resolve the merge conflicts?

Copilot

Pull Request Overview

Adds a median filter augmentation across HOST and HIP backends, with full support for U8/F16/F32/I8 data types and NCHW/NHWC layouts, and updates the test suite to exercise the new functionality.

Registers MEDIAN_FILTER in the augmentation map, enums, and test-case sets.
Implements rppt_median_filter_host and rppt_median_filter_gpu, plus the CPU/GPU kernel code and header declarations.
Updates test scripts and mappings (common.py, runImageTests.py) and adds QA/unit/performance tests for the new filter.

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
utilities/test_suite/rpp_test_suite_image.h	Registers `MEDIAN_FILTER` and updates parameter/case sets
utilities/test_suite/common.py	Adds `median_filter` mapping and updates filter category lists
utilities/test_suite/HOST/runImageTests.py	Includes `median_filter` in kernel-size test loops
utilities/test_suite/HOST/Tensor_image_host.cpp	Adds `MEDIAN_FILTER` switch case with host API call
utilities/test_suite/HIP/runImageTests.py	Includes `median_filter` in HIP test loops
utilities/test_suite/HIP/Tensor_image_hip.cpp	Adds `MEDIAN_FILTER` switch case with GPU API call
src/modules/tensor/rppt_tensor_filter_augmentations.cpp	Implements host and GPU median filter functions
src/modules/tensor/cpu/kernel/median_filter.cpp	Adds generic median filter kernel implementation
src/include/tensor/host_tensor_executors.hpp	Declares `median_filter_generic_host_tensor` template
src/include/tensor/hip_tensor_executors.hpp	Declares `hip_exec_median_filter_tensor` template
api/rppt_tensor_filter_augmentations.h	Documents and prototypes median filter APIs

Comments suppressed due to low confidence (1)

src/modules/tensor/cpu/kernel/median_filter.cpp:63

C++ does not support variable-length arrays; consider using std::vector blockData(kernelSizeSquared * channels) or allocating a fixed-size buffer to avoid undefined behavior.

T blockData[kernelSizeSquared * channels];

src/modules/tensor/cpu/kernel/median_filter.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

codecov · 2025-06-11T21:02:26Z

Codecov Report

Attention: Patch coverage is 99.20319% with 4 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...odules/tensor/rppt_tensor_filter_augmentations.cpp	97.17%	3 Missing ⚠️
src/modules/tensor/hip/kernel/median_filter.cpp	99.64%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #559      +/-   ##
===========================================
+ Coverage    87.60%   87.66%   +0.05%     
===========================================
  Files          185      187       +2     
  Lines        78747    79249     +502     
===========================================
+ Hits         68984    69467     +483     
- Misses        9763     9782      +19

Files with missing lines	Coverage Δ
src/modules/tensor/cpu/kernel/median_filter.cpp	`100.00% <100.00%> (ø)`
src/modules/tensor/hip/kernel/median_filter.cpp	`99.64% <99.64%> (ø)`
...odules/tensor/rppt_tensor_filter_augmentations.cpp	`97.56% <97.17%> (-0.19%)`	⬇️

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kiritigowda · 2025-06-12T17:32:04Z

@r-abishek - can you resolve the conflicts

rrawther · 2025-06-13T17:15:43Z

+            Rpp32u srcIdx = row * srcDescPtr->strides.hStride + col * srcDescPtr->strides.wStride;
+
+            // Copy pixel values for all channels
+            for (Rpp32s ch = 0; ch < channels; ch++)


consider optimizing this loop code try
"
memcpy(&blockData[index], &srcPtrTemp[srcIdx], channels);
index += channels;
"
OR with #pragma unroll before line #79. Try both and see which results in more perf

rrawther · 2025-06-13T17:18:46Z

src/modules/tensor/cpu/kernel/median_filter.cpp

+        for (Rpp32s j = -padLength; j <= padLength; j++)
+        {
+            // Clamp the row and column to image boundaries (nearest-neighbor padding)
+            Rpp32s row = std::max(0, std::min(rowIdx + i, heightLimit));


can we compute the min and max outside the for loops since padlength is predefined

rrawther · 2025-06-13T17:28:33Z

src/modules/tensor/cpu/kernel/median_filter.cpp

+            channelBlock[i] = blockData[i * channels + ch];
+
+        // Sort the data to compute median
+        std::sort(channelBlock, channelBlock + kernelSizeSquared);


have you considered using nth_element instead of std::sort? It is too slow

int mid = kernelSizeSquared / 2;
std::nth_element(channelBlock, channelBlock + mid, channelBlock + kernelSizeSquared);
uint8_t median = channelBlock[mid];

rrawther · 2025-06-13T17:29:41Z

src/modules/tensor/cpu/kernel/median_filter.cpp

+                    T *dstPtrTemp = dstPtrRow;
+                    for(Rpp32s j = 0; j < roi.xywhROI.roiWidth; j++)
+                    {
+                        median_filter_generic_tensor(srcPtrChannel, dstPtrTemp, i, j, kernelSizeSquared, padLength, roi.xywhROI.roiHeight - 1, roi.xywhROI.roiWidth - 1, 1, srcDescPtr, dstDescPtr);


median_filter_generic_tensor has to be optimized as per suggestions

rrawther

@AryanSalmanpour : Can you please add your review for the HIP code. I just skimmed through it

rrawther · 2025-06-13T17:34:31Z

src/modules/tensor/hip/kernel/median_filter.cpp

+        // Sorting network for 3x3 (9 elements) median
+        #define SWAP(i, j) if (window[i] > window[j]) { float tmp = window[i]; window[i] = window[j]; window[j] = tmp; }
+
+        SWAP(1, 2); SWAP(4, 5); SWAP(7, 8); SWAP(0, 1);


why this is skipping alternate SWAPs. like SWAP(2, 3), SWAP(5,6) etc

This is a fixed sorting network for 9 elements. Some swaps like SWAP(2, 3) or SWAP(5, 6) are skipped because they're not needed — the sorting is still correct with fewer steps.

rrawther · 2025-06-13T17:36:59Z

src/modules/tensor/hip/kernel/median_filter.cpp

+    }
+    else
+    {
+        // Partial selection sort for median - sufficient to find median without full sorting


Is this an approximate algorithm?

kiritigowda · 2025-06-13T18:46:37Z

@r-abishek need the conflicts resolved to run CI

Median Filter : Review comments and Conflicts resolution

r-abishek · 2025-06-18T00:57:02Z

@kiritigowda conflicts resolved

AryanSalmanpour · 2025-06-25T14:36:28Z

src/modules/tensor/hip/kernel/median_filter.cpp

+// -------------------- median_filter device helpers --------------------
+
+template<int kernelSize>
+__device__ float compute_median(float *window)


@r-abishek We have a built-in function for computing the median: __builtin_amdgcn_fmed3f. Please see if you can use it here.

We have tried using the __builtin_amdgcn_fmed3f function for median computation. However, this function can only compute the median of three values, whereas a 3×3 filter requires finding the median of nine values.
We attempted sequential calls with combinations of the 9 elements, but faced accuracy issues and were unable to match the golden outputs in QA tests.
This function provides a significant performance boost and may still be useful for approximate or relaxed-accuracy cases.

@r-abishek @HazarathKumarM Please take a look at the MIVisionX HIP kernel, which utilizes the __builtin_amdgcn_fmed3f function for computing a 3x3 median filter. https://github.com/ROCm/MIVisionX/blob/develop/amd_openvx/openvx/hipvx/filter_kernels.cpp#L508

@AryanSalmanpour Thanks! This is working very well for 3x3 specifically. Testing and sending this and few more updates soon.
Unfortunately not for 5,7,9 sizes though

rrawther

I think some comments are still not addressed

HazarathKumarM and others added 16 commits February 24, 2025 19:11

Add Median Filter implementation on HOST and HIP

be2de9c

fix the mismatches on the HIP backend

8011754

fix QA mismatches

de0f507

Merge remote-tracking branch 'develop' into hk/median_filter

113cebd

resolve build errors

7d6ef30

add doc images

resolve review comments

46a47f1

Add comments in the function

7ca3771

templated median filter compute in HIP backend

Merge remote-tracking branch 'develop' into hk/median_filter

055e89e

pre computed kernelSquare and padlength params

f94d703

Added heightlimit and widthLimt variables and remove additional compu…

ffc2ff2

…tations

refactored median_filter_compute function

fcacc25

updated the sorting logic

8f26819

Merge branch 'develop' into hk/median_filter

731ea0f

Revert the median filter case addition in the new func group list

1749187

Merge branch 'hk/median_filter' of https://github.com/HazarathKumarM/rpp

749644d

into hk/median_filter

Merge pull request #404 from HazarathKumarM/hk/median_filter

d98da21

RPP Median Filter on HOST and HIP

r-abishek requested a review from Copilot June 4, 2025 04:34

r-abishek added enhancement New feature or request ci:precheckin labels Jun 4, 2025

This comment was marked as outdated.

Sign in to view

kiritigowda self-assigned this Jun 4, 2025

kiritigowda requested a review from rrawther June 4, 2025 04:40

Merge branch 'develop' into ar/opt_median_filter

20323f2

kiritigowda requested a review from Copilot June 6, 2025 22:27

Copilot AI reviewed Jun 6, 2025

View reviewed changes

src/modules/tensor/cpu/kernel/median_filter.cpp Outdated Show resolved Hide resolved

r-abishek and others added 3 commits June 9, 2025 14:03

typo fix

cd85f42

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge branch 'develop' into ar/opt_median_filter

197d4a8

Merge branch 'develop' into ar/opt_median_filter

b8bf63c

Merge branch 'develop' into ar/opt_median_filter

f3fb249

rrawther reviewed Jun 13, 2025

View reviewed changes

rrawther requested changes Jun 13, 2025

View reviewed changes

rrawther requested a review from AryanSalmanpour June 17, 2025 00:42

HazarathKumarM and others added 5 commits June 17, 2025 15:43

Merge remote-tracking branch 'tot/develop' into ar/opt_median_filter

e528078

Address review comments

c1aa418

Merge branch 'develop' into ar/opt_median_filter

b0fbe18

Merge branch 'ar/opt_median_filter' into hk/median_filter_branch

5050d5f

Merge pull request #450 from HazarathKumarM/hk/median_filter_branch

0a51a4b

Median Filter : Review comments and Conflicts resolution

Merge branch 'develop' into ar/opt_median_filter

deb010a

kiritigowda requested a review from rrawther June 20, 2025 18:50

Merge branch 'develop' into ar/opt_median_filter

204c055

AryanSalmanpour reviewed Jun 25, 2025

View reviewed changes

kiritigowda added 2 commits June 26, 2025 22:46

Merge branch 'develop' into ar/opt_median_filter

e6205d8

Merge branch 'develop' into ar/opt_median_filter

42dbe61

rrawther requested changes Jul 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Median Filter on HOST and HIP #559

Median Filter on HOST and HIP #559

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Median Filter on HOST and HIP #559

Are you sure you want to change the base?

Median Filter on HOST and HIP #559

Conversation

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!