Skip to content

[DO NOT REVIEW]: Support NCHWc activations x NCHWc weights kernel on non-Windows ARM64 platforms#27790

Open
hariharans29 wants to merge 42 commits intomainfrom
hari/nchwc_expt_arm_1
Open

[DO NOT REVIEW]: Support NCHWc activations x NCHWc weights kernel on non-Windows ARM64 platforms#27790
hariharans29 wants to merge 42 commits intomainfrom
hari/nchwc_expt_arm_1

Conversation

@hariharans29
Copy link
Member

@hariharans29 hariharans29 commented Mar 21, 2026

Description

It takes inspiration from the existing NCHW activations x NCHWc weights path and processes 2 output locations in the interior non-padded hot path for 2 filters and 1 output location for tail. It splits up filter count 3 as 2+1 passes and filter count 4 as 2+2 passes. This does much better than the existing C++ implementation that only processes 1 output pixel per loop. For thew padded output pixels, we still fall back to the C++ path.

This is essentially a dupe of #27788 and on comparing perf, the kernel in #27788 does better on the shareable model referenced in that PR by about 5% on a Graviton instance. Hence, we will take that in over investing time in improving this implementation further.

Motivation and Context

Initial untuned NCHWc A x NCHWc B ARM64 kernel

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

hariharans29 and others added 7 commits March 20, 2026 20:25
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
hariharans29 and others added 14 commits March 20, 2026 20:26
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant