Metal backend: Materialize non-packed tensor views in reinterpret_tensor by manuelcandales · Pull Request #19033 · pytorch/executorch

manuelcandales · 2026-04-21T19:48:09Z

AOTI generates reinterpret_tensor views with non-packed strides (e.g. chunk/split for RoPE rotation) that have holes in memory. ExecuTorch's make_tensor_ptr requires densely packed layouts.

When aoti_torch__reinterpret_tensor encounters non-packed strides, allocate a new contiguous Metal buffer and copy elements using strided access from the source.

Authored with Claude.

pytorch-bot · 2026-04-21T19:48:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19033

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 2 Unrelated Failures

As of commit cf71b8e with merge base 1d37abd ():

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest-buck / macos / macos-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-04-21T19:48:57Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

This PR updates the Metal AOTI runtime shim to handle reinterpret_tensor views that have non-packed (holey) strides by materializing them into a newly allocated contiguous Metal buffer, aligning with ExecuTorch tensor construction requirements.

Changes:

Added is_packed_strides() to detect when a strided view has “holes” (storage extent > numel).
Added materialize_packed() to allocate a new Metal buffer and copy elements from the strided source view into a packed layout.
Updated aoti_torch__reinterpret_tensor to materialize non-packed views, compute contiguous strides for the new buffer, and update ownership/refcount bookkeeping accordingly.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

AOTI generates reinterpret_tensor views with non-packed strides (e.g. chunk/split for RoPE rotation) that have holes in memory. ExecuTorch's make_tensor_ptr requires densely packed layouts. When aoti_torch__reinterpret_tensor encounters non-packed strides, allocate a new contiguous Metal buffer and copy elements using strided access from the source. Authored with Claude.

metascroy

Stamping, but maybe consider adopting SlimTensor to address this

manuelcandales requested a review from metascroy April 21, 2026 19:48

manuelcandales requested a review from shoumikhin as a code owner April 21, 2026 19:48

Copilot AI review requested due to automatic review settings April 21, 2026 19:48

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 21, 2026

manuelcandales removed the request for review from shoumikhin April 21, 2026 19:48

Copilot started reviewing on behalf of manuelcandales April 21, 2026 19:48 View session

manuelcandales force-pushed the manuel/metal-materialize-non-packed-views branch from 6a23ab6 to 0f6aae2 Compare April 21, 2026 19:51

Copilot AI reviewed Apr 21, 2026

View reviewed changes

Comment thread backends/apple/metal/runtime/shims/memory.cpp Outdated

Comment thread backends/apple/metal/runtime/shims/memory.cpp

Comment thread backends/apple/metal/runtime/shims/memory.cpp

manuelcandales force-pushed the manuel/metal-materialize-non-packed-views branch from 0f6aae2 to cf71b8e Compare April 21, 2026 20:35

metascroy approved these changes Apr 21, 2026

View reviewed changes

manuelcandales merged commit 6be4fb5 into main Apr 22, 2026
200 of 203 checks passed

manuelcandales deleted the manuel/metal-materialize-non-packed-views branch April 22, 2026 03:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal backend: Materialize non-packed tensor views in reinterpret_tensor#19033

Metal backend: Materialize non-packed tensor views in reinterpret_tensor#19033
manuelcandales merged 1 commit intomainfrom
manuel/metal-materialize-non-packed-views

manuelcandales commented Apr 21, 2026

Uh oh!

pytorch-bot Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

metascroy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

manuelcandales commented Apr 21, 2026

Uh oh!

pytorch-bot Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19033

❌ 1 Cancelled Job, 2 Unrelated Failures

Uh oh!

github-actions Bot commented Apr 21, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

metascroy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot Bot commented Apr 21, 2026 •

edited

Loading

This PR needs a `release notes:` label