Preserve caller-provided non-contiguous strides in make_tensor_ptr by manuelcandales · Pull Request #19006 · pytorch/executorch

manuelcandales · 2026-04-20T19:59:14Z

AOTI generates reinterpret_tensor views with non-contiguous strides (e.g. chunk/split for RoPE rotation). Previously make_tensor_ptr asserted that all strides must match contiguous layout, which crashed on these views. Preserve the caller-provided strides when they represent a legitimate non-contiguous view.

Also apply the same fix to the duplicate logic in wasm_bindings.cpp, and update tests that expected the old assertion to fire.

AOTI generates reinterpret_tensor views with non-contiguous strides (e.g. chunk/split for RoPE rotation). Previously make_tensor_ptr asserted that all strides must match contiguous layout, which crashed on these views. Preserve the caller-provided strides when they represent a legitimate non-contiguous view. Also apply the same fix to the duplicate logic in wasm_bindings.cpp, and update tests that expected the old assertion to fire. Co-authored-by: Claude <noreply@anthropic.com>

pytorch-bot · 2026-04-20T19:59:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19006

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 2 Cancelled Jobs, 5 Unrelated Failures

As of commit 76d354f with merge base 401ea8e ():

NEW FAILURES - The following jobs have failed:

Cadence Build & Test / cpu-test / test-aot / test-aot (gh)
backends/cadence/aot/tests/test_replace_ops_passes.py::TestReplaceOpsPasses::test_replace_transposed_conv_with_linear_1
pull / test-qnn-buck-build-linux / linux-job (gh)
RuntimeError: Command docker exec -t f01b08cd7f81e8e36a0687af67cbe8d2e36d3b2a9cb050066d2a159ebf903d2a /exec failed with exit code 3
pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv2_model

CANCELLED JOBS - The following jobs were cancelled. Please retry:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-openvino-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-qnn-models-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / unittest / windows / windows-job (gh) (matched win rule in flaky-rules.json)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-04-20T19:59:57Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Allow make_tensor_ptr to accept legitimate non-contiguous stride layouts (e.g., reinterpret_tensor views emitted by AOTI) instead of asserting on non-contiguous strides, and mirror the behavior in the WASM bindings.

Changes:

Preserve caller-provided non-contiguous strides; only canonicalize to computed strides when the layout is already contiguous (modulo size-1 dims).
Apply the same stride-preservation logic in extension/wasm/wasm_bindings.cpp.
Update tensor pointer tests to validate preserved non-contiguous strides instead of expecting assertion deaths.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
extension/wasm/wasm_bindings.cpp	Stops forcing computed contiguous strides; preserves caller-provided non-contiguous strides.
extension/tensor/tensor_ptr.cpp	Updates `make_tensor_ptr` to keep caller-provided non-contiguous strides.
extension/tensor/test/tensor_ptr_test.cpp	Adjusts expectations to accept non-contiguous strides and removes prior death tests.
extension/tensor/test/tensor_ptr_maker_test.cpp	Updates `from_blob` tests to accept non-contiguous strides.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-20T20:07:27Z

  if (!strides.empty()) {
+    bool is_contiguous = true;
    for (size_t i = 0; i < dim; i++) {
-      ET_CHECK_MSG(
-          strides[i] == computed_strides[i] || sizes[i] == 1,
-          "invalid strides for dim %zu: %" ET_PRI_SIZES_AND_STRIDES
-          "!= %" ET_PRI_SIZES_AND_STRIDES
-          " while its size is %" ET_PRI_SIZES_AND_STRIDES " != 1",
-          i,
-          strides[i],
-          computed_strides[i],
-          sizes[i]);
+      if (strides[i] != computed_strides[i] && sizes[i] != 1) {
+        is_contiguous = false;
+        break;
+      }
    }
+    if (is_contiguous) {
+      strides = std::move(computed_strides);
+    }
+    // else: keep the caller-provided non-contiguous strides (e.g. from
+    // reinterpret_tensor views like chunk/split).
+  } else {
+    strides = std::move(computed_strides);
  }


With this change, make_tensor_ptr can accept dim_order and strides that disagree (i.e., strides not matching dim_order_to_stride even up to size-1 dims). That makes the precedence/contract between dim_order and strides ambiguous and can break any code that assumes dim_order is consistent with the tensor’s stride order. Consider defining an explicit rule, e.g. (mandatory): when caller provides non-empty strides, either (a) derive/normalize dim_order from the provided strides (similar to the existing dim_order.empty() path), or (b) validate that the provided dim_order matches the ordering implied by the provided strides (ignoring size-1 dims) and reject otherwise. Update the tests accordingly (the new TensorMismatchStridesAndDimOrder behavior currently relies on this ambiguity).

Copilot · 2026-04-20T20:07:27Z

  if (!strides.empty()) {
+    bool is_contiguous = true;
    for (size_t i = 0; i < dim; i++) {
-      ET_CHECK_MSG(
-          strides[i] == computed_strides[i] || sizes[i] == 1,
-          "invalid strides for dim %zu: %" ET_PRI_SIZES_AND_STRIDES
-          "!= %" ET_PRI_SIZES_AND_STRIDES
-          " while its size is %" ET_PRI_SIZES_AND_STRIDES " != 1",
-          i,
-          strides[i],
-          computed_strides[i],
-          sizes[i]);
+      if (strides[i] != computed_strides[i] && sizes[i] != 1) {
+        is_contiguous = false;
+        break;
+      }
    }
+    if (is_contiguous) {
+      strides = std::move(computed_strides);
+    }


This removes the only enforcement that strides are a safe/expected layout. As a result, callers can now pass arbitrary strides (including negative/zero strides, or extremely large strides) which can lead to out-of-bounds accesses or integer overflow when computing element offsets/required storage size downstream. At minimum, add validation for obviously unsafe stride patterns (e.g., non-negative strides for non-broadcast dims if negative strides aren’t supported, and overflow-safe computation of the maximum addressable offset for the view). If the API can’t validate against the actual backing allocation size, consider adding an overload that takes the buffer length in bytes (or an explicit storage descriptor) so from_blob/make_tensor_ptr can enforce that the view is within bounds.

Copilot · 2026-04-20T20:07:28Z

  if (!strides.empty()) {
+    bool is_contiguous = true;
    for (size_t i = 0; i < sizes.size(); i++) {
-      THROW_IF_FALSE(
-          strides[i] == computed_strides[i] || sizes[i] == 1,
-          "invalid strides for dim %zu: %" ET_PRI_SIZES_AND_STRIDES
-          "!= %" ET_PRI_SIZES_AND_STRIDES
-          " while its size is %" ET_PRI_SIZES_AND_STRIDES " != 1",
-          i,
-          strides[i],
-          computed_strides[i],
-          sizes[i]);
+      if (strides[i] != computed_strides[i] && sizes[i] != 1) {
+        is_contiguous = false;
+        break;
+      }
+    }
+    if (is_contiguous) {
+      strides = std::move(computed_strides);
    }
+  } else {
+    strides = std::move(computed_strides);
  }


This duplicates the updated stride canonicalization logic from tensor_ptr.cpp, which risks behavior drift between WASM and non-WASM paths over time. Consider extracting a shared helper (e.g., a small utility that takes sizes, computed_strides, and optional strides and returns the normalized/preserved strides plus any validation) and using it in both places.

Copilot · 2026-04-20T20:07:28Z

+  // dim_order={1,0} implies strides={1,3}, but caller provides {1,4}.
+  // Non-contiguous strides are preserved for reinterpret_tensor views.


The test comment frames this as a reinterpret_tensor-style view, but it constructs a fresh tensor from raw {sizes, dim_order, strides} where dim_order and strides disagree. This is a different scenario than preserving strides from a view and can confuse readers about the intended contract. Suggest clarifying the comment to describe the actual case being tested (mismatched dim_order/strides), and/or adjusting the test to use a reinterpret_tensor-like construction path if that’s the intended coverage.

Suggested change

// dim_order={1,0} implies strides={1,3}, but caller provides {1,4}.

// Non-contiguous strides are preserved for reinterpret_tensor views.

// This directly constructs a tensor from explicit sizes, dim_order, and

// strides. dim_order={1,0} would normally imply strides={1,3}, but the

// caller provides mismatched strides {1,4}; verify make_tensor_ptr preserves

// the provided strides rather than inferring them from dim_order.

manuelcandales requested review from JacobSzwejbka and mergennachin April 20, 2026 19:59

manuelcandales requested a review from shoumikhin as a code owner April 20, 2026 19:59

Copilot AI review requested due to automatic review settings April 20, 2026 19:59

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 20, 2026

Copilot AI reviewed Apr 20, 2026

View reviewed changes

Copilot started reviewing on behalf of manuelcandales April 20, 2026 20:24 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve caller-provided non-contiguous strides in make_tensor_ptr#19006

Preserve caller-provided non-contiguous strides in make_tensor_ptr#19006
manuelcandales wants to merge 1 commit intomainfrom
manuel/make-tensor-ptr-strides

manuelcandales commented Apr 20, 2026

Uh oh!

pytorch-bot Bot commented Apr 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Copilot AI Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// dim_order={1,0} implies strides={1,3}, but caller provides {1,4}.
		// Non-contiguous strides are preserved for reinterpret_tensor views.

-  // dim_order={1,0} implies strides={1,3}, but caller provides {1,4}.
-  // Non-contiguous strides are preserved for reinterpret_tensor views.
+  // This directly constructs a tensor from explicit sizes, dim_order, and
+  // strides. dim_order={1,0} would normally imply strides={1,3}, but the
+  // caller provides mismatched strides {1,4}; verify make_tensor_ptr preserves
+  // the provided strides rather than inferring them from dim_order.

Conversation

manuelcandales commented Apr 20, 2026

Uh oh!

pytorch-bot Bot commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19006

❌ 3 New Failures, 2 Cancelled Jobs, 5 Unrelated Failures

Uh oh!

github-actions Bot commented Apr 20, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented Apr 20, 2026 •

edited

Loading

This PR needs a `release notes:` label