port cuda specific case with hook #2184

daisyden · 2025-10-16T08:43:00Z

Fix #2081

Copilot

Pull Request Overview

Ports a CUDA-specific large-index warp softmax test to XPU, adding large tensor coverage for 64-bit indexing correctness.

Adds import for largeTensorTest and introduces a new large-shape log_softmax gradient correctness test.
Registers the test dynamically on TestNNDeviceType with multiple large memory size decorators.

Comments suppressed due to low confidence (1)

test/xpu/test_nn_xpu.py:1

[nitpick] The decorator iterates over dtype (torch.float, torch.half) but the input tensor is always created as torch.float16, causing the float case to test a half input with a float output cast instead of a true float input path. For clearer coverage of dtype-dependent kernel behavior, construct x with the selected dtype (e.g., x = torch.randn(shape, device=device, dtype=dtype, requires_grad=True)) unless there is a documented reason to force half.

# Owner(s): ["module: intel"]

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-16T08:43:38Z

test/xpu/test_nn_xpu.py

+@largeTensorTest("64GB", "cpu")
+def _test_warp_softmax_64bit_indexing(self, device, dtype):
+    def run_test(*shape):
+        x = torch.randn(shape, device="xpu", dtype=torch.float16, requires_grad=True)


The test receives a device argument (from the instantiated device-type tests and the second @largeTensorTest decorator specifying 'cpu'), but the tensor is hard-coded with device='xpu'. This prevents the CPU-annotated variant from actually exercising a CPU path and can cause mismatched expectations. Use the passed-in device variable for allocation: x = torch.randn(shape, device=device, dtype=torch.float16, requires_grad=True).

Suggested change

x = torch.randn(shape, device="xpu", dtype=torch.float16, requires_grad=True)

x = torch.randn(shape, device=device, dtype=torch.float16, requires_grad=True)

port cuda specific case with hook

df7ebb6

Copilot AI review requested due to automatic review settings October 16, 2025 08:43

daisyden mentioned this pull request Oct 16, 2025

New test cases failures with 20250918 pytorch #2081

Closed

7 tasks

Copilot AI reviewed Oct 16, 2025

View reviewed changes

replace hardcoded xpu with device variable

6ab6446

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

port cuda specific case with hook #2184

port cuda specific case with hook #2184

Uh oh!

daisyden commented Oct 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	x = torch.randn(shape, device="xpu", dtype=torch.float16, requires_grad=True)
	x = torch.randn(shape, device=device, dtype=torch.float16, requires_grad=True)

port cuda specific case with hook #2184

Are you sure you want to change the base?

port cuda specific case with hook #2184

Uh oh!

Conversation

daisyden commented Oct 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant