[ONNX] Update saved exported program in debugging report if the exporting passes run_decomposition() #148617

titaiwangms · 2025-03-05T22:43:53Z

Previous to this PR, if the exporting passes run_decomposition(), the report still shows the exported_program before decomposition, which adds the difficulties to our users when they want to check the exported program that are used to translate to ONNX graph.

The following example is what we see before this PR:

# PyTorch ONNX Conversion Report

✅ Obtain model graph with torch.export.export(..., strict=False)
⚪ Obtain model graph with torch.export.export(..., strict=True)
⚪ Obtain model graph with torch.jit.trace
✅ Decompose operators for ONNX compatibility
❌ Translate the graph into ONNX
⚪ Run onnx.checker on the ONNX model
⚪ Execute the model with ONNX Runtime
⚪ Validate model output accuracy


## Error messages

```pytb


Traceback (most recent call last):

  File "/home/titaiwang/pytorch/torch/onnx/_internal/exporter/_core.py", line 707, in _translate_fx_graph
    _handle_call_function_node_with_lowering(

  File "/home/titaiwang/pytorch/torch/onnx/_internal/exporter/_core.py", line 486, in _handle_call_function_node_with_lowering
    raise _errors.DispatchError(

torch.onnx._internal.exporter._errors.DispatchError: No ONNX function found for <OpOverload(op='aten.slice', overload='Tensor')>. Failure message: No decompositions registered for the complex-valued input


The above exception was the direct cause of the following exception:


Traceback (most recent call last):

  File "/home/titaiwang/pytorch/torch/onnx/_internal/exporter/_core.py", line 1371, in export
    onnx_program = _exported_program_to_onnx_program(
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

  File "/home/titaiwang/pytorch/torch/onnx/_internal/exporter/_core.py", line 1007, in _exported_program_to_onnx_program
    values = _translate_fx_graph(
             ^^^^^^^^^^^^^^^^^^^^

  File "/home/titaiwang/pytorch/torch/onnx/_internal/exporter/_core.py", line 733, in _translate_fx_graph
    raise _errors.ConversionError(

torch.onnx._internal.exporter._errors.ConversionError: Error when translating node %slice_1 : [num_users=1] = call_function[target=torch.ops.aten.slice.Tensor](args = (%_to_copy, 0, 0, 9223372036854775807), kwargs = {}). See the stack trace for more information.

Exported program

ExportedProgram:
    class GraphModule(torch.nn.Module):
        def forward(self, x: "f32[3, 4]"):
             # File: /home/titaiwang/pytorch/test_slice_complex.py:6 in forward, code: x_complex = x.to(torch.complex64)
            to: "c64[3, 4]" = torch.ops.aten.to.dtype(x, torch.complex64);  x = None
            
             # File: /home/titaiwang/pytorch/test_slice_complex.py:8 in forward, code: return x_complex[:, :2]
            slice_1: "c64[3, 4]" = torch.ops.aten.slice.Tensor(to, 0, 0, 9223372036854775807);  to = None
            slice_2: "c64[3, 2]" = torch.ops.aten.slice.Tensor(slice_1, 1, 0, 2);  slice_1 = None
            return (slice_2,)
            
Graph signature: ExportGraphSignature(input_specs=[InputSpec(kind=<InputKind.USER_INPUT: 1>, arg=TensorArgument(name='x'), target=None, persistent=None)], output_specs=[OutputSpec(kind=<OutputKind.USER_OUTPUT: 1>, arg=TensorArgument(name='slice_2'), target=None)])
Range constraints: {}

Analysis

PyTorch ONNX Conversion Analysis

Model Information

The model has 0 parameters and 0 buffers (non-trainable parameters).
Number of parameters per dtype:

defaultdict(<class 'int'>, {})

Number of buffers per dtype:

defaultdict(<class 'int'>, {})

Inputs:

x: TensorMetadata(shape=torch.Size([3, 4]), dtype=torch.float32, requires_grad=False, stride=(4, 1), memory_format=torch.contiguous_format, is_quantized=False, qparams={})

Outputs:

slice_2: TensorMetadata(shape=torch.Size([3, 2]), dtype=torch.complex64, requires_grad=False, stride=(4, 1), memory_format=None, is_quantized=False, qparams={})

The FX graph has 5 nodes in total. Number of FX nodes per op:

placeholder: 1
call_function: 3
output: 1

Of the call_function nodes, the counts of operators used are:

aten.slice.Tensor: 2
aten.to.dtype: 1

ONNX Conversion Information

The model contains operators the dispatcher could not find registered ONNX decompositions for. This may be due to missing implementations, decompositions not registered correctly, or a bug in the dispatcher.

Errors grouped by operator:

aten.to.dtype: No decompositions registered for the real-valued input. Example node: %to : [num_users=1] = call_function[target=torch.ops.aten.to.dtype](args = (%x, torch.complex64), kwargs = {}). All nodes: [to]
aten.slice.Tensor: No decompositions registered for the complex-valued input. Example node: %slice_1 : [num_users=1] = call_function[target=torch.ops.aten.slice.Tensor](args = (%to, 0, 0, 9223372036854775807), kwargs = {}). All nodes: [slice_1, slice_2]

Decomposition comparison

Ops exist only in the ExportedProgram before decomposition: ['aten.to.dtype']

Ops exist only in the ExportedProgram after decomposition: ['aten._to_copy.default']

pytorch-bot · 2025-03-05T22:43:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148617

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit da2d4c2 with merge base 38479e4 ():
💚 Looks good so far! There are no failures yet. 💚

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

⏳ trunk / libtorch-linux-focal-cuda12.4-py3.10-gcc9-debug / build (gh) (#148495)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

justinchuby · 2025-03-05T23:17:31Z

Initially, I actually wanted to retain the original ep because I want to see what happens with decomposition. Maybe we can retain both?

titaiwangms · 2025-03-05T23:25:56Z

Initially, I actually wanted to retain the original ep because I want to see what happens with decomposition. Maybe we can retain both?

Does the section decomposition comparison satisfy your case? Because I think when the decomposition is done, while translating is failing, the users would be interested in the most recent exported program. Two eps seem to be too long though.

titaiwangms · 2025-03-06T03:13:20Z

@pytorchbot merge

pytorchmergebot · 2025-03-06T03:15:02Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

update ep in report

da2d4c2

titaiwangms requested review from justinchuby, shubhambhokare1 and wschin as code owners March 5, 2025 22:43

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Mar 5, 2025

titaiwangms added the topic: improvements topic category label Mar 5, 2025

pytorchbot added the open source label Mar 5, 2025

justinchuby approved these changes Mar 6, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 6, 2025

pytorchmergebot added the merging label Mar 6, 2025

pytorchmergebot added the Merged label Mar 6, 2025

pytorchmergebot closed this in e7bc1d1 Mar 6, 2025

pytorchmergebot removed the merging label Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ONNX] Update saved exported program in debugging report if the exporting passes run_decomposition() #148617

[ONNX] Update saved exported program in debugging report if the exporting passes run_decomposition() #148617

Uh oh!

titaiwangms commented Mar 5, 2025

Uh oh!

pytorch-bot bot commented Mar 5, 2025 •

edited

Loading

Uh oh!

justinchuby commented Mar 5, 2025

Uh oh!

titaiwangms commented Mar 5, 2025

Uh oh!

titaiwangms commented Mar 6, 2025

Uh oh!

pytorchmergebot commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ONNX] Update saved exported program in debugging report if the exporting passes run_decomposition() #148617

[ONNX] Update saved exported program in debugging report if the exporting passes run_decomposition() #148617

Uh oh!

Conversation

titaiwangms commented Mar 5, 2025

Exported program

Analysis

Model Information

ONNX Conversion Information

Decomposition comparison

Uh oh!

pytorch-bot bot commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148617

⏳ No Failures, 1 Pending

Uh oh!

justinchuby commented Mar 5, 2025

Uh oh!

titaiwangms commented Mar 5, 2025

Uh oh!

titaiwangms commented Mar 6, 2025

Uh oh!

pytorchmergebot commented Mar 6, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Mar 5, 2025 •

edited

Loading