Changing logic to deal with graphs with derived quantization spec #16357

RahulC7 · 2025-12-21T22:17:41Z

Summary:
We want to add a test for default_addmm_A8W8 to fully finish testing CadenceDefaultQuantizer. However there are a couple changes we need to make to the testing function.

Change 1: We allow passing `None` in the vec of `QuantizationSpec`

This is because the addmm op has 3 inputs: bias, mat1, mat2. The bias uses a DerivedQuantizationSpec, which is dynamically constructed with references to the actual graph nodes (mat1 and mat2). We can't construct an identical DerivedQuantizationSpec in the test because we'd need to reference the exact same node objects that the quantizer creates internally. Since we can't compare it directly, we use None to skip validation for that input. If mat1 and mat2 are quantized correctly, the derived bias spec will be correct too.

https://www.internalfb.com/code/fbsource/[2cfdb40fd8b628da2f46366115516408cfb9f50f]/xplat/executorch/backends/cadence/aot/quantizer/patterns.py?lines=91-103

Change 2: We changed how we iterate through `input_qspec_map`

input_qspec_map is a dictionary mapping input nodes to their qspecs. The iteration order depends on insertion order, which follows how the quantizer processes PartitionAnchors.

Each QuantizationPattern implements a get_anchors() method that returns a PartitionAnchors describing which arguments are inputs, weights, biases and nodes. This is relevant because for addmm, the PartitionAnchors lists them as inputs=[(node, 1)], weights=[(node, 2)], biases=[(node, 0, ...)]. So the map might iterate in order mat1, mat2, bias (args indices 1, 2, 0) rather than bias, mat1, mat2 (args indices 0, 1, 2).

This means that our previous way of iterating wouldn't work. Thus, we now use the following way to iterate:

for input_node, input_qspec in annotation.input_qspec_map.items():
      // Find the index of this input node in the op's args
      arg_index = None
      for i, arg in enumerate(op_node.args):
          if arg is input_node:
              arg_index = i
              break
      self.assertIsNotNone(
          arg_index,
          f"Input node {input_node} not found in op_node.args",
      )
      # Skip comparison if expected qspec is None (e.g., for DerivedQuantizationSpec)
      if expected_input_qspecs[arg_index] is not None:
          self.assertEqual(
              input_qspec,
              expected_input_qspecs[arg_index],
              f"Input qspec mismatch at arg index {arg_index}",
          )

The new code looks up which argument index each input_node corresponds to by searching through op_node.args, rather than assuming the enumeration index i matches the argument position.

Differential Revision: D88955761

pytorch-bot · 2025-12-21T22:17:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16357

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit bafed32 with merge base 6785e1f ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / android / run-emulator (gh) (#16137)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-12-21T22:17:51Z

@RahulC7 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D88955761.

github-actions · 2025-12-21T22:18:29Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…torch#16357) Summary: We want to add a test for `default_addmm_A8W8` to fully finish testing `CadenceDefaultQuantizer`. However there are a couple changes we need to make to the testing function. ## Change 1: We allow passing `None` in the vec of `QuantizationSpec` This is because the addmm op has 3 inputs: `bias`, `mat1`, `mat2`. The bias uses a `DerivedQuantizationSpec`, which is dynamically constructed with references to the actual graph nodes (`mat1` and `mat2`). We can't construct an identical `DerivedQuantizationSpec` in the test because we'd need to reference the exact same node objects that the quantizer creates internally. Since we can't compare it directly, we use `None` to skip validation for that input. If `mat1` and `mat2` are quantized correctly, the derived bias spec will be correct too. https://www.internalfb.com/code/fbsource/[2cfdb40fd8b628da2f46366115516408cfb9f50f]/xplat/executorch/backends/cadence/aot/quantizer/patterns.py?lines=91-103 ## Change 2: We changed how we iterate through `input_qspec_map` `input_qspec_map` is a dictionary mapping input nodes to their `qspecs`. The iteration order depends on insertion order, which follows how the quantizer processes `PartitionAnchors`. Each `QuantizationPattern` implements a `get_anchors()` method that returns a `PartitionAnchors` describing which arguments are inputs, weights, biases and nodes. This is relevant because for `addmm`, the `PartitionAnchors` lists them as `inputs=[(node, 1)], weights=[(node, 2)], biases=[(node, 0, ...)]. ` So the map might iterate in order `mat1, mat2, bias` (args indices 1, 2, 0) rather than `bias, mat1, mat2` (args indices 0, 1, 2). This means that our previous way of iterating wouldn't work. Thus, we now use the following way to iterate: ``` for input_node, input_qspec in annotation.input_qspec_map.items(): // Find the index of this input node in the op's args arg_index = None for i, arg in enumerate(op_node.args): if arg is input_node: arg_index = i break self.assertIsNotNone( arg_index, f"Input node {input_node} not found in op_node.args", ) # Skip comparison if expected qspec is None (e.g., for DerivedQuantizationSpec) if expected_input_qspecs[arg_index] is not None: self.assertEqual( input_qspec, expected_input_qspecs[arg_index], f"Input qspec mismatch at arg index {arg_index}", ) ``` The new code looks up which argument index each input_node corresponds to by searching through `op_node.args`, rather than assuming the enumeration index i matches the argument position. Differential Revision: D88955761

Copilot

Pull request overview

This PR adds comprehensive test coverage for several Cadence quantizers, including the CadenceDefaultQuantizer, by enhancing the testing framework to handle graphs with derived quantization specifications. The key innovation is allowing None in expected input quantization specs to skip validation of dynamically-constructed DerivedQuantizationSpec objects, and updating the iteration logic to handle non-sequential argument ordering in PartitionAnchors.

Added test coverage for 5 previously untested quantizer classes: CadenceDefaultQuantizer, CadenceWakeWordQuantizer, CadenceWith16BitConvActivationsQuantizer, CadenceWithLayerNormQuantizer, and CadenceWithSoftmaxQuantizer
Enhanced testing framework to support DerivedQuantizationSpec by allowing None values in expected input specs
Updated iteration logic to map input nodes to argument positions dynamically, handling cases where the quantizer's internal ordering differs from argument position order

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Summary: Add annotation tests for CadenceWith16BitConvActivationsQuantizer covering both conv1d and conv2d operations. Differential Revision: D88895865

Differential Revision: D88896712

Differential Revision: D88898823

Differential Revision: D88898933

Differential Revision: D88899457

…torch#16357) Summary: Pull Request resolved: pytorch#16357 We want to add a test for `default_addmm_A8W8` to fully finish testing `CadenceDefaultQuantizer`. However there are a couple changes we need to make to the testing function. ## Change 1: We allow passing `None` in the vec of `QuantizationSpec` This is because the addmm op has 3 inputs: `bias`, `mat1`, `mat2`. The bias uses a `DerivedQuantizationSpec`, which is dynamically constructed with references to the actual graph nodes (`mat1` and `mat2`). We can't construct an identical `DerivedQuantizationSpec` in the test because we'd need to reference the exact same node objects that the quantizer creates internally. Since we can't compare it directly, we use `None` to skip validation for that input. If `mat1` and `mat2` are quantized correctly, the derived bias spec will be correct too. https://www.internalfb.com/code/fbsource/[2cfdb40fd8b628da2f46366115516408cfb9f50f]/xplat/executorch/backends/cadence/aot/quantizer/patterns.py?lines=91-103 ## Change 2: We changed how we iterate through `input_qspec_map` `input_qspec_map` is a dictionary mapping input nodes to their `qspecs`. The iteration order depends on insertion order, which follows how the quantizer processes `PartitionAnchors`. Each `QuantizationPattern` implements a `get_anchors()` method that returns a `PartitionAnchors` describing which arguments are inputs, weights, biases and nodes. This is relevant because for `addmm`, the `PartitionAnchors` lists them as `inputs=[(node, 1)], weights=[(node, 2)], biases=[(node, 0, ...)]. ` So the map might iterate in order `mat1, mat2, bias` (args indices 1, 2, 0) rather than `bias, mat1, mat2` (args indices 0, 1, 2). This means that our previous way of iterating wouldn't work. Thus, we now use the following way to iterate: ``` for input_node, input_qspec in annotation.input_qspec_map.items(): // Find the index of this input node in the op's args arg_index = None for i, arg in enumerate(op_node.args): if arg is input_node: arg_index = i break self.assertIsNotNone( arg_index, f"Input node {input_node} not found in op_node.args", ) # Skip comparison if expected qspec is None (e.g., for DerivedQuantizationSpec) if expected_input_qspecs[arg_index] is not None: self.assertEqual( input_qspec, expected_input_qspecs[arg_index], f"Input qspec mismatch at arg index {arg_index}", ) ``` The new code looks up which argument index each input_node corresponds to by searching through `op_node.args`, rather than assuming the enumeration index i matches the argument position. Reviewed By: hsharma35 Differential Revision: D88955761

Copilot AI review requested due to automatic review settings December 21, 2025 22:17

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 21, 2025

meta-codesync bot added fb-exported meta-exported labels Dec 21, 2025

Copilot started reviewing on behalf of RahulC7 December 21, 2025 22:18 View session

Copilot AI reviewed Dec 21, 2025

View reviewed changes

hsharma35 approved these changes Dec 22, 2025

View reviewed changes

RahulC7 added 6 commits December 22, 2025 07:23

Adding test for CadenceWith16BitConvActivationsQuantizer

e3c5217

Summary: Add annotation tests for CadenceWith16BitConvActivationsQuantizer covering both conv1d and conv2d operations. Differential Revision: D88895865

Adding test for CadenceWithLayerNormQuantizer

6ce0bec

Differential Revision: D88896712

Adding Test for CadenceWithLayerNormQuantizer

399f502

Differential Revision: D88898823

Adding test for CadenceWakeWordQuantizer

fd9071e

Differential Revision: D88898933

Adding Tests for CadenceDefaultQuantizer

74805a1

Differential Revision: D88899457

RahulC7 force-pushed the export-D88955761 branch from 9f4be46 to bafed32 Compare December 22, 2025 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Changing logic to deal with graphs with derived quantization spec #16357

Changing logic to deal with graphs with derived quantization spec #16357

RahulC7 commented Dec 21, 2025

Uh oh!

pytorch-bot bot commented Dec 21, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Dec 21, 2025

Uh oh!

github-actions bot commented Dec 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Changing logic to deal with graphs with derived quantization spec #16357

Are you sure you want to change the base?

Changing logic to deal with graphs with derived quantization spec #16357

Conversation

RahulC7 commented Dec 21, 2025

Change 1: We allow passing None in the vec of QuantizationSpec

Change 2: We changed how we iterate through input_qspec_map

Uh oh!

pytorch-bot bot commented Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16357

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

meta-codesync bot commented Dec 21, 2025

Uh oh!

github-actions bot commented Dec 21, 2025

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Change 1: We allow passing `None` in the vec of `QuantizationSpec`

Change 2: We changed how we iterate through `input_qspec_map`

pytorch-bot bot commented Dec 21, 2025 •

edited

Loading

This PR needs a `release notes:` label