[Torch] Fold `aten.to.dtype` on splat constants. #4306

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

mdazz wants to merge 1 commit into llvm:main from mdazz:mdazz/add-todtype-folder

Contributor

mdazz commented Sep 5, 2025

This commit teaches AtenToDtypeOp::fold to constant-fold dtype conversions when the operand is a splat DenseElementsAttr.

Folding is done according to torch's rounding behavior, i.e.

Bool: 0 and -0.0 → false; nonzero/NaN/±Inf → true.
Float → Int: round toward zero.
Int → Float: sign-aware, rmNearestTiesToEven.
Float ↔ Float: use builtin mlir::FloatType::getFloatSemantics().
Int ↔ Int: use zextOrTrunc / sextOrTrunc based on source signedness.

Folding is only performed when non_blocking == false, copy == false, and memory_format is None.

Contributor Author

mdazz commented Sep 5, 2025 •

edited

Loading

Not sure who can review, maybe you would know @vivekkhandelwal1 @zjgarvey ?

mdazz force-pushed the mdazz/add-todtype-folder branch from 3bf4e4b to 1d7b55b Compare

September 8, 2025 10:55

zjgarvey requested review from zjgarvey, vivekkhandelwal1 and sahas3

September 9, 2025 15:34

sahas3 requested changes

View reviewed changes

test/Dialect/Torch/canonicalize.mlir Outdated

+              	// int32 splat → float32
+              	%int_splat = torch.vtensor.literal(dense<42> : tensor<2x3xsi32>) : !torch.vtensor<[2,3],si32>
+              	%int6 = torch.constant.int 6 // torch.float32
+              	// CHECK: %[[R1:.*]] = torch.vtensor.literal({{.*}} : tensor<2x3xf32>) : !torch.vtensor<[2,3],f32>

Member

sahas3 Sep 11, 2025

Can you put the actual value which I think here will be 42.0 ?

test/Dialect/Torch/canonicalize.mlir Outdated

+              						-> !torch.vtensor<[4,4],si32>
+              	// int64 splat (max int32) → int32 (trunc)
+              	%int64_splat = torch.vtensor.literal(dense<2147483647> : tensor<10xsi64>) : !torch.vtensor<[10],si64>

Member

sahas3 Sep 11, 2025

Can this value be int32max+1 to ensure that trucation does happen in the IR being locked down?

test/Dialect/Torch/canonicalize.mlir Outdated

+              	// float32 splat → float64
+              	%float32_splat = torch.vtensor.literal(dense<2.71828> : tensor<5x5xf32>) : !torch.vtensor<[5,5],f32>
+              	%int7 = torch.constant.int 7 // torch.float64
+              	// CHECK: %[[R4:.*]] = torch.vtensor.literal({{.*}} : tensor<5x5xf64>) : !torch.vtensor<[5,5],f64>

Member

sahas3 Sep 11, 2025

Let's capture the actual value here too and other such places.

Contributor Author

mdazz Sep 15, 2025

Done for all testcases.

test/Dialect/Torch/canonicalize.mlir Outdated

+              	// int32 splat → float32
+              	%int_splat = torch.vtensor.literal(dense<42> : tensor<2x3xsi32>) : !torch.vtensor<[2,3],si32>
+              	%int6 = torch.constant.int 6 // torch.float32
+              	// CHECK: %[[R1:.*]] = torch.vtensor.literal({{.*}} : tensor<2x3xf32>) : !torch.vtensor<[2,3],f32>

Member

sahas3 Sep 11, 2025

Since we are not locking the values being returned from the output IR, I think we should add CHECK-NOT:torch.aten.to.dtype as well to ensure that the op is being folded.

test/Dialect/Torch/canonicalize.mlir

                 return %0 : !torch.tensor
               }
+              // CHECK-LABEL:   @torch.aten.to.dtype$fold_splat(

Member

sahas3 Sep 11, 2025

Can you add some e2e tests to ensure that torch's rounding logic is accurately captured in this implementation?
Also please fix the CI failures, we cannot merge until CI pipelines are green.

Contributor Author

mdazz Sep 15, 2025

I added e2e tests in test_conversions.py. Note that one test in the fx_importer_stablehlo group fails due to an error in the conversion in stablehlo: stablehlo sign extends all int constants regardless of whether they are signed or not, and now that we fold constants it finds an unsigned int that it will wrongly sing-extend. I have a PR that fixes it, so I guess we'll need to wait for that to go in first.

Contributor Author

mdazz Sep 16, 2025

@sahas3 #4313 was merged and now the CI passes for this PR:)

mdazz force-pushed the mdazz/add-todtype-folder branch 2 times, most recently from 9b8168c to 42edabd Compare

September 15, 2025 10:06


          [Torch] Fold aten.to.dtype on splat constants.

3abbd48

This commit teaches `AtenToDtypeOp::fold` to constant-fold dtype conversions
when the operand is a splat `DenseElementsAttr`.

Folding is done according to torch's rounding behavior, i.e.
  * Bool: 0 and -0.0 → false; nonzero/NaN/±Inf → true.
  * Float → Int: round toward zero.
  * Int → Float: sign-aware, rmNearestTiesToEven.
  * Float ↔ Float: use builtin `mlir::FloatType::getFloatSemantics()`.
  * Int ↔ Int: use `zextOrTrunc` / `sextOrTrunc` based on source signedness.

Folding is only performed when `non_blocking == false`, `copy == false`, and `memory_format` is None.

mdazz force-pushed the mdazz/add-todtype-folder branch from 42edabd to 3abbd48 Compare

September 16, 2025 08:45

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet