`ttir.convolution` decomposition may insert unnecessary permute #1751

LPanosTT · 2025-01-10T20:57:56Z

When decomposing ttir.convolution to ttir.conv2d, if the weights and/or input are already in channel-last format, we do not need to permute them. However, ttir.permute ops are still added. I.e.:

auto weight = rewriter.create<ttir::PermuteOp>(
        op.getLoc(), weightDPSOutput.getType(), adaptor.getWeight(),
        weightDPSOutput, kernelPermutation);

When kernelPermutation = {0, 1, 2, 3}. This would be solvable with a simple canonicalization pattern, but we probably shouldn't knowingly introduce ops that do nothing just because they'll be erased later.

We could add a bool checkIsNopPermutation(SmallVector<int> permutation) and replace the assignment of weight with

...
auto weight  = adaptor.getWeight()
if (!checkisNopPermutation(kernelPermutation)) {
    auto weightType =
        mlir::cast<RankedTensorType>(adaptor.getWeight().getType());
    auto kernelPermutation =
        generateConvKernelPermutation(op, conv2dKernelLayout);
    auto weightOutputShape = ::ttmlir::utils::applyPermutation(
        mlir::cast<RankedTensorType>(adaptor.getWeight().getType()).getShape(),
        kernelPermutation);
    auto weightDPSOutput = rewriter.create<tensor::EmptyOp>(
        op.getLoc(), weightOutputShape, weightType.getElementType());
    weight = rewriter.create<ttir::PermuteOp>(
        op.getLoc(), weightDPSOutput.getType(), weight,
        weightDPSOutput, kernelPermutation);
}
...

We could do the same with the input and output.

The text was updated successfully, but these errors were encountered:

LPanosTT · 2025-01-10T21:28:35Z

Assigned @azecevicTT because you were the last person who edited this part of the code. But if you want me to take it on I'll do it.

azecevicTT · 2025-01-13T09:12:02Z

Yeah, I added that in #1670. I didn't want to merge it too early because of the holidays, so I will leave it open for one or two more days for review.
By the way, this is a very common theme, especially with data movement ops during conversion. All of them have well-defined identities, so identities should be folded by the respective op, not in the conversion of some other op. First, it's very error-prone to do it directly during conversion, and second, if the op changes the interface (like a broadcast op recently) the condition for folding an op can change and folding can become invalid, on the other hand, if it's done in one place it's much easier to spot an error (any such folding should be covered with test-case).
The same goes for direct erasure of ops during conversions. Ideally, every op should be responsible for itself and give relevant information about itself to the global context (like NoMemoryEffect/Pure). Otherwise, we are unconsciously creating strong interdependencies between ops themselves.

LPanosTT · 2025-01-13T16:23:47Z

Makes Sense to me. Thanks!

LPanosTT assigned LPanosTT and azecevicTT and unassigned LPanosTT Jan 10, 2025

LPanosTT closed this as completed Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ttir.convolution` decomposition may insert unnecessary permute #1751

`ttir.convolution` decomposition may insert unnecessary permute #1751

LPanosTT commented Jan 10, 2025 •

edited

Loading

LPanosTT commented Jan 10, 2025 •

edited

Loading

azecevicTT commented Jan 13, 2025 •

edited

Loading

LPanosTT commented Jan 13, 2025

ttir.convolution decomposition may insert unnecessary permute #1751

ttir.convolution decomposition may insert unnecessary permute #1751

Comments

LPanosTT commented Jan 10, 2025 • edited Loading

LPanosTT commented Jan 10, 2025 • edited Loading

azecevicTT commented Jan 13, 2025 • edited Loading

LPanosTT commented Jan 13, 2025

`ttir.convolution` decomposition may insert unnecessary permute #1751

`ttir.convolution` decomposition may insert unnecessary permute #1751

LPanosTT commented Jan 10, 2025 •

edited

Loading

LPanosTT commented Jan 10, 2025 •

edited

Loading

azecevicTT commented Jan 13, 2025 •

edited

Loading