add names to generated IR #969

ChuanqiXu9 · 2024-10-14T06:53:56Z

Now when we generate IR from CIR, the generated codes are named by numbers, which is pretty hard to read. While we lower from the traditional pipeline, we can see the names pretty clearly. It is pretty helpful for debugging.

smeenai · 2024-10-14T17:53:10Z

@lanza and I have discussed this too. This will also be pretty important in the future when we want to start running some original CodeGen tests with ClangIR, since they assume value names will be present.

From what I understand, MLIR doesn't have the concept of value names the same way LLVM does; they're not stored in the bitcode format, for example, and just generated by the AsmPrinter on the fly instead. We'll probably need to extend the LLVMIR dialect to support this; one idea we were thinking of was an optional attribute on all operations to name the result (similar to how cir.alloca stores the variable names) that can be accessed during translation. It'd need to be discussed with upstream MLIR though.

Lancern · 2024-10-16T05:00:08Z

Beside SSA value names, what about basic block names? MLIR does not support block names either.

smeenai · 2024-10-16T17:14:31Z

Good point, I'd forgotten about those. I'm not sure at all about the right approach for those, but hopefully the MLIR folks will have ideas.

bcardosolopes · 2024-10-16T18:21:32Z

One interesting bit is that operations with multiple return values actually get names based on tablegen, so some of the mechanism is already there.

This discussion has happened in MLIR related forums before, I don't have a digest to give, but it might be worth asking there to get an informed status quo

smeenai · 2024-10-17T04:57:44Z

I found some relevant reading: https://discourse.llvm.org/t/rfc-better-support-for-dialects-that-want-to-make-ssa-names-load-bearing/674 and https://discourse.llvm.org/t/names-in-ssa-values-for-debuggability/1041. In particular llvm/llvm-project@596da62 actually landed, but I'm not sure what the API looks like in practice.

bcardosolopes · 2024-10-17T21:50:48Z

Haven't looked in depth, but seems like it requires changing the printers to custom ones? This is from 2020, so maybe there's something more streamlined we could use?

smeenai added the IR difference A difference in ClangIR-generated LLVM IR that could complicate reusing original CodeGen tests label Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add names to generated IR #969

add names to generated IR #969

ChuanqiXu9 commented Oct 14, 2024

smeenai commented Oct 14, 2024

Lancern commented Oct 16, 2024

smeenai commented Oct 16, 2024

bcardosolopes commented Oct 16, 2024 •

edited

Loading

smeenai commented Oct 17, 2024

bcardosolopes commented Oct 17, 2024

add names to generated IR #969

add names to generated IR #969

Comments

ChuanqiXu9 commented Oct 14, 2024

smeenai commented Oct 14, 2024

Lancern commented Oct 16, 2024

smeenai commented Oct 16, 2024

bcardosolopes commented Oct 16, 2024 • edited Loading

smeenai commented Oct 17, 2024

bcardosolopes commented Oct 17, 2024

bcardosolopes commented Oct 16, 2024 •

edited

Loading