To help debugging it would be great to use compilation groups to indicate the work a TRT engine is doing. (This is a good tutorial on this https://jott.live/markdown/Writing%20a%20Toy%20Backend%20Compiler%20for%20PyTorch)