Interpreter to JIT/AOT calls #115375

janvorli · 2025-05-07T18:42:01Z

This change adds support for making calls from the interpreter to JIT/AOT generated code. For each target method, it parses the signature and creates a list of hand written asm routines that transfer the arguments from the interpreter stack to the CPU registers / stack based on the native calling convention, call the target method and then places the return value to the interpreter stack. This list is cached in the MethodDescData so that for repeated calls to the same method, it doesn't need to be re-generated.

For example, let's say that interpreter needs to call a JIT/AOT generated method with the following signature on Windows x64:

long M(int a, int b, long c, double d)

The interpreter stack stores these arguments aligned to 8 byte slots like this:

Slot 0: a
Slot 1: b
Slot 2: c
Slot 3: d

The CallStubGenerator::GenerateCallStub would then generate the following list of routines:

Load_RCX_RDX_R8
Load_XMM3
<target method address>

The interpreter calls the CallJittedMethodRetI8 function, because the return type is long. The arguments passed to this function are the list of routines above, pointer to slot 0 mentioned above, pointer to the return value slot on the interpreter stack and the size of the stack arguments the target method uses (0 in this case, as all arguments are passed in registers).
The CallJittedMethodRetI8 calls the first routine in the list
The Load_RCX_RDX_R8 loads RCX from slot 0, RDX from slot 1, R8 from slot 2 and then jumps to the next routine, which is the Load_XMM3
The Load_XMM3 loads XMM3 from the slot 3 and jumps to the next routine, which is the target method
After the target method returns, the control flow gets back to the CallJittedMethodRetI8. That function stores the RAX that contains the result to the return value slot on the interpreter stack and returns back to the interpreter.

Copilot

Pull Request Overview

This PR implements support for making calls from the interpreter to JIT/AOT generated code by generating and caching call stubs for target methods. The key changes include:

Adding new fields and methods in MethodDesc to store and manage call stub headers.
Implementing a new routine in interpexec.cpp to invoke compiled methods via generated call stubs.
Adding new assembly routines in AsmHelpers.asm and updating CMakeLists.txt to include the call stub generator sources.

Reviewed Changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
src/coreclr/vm/method.hpp	Added call stub header field and new API for call stub management
src/coreclr/vm/method.cpp	Implemented SetCallStubHeader and GetCallStubHeader methods
src/coreclr/vm/interpexec.cpp	Added InvokeCompiledMethod and updated interpreter branch to handle JIT/AOT calls
src/coreclr/vm/callstubgenerator.h	Introduced the definition for call stub generation
src/coreclr/vm/amd64/AsmHelpers.asm	Added multiple assembly routines to move arguments and perform calls
src/coreclr/vm/CMakeLists.txt	Updated build configuration to include call stub generator sources

Copilot · 2025-05-07T18:42:31Z

src/coreclr/vm/interpexec.cpp

+                    EECodeInfo codeInfo((PCODE)targetIp);
+                    if (!codeInfo.IsValid())
+                    {
+                        printf("Attempted to execute native code from interpreter.\n");


Using printf and assert directly for error handling may not be robust in production. Consider replacing with structured error logging or exception handling to better manage execution in release builds.

dotnet-policy-service · 2025-05-07T18:42:35Z

Tagging subscribers to this area: @BrzVlad, @janvorli, @kg
See info in area-owners.md if you want to be subscribed.

src/coreclr/vm/interpexec.cpp

kg · 2025-05-08T07:35:54Z

This was very easy to adapt for p/invoke, I like how you engineered it.

src/coreclr/vm/interpexec.cpp

janvorli · 2025-05-13T22:30:23Z

The PR is now ready for review.

src/coreclr/vm/callstubgenerator.cpp

This change adds support for making calls from the interpreter to JIT/AOT generated code. For each target method, it parses the signature and creates a list of hand written asm routines that transfer the arguments from the interpreter stack to the CPU registers / stack based on the native calling convention, call the target method and then places the return value to the interpreter stack. This list is cached in the MethodDescData so that for repeated calls to the same method, it doesn't need to be re-generated.

…commit

src/coreclr/vm/callstubgenerator.cpp

src/coreclr/vm/interpexec.cpp

src/coreclr/vm/method.cpp

src/coreclr/vm/interpexec.cpp

jkotas · 2025-05-14T22:11:27Z

src/coreclr/vm/interpexec.cpp

+                    if (!codeInfo.IsValid())
+                    {
+                        EEPOLICY_HANDLE_FATAL_ERROR_WITH_MESSAGE(COR_E_EXECUTIONENGINE, W("Attempted to execute native code from interpreter"));
+                    }
+                    else if (codeInfo.GetCodeManager() != ExecutionManager::GetInterpreterCodeManager())


Suggested change

if (!codeInfo.IsValid())

{

EEPOLICY_HANDLE_FATAL_ERROR_WITH_MESSAGE(COR_E_EXECUTIONENGINE, W("Attempted to execute native code from interpreter"));

}

else if (codeInfo.GetCodeManager() != ExecutionManager::GetInterpreterCodeManager())

if (!codeInfo.IsValid() || codeInfo.GetCodeManager() != ExecutionManager::GetInterpreterCodeManager())

I do not see why we need to block native code (FCalls?) here. Should this be like this?

(I agree with TODO that this needs to be faster.)

This was initially a temporary check to make sure something unexpected doesn't leak in here. As the comment mentions, I want to get rid of the codeInfo construction here soon and move to tagged pointer for the interpreter code so that we don't need to burn time looking up the code ranges. Then this will go away.
As for FCalls, I have made a local change in a testing branch yesterday to make them work, but they need to be handled explicitly, as their GetNativeCode returns NULL, so I have used the TryGetMultiCallableAddrOfCode. Maybe we can use that function instead of the GetNativeCode for all calls, but I need to refresh my memory on the differences.

jkotas

LGTM. Somebody from the interpreter v-team should sign-off as well.

BrzVlad · 2025-05-19T07:15:00Z

src/coreclr/vm/interpexec.cpp

+                    }
+                    else if (codeInfo.GetCodeManager() != ExecutionManager::GetInterpreterCodeManager())
+                    {
+                        MethodDesc *pMD = codeInfo.GetMethodDesc();


I think it would be great if we didn't have to work with MethodDesc at all for the fast invocation path. Also if these transition thunks would be shared per signature and not owned by each method independently, meaning the compiled targetIp would be passed explicitly. So a call pseudocode could look like:

obtain targetIp; if (targetIp is not interp) callStubInvoke = check call site cache if (!callStubInvoke) build CallStubGenerator set callStubInvoke and write cache callStubInvoke->Invoke(targetIp, ...) else interp call

I have considered generating the thunk per (normalized) signature, but decided to leave it as a possible future optimization. I don't have a good idea yet on how to make normalized signature comparison and generate cache key for a signature that would be significantly faster than generating it per method. And the size of the thunk is very small, so I am not sure how much we would save space-wise. Anyways, it is still worth looking into at some point.

BrzVlad

Nice

janvorli added this to the 10.0.0 milestone May 7, 2025

janvorli self-assigned this May 7, 2025

Copilot AI review requested due to automatic review settings May 7, 2025 18:42

janvorli requested review from BrzVlad and kg as code owners May 7, 2025 18:42

janvorli added NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) NO-REVIEW Experimental/testing PR, do NOT review it area-CodeGen-Interpreter-coreclr labels May 7, 2025

Copilot AI reviewed May 7, 2025

View reviewed changes

janvorli mentioned this pull request May 7, 2025

CoreCLR Interpreter #112158

Open

66 tasks

janvorli changed the title ~~[WIP] Intepreter to JIT/AOT calls~~ [WIP] Interpreter to JIT/AOT calls May 7, 2025

kg reviewed May 8, 2025

View reviewed changes

src/coreclr/vm/interpexec.cpp Show resolved Hide resolved

kg mentioned this pull request May 8, 2025

Interpreter P/Invoke support #115393

Merged

janvorli force-pushed the interpreter-to-jit-calls branch from 5eb2f82 to d375289 Compare May 12, 2025 23:54

kg reviewed May 13, 2025

View reviewed changes

src/coreclr/vm/interpexec.cpp Outdated Show resolved Hide resolved

build-analysis bot mentioned this pull request May 13, 2025

CI flakiness: mono interpreter build getting killed #114123

Open

janvorli removed NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) NO-REVIEW Experimental/testing PR, do NOT review it labels May 13, 2025

janvorli changed the title ~~[WIP] Interpreter to JIT/AOT calls~~ Interpreter to JIT/AOT calls May 13, 2025

janvorli requested a review from jkotas May 13, 2025 22:29

jkotas reviewed May 13, 2025

View reviewed changes

src/coreclr/vm/callstubgenerator.cpp Outdated Show resolved Hide resolved

jkotas reviewed May 13, 2025

View reviewed changes

src/coreclr/vm/callstubgenerator.cpp Outdated Show resolved Hide resolved

janvorli added 5 commits May 14, 2025 15:25

Calling convention testing

27bac16

Cleanup, comments and apple arm64 fix

53806f8

Fix test build break and cleanup Apple arm64 stack args handling

6ac574e

Fix build break

8956ddf

janvorli added 3 commits May 14, 2025 15:25

Fix Unix x64 build break

d787c4c

Fix some contracts and a bug in args by ref introduced in a previous …

59360a6

…commit

Move to allocations from LoaderHeap

ed393bc

janvorli force-pushed the interpreter-to-jit-calls branch from 9ac91be to ed393bc Compare May 14, 2025 14:47