Bootstrapping the effort to test large pages #51028

cshung · 2021-04-09T21:17:51Z

In 2019, we introduced the large page support in the GC. However, this feature is not regularly tested, and we found it broken (and fixed) once in a while due to careless changes.

It is time for us to introduce automated testing to avoid future regression. I have got some initial pointers from @Maoni0 and @safern. Here I am just bootstrapping the effort, as one can see, the change is kinda empty since I am not sure what to do there.

From the GC's perspective, the large page support only changes how the GC interacts with the operating system in terms of how memory pages are acquired. Therefore, it is not particularly meaningful to run many tests. A handful of tests that exercise the various hard limits config and do some simple allocation would do.

Here are the two things that need to happen:

To only run tests on windows amd64 machines with the SeLockMemoryPrivilege enabled (I knew some infrastructure work is already done to enable the privilege), and
To specify some COMPLUS variables so tests can pick it to do the specific large page testing when we need it.

ghost · 2021-04-09T21:17:55Z

I couldn't figure out the best area label to add to this PR. If you have write-permissions please help me learn by adding exactly one area label.

ghost · 2021-04-09T21:18:20Z

Tagging subscribers to this area: @dotnet/gc
See info in area-owners.md if you want to be subscribed.

Issue Details

In 2019, we introduced the large page support in the GC. However, this feature is not regularly tested, and we found it broken (and fixed) once in a while due to careless changes.

It is time for us to introduce automated testing to avoid future regression. I have got some initial pointers from @Maoni0 and @safern. Here I am just bootstrapping the effort, as one can see, the change is kinda empty since I am not sure what to do there.

From the GC's perspective, the large page support only changes how the GC interacts with the operating system in terms of how memory pages are acquired. Therefore, it is not particularly meaningful to run many tests. A handful of tests that exercise the various hard limits config and do some simple allocation would do.

The key challenge to this work is that the tests will require special machines (e.g. on Windows, we will need the SeLockMemoryPrivilege). That is why I need to make these changes.

Author:	cshung
Assignees:	-
Labels:	`area-GC-coreclr`
Milestone:	-

Maoni0 · 2021-04-09T21:25:44Z

I don't think I understand this part -

The key challenge to this work is that the tests will require special machines (e.g. on Windows, we will need the SeLockMemoryPrivilege). That is why I need to make these changes.

this change was already there as I mentioned on Teams. the work that's left is to add the GCLargePages env var so tests can pick it and run with it.

src/tests/Common/testenvironment.proj

eng/pipelines/libraries/run-test-job.yml

eng/pipelines/coreclr/gcstress-extra.yml

safern · 2021-04-09T21:43:02Z

cc: @BruceForstall

BruceForstall

I don't think you should add this to the "gcstress-extra" job. You should create a new job. E.g., there already is gc-longrunning and gc-simulator (I don't know if they actually run).

You should change eng\pipelines\common\templates\runtimes\run-test-job.yml. The eng\pipelines\libraries-run-test-job.yml is used (by coreclr testing) for running coreclr stress tests over the libraries test assets.

src/tests/Common/testenvironment.proj

safern

You probably want to add a helixQueueGroup for this as well so that we only run this tests in the windows helix queue that has this support enabled.

https://github.com/dotnet/runtime/blob/7810035f31bfd4e2f5d83cc405cf9307d346d77c/eng/pipelines/coreclr/templates/helix-queues-setup.yml#L117

eng/pipelines/coreclr/gc-largepages.yml

eng/pipelines/common/templates/runtimes/run-test-job.yml

cshung · 2021-04-14T18:20:36Z

We will continue this work on #51255

cshung added the area-GC-coreclr label Apr 9, 2021

safern reviewed Apr 9, 2021

View reviewed changes

src/tests/Common/testenvironment.proj Outdated Show resolved Hide resolved

safern reviewed Apr 9, 2021

View reviewed changes

eng/pipelines/libraries/run-test-job.yml Outdated Show resolved Hide resolved

safern reviewed Apr 9, 2021

View reviewed changes

eng/pipelines/coreclr/gcstress-extra.yml Outdated Show resolved Hide resolved

BruceForstall reviewed Apr 9, 2021

View reviewed changes

src/tests/Common/testenvironment.proj Outdated Show resolved Hide resolved

safern reviewed Apr 9, 2021

View reviewed changes

eng/pipelines/coreclr/gc-largepages.yml Outdated Show resolved Hide resolved

cshung added 6 commits April 12, 2021 14:34

Bootstrapping the effort to test large pages

d2164ad

Code review feedback

0c27bbb

Bruce's comments

fbe19e6

Start with some guess work

a388fbd

Looks like some helix queue change is required?

8095c0f

Experiment with changes in helix-queues-setup

6bc2ab5

cshung force-pushed the public/large-pages-testing branch from 6713137 to 6bc2ab5 Compare April 12, 2021 21:44

safern reviewed Apr 12, 2021

View reviewed changes

eng/pipelines/common/templates/runtimes/run-test-job.yml Outdated Show resolved Hide resolved

cshung added 2 commits April 12, 2021 17:32

Remove unused comment

f67fd96

Fix pipeline to build test on Windows

18442fb

cshung mentioned this pull request Apr 14, 2021

Large page testing #51255

Closed

cshung closed this Apr 14, 2021

cshung deleted the public/large-pages-testing branch April 14, 2021 18:20

ghost locked as resolved and limited conversation to collaborators May 14, 2021

karelz added this to the 6.0.0 milestone May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bootstrapping the effort to test large pages #51028

Bootstrapping the effort to test large pages #51028

Uh oh!

cshung commented Apr 9, 2021 •

edited

Loading

Uh oh!

ghost commented Apr 9, 2021

Uh oh!

ghost commented Apr 9, 2021

Uh oh!

Maoni0 commented Apr 9, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

safern commented Apr 9, 2021

Uh oh!

BruceForstall left a comment

Uh oh!

Uh oh!

safern left a comment

Uh oh!

Uh oh!

Uh oh!

cshung commented Apr 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Bootstrapping the effort to test large pages #51028

Bootstrapping the effort to test large pages #51028

Uh oh!

Conversation

cshung commented Apr 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Apr 9, 2021

Uh oh!

ghost commented Apr 9, 2021

Uh oh!

Maoni0 commented Apr 9, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

safern commented Apr 9, 2021

Uh oh!

BruceForstall left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

safern left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cshung commented Apr 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cshung commented Apr 9, 2021 •

edited

Loading