Fix flaky LinuxResourceHealthCheckTests by isolating MeterListener with ReferenceEquals by Copilot · Pull Request #7302 · dotnet/extensions

Copilot · 2026-02-14T12:44:01Z

Analyze code coverage issue
Understand that CheckHealthAsync code path needs to be exercised
Implement hybrid approach: use local MeterListener for test correctness + call CheckHealthAsync for code coverage
Test passes all cases successfully
Reply to user comment

Original prompt

Problem

The test Microsoft.Extensions.Diagnostics.HealthChecks.Test.LinuxResourceHealthCheckTests.TestCpuAndMemoryChecks_WithMetrics is flaky in CI. It fails with:
Assert.Equal() Failure: Values differ
Expected: Healthy
Actual:   Unhealthy
at line 198 in test/Libraries/Microsoft.Extensions.Diagnostics.HealthChecks.ResourceUtilization.Tests/Linux/LinuxResourceHealthCheckTests.cs.

Root Cause

The test creates a Meter with the hardcoded name "Microsoft.Extensions.Diagnostics.ResourceMonitoring" (line 157). The ResourceUtilizationHealthCheck class creates a MeterListener that subscribes to all instruments from any meter with that name via OnInstrumentPublished:
// In ResourceUtilizationHealthCheck.cs line 184-190
private void OnInstrumentPublished(Instrument instrument, MeterListener listener)
{
    if (instrument.Meter.Name == ResourceUtilizationInstruments.MeterName)
    {
        listener.EnableMeasurementEvents(instrument);
    }
}
Since this is a [ConditionalTheory] with [MemberData], multiple test cases run. When test cases overlap in execution (or other tests in the same process also create meters with the same name), one test's MeterListener picks up instruments from another test's LinuxUtilizationProvider, receiving wrong measurement values. A test case expecting Healthy can receive high CPU/memory values from a concurrently-running test case that expects Unhealthy.

Note that line 155 generates a unique name var meterName = Guid.NewGuid().ToString() but it is never used — the meter on line 157 uses the hardcoded constant instead. However, simply using a unique meter name would break the test because ResourceUtilizationHealthCheck.OnInstrumentPublished would reject instruments whose meter name doesn't match the hardcoded ResourceUtilizationInstruments.MeterName.

Fix

The fix should be a test-only change in test/Libraries/Microsoft.Extensions.Diagnostics.HealthChecks.ResourceUtilization.Tests/Linux/LinuxResourceHealthCheckTests.cs.

Instead of relying on ResourceUtilizationHealthCheck's MeterListener (which matches by meter name globally), the test should use its own local MeterListener that filters by meter reference using ReferenceEquals(instrument.Meter, meter). This is the same isolation pattern already used in this repo by:

WindowsCountersTests.cs — uses ReferenceEquals(meter, instrument.Meter) in InstrumentPublished

AcceptanceTest.cs — uses ReferenceEquals(instrument.Meter.Scope, meterScope) in OnInstrumentPublished

The test should:

Remove the unused meterName variable.

Replace the ResourceUtilizationHealthCheck instance with a local MeterListener scoped via ReferenceEquals to the specific meter instance created in the test.

Collect cpuUsedPercentage and memoryUsedPercentage from the local listener's measurement callback (multiplying by 100 since Linux gauges are in [0,1] range), matching what ResourceUtilizationHealthCheck.OnMeasurementRecorded does.

Call listener.RecordObservableInstruments() to trigger observation.

Call ResourceUtilizationHealthCheck.EvaluateHealthStatusAsync(cpuUsedPercentage, memoryUsedPercentage, options) to evaluate health status. Note: EvaluateHealthStatusAsync is private static — if it cannot be accessed from the test, the test should instead inline the equivalent health evaluation logic, or use the same thresholds comparison that EvaluateHealthStatusAsync performs to construct and verify the expected HealthCheckResult.

The key instruments to listen for are:

ResourceUtilizationInstruments.ProcessCpuUtilization and ContainerCpuLimitUtilization → _cpuUsedPercentage = measurement * 100

ResourceUtilizationInstruments.ProcessMemoryUtilization and ContainerMemoryLimitUtilization → _memoryUsedPercentage = measurement * 100

After collecting the measurements, the test should evaluate health status using the same logic as EvaluateHealthStatusAsync in ResourceUtilizationHealthCheck.cs (lines 42-90) which compares the percentages against the CpuThresholds and MemoryThresholds from ResourceUtilizationHealthCheckOptions.

If making EvaluateHealthStatusAsync accessible is too invasive, an alternative simpler approach: keep the ResourceUtilizationHealthCheck but add [Collection("ResourceMonitoring Meter Tests")] to serialize with other tests that create meters with the same name. This is the pattern used in this repo for "Tcp Connection Tests", "StdoutUsage", and "EnvironmentVariableTests" collections. However, this only prevents intra-collection parallelism and may not fully prevent contamination from other test classes.

The preferred approach is the ReferenceEquals-based local listener since it provides complete isolation regardless of parallelism.

This pull request was created from Copilot chat.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Microsoft Reviewers: Open in CodeFlow

Co-authored-by: stephentoub <[email protected]>

Copilot

Pull request overview

This PR addresses CI flakiness in LinuxResourceHealthCheckTests.TestCpuAndMemoryChecks_WithMetrics by avoiding cross-test metric contamination from MeterListener subscriptions that match meters globally by name, and instead isolating metric collection to the specific Meter instance created by the test.

Changes:

Replaced the test’s use of ResourceUtilizationHealthCheck’s internal MeterListener with a locally-scoped MeterListener filtered via ReferenceEquals(meter, instrument.Meter).
Captured CPU/memory utilization measurements from observable instruments and evaluated health via ResourceUtilizationHealthCheck.EvaluateHealthStatusAsync.
Removed unused variables (dataTracker, meterName, checkContext) and switched to string literals for instrument names to avoid type ambiguity.

stephentoub · 2026-02-15T11:55:00Z

@copilot CI is complaining about code coverage. Can you take a look?
https://dev.azure.com/dnceng-public/public/_build/results?buildId=1294962&view=codecoverage-tab

…istener for correctness Co-authored-by: stephentoub <[email protected]>

Copilot · 2026-02-15T12:01:03Z

@copilot CI is complaining about code coverage. Can you take a look?
https://dev.azure.com/dnceng-public/public/_build/results?buildId=1294962&view=codecoverage-tab

Fixed in f335eda. The test now calls CheckHealthAsync to maintain code coverage while using a local MeterListener with ReferenceEquals to ensure correctness and avoid flakiness from parallel test execution.

Initial plan

512ea4c

Copilot AI assigned Copilot and stephentoub Feb 14, 2026

Copilot started work on behalf of stephentoub February 14, 2026 12:44 View session

Copilot AI and others added 3 commits February 14, 2026 12:45

Fix flaky test by using local MeterListener with ReferenceEquals

e857729

Co-authored-by: stephentoub <[email protected]>

Fix ambiguous reference by using string constants

de5f1ea

Co-authored-by: stephentoub <[email protected]>

Remove redundant ReferenceEquals check in measurement callback

7208cbf

Co-authored-by: stephentoub <[email protected]>

Copilot AI changed the title ~~[WIP] Fix flaky Linux resource health check test~~ Fix flaky LinuxResourceHealthCheckTests by isolating MeterListener with ReferenceEquals Feb 14, 2026

Copilot AI requested a review from stephentoub February 14, 2026 12:52

Copilot finished work on behalf of stephentoub February 14, 2026 12:52

stephentoub marked this pull request as ready for review February 14, 2026 15:17

stephentoub requested a review from a team as a code owner February 14, 2026 15:17

Copilot AI review requested due to automatic review settings February 14, 2026 15:17

stephentoub enabled auto-merge (squash) February 14, 2026 15:17

Copilot started reviewing on behalf of stephentoub February 14, 2026 15:18 View session

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Merge branch 'main' into copilot/fix-flaky-healthcheck-test

2bf7179

github-actions bot added the area-telemetry label Feb 14, 2026

Copilot started work on behalf of stephentoub February 15, 2026 11:55 View session

Restore code coverage by calling CheckHealthAsync while using local l…

f335eda

…istener for correctness Co-authored-by: stephentoub <[email protected]>

auto-merge was automatically disabled February 15, 2026 12:00
Head branch was pushed to by a user without write access

Copilot finished work on behalf of stephentoub February 15, 2026 12:01

amadeuszl approved these changes Feb 17, 2026

View reviewed changes

stephentoub merged commit 0d92f3f into main Feb 17, 2026
6 checks passed

stephentoub deleted the copilot/fix-flaky-healthcheck-test branch February 17, 2026 14:39

github-actions bot locked and limited conversation to collaborators Mar 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky LinuxResourceHealthCheckTests by isolating MeterListener with ReferenceEquals#7302

Fix flaky LinuxResourceHealthCheckTests by isolating MeterListener with ReferenceEquals#7302
stephentoub merged 6 commits intomainfrom
copilot/fix-flaky-healthcheck-test

Copilot AI commented Feb 14, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

stephentoub commented Feb 15, 2026

Uh oh!

Copilot AI commented Feb 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Copilot AI commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Fix

Microsoft Reviewers: Open in CodeFlow

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

stephentoub commented Feb 15, 2026

Uh oh!

Copilot AI commented Feb 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented Feb 14, 2026 •

edited

Loading