[RFC] Tests using the Everything server #582
Closed
+497
−35
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add Playwright e2e tests which connect to the reference Everything server.
This implementation is in a fork:
Motivation and Context
The Inspector lacks automated testing against real MCP servers, making it difficult to catch regressions and validate new functionality. (Much of the UI depends on a connected server.)
The Everything server provides example implementations of many MCP protocol features.
Together, these create an opportunity for a feedback loop to drive MCP specification validation and compliance:
I think there is a lot of potential in this direction.
I'm particularly interested in validation and compliance of MCP clients and servers, and how I can help. I know about the focus on validation in the Roadmap, and SDK compliance spec schema.
How Has This Been Tested?
Running in GitHub Actions, sample run
Request for Comments
Seeking feedback on the concept. But specifically:
Current scope
structuredContent
implements the new MCP 2025-06-18 specification feature)Current limitations
structuredContent
tool (can'tnpx ...
from a sub-package branch)Next Steps
npx modelcontextprotocol/server-everything
(oncestructuredContent
tool is merged; otherwise, change the tested tool)
Future Test Coverage
Elicitation, Roots, Change Notifications in various contexts