A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python. This server allows you to control web browsers programmatically through Claude's desktop application.
- Browser automation with Chrome and Firefox support
- Comprehensive set of Selenium WebDriver operations
- Easy integration with Claude desktop application
- Support for various browser interactions:
- Navigation
- Element interaction (click, send keys, etc.)
- Screenshot capture
- Window and iframe handling
- Local storage management
- And much more!
- Python 3.7+
- Chrome or Firefox browser installed
- Claude desktop application
- Clone this repository:
git clone https://github.com/Jyothishkumarav/selenium-mcp-server-python.git
cd selenium-mcp-server-python
- Install the required dependencies:
pip install -r requirements.txt
- Install the MCP server in Claude:
mcp install server.py
- Start the server:
python server.py
- In Claude desktop application:
- Go to Settings > Developer Options
- You should see "MCP-Selenium" listed as an available server
- Enable the server to start using Selenium automation features
The server provides the following automation capabilities:
-
Browser Management:
- Start browser sessions
- Close sessions
- Switch between windows
-
Navigation:
- Navigate to URLs
- Refresh pages
- Wait for page loads
-
Element Interaction:
- Find elements
- Click elements
- Send keys
- Clear fields
- Double click
- Right click
-
Element Information:
- Get element text
- Get element attributes
- Check element presence
- Check element visibility
- Check element selection
-
Page Operations:
- Take screenshots
- Get page content
- Scroll pages
- Manage local storage
-
Frame Handling:
- Switch to iframes
- Return to default content
Here's a simple example of how to use the server with Claude:
- Start a browser session
- Navigate to a website
- Find and interact with elements
- Take screenshots
- Close the session
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.