Current Version: v1.5.1-dev.3 | Updated: January 2025
WebAI-MCP is a comprehensive browser automation platform that provides AI agents with deep insights into browser behavior and user experience. This guide covers everything you need to know as a user.
MCP Client → webai-mcp → webai-server → Chrome Extension
(AI) → (MCP) → (Bridge) → (Browser)
Always gets the latest version, no permanent installation:
# 1. Start WebAI Server (run in separate terminal)
npx @cpjet64/webai-server@dev
# 2. Configure your IDE with MCP server
# Add to your IDE's MCP configuration: npx @cpjet64/webai-mcp@devInstall once, run anytime:
# 1. Install globally
npm install -g @cpjet64/webai-mcp@dev
npm install -g @cpjet64/webai-server@dev
# 2. Run anytime with simple commands
webai-server
# MCP server runs automatically when your IDE calls itFor developers who want to modify the code:
# 1. Clone the repository
git clone https://github.com/cpjet64/webai-mcp.git
cd webai-mcp
# 2. Install dependencies
npm run install:all
# 3. Build packages
npm run build:all
# 4. Run locally
cd webai-server && npm start
# Configure IDE with: node /path/to/webai-mcp/webai-mcp/dist/mcp-server.js-
Download Extension:
- Download from Chrome Web Store (coming soon)
- Or load unpacked from
chrome-extension/directory
-
Load Unpacked Extension (Development):
1. Open Chrome → Extensions → Developer mode ON 2. Click "Load unpacked" 3. Select the chrome-extension/ folder 4. Extension should appear in toolbar -
Verify Installation:
- Extension icon appears in Chrome toolbar
- Right-click on any webpage → "Inspect" → "WebAI" tab should be visible
- Open Extension Panel: Click the WebAI extension icon
- Configure Auto-paste (optional):
- Check "Enable Auto-paste"
- Select target IDE: Cursor, VS Code, Zed, Claude Desktop, or Custom
- Test Connection: Take a screenshot to verify everything works
takeScreenshot- High-quality screenshots with auto-pastegetConsoleLogs- Browser console monitoringgetNetworkLogs- Network request trackinginspectElementsBySelector- Element inspection with styleswipeLogs- Clear browser logs
getCookies- Browser cookies accessgetLocalStorage- localStorage data retrievalgetSessionStorage- sessionStorage data retrieval
clickElement- Enhanced clicking with coordinatesfillInput- Input field manipulationselectOption- Dropdown selectionsubmitForm- Form submission
refreshBrowser- Advanced page refreshgetSelectedElement- Currently selected element
runAccessibilityAudit- A11y compliance checkingrunPerformanceAudit- Performance analysisrunSEOAudit- SEO optimization analysisrunBestPracticesAudit- Web best practices
checkVersionCompatibility- Component compatibility checkgetVersionInfo- Comprehensive version detailsgetVersions- Quick version overview
Auto-paste automatically copies screenshots to your clipboard and pastes them directly into your IDE with a descriptive message. This eliminates the manual copy-paste workflow when sharing screenshots with AI assistants.
- Cursor - Process detection + AppleScript automation
- VS Code - Process detection + AppleScript automation
- Zed - Process detection + AppleScript automation
- Claude Desktop - Process detection + AppleScript automation
- Custom Apps - User-defined application names
- Cursor - Process name:
Cursor.exe - VS Code - Process name:
Code.exe - VS Code Insiders - Process name:
Code - Insiders.exe - Zed - Process name:
Zed.exe - Claude Desktop - Process name:
Claude.exe - Custom Applications - User-defined process names
- Cursor - Window detection + xdotool automation
- VS Code - Window detection + xdotool automation
- Zed - Window detection + xdotool automation
- Claude Desktop - Window detection + xdotool automation
- Custom Apps - User-defined application names
- Enable in Extension: Check "Enable Auto-paste" in the Chrome extension panel
- Select Target IDE: Choose from dropdown or select "Custom Application"
- Test: Take a screenshot and verify it appears in your IDE
- ✅ Windows 10/11 - Full support
- ✅ PowerShell 5.1+ - Included with Windows
- ✅ System.Windows.Forms - Built-in .NET assembly
- ✅ System.Drawing - Built-in .NET assembly
# Test if PowerShell can run automation scripts
Get-ExecutionPolicy
# Should show: RemoteSigned, Unrestricted, or BypassIf restricted, run as Administrator:
Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser# Check running IDE processes
Get-Process | Where-Object {$_.ProcessName -like "*Code*" -or $_.ProcessName -like "*Cursor*" -or $_.ProcessName -like "*Zed*"}- Open your target IDE (VS Code, Cursor, etc.)
- Open Chrome with the extension
- Navigate to any webpage
- Take a screenshot using the extension
- Verify the image appears in your IDE with "here is the screenshot" text
- Open multiple IDEs (VS Code + Cursor)
- Set target IDE in extension settings
- Take screenshot
- Verify it pastes to the correct IDE
- Minimize your IDE
- Take screenshot
- Verify IDE window is restored and receives paste
Ask your AI assistant: "What versions of WebAI-MCP components are currently installed?"
Example Output:
📋 WebAI-MCP Component Versions
===============================
• MCP Server: 1.5.1-dev.3 ✅
• WebAI Server: 1.5.1-dev.3 ✅
• Chrome Extension: 1.5.1 ✅
🖥️ System: win32 (x64)
📦 Node.js: v20.11.0 | NPM: 10.2.4
Ask your AI assistant: "Are there any updates available for WebAI-MCP?"
Ask your AI assistant: "Check if my WebAI-MCP components are compatible"
Solutions:
- Verify server is running: Check if
webai-serverprocess is active - Check ports: Server runs on ports 3025-3027
- Restart server: Stop and restart
webai-server - Check firewall: Ensure ports 3025-3027 are not blocked
Solutions:
- Reload extension: Chrome → Extensions → Reload WebAI extension
- Check permissions: Ensure extension has necessary permissions
- Restart Chrome: Close and reopen Chrome browser
- Check DevTools: Look for WebAI tab in Chrome DevTools
Solutions:
- Check auto-paste setting: Ensure "Enable Auto-paste" is checked
- Verify target IDE: Confirm correct IDE is selected
- Check IDE is running: Ensure target IDE process is active
- Test manually: Try copying/pasting manually first
Solutions:
- Verify selector: Check CSS selector is correct
- Wait for page load: Ensure page is fully loaded
- Check element visibility: Element must be visible on page
- Try different selector: Use more specific or alternative selectors
# Check if your IDE is actually running
Get-Process | Where-Object {$_.ProcessName -like "*YourIDE*"}- Check file permissions on screenshot folder
- Verify PowerShell execution policy
- Try running as Administrator
- Disable focus stealing prevention: Windows Settings → System → Focus assist → Off
- Check if IDE has multiple windows open
- Try clicking on IDE window manually first
- Check browser performance: Close unnecessary tabs
- Verify system resources: Ensure adequate RAM/CPU
- Update Chrome: Use latest Chrome version
- Restart components: Restart webai-server and Chrome extension
- Clear browser cache: Clear Chrome cache and cookies
- Check for memory leaks: Monitor system resources
- Keep components updated: Regularly check for updates
- Use latest Chrome: Keep Chrome browser updated
- Monitor system resources: Ensure adequate RAM/CPU
- Regular restarts: Restart components if experiencing issues
- Review permissions: Only grant necessary permissions
- Monitor data access: Be aware of what data is being accessed
- Use privacy modes: Configure privacy filtering as needed
- Use version tools: Regularly check component compatibility
- Test thoroughly: Verify functionality after updates
- Report issues: Submit bug reports with detailed information
- GitHub Repository: https://github.com/cpjet64/webai-mcp
- Issue Tracker: GitHub Issues
- NPM Packages: @cpjet64/webai-mcp | @cpjet64/webai-server
- Chinese Documentation: 简体中文文档
"Take a screenshot of the current page"
"Capture a screenshot and paste it to my IDE"
"Screenshot the current browser tab"
"Click the submit button"
"Fill the email field with test@example.com"
"Select 'Option 2' from the dropdown menu"
"Submit the contact form"
"Show me the console logs"
"What network requests were made?"
"Analyze the page performance"
"Check this page for accessibility issues"
"What cookies are set on this page?"
"Show me the localStorage data"
"What's stored in sessionStorage?"
// Enhanced element clicking with coordinates and options
clickElement({
selector: "#submit-button", // CSS selector
coordinates: { x: 100, y: 200 }, // Optional coordinates
waitForElement: true, // Wait for element visibility
timeout: 5000, // Timeout in milliseconds
scrollIntoView: true // Scroll element into view
})
// Input field manipulation
fillInput({
selector: "input[name='email']", // CSS selector for input
text: "user@example.com", // Text to enter
clearFirst: true, // Clear existing value first
waitForElement: true, // Wait for element visibility
timeout: 5000, // Timeout in milliseconds
triggerEvents: true // Trigger input/change events
})
// Dropdown selection
selectOption({
selector: "select[name='country']", // CSS selector for select
value: "US", // Value of option to select
text: "United States", // OR visible text of option
index: 0, // OR index of option (0-based)
waitForElement: true, // Wait for element visibility
timeout: 5000 // Timeout in milliseconds
})
// Form submission
submitForm({
formSelector: "#contact-form", // CSS selector for form
submitButtonSelector: "button[type='submit']", // OR submit button selector
waitForElement: true, // Wait for element visibility
timeout: 5000, // Timeout in milliseconds
waitForNavigation: false // Wait for navigation after submit
})// Advanced browser refresh with comprehensive options
refreshBrowser({
waitForLoad: true, // Wait for page to fully load
timeout: 10000, // Timeout for page load
preserveScrollPosition: false, // Preserve scroll position
clearCache: false // Clear browser cache before refresh
})// Comprehensive page auditing
runAccessibilityAudit() // Check WCAG compliance
runPerformanceAudit() // Analyze page performance
runSEOAudit() // SEO optimization analysis
runBestPracticesAudit() // Web development best practices- Level 1 (Hide Nothing): All data visible
- Level 2 (Hide Sensitive): Passwords, tokens, keys filtered
- Level 3 (Hide All Data): Maximum privacy protection
settings = {
allowAutoPaste: true, // Enable/disable auto-paste
targetIDE: "cursor", // Selected IDE
customAppName: "MyIDE" // Custom app name (if targetIDE === "custom")
}- Full page capture: Entire webpage including scrolled content
- Visible area only: Current viewport
- Element-specific: Capture specific elements with highlighting
{
"mcpServers": {
"webai-mcp": {
"command": "npx",
"args": ["@cpjet64/webai-mcp@dev"]
}
}
}{
"mcpServers": {
"webai-mcp": {
"command": "npx",
"args": ["@cpjet64/webai-mcp@dev"]
}
}
}{
"mcpServers": {
"webai-mcp": {
"command": "npx",
"args": ["@cpjet64/webai-mcp@dev"]
}
}
}- Node.js: 18.0.0 or higher
- Chrome: Version 90 or higher
- RAM: 4GB minimum, 8GB recommended
- CPU: Dual-core minimum
- Close unnecessary browser tabs: Reduces memory usage
- Regular component restarts: Prevents memory leaks
- Monitor system resources: Use task manager to check usage
- Update regularly: Keep all components current
- Use
runPerformanceAuditto identify bottlenecks - Monitor network requests with
getNetworkLogs - Check console for errors with
getConsoleLogs
This complete user guide provides everything you need to successfully install, configure, and use WebAI-MCP for enhanced AI-browser interactions.