English · 简体中文 · 日本語 · Español · Português
github_example_720p.mp4
Native Browser Automation Platform with AI Integration
- Complete Browser Control: 26+ HTTP API endpoints for full-featured browser automation
- Built-in AI Agent: Direct conversational interface for browser automation tasks
- Universal AI Tool Integration: Native MCP & Skills protocol support - compatible with any AI tool that supports these standards
- Visual Script Recording: Record browser actions, edit visually, and replay with precision
- Flexible Export Options: Convert recorded scripts to MCP commands or Skills files for AI tool integration
- Intelligent Data Extraction: LLM-powered semantic extraction supporting OpenAI, Claude, DeepSeek, and more
- Session Management: Robust cookie and storage handling for stable, authenticated browsing sessions
- Google Chrome or Chromium installed and accessible in your environment.
You can now combine any scripts into a SKILL.md.
Using npm:
npm install -g browserwing
browserwing --port 8080Using pnpm:
pnpm add -g browserwing
browserwing --port 8080The npm package automatically tests GitHub and Gitee mirrors during installation and selects the fastest one.
If you encounter a "killed" error when running, fix it with:
xattr -d com.apple.quarantine $(which browserwing)See the macOS Installation Fix Guide for details.
Using Homebrew (macOS/Linux):
# Coming soon
brew install browserwingLinux / macOS:
curl -fsSL https://raw.githubusercontent.com/browserwing/browserwing/main/install.sh | bashWindows (PowerShell):
iwr -useb https://raw.githubusercontent.com/browserwing/browserwing/main/install.ps1 | iexThe script automatically:
- Detects your OS/architecture
- Tests GitHub and Gitee mirrors, selects the fastest one
- Downloads and extracts the binary
- Adds to PATH
Then start BrowserWing:
browserwing --port 8080
# Open http://localhost:8080 in your browserNote for users in China: The installation script automatically uses Gitee mirror if GitHub is slow.
Download the prebuilt binary for your OS from Releases:
# Linux/macOS
chmod +x ./browserwing
./browserwing --port 8080
# Windows (PowerShell)
./browserwing.exe --port 8080# Install deps (Go + pnpm required)
make install
# Build integrated binary (frontend embedded)
make build-embedded
./build/browserwing --port 8080
# Or build all targets and packages
make build-all
make packageThree Ways to Use BrowserWing:
Configure BrowserWing as an MCP server in any MCP-compatible AI tool:
{
"mcpServers": {
"browserwing": {
"url": "http://localhost:8080/api/v1/mcp/message"
}
}
}Paste this configuration into your AI tool's MCP settings to enable browser automation capabilities.
Download and import the Skills file into any AI tool that supports the Skills protocol:
- Start BrowserWing
- Download SKILL.md from the repository
- Import into your AI tool's Skills settings
- Start automating with natural language commands
Example:
"Navigate to example.com, search for 'AI tools', and extract the top 5 results"
Use BrowserWing's built-in AI Agent for immediate browser automation:
- Open BrowserWing web interface at
http://localhost:8080 - Navigate to "AI Agent" section
- Configure your LLM (OpenAI, Claude, DeepSeek, etc.)
- Start conversational browser automation
Export Custom Scripts:
# Export your recorded scripts as Skills or MCP commands
curl -X POST 'http://localhost:8080/api/v1/scripts/export/skill' \
-H 'Content-Type: application/json' \
-d '{"script_ids": []}' \
-o MY_CUSTOM_SCRIPTS.mdProfessional Browser Automation with AI Integration
- Universal Protocol Support: Native MCP & Skills implementation works with any compatible AI tool
- Complete Automation API: 26+ HTTP endpoints providing comprehensive browser control capabilities
- Flexible Integration Options: Use as MCP server, Skills file, or standalone AI Agent
- Visual Workflow Builder: Record, edit, and replay browser actions without writing code
- Token-Efficient Design: Optimized for LLM usage with fast performance and minimal token consumption
- Production-Ready: Stable session management, cookie handling, and error recovery
- Extensible Architecture: Convert recorded scripts to reusable MCP commands or Skills files
- Multi-LLM Support: Works with OpenAI, Anthropic, DeepSeek, and other providers
- Enterprise Use Cases: Data extraction, RPA, testing, monitoring, and agent-driven automation
-
Choose Integration Method
- Copy MCP server configuration for AI tool integration
- Download Skills file for Skills-compatible AI tools
- Or use built-in AI Agent for immediate access
-
Configure Your AI Tool
- Import MCP configuration or Skills file into your preferred AI tool
- Configure LLM settings (API keys, model selection)
- Verify connection to BrowserWing
-
Start Automating
- Control browser through natural language commands
- Record custom scripts for repeated tasks
- Export scripts as MCP commands or Skills for reuse
For Browser Automation:
- Launch and manage multiple browser instances
- Configure profiles, proxies, and browser settings
- Handle cookies and authentication sessions
- Execute complex interaction sequences
For Script Recording:
- Capture clicks, inputs, navigation, and waits
- Edit actions visually in the script editor
- Test and debug with step-by-step replay
- Add variables and conditional logic
For AI Integration:
- Convert scripts to MCP commands or Skills files
- Integrate with multiple LLM providers
- Use semantic extraction for data parsing
- Build agent-driven automation workflows
BrowserWing exposes 26+ RESTful endpoints for programmatic browser control:
Navigation & Control
- Navigate to URLs, go back/forward, refresh pages
- Manage browser windows and tabs
- Handle page loading and timeouts
Element Interaction
- Click, type, select, and hover actions
- File uploads and form submissions
- Keyboard shortcuts and key presses
Data Extraction
- Extract text, HTML, and attributes
- Semantic content analysis with LLM
- Screenshot capture (full page or element)
Advanced Operations
- Execute custom JavaScript
- Manage cookies and local storage
- Batch operations for efficiency
- Wait conditions and element visibility
Complete Documentation: See docs/EXECUTOR_HTTP_API.md for detailed endpoint specifications
- Issues and PRs are welcome. Please include clear steps to reproduce or a concise rationale.
- For feature ideas, open a discussion with use cases and expected outcomes.
Discord: https://discord.gg/BkqcApRj twitter: https://x.com/chg80333
- Inspired by modern browser automation, agentic workflows, and MCP.
- MIT License. See
LICENSE.
- Do not use for illegal purposes or to violate site terms.
- Intended for personal learning and legitimate automation only.

