Skip to content

BrowserWing turns your browser actions into MCP commands Or Claude Skill, allowing AI agents to control browsers efficiently and reliably. Say goodbye to slow, token-heavy LLM interactions — let agents call commands directly for faster automation. Perfect for AI-driven tasks, browser automation, and boosting productivity.

License

Notifications You must be signed in to change notification settings

browserwing/browserwing

Repository files navigation

BrowserWing

Go React TypeScript Vite pnpm MCP

English · 简体中文 · 日本語 · Español · Português

browserwing.com

github_example_720p.mp4

Highlights

Native Browser Automation Platform with AI Integration

  • Complete Browser Control: 26+ HTTP API endpoints for full-featured browser automation
  • Built-in AI Agent: Direct conversational interface for browser automation tasks
  • Universal AI Tool Integration: Native MCP & Skills protocol support - compatible with any AI tool that supports these standards
  • Visual Script Recording: Record browser actions, edit visually, and replay with precision
  • Flexible Export Options: Convert recorded scripts to MCP commands or Skills files for AI tool integration
  • Intelligent Data Extraction: LLM-powered semantic extraction supporting OpenAI, Claude, DeepSeek, and more
  • Session Management: Robust cookie and storage handling for stable, authenticated browsing sessions

Requirements

  • Google Chrome or Chromium installed and accessible in your environment.

Screenshots

BrowserWing Homepage

Turn Scripts Into Claude Skill

You can now combine any scripts into a SKILL.md.

BrowserWing Skill

Quick Start

Option A — Install via Package Manager (recommended)

Using npm:

npm install -g browserwing
browserwing --port 8080

Using pnpm:

pnpm add -g browserwing
browserwing --port 8080

The npm package automatically tests GitHub and Gitee mirrors during installation and selects the fastest one.

⚠️ macOS Users:
If you encounter a "killed" error when running, fix it with:

xattr -d com.apple.quarantine $(which browserwing)

See the macOS Installation Fix Guide for details.

Using Homebrew (macOS/Linux):

# Coming soon
brew install browserwing

Option B — One-Line Install Script

Linux / macOS:

curl -fsSL https://raw.githubusercontent.com/browserwing/browserwing/main/install.sh | bash

Windows (PowerShell):

iwr -useb https://raw.githubusercontent.com/browserwing/browserwing/main/install.ps1 | iex

The script automatically:

  • Detects your OS/architecture
  • Tests GitHub and Gitee mirrors, selects the fastest one
  • Downloads and extracts the binary
  • Adds to PATH

Then start BrowserWing:

browserwing --port 8080
# Open http://localhost:8080 in your browser

Note for users in China: The installation script automatically uses Gitee mirror if GitHub is slow.

Option C — Manual Download

Download the prebuilt binary for your OS from Releases:

# Linux/macOS
chmod +x ./browserwing
./browserwing --port 8080

# Windows (PowerShell)
./browserwing.exe --port 8080

Option D — Build from Source

# Install deps (Go + pnpm required)
make install

# Build integrated binary (frontend embedded)
make build-embedded
./build/browserwing --port 8080

# Or build all targets and packages
make build-all
make package

Quick Integration with AI Tools

Three Ways to Use BrowserWing:

1. MCP Server Integration

Configure BrowserWing as an MCP server in any MCP-compatible AI tool:

{
  "mcpServers": {
    "browserwing": {
      "url": "http://localhost:8080/api/v1/mcp/message"
    }
  }
}

Paste this configuration into your AI tool's MCP settings to enable browser automation capabilities.

2. Skills File Integration

Download and import the Skills file into any AI tool that supports the Skills protocol:

  1. Start BrowserWing
  2. Download SKILL.md from the repository
  3. Import into your AI tool's Skills settings
  4. Start automating with natural language commands

Example:

"Navigate to example.com, search for 'AI tools', and extract the top 5 results"

3. Direct AI Agent Interface

Use BrowserWing's built-in AI Agent for immediate browser automation:

  1. Open BrowserWing web interface at http://localhost:8080
  2. Navigate to "AI Agent" section
  3. Configure your LLM (OpenAI, Claude, DeepSeek, etc.)
  4. Start conversational browser automation

Export Custom Scripts:

# Export your recorded scripts as Skills or MCP commands
curl -X POST 'http://localhost:8080/api/v1/scripts/export/skill' \
  -H 'Content-Type: application/json' \
  -d '{"script_ids": []}' \
  -o MY_CUSTOM_SCRIPTS.md

Why BrowserWing

Professional Browser Automation with AI Integration

  • Universal Protocol Support: Native MCP & Skills implementation works with any compatible AI tool
  • Complete Automation API: 26+ HTTP endpoints providing comprehensive browser control capabilities
  • Flexible Integration Options: Use as MCP server, Skills file, or standalone AI Agent
  • Visual Workflow Builder: Record, edit, and replay browser actions without writing code
  • Token-Efficient Design: Optimized for LLM usage with fast performance and minimal token consumption
  • Production-Ready: Stable session management, cookie handling, and error recovery
  • Extensible Architecture: Convert recorded scripts to reusable MCP commands or Skills files
  • Multi-LLM Support: Works with OpenAI, Anthropic, DeepSeek, and other providers
  • Enterprise Use Cases: Data extraction, RPA, testing, monitoring, and agent-driven automation

Usage Guide

Getting Started in Three Steps

  1. Choose Integration Method

    • Copy MCP server configuration for AI tool integration
    • Download Skills file for Skills-compatible AI tools
    • Or use built-in AI Agent for immediate access
  2. Configure Your AI Tool

    • Import MCP configuration or Skills file into your preferred AI tool
    • Configure LLM settings (API keys, model selection)
    • Verify connection to BrowserWing
  3. Start Automating

    • Control browser through natural language commands
    • Record custom scripts for repeated tasks
    • Export scripts as MCP commands or Skills for reuse

Advanced Workflows

For Browser Automation:

  • Launch and manage multiple browser instances
  • Configure profiles, proxies, and browser settings
  • Handle cookies and authentication sessions
  • Execute complex interaction sequences

For Script Recording:

  • Capture clicks, inputs, navigation, and waits
  • Edit actions visually in the script editor
  • Test and debug with step-by-step replay
  • Add variables and conditional logic

For AI Integration:

  • Convert scripts to MCP commands or Skills files
  • Integrate with multiple LLM providers
  • Use semantic extraction for data parsing
  • Build agent-driven automation workflows

HTTP API Reference

BrowserWing exposes 26+ RESTful endpoints for programmatic browser control:

Navigation & Control

  • Navigate to URLs, go back/forward, refresh pages
  • Manage browser windows and tabs
  • Handle page loading and timeouts

Element Interaction

  • Click, type, select, and hover actions
  • File uploads and form submissions
  • Keyboard shortcuts and key presses

Data Extraction

  • Extract text, HTML, and attributes
  • Semantic content analysis with LLM
  • Screenshot capture (full page or element)

Advanced Operations

  • Execute custom JavaScript
  • Manage cookies and local storage
  • Batch operations for efficiency
  • Wait conditions and element visibility

Complete Documentation: See docs/EXECUTOR_HTTP_API.md for detailed endpoint specifications

Contributing

  • Issues and PRs are welcome. Please include clear steps to reproduce or a concise rationale.
  • For feature ideas, open a discussion with use cases and expected outcomes.

Community

Discord: https://discord.gg/BkqcApRj twitter: https://x.com/chg80333

Acknowledgements

  • Inspired by modern browser automation, agentic workflows, and MCP.

License

  • MIT License. See LICENSE.

Disclaimer

  • Do not use for illegal purposes or to violate site terms.
  • Intended for personal learning and legitimate automation only.

About

BrowserWing turns your browser actions into MCP commands Or Claude Skill, allowing AI agents to control browsers efficiently and reliably. Say goodbye to slow, token-heavy LLM interactions — let agents call commands directly for faster automation. Perfect for AI-driven tasks, browser automation, and boosting productivity.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published