RAW HTTP — NO HEADLESS BROWSER OVERHEADMARKDOWN · JSON · HTML · LLM-READY FORMATSMCP SERVER FOR AI AGENTSTLS FINGERPRINT IMPERSONATIONEXTRACT · SUMMARIZE · DIFF · BRANDSITEMAP DISCOVERY & DEEP CRAWLINGSELF-HOST OR USE OUR CLOUD APIBUILT IN RUST — FAST BY DEFAULTDEEP RESEARCH — AI SYNTHESIZES REPORTS FROM 50+ SOURCESWEB SEARCH — QUERY AND SCRAPE SEARCH RESULTS IN ONE CALLAGENT SCRAPE — GIVE A GOAL, AI EXTRACTS WHAT YOU NEEDURL MONITORING — WATCH PAGES FOR CHANGES WITH WEBHOOKSBONUS CREDITS — EARN FREE CREDITS BY STARRING AND REFERRINGRAW HTTP — NO HEADLESS BROWSER OVERHEADMARKDOWN · JSON · HTML · LLM-READY FORMATSMCP SERVER FOR AI AGENTSTLS FINGERPRINT IMPERSONATIONEXTRACT · SUMMARIZE · DIFF · BRANDSITEMAP DISCOVERY & DEEP CRAWLINGSELF-HOST OR USE OUR CLOUD APIBUILT IN RUST — FAST BY DEFAULTDEEP RESEARCH — AI SYNTHESIZES REPORTS FROM 50+ SOURCESWEB SEARCH — QUERY AND SCRAPE SEARCH RESULTS IN ONE CALLAGENT SCRAPE — GIVE A GOAL, AI EXTRACTS WHAT YOU NEEDURL MONITORING — WATCH PAGES FOR CHANGES WITH WEBHOOKSBONUS CREDITS — EARN FREE CREDITS BY STARRING AND REFERRING

MCP SERVER

Give AI agents the web.

The Model Context Protocol connects Claude, Cursor, Windsurf, OpenCode, Codex, Antigravity, and any MCP-compatible AI agent directly to webclaw's extraction engine. One JSON config. Full web access. 12 tools your agent can call.

Quick setup

Pick your client. Paste the config. Done.

{
  "mcpServers": {
    "webclaw": {
      "command": "webclaw-mcp",
      "env": {
        "WEBCLAW_API_KEY": "<YOUR_API_KEY>"
      }
    }
  }
}

Installation

Two ways to get the binary.

Pre-built binary

bash
curl -fsSL https://github.com/0xMassi/webclaw/releases/latest/download/webclaw-mcp-aarch64-apple-darwin \
  -o webclaw-mcp && chmod +x webclaw-mcp

From source

bash
cargo install --git https://github.com/0xMassi/webclaw webclaw-mcp

Configuration

All configuration is through environment variables. Set them in your MCP client config or your shell.

WEBCLAW_API_KEY

Enables cloud fallback for bot-protected sites (Cloudflare, DataDome, AWS WAF)

OPENAI_API_KEY / ANTHROPIC_API_KEY

Enables extract and summarize tools (LLM features)

OLLAMA_HOST

Use local Ollama for LLM features (default: localhost:11434)

proxies.txt

Place a proxies.txt file in working directory for automatic proxy rotation

12 tools

Every tool your agent needs to read, extract, and monitor the web.

scrape

Extract content from any URL

url, format (markdown | text | llm | json | links | rawHtml | attributes | query)

crawl

Crawl entire sites with BFS traversal

url, depth, max_pages

map

Discover all URLs via sitemap parsing

url, include, exclude

batch

Extract multiple URLs concurrently

urls[], format, concurrency

extract

LLM-powered structured data extraction

url, prompt, schema

summarize

AI-generated page summaries

url, max_length

diff

Track content changes between snapshots

url, previous_snapshot

brand

Extract brand identity from any site

url (returns colors, fonts, logos)

search

Web search via Serper with optional scraping

query, num_results, scrape_results

research

Async deep research with multi-source analysis

query, depth, max_sources

get_research

Poll research job status and retrieve results

job_id

Smart Fetch
Local-first architecture: 80% of scrapes cost zero API credits. The cloud API only activates for bot-protected or JS-rendered pages. Your agent gets full web access while keeping costs near zero.

Related guides

Read more on how the MCP server fits the wider stack.

Ready to connect?

Get an API key for cloud features, or use the MCP server fully local with zero configuration.