The platform

One engine. Three ways to ship.

Webclaw is one Rust extraction engine, served as a hosted Cloud API, a single-binary CLI, and an MCP server for AI agents. Same clean, LLM-ready output everywhere, with SDKs for Python, TypeScript and Go.

REST

Cloud API

Web extraction, as a service.

A REST API for production apps. Automatic bot protection, JS rendering, LLM-optimized output, and structured extraction. One key, every format.

14 endpoints: scrape, crawl, search, extract, batch, research
Bot protection and JS rendering handled for you
Python, TypeScript and Go SDKs

Explore the Cloud API

Binary

CLI Tool

Extraction from your terminal.

A single Rust binary that scrapes, crawls, extracts structured data, runs web searches, deep research, and AI-guided agents. Runs anywhere.

One binary, zero dependencies
Every output format, pipe-friendly for scripts
Local LLM features via Ollama, or self-host the whole stack

Explore the CLI

Agents

MCP Server

Give AI agents the web.

A Model Context Protocol server that connects Claude, Cursor, Windsurf, Codex and any MCP client to webclaw's extraction engine. One JSON config, full web access.

14 tools your agent can call over stdio
Works with Claude Desktop, Cursor, Windsurf, OpenCode, Codex
Real-time web access with no custom middleware

Explore the MCP server

Under the hood

Same engine, every surface.

Whether you call the API, the CLI, or the MCP server, the same Rust extraction pipeline does the work, so the output and the behavior are identical across all three.

LLM-ready output

Clean markdown, JSON, text and an LLM-optimized format that cuts token count ~90% vs raw HTML.

Bot protection handled

Challenge pages, CAPTCHAs and fingerprinting cleared transparently, no per-site config.

Fast by default

Sub-200ms median on static pages; JS rendering only kicks in when a page actually needs it.

Nine output formats

Markdown, JSON, text, links, raw HTML, screenshots and more, requested per call.

Built in Rust

A single fast engine under every surface, with no heavy infrastructure to babysit.

Open source

AGPL-3.0. Run the API, CLI and MCP server on your own hardware with no limits.

Official SDKs for Python, TypeScript and Go.

Type-safe clients over the same REST API, so you can call every endpoint in the language you already ship in.

Read the SDK docs

Pick your surface. Ship today.

Cancel anytime. API, CLI, or MCP, the engine is the same.

View on GitHub