REST API
The webclaw REST API gives you programmatic access to the full extraction engine. Every endpoint accepts JSON and returns JSON.
Base URL
Use the cloud endpoint for managed infrastructure, or point at your own instance when self-hosting.
Authentication
All requests require an API key sent via the Authorization header.
Cloud: Create API keys from your dashboard at webclaw.io. Keys are prefixed with wc_.
Self-hosted: Pass --api-key when starting the server, or set the WEBCLAW_API_KEY environment variable. If neither is set, the server runs without authentication.
Request format
All POST endpoints accept a JSON body. Set the Content-Type header accordingly.
Response format
All responses are JSON. Successful responses return the data directly. Errors use a consistent shape:
Rate limiting
Cloud API rate limits are based on your plan tier. Self-hosted instances have no rate limits by default. See the Cloud API page for plan details.
Endpoints
The full list of available endpoints.
| Method | Path | Description |
|---|---|---|
| POST | /v1/scrape | Single URL extraction |
| POST | /v1/crawl | Start async crawl |
| GET | /v1/crawl/{id} | Poll crawl status |
| POST | /v1/batch | Multi-URL extraction |
| POST | /v1/map | Sitemap discovery |
| POST | /v1/extract | LLM JSON extraction |
| POST | /v1/summarize | LLM summarization |
| POST | /v1/diff | Content change tracking |
| POST | /v1/brand | Brand identity extraction |
| GET | /health | Health check + Ollama status |