LLM-ready formats
Choose markdown, JSON, llm, text, or raw HTML per request. No HTML parsing on your side.
POST /v1/scrape
Turn any URL into clean, LLM-ready content with one call.
Send a URL, get back markdown, JSON, structured text, or raw HTML, stripped of nav, ads, and boilerplate. Built in Rust, returns static pages in around 118ms, and handles JavaScript rendering and bot protection automatically.
Choose markdown, JSON, llm, text, or raw HTML per request. No HTML parsing on your side.
Boilerplate, nav, and ads are stripped so your model spends tokens on signal, not chrome.
JavaScript rendering and bot-protection bypass kick in automatically when a page needs them.
Title, description, language, status, and timing come back alongside the content on every call.
01
POST the URL plus an optional format. No headless browser to run, no proxies to manage.
02
The engine picks a fingerprint and escalates to JS rendering or anti-bot only if the page needs it.
03
HTML is parsed and converted to your chosen format, with boilerplate removed.
04
Clean content plus metadata comes back, ready to pass to any LLM, vector store, or pipeline.
Turn any website into LLM-ready markdown, JSON, or structured data. Handles protected sites and returns static pages in around 118ms.
POST the URL to /v1/scrape with the markdown format. webclaw fetches the page, strips nav, ads, and boilerplate, and returns GitHub-flavored markdown ready for an LLM, with no HTML parsing on your side.
Yes. The engine detects when a page needs JavaScript rendering and escalates automatically, so single-page apps and client-rendered content come back fully rendered.
Yes. Bot-protection bypass runs automatically when a page needs it, so protected pages return real content instead of a block page.
markdown, json, llm, text, and raw html. Pick the one that fits your pipeline: markdown for RAG and prompts, json for structured data, html when you need the raw document.
No. Credits are only consumed on successful responses. A standard page is 1 credit; heavier work like JS rendering or protected-site access costs a few extra credits.
One credit pool, every endpoint. Cancel anytime, or self-host the open-source core for free.
Cookies & analytics
We'd like to use analytics to understand how this site is used. Nothing loads or fires until you agree. See our privacy policy for the full list of processors.