Web scraping API demo

Turn any URL into markdown or JSON.

Pick a mode, drop in a URL or a query, and see exactly what the API hands back: clean markdown, structured JSON, ranked search results, or a full site map, in one request.

Live, no signup — 3 free runs a day. The hosted API is a paid plan from $19/mo; the open-source core is free to self-host.

Want every option and no cap? Open the full playground

API docs

Quick answer: Webclaw is a web scraping API for turning a URL into markdown, JSON, text, metadata, and extracted fields. Use it when an AI agent, RAG pipeline, or backend job needs page content that is cleaner than raw HTML and more complete than a summary.

In your code

Copy the request. Swap the URL.

The same call works from curl, the official SDKs, or any service that can hit a REST API.

bash

curl -X POST https://api.webclaw.io/v1/scrape \
  -H "Authorization: Bearer $WEBCLAW_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url":"https://example.com","formats":["markdown","json"]}'

Outputs

URL to markdown, website to JSON, and schema extraction.

The demo maps directly to production endpoints. Start with /v1/scrape for page content, then move to /v1/extract when your app needs a typed object instead of a document.

URL to markdown API

Turn messy HTML into readable markdown with headings, lists, tables, links, and main content preserved.

Open

Website to JSON API

Extract title, metadata, links, images, page structure, or a custom schema from any target URL.

Open

LLM-ready markdown

Feed agents and RAG pipelines content that is smaller, cleaner, and easier to cite than raw HTML.

Open

Production web extraction

Use retries, rendering, caching, and managed API keys without operating your own scraping cluster.

Open

Why it matters

Raw HTML is not an answer.

Most AI and automation workflows do not need class names, script tags, repeated navigation, or empty wrappers. They need the content, structure, source metadata, and fields that move into the next step.

Raw HTML

Too noisy for retrieval and expensive for model context.

Clean extraction

Main content, links, metadata, and readable structure.

App-ready data

Markdown for RAG, JSON for databases, typed fields for products and leads.

Workflows

Built for RAG, AI agents, monitoring, and enrichment.

RAG web scraping

Index docs, changelogs, help centers, and knowledge bases as clean markdown chunks.

Scraping API for AI agents

Let agents retrieve real page content, not lossy summaries or token-heavy raw HTML.

Price monitoring

Extract product data, variants, availability, ratings, and competitor pricing.

Lead enrichment

Convert company pages, directories, and profiles into structured fields.

FAQ

Web scraping API demo questions.

What is a web scraping API demo?+

A web scraping API demo shows the full request and response flow for converting a URL into usable data. In Webclaw, that means sending a URL to /v1/scrape and receiving markdown, JSON, text, metadata, links, or extracted fields.

Can Webclaw turn a URL into markdown?+

Yes. Webclaw converts web pages into clean markdown for RAG pipelines, AI agents, documentation search, and knowledge-base ingestion.

Can Webclaw extract website data as JSON?+

Yes. The scraping API can return page-level JSON, and the extract endpoint can produce schema-shaped JSON objects for products, companies, articles, listings, and research workflows.

Is Webclaw useful for RAG and AI agents?+

Yes. Webclaw is built for AI workflows that need reliable web content. It removes boilerplate, preserves useful structure, and returns LLM-ready markdown or typed JSON for downstream retrieval and reasoning.

Does Webclaw handle JavaScript-rendered websites?+

Yes. Webclaw returns static pages in sub-200ms responses and renders JavaScript-heavy pages automatically when the final content is not available in the initial HTML.

Next reads

Crawl4AI vs Playwright JavaScript rendering for scraping APIs Webclaw vs Crawl4AI API reference SDKs Cloud API

Ready to test it? Run your own URL.

Open the playground, paste a URL, pick a format, and ship the exact API shape you just saw. Cancel anytime.

Read the API docs