POST /v1/brand + /v1/extract

Lead enrichment from any domain.

Turn a company domain into structured firmographic data.

Given a company website, webclaw extracts brand identity, tech stack signals, team information, pricing tiers, and product features. Perfect for sales intelligence, ABM targeting, and enrichment pipelines without paying Clearbit or ZoomInfo prices.

Enrich a company

View API docs

How it works

Build it step by step.

The real flow, one step at a time. Switch between TypeScript, Python, and cURL on any snippet.

Start from a domain

Take a company website and use /v1/map to locate its About, Pricing, and Team pages.

const domain = "example.com";// Discover the pages worth enriching fromconst { links } = await webclaw.map({ url: `https://${domain}` });const pages = links.filter((url) =>  /\/(about|pricing|team)/i.test(url),);

Define a firmographic schema

Describe the fields to capture: company name, industry, tagline, product categories, and target audience.

// One schema, reused across every domainconst schema = {  type: "object",  properties: {    company_name: { type: "string" },    industry: { type: "string" },    tagline: { type: "string" },    product_categories: { type: "array", items: { type: "string" } },    target_audience: { type: "string" },  },};

Extract structured data

Call /v1/extract with that schema so every site returns the same consistently shaped JSON.

// Same schema in, same JSON shape out — every timeconst firmographic = await webclaw.extract({  url: `https://${domain}`,  schema,});console.log(firmographic.data);

Merge into your CRM

Combine the extracted firmographics with brand identity and push the enriched record to your pipeline.

// Add brand identity, then push the enriched recordconst brand = await webclaw.brand({ url: `https://${domain}` });const lead = { domain, ...firmographic.data, ...brand.data };await crm.upsert(lead);

Enrich a company

Why webclaw

Built for lead enrichment.

Brand identity extraction (logo, colors, fonts, tagline)

LLM structured extraction with custom schemas

Site mapping to discover About, Pricing, Team pages

Fresh data: scrape on demand, not stale databases

Predictable per-page pricing at scale

What you get

Everything this use case needs.

Brand extraction (logo, colors, fonts)
LLM structured extraction with schemas
Sitemap and page discovery
Fresh data on every call
Volume-friendly pricing

Where it fits

Built for the messy parts.

Enrichment providers charge $0.20-$2 per contact lookup and provide stale data scraped months ago. Building your own scraping pipeline requires handling bot protection, parsing diverse site structures, and extracting fields consistently across thousands of sites.

webclaw /v1/brand extracts design identity and positioning. /v1/extract with a firmographic schema pulls structured company data from About, Pricing, and Team pages. Combine with /v1/crawl for full-site enrichment and /v1/map for sitemap discovery.

Common questions

Frequently asked questions

Can webclaw replace Clearbit or ZoomInfo for lead enrichment?

For firmographic data extracted from public websites, yes. webclaw gives you structured company data from home, about, pricing, and team pages at a fraction of the cost. It does not provide B2B contact emails, which are datasets Clearbit builds separately.

How do I extract the same fields across 10,000 different company sites?

Define a JSON schema once and use /v1/extract with that schema against every domain. The LLM extraction handles the variation in page structures and returns consistently shaped output you can load into your CRM.

What about privacy and GDPR compliance?

webclaw only scrapes publicly accessible content, respects robots.txt by default, and does not store customer data. You are responsible for complying with GDPR, CCPA, and other regulations when processing scraped data.

For AI agents

Or hand it to your agent.

Add the webclaw MCP server to Claude, Cursor, or any MCP client, then paste this prompt. The agent calls the webclaw tools and hands the result back to your model — no code to write.

PROMPT FOR YOUR AGENT

Using the webclaw tools, enrich this company into a structured lead record: [the company domain]. First call map on the domain to locate its About, Pricing, and Team pages, then call brand on the domain to pull its logo, colors, fonts, and tagline. Next call extract on the homepage (and the About/Pricing pages you found) with a firmographic schema capturing company_name, industry, tagline, product_categories, target_audience, and pricing_tiers, so every field comes back consistently shaped. Merge the brand identity and the extracted firmographics into a single enriched JSON record keyed by the domain. Return that JSON plus a short summary table of the key firmographic fields so I can drop it straight into my CRM.

Set up the MCP server

Ready to build? Start extracting.

Cancel anytime. Clean, structured data on every call.

View API docs

API documentation

Other use cases

Travel price monitoring RAG pipelines AI agents Deep research Price monitoring Competitive intelligence Web search Brand extraction Content change monitoring Content summarization Website to Markdown YouTube transcripts