RAW HTTP — NO HEADLESS BROWSER OVERHEADMARKDOWN · JSON · HTML · LLM-READY FORMATSMCP SERVER FOR AI AGENTSTLS FINGERPRINT IMPERSONATIONEXTRACT · SUMMARIZE · DIFF · BRANDSITEMAP DISCOVERY & DEEP CRAWLINGSELF-HOST OR USE OUR CLOUD APIBUILT IN RUST — FAST BY DEFAULTDEEP RESEARCH — AI SYNTHESIZES REPORTS FROM 50+ SOURCESWEB SEARCH — QUERY AND SCRAPE SEARCH RESULTS IN ONE CALLAGENT SCRAPE — GIVE A GOAL, AI EXTRACTS WHAT YOU NEEDURL MONITORING — WATCH PAGES FOR CHANGES WITH WEBHOOKSBONUS CREDITS — EARN FREE CREDITS BY STARRING AND REFERRINGRAW HTTP — NO HEADLESS BROWSER OVERHEADMARKDOWN · JSON · HTML · LLM-READY FORMATSMCP SERVER FOR AI AGENTSTLS FINGERPRINT IMPERSONATIONEXTRACT · SUMMARIZE · DIFF · BRANDSITEMAP DISCOVERY & DEEP CRAWLINGSELF-HOST OR USE OUR CLOUD APIBUILT IN RUST — FAST BY DEFAULTDEEP RESEARCH — AI SYNTHESIZES REPORTS FROM 50+ SOURCESWEB SEARCH — QUERY AND SCRAPE SEARCH RESULTS IN ONE CALLAGENT SCRAPE — GIVE A GOAL, AI EXTRACTS WHAT YOU NEEDURL MONITORING — WATCH PAGES FOR CHANGES WITH WEBHOOKSBONUS CREDITS — EARN FREE CREDITS BY STARRING AND REFERRING

About.

I'm Massi, the engineer behind webclaw. I build developer tools in Rust and work on web extraction, systems programming, and infrastructure for AI applications.

webclaw started because I needed to get clean content out of web pages for LLM pipelines and every existing tool was either too slow, got blocked, or returned garbage HTML that wasted 95% of tokens. So I wrote my own.

The core extraction engine is Rust, open source, and AGPL-3.0 licensed. It handles TLS fingerprinting at the transport layer (no headless browsers), runs a 9-step content optimization pipeline, and averages 118ms per page for static content. The cloud API adds JavaScript rendering, anti-bot bypass, LLM-powered extraction, and a crawling pipeline on top.

I write about the engineering behind webclaw on the blog. Topics range from token optimization and RAG pipelines to TLS fingerprinting and MCP integration. Everything I publish comes from building and shipping this in production.

Links

Contact

For questions, feedback, or anything else: admin@webclaw.io. You can also open an issue on GitHub or join the Discord.

Stay in the loop

Get notified when the webclaw API launches. Early subscribers get extended free tier access.

No spam. Unsubscribe anytime.