About.
I'm Massi, the engineer behind webclaw. I build developer tools in Rust and work on web extraction, systems programming, and infrastructure for AI applications.
webclaw started because I needed to get clean content out of web pages for LLM pipelines and every existing tool was either too slow, got blocked, or returned garbage HTML that wasted 95% of tokens. So I wrote my own.
The core extraction engine is Rust, open source, and AGPL-3.0 licensed. It handles TLS fingerprinting at the transport layer (no headless browsers), runs a 9-step content optimization pipeline, and averages 118ms per page for static content. The cloud API adds JavaScript rendering, anti-bot bypass, LLM-powered extraction, and a crawling pipeline on top.
I write about the engineering behind webclaw on the blog. Topics range from token optimization and RAG pipelines to TLS fingerprinting and MCP integration. Everything I publish comes from building and shipping this in production.
Links
- 0xmassi.dev — personal site
- GitHub — open source work and webclaw repos
- X / Twitter — updates on webclaw and what I'm building
- webclaw on GitHub — the extraction engine source code
- Blog — technical deep dives on extraction and LLMs
Contact
For questions, feedback, or anything else: admin@webclaw.io. You can also open an issue on GitHub or join the Discord.