Every page as markdown
Each crawled page comes back as clean markdown with title and word count, stripped of nav, ads, and boilerplate.
POST /v1/crawl
Point it at one URL and get clean markdown for every page on the site.
Feed your LLM or agent a whole knowledge base in one job. Start an async crawl from a single URL, follow same-origin links with depth and page limits you set, and poll for clean markdown on every page. Built in Rust, handles JS rendering and bot protection automatically.
Each crawled page comes back as clean markdown with title and word count, stripped of nav, ads, and boilerplate.
Set max_depth and max_pages to control exactly how far the crawler walks and how many pages it pulls.
Turn on use_sitemap to seed the queue with sitemap URLs and reach pages that links alone never expose.
Start a job, get a UUID back, and poll until it completes without holding a request open.
01
Send one URL with optional max_depth, max_pages, and use_sitemap, and get a crawl job ID in return.
02
The crawler walks links breadth-first, staying on the same origin until it hits your depth or page limit.
03
Every page is rendered if needed and converted to markdown, around 90% fewer tokens than the raw HTML.
04
Poll GET /v1/crawl/{id} until status is completed, then read the full pages array of markdown.
4 pages crawled
Send one POST to /v1/crawl with a start URL. The crawler follows same-origin links breadth-first and returns clean markdown for every page it reaches. Set max_depth and max_pages to bound how far it goes.
It is async. Starting a crawl returns a job UUID and a status, then you poll GET /v1/crawl/{id} until the status flips to completed and the full pages array is ready.
Pass max_depth and max_pages in the request body. They default to 2 and 50, so the crawler stays bounded unless you raise them.
Pages only reachable through forms or scripts may not show up via link traversal. Set use_sitemap to true to seed the queue with sitemap URLs and reach pages that links alone never expose.
No. Credits are only consumed on successful responses. A standard page is 1 credit; heavier work like JS rendering or protected-site access costs a few extra credits.
One credit pool, every endpoint. Cancel anytime, or self-host the open-source core for free.
Cookies & analytics
We'd like to use analytics to understand how this site is used. Nothing loads or fires until you agree. See our privacy policy for the full list of processors.