Clean web data for AI agents.
One API for crawling, scraping, search, and a real browser. Built for agents that need fresh web data at runtime.
Free balance on signup. No card required.
Fast, reliable, low cost. Get all three.
Spider was benchmarked against the leading alternatives on speed, reliability, and price.
- 100K+
- URLs per request10K req/min, no degradation
- $0.03
- Avg per 1,000 pagespay-as-you-go, no subscription
- MIT
- Open-source coreRust engine, self-host option
Native integrations with the leading AI frameworks
Teams shipping with Spider.
What engineers said on X, Medium, and Dify.
Common questions.
Billing, rate limits, and crawl failures.
What is Spider?
Spider is a fast web scraping and crawling API designed for AI agents, RAG pipelines, and LLMs. It supports structured data extraction and multiple output formats including markdown, HTML, JSON, and plain text.
How can I try Spider?
Sign up for a free balance to test, or explore the open-source Spider engine at https://github.com/spider-rs/spider.
What formats can Spider convert web data into?
Spider outputs HTML, raw, text, and various markdown formats. It supports JSON, JSONL, CSV, and XML for API responses.
Can you crawl all pages?
Yes. Spider crawls all necessary content without needing a sitemap. We rate-limit individual URLs per minute to balance the load on a target server.
Does it respect robots.txt?
Yes. robots.txt compliance is on by default. You can disable it on a per-request basis when needed.
What if a crawl fails?
Failed requests are billed at $0. You only pay for responses that return data.
What if I get blocked?
Spider includes an Unblocker with stealth, rotating proxies, and automatic retries. Heavily protected sites route to the Browser Cloud, which runs full browser sessions with anti-detection built in.
How does billing work?
Each request is billed for bandwidth ($1/GB) plus compute ($0.001/min). Most pages cost a fraction of a cent. You can estimate your spend with the pricing calculator at /compare.