Skip to main content New gottem — one API for every web scraping vendor.
Web data API · Built for agents

Clean web data for AI agents.

One API for crawling, scraping, search, and a real browser. Built for agents that need fresh web data at runtime.

Free balance on signup. No card required.

200 OK·https://example.com204ms·1.2 KB
# Example Domain

This domain is for use in illustrative examples in documents.
You may use this domain in literature without prior coordination
or asking for permission.

[More information…](https://www.iana.org/domains/example)
Sample for scrape·Run live to stream the real one
POST·api.spider.cloud/scrapeGet a free API key
Browser Use stealth benchmark · 80 anti-bot sites

85% pass rate. Highest in the field.

Cloudflare, Akamai, PerimeterX, DataDome. Spider Browser scored highest overall and led in four of six anti-bot categories.

Spider85%
Kernel68%
Browserbase41%
100%
999-URL pass 254 domains, 18 categories
2.5s
Median e2e connect → render → close
705ms
Session create no cold start
Community

Teams shipping with Spider.

What engineers said on X, Medium, and Dify.

Why Spider

Built for the live web.

Rust core, open-source under MIT, and the same engine in every request.

100K+ pages per second

01

Written in Rust with async concurrency from the ground up. The same open-source engine powers every request.

Open source core

02

The crawler powering this API is on GitHub with 2K+ stars. Audit the code, self-host, or use the managed cloud. No lock-in.

AI-native output

03

Clean markdown, structured JSON, or screenshots, straight from the rendered DOM. No HTML cleanup, no wasted LLM tokens on navigation chrome.

Bot detection, handled

04

Stealth headers, residential proxies, and fingerprint rotation on by default. Tune per request when a target demands more.

FAQ

Common questions.

Billing, rate limits, and crawl failures.

What is Spider?

Spider is a fast web scraping and crawling API designed for AI agents, RAG pipelines, and LLMs. It supports structured data extraction and multiple output formats including markdown, HTML, JSON, and plain text.

How can I try Spider?

Sign up for a free balance to test, or explore the open-source Spider engine at https://github.com/spider-rs/spider.

What formats can Spider convert web data into?

Spider outputs HTML, raw, text, and various markdown formats. It supports JSON, JSONL, CSV, and XML for API responses.

Can you crawl all pages?

Yes. Spider crawls all necessary content without needing a sitemap. We rate-limit individual URLs per minute to balance the load on a target server.

Does it respect robots.txt?

Yes. robots.txt compliance is on by default. You can disable it on a per-request basis when needed.

What if a crawl fails?

Failed requests are billed at $0. You only pay for responses that return data.

What if I get blocked?

Spider includes an Unblocker with stealth, rotating proxies, and automatic retries. Heavily protected sites route to the Browser Cloud, which runs full browser sessions with anti-detection built in.

How does billing work?

Each request is billed for bandwidth ($1/GB) plus compute ($0.001/min). Most pages cost a fraction of a cent. You can estimate your spend with the pricing calculator at /compare.