Skip to main content
New Gottem — one API for every web scraping vendor. Read more

Clean Web Data for AI Agents and LLMs

Turn any URL into clean markdown, structured JSON, or a screenshot. One API for crawling, scraping, search, and a headless browser. Built for agents that need fresh web data at runtime.

Free credits on signup. No card required.

200 OK·https://example.com204ms·1.2 KB
# Example Domain

This domain is for use in illustrative examples in documents.
You may use this domain in literature without prior coordination
or asking for permission.

[More information…](https://www.iana.org/domains/example)
Sample response for scrape. Enter a URL above and Run Test to stream the real one.
POSTapi.spider.cloud/scrapeGet a free API key →
100K+
pages / second
199
proxy countries
99.9%
success rate
10K
req / min
Features

Give your agents the live web.

Your agents need fresh data. Spider delivers it, even from sites that fight back.

PAY PER USE

Pay for bandwidth and compute, nothing else. Most pages settle for a fraction of a cent. Volume credits unlock at $500+.

Bandwidth
$1 / GB
Compute
$0.001 / min
See pricing →
Community

Teams that ship with Spider.

Engineers building AI agents, RAG pipelines, and research tools on top of Spider.

Why Spider

Built for production.

Rust core. Open source. Engineered for the live web.

100K+ pages per second

Written in Rust with async concurrency from the ground up. The same open-source engine powers every request.

Open source core

The crawler powering this API is on GitHub with 2K+ stars. Audit the code, self-host, or use the managed cloud. No lock-in.

AI-native output

Clean markdown, structured JSON, or screenshots, straight from the rendered DOM. No HTML cleanup, no wasted LLM tokens on navigation chrome.

Bot detection, handled

Stealth headers, residential proxies, and fingerprint rotation on by default. Tune per request when a target demands more.

Get started

Start crawling in 30 seconds.

One API key. Immediate results. No servers to manage.

Free credits on signup. No card required.

Get started freeRead the docs
FAQ

Common questions.

Billing, rate limits, and what happens when a crawl fails.

What is Spider?

Spider is a fast web scraping and crawling API designed for AI agents, RAG pipelines, and LLMs. It supports structured data extraction and multiple output formats including markdown, HTML, JSON, and plain text.

How can I try Spider?

Sign up and get free credits to test, or explore the Open-Source Spider engine at https://github.com/spider-rs/spider.

What formats can Spider convert web data into?

Spider outputs HTML, raw, text, and various markdown formats. It supports JSON, JSONL, CSV, and XML for API responses.

Can you crawl all pages?

Yes, Spider accurately crawls all necessary content without needing a sitemap ethically. We rate-limit individual URLs per minute to balance the load on a web server.

Does it respect robots.txt?

Yes, compliance with robots.txt is default, but you can disable this if necessary.

What if a crawl fails?

Failed requests cost nothing. You only pay for successful responses that return data.

What if I get blocked?

Spider includes an unblocker with stealth mode, rotating proxies, and automatic retries. For heavily protected sites, the browser cloud provides full browser sessions with anti-detection built in.

How does billing work?

Each request is billed for bandwidth ($1/GB) plus compute ($0.001/min). Most pages cost a fraction of a cent. You can estimate your costs with the pricing calculator above.