Skip to main content gottem  · one API for every scraper.

Collect · Search · Automate

Realtime web data for AI agents

One fast API for crawling, scraping, search, and a real browser. Clean, structured data the moment your agents ask for it.

Free credits on signup. No card required.

200 OK·https://example.comsample
# Example Domain

This domain is for use in illustrative examples in documents.
You may use this domain in literature without prior coordination
or asking for permission.

[More information…](https://www.iana.org/domains/example)
Scrape API benchmark · 1,000 URLs · open methodology

Fast, reliable, and low cost.

Spider was benchmarked against the leading alternatives on speed, reliability, and price.

Spider99.9%best
only 0.1% of pages missed
Firecrawl95.3%47× more misses
Bright Dataest.93.0%70× more misses
Higher is betterusable content returned
100K+
URLs per request10K req/min, no degradation
$0.03
Avg per 1,000 pagespay-as-you-go, no subscription
MIT
Open-source coreRust engine, self-host option
Measured + published · reproduce itRead the benchmark

Extraction

From the messy web to clean data.

Point Spider at any URL. We strip the nav, ads, and boilerplate and hand back exactly what your model needs: clean markdown or structured JSON.

example.com/article
Raw page · nav, ads, boilerplate
markdown json 2.1 KB
# Example Domain
This domain is for use in illustrative examples in
documents. You may use this domain in literature
without prior coordination or asking for permission.

[More information...](https://iana.org/domains/example)
Clean output · markdown & JSON

Browser automation

A real browser, driven by your agent.

Not a screenshot service. A real browser your agent steers. Navigate, click, type, and extract structured data from anything behind JavaScript or a login.

https://news.ycombinator.com live
session
  1. navigate news.ycombinator.com
  2. act click “new”
  3. act scroll to end
  4. extract top 30 stories → JSON
30 rows extracted
stealth pass · e2e 2.4s · cost $0.004

Community

Teams shipping with Spider.

What engineers said on the internet.

FAQ

Common questions.

Billing, rate limits, and crawl failures.

What is Spider?

Spider is a fast web scraping and crawling API designed for AI agents, RAG pipelines, and LLMs. It supports structured data extraction and multiple output formats including markdown, HTML, JSON, and plain text.

How can I try Spider?

Sign up for a free balance to test, or explore the open-source Spider engine at https://github.com/spider-rs/spider.

What formats can Spider convert web data into?

Spider outputs HTML, raw, text, and various markdown formats. It supports JSON, JSONL, CSV, and XML for API responses.

Can you crawl all pages?

Yes. Spider crawls all necessary content without needing a sitemap. We rate-limit individual URLs per minute to balance the load on a target server.

Does it respect robots.txt?

Yes. robots.txt compliance is on by default. You can disable it on a per-request basis when needed.

What if a crawl fails?

Failed requests are billed at $0. You only pay for responses that return data.

What if I get blocked?

Spider includes an Unblocker with stealth, rotating proxies, and automatic retries. Heavily protected sites route to the Browser Cloud, which runs full browser sessions with anti-detection built in.

How does billing work?

Each request is billed for bandwidth ($1/GB) plus compute ($0.001/min). Most pages cost a fraction of a cent. You can estimate your spend with the pricing calculator at /compare.