NEW AI Studio is now available Try it now

API Features

A complete toolkit for web data collection. Crawl, scrape, search, screenshot, transform, and extract, all from a single API.

9
Core endpoints
50K+
Pages per second
4
Output formats
REST
JSON & streaming

Crawl

POST /crawl

Recursively crawl entire websites and collect every page. Set depth limits, respect robots.txt, and get structured output in markdown, HTML, or plain text.

Recursive crawling Depth control Markdown output Batch URLs
View details

Scrape

POST /scrape

Extract content from individual pages with precision. Optimized for single-page extraction with CSS selectors, metadata, and multiple output formats.

Single-page focus CSS selectors Fast response Structured data
View details

Search

POST /search

Perform search engine queries and automatically crawl the results. Combine search discovery with content extraction in a single step.

Search + crawl Result limits Query control Content extraction
View details

Screenshot

POST /screenshot

Capture high-quality screenshots of any web page. Full-page or viewport captures returned as base64 or binary with configurable format and quality.

Full-page capture Base64 & binary Quality control CDP params
View details

Transform

POST /transform

Convert raw HTML into clean markdown, plain text, or sanitized HTML. Process content offline without re-fetching pages from the web.

HTML to markdown Content cleaning Readability mode Batch transform
View details

Unblocker

POST /unblocker

Access content behind anti-bot protections and challenging security measures. Advanced fingerprinting and session management for protected sites.

Anti-bot bypass Fingerprinting Session handling High success rate
View details

AI Extraction

POST /pipeline/*

Extract structured data using AI-powered pipelines. Pull contacts, generate Q&A pairs, label websites, and filter links with built-in intelligence.

Contact extraction Q&A generation Website labeling Link filtering
View details

Links

POST /links

Collect all links from a website without extracting page content. Optimized for sitemap generation, link analysis, and URL discovery at lower cost.

URL discovery Low latency Cost efficient Sitemap building
View details

Fetch (Alpha)

POST /fetch/{domain}/{path}

Per-website fetch APIs with AI-discovered configurations. Configs are discovered once, validated, cached, and reused. Browse available endpoints in the directory.

AI config discovery CSS selectors Auto-caching Endpoint directory
View details

Shared Across All Endpoints

Proxy Support

Residential, mobile, and ISP proxies with geo-routing to 100+ countries

Streaming

JSONL streaming responses so you can process results as they arrive

Caching

Built-in HTTP caching with configurable TTL to reduce redundant requests

Webhooks

Async delivery of results to your endpoint when crawls complete

Rate Limiting

Automatic concurrency management to respect target site limits

Auth Header

Simple Bearer token authentication on every request

Multi-Format

Response as JSON, XML, CSV, or JSONL depending on your needs

Client Libraries

Official SDKs for Python, JavaScript, Rust, and CLI access

Start building with Spider

Get your API key and make your first request now.