Discover every URL on a site.
Same depth and limit controls as /crawl, but Spider skips content extraction — cheaper and faster when URLs are all you need.
Discover. Filter. Extract.
Use Links to discover URLs first, then selectively scrape only what you need.
Links API
Get all URLs from a domain. Fast and cheap — no content extraction overhead.
→ 1,024 URLs foundYour code
Filter URLs programmatically. Keep only blog posts, product pages, or whatever you need.
→ 47 blog posts matchedScrape API
Scrape only the filtered URLs. Pay for content extraction only where it matters.
→ 47 pages as markdownWhen to pick which.
Links endpoint
- URL discovery
- Faster response times
- Lower cost per page
- No page content
- No metadata
Crawl endpoint
- URL discovery
- Full page content
- Metadata extraction
- MD / HTML / text formats
- Higher cost per page
What you can tune.
Lower cost per URL
By skipping content extraction and format conversion, the Links endpoint uses fewer credits per page. Ideal for large-scale discovery.
Faster response
Without rendering and extracting content, Links returns results faster. Lower latency means quicker iteration on discovery workflows.
Full crawl parameters
Use the same depth, limit, subdomains, and TLD controls as crawl. All the same knobs for scoping discovery.
Subdomain discovery
Enable subdomains to discover URLs across all subdomains. Map the full structure of organizations with complex web presence.
External domain linking
Track outbound links to external domains using external_domains. Analyze a site's link profile and relationships.
Streaming output
Stream discovered URLs as they're found using JSONL content type. Build real-time pipelines that process URLs as the crawl progresses.
cURL, Python, Node.
from spider import Spider
client = Spider()
# Get all URLs from a website
links = client.links(
"https://example.com",
params={
"limit": 0, # No limit - discover everything
"subdomains": True,
},
)
for link in links:
print(link["url"])
print(f"Total: {len(links)} URLs found")Where teams reach for it.
Sitemap generation
Build comprehensive sitemaps by discovering every URL on a website. Find pages missing from the existing sitemap.xml.
SEO link auditing
Map internal link structure to identify orphan pages, broken links, and opportunities to improve site architecture.
Pre-crawl discovery
Discover URLs first, filter programmatically, then scrape only pages you need. More efficient than crawling everything.
Change detection
Periodically collect links to detect new pages, removed pages, or URL structure changes across a domain.
More from the API.
Map any website's URL structure.
Discover every page on a website, fast and cost-efficient.