Skip to main content gottem  — one API for every scraper.
International
Verified

Zhihu Scraper

Extract content, articles, and data from Zhihu. Built on spider-browser .

Get started Docs
target
zhihu.com
success rate
99.9%
latency
~4ms
Quick start

Extract data in minutes.

zhihu-com-scraper.ts
import { Spider } from "@spider-cloud/spider-client";

const spider = new Spider({ apiKey: process.env.SPIDER_API_KEY! });

const result = await spider.scrapeUrl("https://www.zhihu.com", {
  return_format: "markdown",
});

console.log(result);
ready to run · spider-browser · TypeScript
Fetch API

One endpoint for zhihu.com.

Structured JSON from zhihu.com with a single POST. AI-resolved selectors, cached on the first call.

POST /fetch/zhihu.com/
TitleContentDateSource
cURL
curl -X POST https://api.spider.cloud/fetch/zhihu.com/ \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"return_format": "json"}'
Python
import requests

resp = requests.post(
    "https://api.spider.cloud/fetch/zhihu.com/",
    headers={
        "Authorization": "Bearer YOUR_API_KEY",
        "Content-Type": "application/json",
    },
    json={"return_format": "json"},
)
print(resp.json())
Node.js
const resp = await fetch("https://api.spider.cloud/fetch/zhihu.com/", {
  method: "POST",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json",
  },
  body: JSON.stringify({ return_format: "json" }),
});
const data = await resp.json();
console.log(data);
Extraction

Fields you can pull.

TitleContentDateSource
Geo-Proxy

Regional access

Access geo-restricted content on zhihu.com via local proxies.

Rendering

Multi-language SPA

Handle internationalized React/Vue storefronts with dynamic content.

Scale

Global coverage

Scrape marketplace listings across multiple countries and currencies.

Related

More International scrapers.

Start

Start scraping zhihu.com.

Grab an API key and call the endpoint above. The first request resolves the config; every request after hits cache.