NEW AI Studio is now available Try it now
Health

NIH Scraper

Extract research publications, health topics, clinical guidelines, and funding data from the National Institutes of Health. Powered by spider-browser .

Get Started Documentation
nih.gov target
99.5% success rate
~4ms latency
Quick Start

Extract data in minutes

nih-scraper.ts
import { SpiderBrowser } from "spider-browser";

const spider = new SpiderBrowser({
  apiKey: process.env.SPIDER_API_KEY!,
});

await spider.connect();
const page = spider.page!;
await page.goto("https://www.nih.gov/health-information");

const data = await page.evaluate(`(() => {
  const topics = [];
  document.querySelectorAll(".health-topic-card, .teaser").forEach(el => {
    const title = el.querySelector("h2, h3, a")?.textContent?.trim();
    const summary = el.querySelector("p")?.textContent?.trim();
    const link = el.querySelector("a")?.href;
    if (title) topics.push({ title, summary, link });
  });
  return JSON.stringify({ total: topics.length, topics: topics.slice(0, 15) });
})()`);

console.log(JSON.parse(data));
await spider.close();
✓ ready to run | spider-browser | TypeScript
Fetch API

Structured data endpoint

Extract structured JSON from nih.gov with a single POST request. AI-configured selectors, cached for fast repeat calls.

POST /fetch/nih.gov/
Topic titleDescriptionRelated conditionsResearch linksInstituteLast reviewed
curl
curl -X POST https://api.spider.cloud/fetch/nih.gov/ \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"return_format": "json"}'
Python
import requests

resp = requests.post(
    "https://api.spider.cloud/fetch/nih.gov/",
    headers={
        "Authorization": "Bearer YOUR_API_KEY",
        "Content-Type": "application/json",
    },
    json={"return_format": "json"},
)
print(resp.json())
Node.js
const resp = await fetch("https://api.spider.cloud/fetch/nih.gov/", {
  method: "POST",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json",
  },
  body: JSON.stringify({ return_format: "json" }),
});
const data = await resp.json();
console.log(data);
Extraction

Data you can extract

Topic titleDescriptionRelated conditionsResearch linksInstituteLast reviewed
Content

Medical data extraction

Extract drug info, conditions, and health articles from nih.gov.

Parsing

Structured health data

Clean extraction of dosage, interactions, and clinical information.

Scale

Bulk research

Process thousands of medical pages for research and comparison datasets.

Related

More Health scrapers

Start scraping nih.gov

Get your API key and start extracting data in minutes.