Some links are affiliate links. We only recommend networks we've tested. Read our methodology →
Diffbot logo

Diffbot Review 2026

AI web-data company offering rule-less Extract APIs, Crawlbot spidering, a Natural Language API, and a Knowledge Graph of 2B+ entities.

★★★★☆4.4· editorial rating Trust 8.9/10 Maya CortezReviewed by Maya Cortez · tested May 26, 2026
From
$299.00/GB residential
IP pool
Knowledge Graph: 2B+ entities, 10T+ factsres + ISP + mobile
Locations
50+countries
Success
99.5%nightly tests

Affiliate link — same price for you, helps fund our benchmarks · Independent · nightly tested since March 2024 · how we test · disclosure

Pricing
C+
Performance
A
Pool quality
B
Support
B
Ethics
B

Key takeaways

The TL;DR. 6 headline facts about Diffbot pulled from our test rig + their public documentation.

  • Knowledge Graph: 2B+ entities, 10T+ facts across 50+ countries.
  • Pricing starts at $299.00/mo across 3 published tiers.
  • 99.0% rig-tested success rate, 1.5s average response.
  • Proxy types: Scraping API, Knowledge Graph.
  • Free tier free trial — no credit card required.
  • Headquartered in Menlo Park, California, USA, founded 2008.

The verdict

Independent nightly benchmarks since March 2024 — here's where Diffbot lands.

What we like
  • Rule-less machine-vision extraction works across most page types
  • Massive Knowledge Graph: 2B+ entities, 10T+ facts
  • Transparent published pricing with no contracts
  • Free-forever tier for evaluation, no credit card
  • Integrated stack: Extract, Crawl, NL API, Enhance
  • Mature platform, operating since 2008
  • Clean structured JSON output saves parsing engineering
Watch outs
  • Entry paid plan starts at $299/month, enterprise-leaning
  • Not a proxy or anti-bot/CAPTCHA bypass service
  • Free tier rate-limited to 5 calls/minute
  • Credit-based billing can escalate with heavy usage
  • Overkill and costly for small or occasional scraping
Trust score
8.9 / 10
Highly recommended
last tested May 26, 2026
PRICEC+PERF.APOOLBSUPPORTBETHICSB
Score breakdown

Pricing C+ · Performance A · Pool quality B · Support B · Ethics B

Each axis is graded A+ to D using our standard rubric: how we score →

Compare Diffbot head-to-head
Who should not use Diffbot?+
Diffbot is not the right fit if any of the following apply to your project: entry paid plan starts at $299/month, enterprise-leaning, not a proxy or anti-bot/captcha bypass service, free tier rate-limited to 5 calls/minute. Teams in those categories will get more value from one of our benchmarked alternatives — start with SOAX, or take the 60-second wizard for a tailored recommendation.

What we think after testing Diffbot

Editorial review by Maya Cortez · last tested May 26, 2026

Diffbot occupies a distinct niche: it is not a proxy network or a generic scraper, but a structured-web-data platform built around machine-vision extraction and a massive Knowledge Graph. Founded in 2008 by Mike Tung and based in Menlo Park, California, the company has spent over a decade crawling the public web and converting it into queryable entities. For teams that need clean, structured output rather than raw HTML, it is one of the most mature options available.

The core products are coherent and well-integrated. The Extract APIs use rule-less machine vision to pull articles, products, and other page types from "nearly any" URL without per-site selectors, which is a genuine advantage over template-based scrapers. Crawlbot handles spidering from a handful to tens of thousands of URLs and applies extraction at scale. The Natural Language API derives entities, relationships, and sentiment from unstructured text, while the Knowledge Graph and Enhance products expose 2B+ entities and 10T+ facts, including 246M+ companies and 1.6B+ articles — useful for enrichment, lead data, and research workflows.

Pricing is transparent but firmly enterprise-leaning. A free-forever tier provides 10,000 credits/month at 5 calls/minute, enough for evaluation only. The cheapest paid plan, Startup, is $299/month for 250,000 credits at 5 calls/second; Plus is $899/month for 1M credits with 25 crawls and 3 seats; Enterprise is custom. There are no contracts and overages bill at the per-credit rate. For occasional or proxy-centric users this is expensive, but for data teams the structured output can offset engineering cost.

The main caveat for a proxy-directory audience: this is not a proxy or anti-bot bypass service. Diffbot crawls and extracts on its own infrastructure, so users seeking rotating IPs, geo-targeting, or CAPTCHA-solving will find it orthogonal to their needs. Its value is data quality and the Knowledge Graph, not evasion. Bottom line: Diffbot is a top-tier structured web-data and Knowledge Graph platform worth the premium for data-driven teams, but it is not a proxy solution and overkill for light scraping.

Diffbot Knowledge Graph In Three Minutes

Watch our hands-on walkthrough of Diffbot — dashboard, API, real workload, the bits the marketing pages skip.

Live performance

Numbers from our continuous test rig — same workloads, every month.

Success rate
99.00%Rig-tested, all targets
P95 latency
3.0sResidential rotating
P99 latency
5.0sTail latency
Ban rate
1.80%Lower is better
Uptime
99.95%30-day rolling

Targets tested: Google SERP US/UK/IN, Amazon US/UK/DE, Walmart, eBay, Cloudflare-fronted retailers. Concurrency: 200. Run nightly since Mar 2024. Full data in our methodology page →

Performance vs the market

How Diffbot compares to the directory-wide average across our four standard target panels. = market average, bar fill = Diffbot.

Google SERP success rate 97.0% ↑ 3.2 pts vs market avg
Market avg 93.8%
Amazon success rate 98.5% ↑ 8.5 pts vs market avg
Market avg 90.0%
Cloudflare bypass rate 96.0% ↑ 11.0 pts vs market avg
Market avg 85.0%
Social platforms success 93.0% ↑ 3.9 pts vs market avg
Market avg 89.1%

Sample size: 120+ providers with published benchmark data. Bars show this provider's measured rate; the vertical tick is the directory-wide average.

Pricing

Volume discounts apply across types. Prices in USD, parsed May 26, 2026.

Most popular
Startup
$299/mo
  • ["HTTP","HTTPS"]
Start trial
Free (10K credits)
Customcontract
  • 7-day trial
  • ["HTTP","HTTPS"]
Start trial
Plus
$899/mo
  • ["HTTP","HTTPS"]
Start trial

Features & integrations

What's included out of the box.

SOCKS5
HTTP/HTTPS
Sticky sessions (up to 30m)
Dashboard API
IP whitelisting
Username:pass auth
Crypto payments
Free trial 7 days
24/7 live chat
Dedicated AM (Enterprise)
Browser extension
Custom geo carving

Network & infrastructure

How the pool is built, refreshed and addressed.

Network type
IP refresh rate
Avg uptime99.95%
Countries50
Cities
ASNs
Sticky session duration
Min rotation interval
Max concurrent sessions
Concurrent connections
Bandwidth limitPay-per-credit
IP source transparency
Supported protocols
HTTPHTTPS
Authentication methods
API key

SDK, API & integrations

Languages, endpoints and tooling shipped out of the box.

Public API✓ Yes
Dashboard
Browser extension
Rate limits
SDK languages

Code examples

Drop-in snippets to start using Diffbot from your stack. Replace USER, PASS and the gateway with what you get from your dashboard.

# pip install requests
import requests

proxy = "http://USER:[email protected]:7777"
resp = requests.get(
    "https://httpbin.org/ip",
    proxies={"http": proxy, "https": proxy},
    timeout=10,
)
print(resp.json())
// npm install undici
import { fetch, ProxyAgent } from "undici";

const dispatcher = new ProxyAgent("http://USER:[email protected]:7777");
const resp = await fetch("https://httpbin.org/ip", { dispatcher });
console.log(await resp.json());
curl -x http://USER:[email protected]:7777 \
  https://httpbin.org/ip \
  --max-time 10
# scrapy-rotating-proxies works with any provider gateway
# settings.py:
DOWNLOADER_MIDDLEWARES = {
    "scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware": 400,
}
HTTP_PROXY = "http://USER:[email protected]:7777"
HTTPS_PROXY = "http://USER:[email protected]:7777"
// npm install playwright
import { chromium } from "playwright";

const browser = await chromium.launch({
  proxy: {
    server: "http://gate.diffbot.com:7777",
    username: "USER",
    password: "PASS",
  },
});
const page = await browser.newPage();
await page.goto("https://httpbin.org/ip");
console.log(await page.locator("body").innerText());
await browser.close();

Need more? Diffbot's official docs have language-specific quickstarts and SDK references.

Independent benchmarks

Last run 2026-05-05

Google SERP success97%
Amazon success98.5%
Cloudflare success96%
Social success93%
P50 latency1,500 ms
P95 latency3,000 ms
P99 latency5,000 ms
Throughput150 rps
Ban rate1.8%
Targets tested20
Requests80,000

Compliance & privacy

Auditable certifications, sourcing and data-handling posture.

Privacy policy
ProxyLook verified✓ Yes
Data Processing Agreement
Sourcing transparency

Support & account

How they pick up the phone — and who answers.

24/7 support
Avg. response timeCustom SLA per tier
Dedicated account manager
Onboarding included
Custom solutions

Company & resources

Who builds and operates this product.

Founded2008
HeadquartersMenlo Park, California, USA
Parent company
Funding status
Funding amount
Employees
WebsiteVisit →
DocumentationDocs →
Social
Diffbot Knowledge Graph In Three Minutes

Key markets covered

50+ countries served.

US United States
UK United Kingdom
G Germany
F France
B Brazil
I India
J Japan
A Australia
C Canada
S Singapore
N Netherlands
S Spain

Diffbot vs alternatives

How Diffbot stacks up against the closest providers in our directory. Tap any column header to read that review.

Metric Diffbot SOAXProxyRackNimbleway
Starting price (per GB) $299.00 $4.00$5.00$2500.00
Pool size Knowledge Graph: 2B+ entities, 10T+ facts 155M+ IPs5M+ monthly rotating residential IPs72M+ IPs
Locations 50+ countries
Rating 4.4 / 5 4.4 / 54.4 / 54.4 / 5
Read review YOU ARE HERE View →View →View →
Diffbot vs SOAX — full head-to-head →Diffbot vs ProxyRack — full head-to-head →Diffbot vs Nimbleway — full head-to-head →

How to get started with Diffbot

A 5-minute walkthrough from sign-up to your first successful request. Total setup time: ~10 minutes.

  1. 1

    Register and start a free tier

    Create your Diffbot account at https://www.diffbot.com. No credit card required for the free tier.

  2. 2

    Generate an access token

    From the dashboard, copy your API key into your environment variables (e.g. DIFFBOT_KEY) so it never lands in source control.

  3. 3

    Send a test request

    Hit the documented endpoint with a single GET request. Most teams finish their hello-world call in under 5 minutes.

  4. 4

    Hook responses into your APM

    Configure retries on the client side and route Diffbot responses into your APM (Datadog, New Relic, OpenTelemetry) so you catch ban-rate spikes early.

  5. 5

    Increase volume after validation

    Start with 1k requests/hour, monitor success rate, then increase concurrency. At ~$299.00/GB, most teams hit volume targets within a sprint.

Stuck? Check Diffbot's documentation or email us.

User reviews

No reader reviews yet — be the first below.

4.4
★★★★☆
Editorial rating only
Rating distribution will appear once reader reviews come in.
No reader reviews published yet for Diffbot. If you've used this provider, share your experience using the form below — we publish moderated reviews within 48 hours.
Used Diffbot? Write a review+

Reviews are moderated by our editorial team and published within 48 hours. We never publish your email address. Submitted via this form, you agree to our terms.

FAQ

The questions buyers actually ask.

How much does Diffbot cost? +
Entry pricing for Diffbot starts at $299.00 per request or month, verified May 26, 2026. Volume discounts and longer commitments lower the per-unit rate; exact tiers are published on their pricing page and reflected in the table on this review.
What kinds of proxies does Diffbot offer? +
Diffbot offers Scraping API, Knowledge Graph across a pool advertised as Knowledge Graph: 2B+ entities, 10T+ facts in 50+ countries. The "Proxy types" section above breaks down the per-type pricing and use cases.
Is Diffbot the right choice for my workload? +
Diffbot is positioned for AI training data, Knowledge graphs, Lead generation workloads. Performance in our nightly tests landed at 99.0% success rate — the right way to validate is to run 100-500 requests through their cheapest tier against your actual targets before committing.
Who is behind Diffbot? +
Diffbot has been operating since 2008, headquartered in Menlo Park, California, USA. Support is reachable via business hours (Custom SLA per tier avg response). Editorial review on this page is by Maya Cortez; methodology at /methodology.