Open Beta · 17 algorithms · 12 MCP tools · 1,000 free requests/monthOpen Beta · Free to start

The search API
that thinks.

One API call returns search + extracted content + citations — drop it into your RAG pipeline or AI agent. No scrapers, no plumbing.

Get API Key — Free View Docs

Novel algorithms

17+

Search backends

1,100+

Tests passing

MCP tools

Why developers choose Fetchium

17 novel algorithms

not available anywhere else

12 MCP tools

search, fetch, research, YouTube, and social workflows

17+ search backends

federated in a single call

1,100+ unit tests

production-grade reliability

Built in Rust

zero GC pauses, safe concurrency

Zero telemetry

your queries stay private

A search API with federated backends, neural ranking, and cross-session learning built into a single retrieval pipeline.

See comparison

Built for every AI workflow

From real-time search to complex research agents, Fetchium provides the foundation for reliable AI output.

Explore use cases

RAG Pipelines

Feed your vector DB with up-to-the-minute web data.

Deep Research

Multi-step autonomous browsing for complex queries.

Social Monitoring

Track trends and discussion across Reddit and Hacker News.

Video Intel

Search and analyze YouTube videos and transcripts.

17 Novel Algorithms

Capabilities no other search API has

Fetchium combines federation, ranking, extraction, and research primitives into one API surface for AI search workloads.

Unique

11-Backend Search Federation

Fan out a single query across SearXNG, Brave, GitHub, Reddit, HackerNews, StackOverflow, YouTube, Wikipedia, ArXiv, Bing, and DuckDuckGo in parallel. Adaptive Backend Selector picks the right mix per query intent.

Novel

HyperFusion Neural Ranking

8-signal ranking engine: BM25 lexical match, semantic similarity, temporal decay, domain authority, evidence density, source diversity, content depth, and cross-source consensus.

Novel

5-Layer CEP Content Extraction

Content Extraction Protocol: CSS selectors, Readability, headless JS rendering, PDF parsing, and screenshot OCR. Designed to recover structured text from a wide range of pages, including SPAs and PDFs.

Novel

QATBE Token Budget Control

Query-Aware Token-Budgeted Extraction scores every content segment with BM25 then solves a greedy knapsack to pack maximum relevance into your exact LLM context window. You always get the most useful content — not just the first N characters.

Novel

Deep Research Pipeline

AMRS multi-agent research swarm: 4 specialist agent types communicate over async channels, synthesize findings, and generate evidence graphs with claims traced to sources. Citation export supports APA, IEEE, BibTeX, and Chicago.

Production

Production Resilience

Circuit breakers, bulkhead isolation, adaptive rate limiting, and backend-aware timeouts. Automatic failover and retry-and-refine validation help stabilize retrieval across heterogeneous sources.

Novel

YouTube & Social Intelligence

VideoFusion ranking for YouTube with transcript extraction and channel/video analysis. Native Reddit, HackerNews, and StackOverflow backends add community-signal retrieval alongside web search.

Novel

PIE Cross-Session Learning

Persistent Intelligence Engine tracks source trust, failure patterns, and query predictions across sessions via SQLite for deployments that enable persistence.

Unique

MCP Protocol Native

First-class Model Context Protocol support. Fetchium exposes 12 MCP tools spanning search, fetch, estimate, research, YouTube, and social workflows.

17+

Search backends

Novel algorithms

1,100+

Unit tests

MCP tools

Built for real workloads

What developers build with Fetchium

From quick RAG prototypes to production research pipelines — one API handles every retrieval workload.

RAG Pipelines & AI Agents

Drop Fetchium into custom RAG stacks and AI agents over HTTP or MCP. One call returns search results, extracted content, and citations ready for downstream retrieval or prompting.

HTTP APIMCPRAGAgents

Learn more

Deep Research Reports

The AMRS pipeline spawns 4 agent types in parallel, searches different angles, cross-validates findings, and assembles an evidence graph.

AMRSEvidence graphsMulti-agentCitations

Learn more

Content Monitoring & Diffs

Track pages, domains, or topics over time and compare fetched content across runs for change detection workflows.

DiffsMonitoringExtractionAnalysis

Learn more

YouTube Intelligence

Search, extract, and analyze YouTube content at scale. Get transcripts, metadata, engagement signals, and semantic summaries — all through the same unified API.

TranscriptsMetadataChannelsSentiment

Learn more

Social Intelligence

Pull structured data from Reddit and Hacker News alongside broader web retrieval to understand community signals around a topic.

RedditHackerNewsCommunityTrends

Learn more

Pipeline Architecture

How it works

Six stages. Search, extraction, ranking, and evidence tracing in one pipeline with an evidence graph.

Swipe to see pipeline

Query

Federation

Ranking

Extraction

Token Budget

Response

Query Analysis

Stage 1

QFD + QCE + QXE

Your query is fingerprinted, classified by intent, scored for complexity, and expanded with semantic variants. The system chooses the optimal backend mix before a single network call is made.

Multi-Backend Federation

Stage 2

ABS + Resilience Layer

The Adaptive Backend Selector fans your query across up to 17 sources in parallel — SearXNG, Brave, GitHub, Reddit, StackOverflow, YouTube, and more. Circuit breakers handle backend failures invisibly.

HyperFusion Ranking

Stage 3

8-Signal Neural Ranking

Results are scored on 8 signals: BM25 lexical match, semantic similarity, temporal freshness, domain authority, evidence density, source diversity, content depth, and cross-source consensus.

CEP Content Extraction

Stage 4

5-Layer Cascade

Top-ranked URLs are deep-extracted via the Content Extraction Protocol: CSS selectors, Readability, headless JS rendering, PDF parsing, and screenshot OCR.

Token Budget Control

Stage 5

QATBE Algorithm

Extracted content is segmented, BM25-scored for query relevance, then packed into your token budget via greedy knapsack. You always get the most relevant content that fits your LLM context window.

AI-Ready Response

Stage 6

Evidence Graph + Citations

The final response includes ranked results, extracted content within your budget, an evidence graph tracing every claim to a source, and auto-generated citations in APA, IEEE, BibTeX, or Chicago format.

Start building in 2 minutes

Get an API key, make your first search call, get back ranked results with extracted content and evidence graphs. Free plan — no credit card required.

Get API Key Free

Simple API

First result in 60 seconds

Install the SDK, paste your key, ship. Real multi-source search with zero boilerplate.

1import { Fetchium } from "@fetchium/sdk";
2
3const fetchium = new Fetchium({
4  apiKey: process.env.FETCHIUM_API_KEY!,
5  baseUrl: "https://api.fetchium.com",
6});
7
8// Multi-source federated search
9const results = await fetchium.search("rust async programming", {
10  backends: ["searxng", "brave", "github", "stackoverflow"],
11  maxResults: 10,
12  tier: "summary",       // key_facts | summary | detailed | complete
13  tokenBudget: 2000,     // QATBE greedy-knapsack packing
14  ranking: "hyperfusion" // 8-signal neural ranking
15});
16
17console.log(results.items[0].title);
18console.log(results.meta.tokensUsed);    // always within budget
19console.log(results.evidenceGraph);      // citations + trust scores
20
21// Deep-extract any URL
22const page = await fetchium.extract("https://docs.rs/tokio", {
23  format: "markdown",
24  tokenBudget: 4096,
25  layer: "readability",  // css | readability | headless | pdf | ocr
26});

$npm install @fetchium/sdk

API Response

200 OK187ms

Tokens

1984/2000

Backends

Ranked

Token budget usage99%

1Asynchronous Programming in Rust

brave

The async book covers futures, async/await syntax, Tokio runtime, and concurrent task management...

score

0.97

trust0.94

2tokio-rs/tokio — GitHub

github

A runtime for writing reliable, asynchronous, and slim applications. Stars: 28k...

score

0.93

trust0.98

3How does async/await work in Rust?

stackoverflow

Accepted answer (1.2k votes): Rust's async/await desugars into state machines at compile time...

score

0.88

trust0.91

Evidence Graph

10 nodes23 edgesconsensus 0.89

Capability Comparison

Fetchium capability shape at a glance.

This view focuses on first-party Fetchium capabilities and broad product-shape differences. It intentionally avoids hard benchmark and pricing claims for third-party services that can change independently of this repo.

Scroll horizontally to compare

Feature	Best Value Fetchium Full pipeline	Tavily AI agent search vs Fetchium	Exa Neural search vs Fetchium	SerpAPI SERP scraper vs Fetchium	Firecrawl Web extraction vs Fetchium
Multi-source federation 17+ simultaneous backends
Token budget control (QATBE)
5-layer content extraction (CEP)
8-signal neural ranking
Evidence graphs + citations
Cross-session learning (PIE)
Deep research pipeline (AMRS)
YouTube & social search
Real-time monitoring + diffs
MCP protocol support
Independent backend mix
Free tier (renewing)
Plan-based rate limits From current API auth configuration	60-2000/min	varies	varies	varies	varies
Free tier available 1,000 requests/month in current API auth configuration

Full support

Partial support

Not available

Fetchium values in this table are tied to the current codebase and auth configuration. Non-Fetchium entries are shown as broad capability comparisons only.

17+

federated backends

single-query fanout

algorithms

ranking, extraction, validation

Full pipeline means search, extraction, ranking, citations, and research workflows in one product surface.

vs Tavily vs Exa vs SerpAPI

Current API Tiers

Plans synced to the current auth configuration

Free tier limits are sourced from the API auth layer. Paid plan names and request ceilings below reflect the current codebase; contact sales for commercial pricing details.

Free

Explore the API, build a prototype, or run personal projects.

1,000 req / month

60 req / min

1,000 API requests per month
All 17 search backends
5-layer CEP content extraction
HyperFusion 8-signal ranking
Token budget management
Evidence graphs + citations
MCP protocol tools
Community support (Discord)

Start for Free

Starter

Contact

First paid tier in the current API configuration.

25,000 req / month

200 req / min

25,000 API requests per month
Everything in Free
YouTube intelligence API
Social media research
Async jobs + usage tracking
Admin key management
Usage dashboard

source: Current auth limits: 200 req/min and 25,000 req/month

Contact Sales

What is verified here

This section is intentionally conservative. Request quotas, rate limits, and free-tier availability are taken from the current API auth code. If you need a signed commercial quote, use the contact flow.

Contact sales

1,000/mo

Free

60 req/min

25,000/mo

Starter

200 req/min

250,000/mo

Pro

500 req/min

Unlimited

Enterprise

2,000 req/min

Source: current API auth configuration in the Fetchium codebase.

Questions about pricing? Talk to us or see the full pricing FAQ. All plans include all 17 algorithms, all backends, and evidence graphs — only scale differs.

Common Questions

Frequently asked questions

Everything you need to decide if Fetchium is right for your project.

Still have questions? Contact us or join our Discord community.

Free forever · No credit card required

Start building with Fetchium today

Join the open beta. 1,000 free API requests per month, all features included. Upgrade when you need more with higher-volume Starter, Pro, and Enterprise tiers.

Get API Key — Free Read the Quickstart Talk to Sales →

1,000

free req/mo

no expiry

25K

Starter quota

req / month

algorithms

all included

MCP tools

included

The search APIthat thinks.

Built for every AI workflow

RAG Pipelines

Deep Research

Social Monitoring

Video Intel

Capabilities no other search API has

11-Backend Search Federation

HyperFusion Neural Ranking

5-Layer CEP Content Extraction

QATBE Token Budget Control

Deep Research Pipeline

Production Resilience

YouTube & Social Intelligence

PIE Cross-Session Learning

MCP Protocol Native

What developers build with Fetchium

RAG Pipelines & AI Agents

Deep Research Reports

Content Monitoring & Diffs

YouTube Intelligence

Social Intelligence

How it works

Query Analysis

Multi-Backend Federation

HyperFusion Ranking

CEP Content Extraction

Token Budget Control

AI-Ready Response

First result in 60 seconds

Fetchium capability shape at a glance.

Plans synced to the current auth configuration

What is verified here

Frequently asked questions

Start building with Fetchium today

The search API
that thinks.