The Research Endpoint With Flat Pricing

One API call. Finished research. Flat rate.
Never again guess your bill.

FlatSearch is the search infrastructure behind VCP Scanner, a financial equity research platform. You're accessing production capacity at wholesale rates.

Save up to 92%
No Card Required
Made for AI Agents
The Problem

Building web research into your agent is a full-time job.

Your agent hits a search API, gets 10 blue links, scrapes each one, strips the HTML, stuffs it into a prompt, runs the LLM, parses the output. You maintain the scraping logic, handle rate limits, deal with blocked pages, and pay for every token of that chain.

At scale, that's not an integration. It's an entire microservice you didn't budget for.

The Solution

We do the research. You get the answer.

FlatSearch is a managed research sub-agent. Send a question, and we scrape live sources, strip the noise, cross-reference findings, and return a structured JSON answer with cited sources. One API call replaces your scraping loop, content cleaning, and synthesis prompt.

No scraping logic. No token math. Just $1.00 per 1,000 queries.

Two endpoints. One decision: speed or depth.

Your chatbot needs a quick answer, or your agent needs a vetted report. Pick the one that fits.

Fast Search

/v1/search/fast
$1.00per 1,000 queries

Your chatbot asks a question, we return a cited answer in ~1 second. Built for customer support bots, autocomplete layers, and any agent that needs web context without building a scraping pipeline.

  • ~1 second response time
  • 3 parallel source scrapes per query
  • High-speed synthesis engine
Reasoning Engine

Deep Search

/v1/search/deep
$2.50per 1,000 queries

Hand your agent a complex question — we research it across multiple sources, cross-reference the findings, and return a vetted report. Built for agents where being wrong costs more than being slow.

  • 15–25 second reasoning loop
  • 5 sources + targeted follow-up scrapes
  • Cross-referenced, validated output

Start free. Scale when you're ready.

No lock-in on PAYG. Overages on Pro are billed at standard flat rates — never blocked.

Hacker
Free

Prototyping and side projects.

  • 100 free queries/mo (Fast Search)
  • Shared proxy pool access
  • Rate limit: 2 req/s
  • Discord support channel
Start Free — 100 queries/mo
Most Popular
PAYG
$15min top-up

Pay only for what you use. Credits never expire.

  • Billed at standard flat rates
  • Credits persist indefinitely
  • Rate limit: 25 req/s
  • Standard residential proxy pools
Get Started
Best Value
Startup Pro
$49/ month

Teams in production. Includes $40 in credits.

  • Includes $40 in built-in credits
  • Equiv. to 40k Fast or 16k Deep queries
  • Overages billed at standard flat rates
  • Prioritized proxy routing subnets
  • High rate limits (50 req/s)
Subscribe Now
Enterprise
Custom

Dedicated nodes, SLAs, 24/7 support.

  • Unlimited custom query capacity
  • Dedicated Search Engine nodes
  • Custom IP routing & blocklists
  • 99.9% service uptime guarantee
  • 24/7 dedicated support pager
Contact Sales

Developers running 61,000 queries/month save ~$671.

At that volume, traditional variable token APIs cost an estimated $732/month. FlatSearch costs $61. That's not a rounding error — that's a different product category.

Estimated Monthly Queries:61,000
Select Endpoint Mode:
FlatSearch Cost (Flat):$61.00
Variable Token API (Estimated):$732.00
Saves you approx. 92% ($671/mo)

Tiers Comparison Matrix

Feature SpecificationsTier 1: Fast SearchTier 2: Deep Search
Retail Pricing$1.00 per 1,000 queries$2.50 per 1,000 queries
Synthesis LLM EngineHigh-speed synthesis engineAdvanced reasoning engine
Execution Loop PipelineSingle-pass context summarizationMulti-stage retry reasoning loop
Query FormulationGenerates 1 to 3 concurrent engine checksGenerates 5 initial + targeted secondary refetches
Proxy Routing NetworkRotating household residential subnetsPrioritized high-reputation subnets
Ingress Content StrippingBoilerplates, navigation nodes, URL BlocklistDeep index cross-checking & content gate
Standard Rate Limits10 requests per second25 requests per second

Every other search API charges you for the tokens we eat.

We got burned by Tavily and Perplexity's token multipliers running our own platform. Variable pricing at our query volume was destroying margin. We built flat pricing because we needed it ourselves.

API ProviderCore Billing StrategyTrue Cost Per 1k Queries
FlatSearch (Fast Search)Flat rate wholesale pricing. Zero input/output token tracking.$1.00 (Flat)
FlatSearch (Deep Search)Flat rate agentic research. Self-critique iterations included.$2.50 (Flat)
Perplexity SonarFlat query fee ($6-$14/1k) + variable token usage (input/output/citations).~$13.50+ (Variable)
Tavily APIAdvanced extractions consume up to 250 credits per query.~$6.40 (Starter Plan)
Exa AICredits charged per search link + supplementary fee for summaries.$7.00 - $15.00
FirecrawlCapped monthly limits. Targeted AI scrapes cost 5x credits.$3.20 - $8.30

Frequently Asked Questions

Why not just use Google Custom Search + my own LLM?

You can. But then you're maintaining a scraping pipeline, handling rate limits, dealing with blocked pages, cleaning HTML, managing context windows, and paying per-token for the LLM synthesis. We bundle all of that into one API call at a flat rate. Most developers find the engineering time alone isn't worth it.

Why residential proxies? Why does that matter?

Search engines immediately block cloud server IPs with CAPTCHAs. Residential IPs look like real users browsing from home — which they are. This is what lets us achieve 99.9% successful ingestion on queries instead of hitting walls.

What happens if I go over my plan's credits?

We don't cut you off. Overages continue at the standard flat rate and come out of your account balance. No service interruption, no penalty pricing.

Do I only get a summary, or do I get the raw links too?

You get both. Every response includes the synthesized markdown answer AND the raw sources array containing the top 10-25 URLs, titles, and snippets. Use the finished answer for speed, or parse the raw sources if your agent needs total control.