Buying Recall in AI Search

A research blog from the builder’s desk

Preface: this gets exponentially nerdy below. Bring coffee.

How I ended up here

This started in my headphones. I was listening to a deep dive on Google’s history and the economics of search. The reminder landed hard: search is the best software business ever built because a single query fans out to millions of advertisers competing for intent. The auction converts that competition into money without wrecking usefulness.

Now the interface is changing. People ask natural questions and expect a composed answer or a next step, not a page of links. That isn’t just a UI swing. Under the hood, modern systems assemble answers from a small working set of sources and tools. If something never enters that set, it cannot shape the answer or the follow-up. That observation is what kicked off this post.

I wanted to write down a clean, technical way to finance answer-first search that keeps the answer trustworthy. The idea is simple: sell bounded admission probability into the model’s working set and sell one clearly labeled follow-up. Nothing touches the model’s voice.


Why AI search matters right now

User behavior shifted. When questions are narrow (“is this adapter compatible”, “what size fits under this airline seat”), people prefer an answer that cites sources and lets them finish the task. They stop scanning ten tabs.

Publishers noticed. If the answer is composed from your content but doesn’t send traffic, the old bargain breaks. Source attribution and which sources are allowed into the working set suddenly determine who gets paid.

Advertisers noticed. Keyword targeting is less visible in an answer box. The predictable lever is no longer “bid on this exact phrase” but “be allowed into the set the model uses” and “be the one follow-up that launches the tool.”

New players smell opportunity. Answer engines and AI modes are shipping fast. Incumbents are threading a needle: keep answers useful, show citations, let ads appear without quietly editing the response. Startups are experimenting with sponsored follow-ups that feel like product steps, not banners.

All roads lead to the same bottleneck: what gets into the grounding set and what the user is asked to do next.


A quick mental model

The product goal is to reduce uncertainty and move the task forward:

maxG,F 𝔼[U(A(q;G),Next(q;F))]

subject to latency and trust. U(·) is a learned proxy for user value.

Two places are fair to charge for value:

  1. Admission to G - a bounded nudge that lets high-quality items get considered.
  2. One sponsored follow-up - a labeled, optional steer that launches a useful action.

We’ll also allow a single sponsored citation block in the answer, clearly marked and only alongside organic citations. That’s visibility, not authorship.

Goal: let strong items buy a bounded boost in recall probability into G(q). Money should never be able to force a weak or off-intent item into the answer.

Define:

Eligibility and cap

An item may enter the sponsored pool only if

s(q,d)+βg(bd)h(ι(q),d)  θandu^(dq)  τ,   ...(1)

with a strict cap |Cs(q)|ρK. In parallel, the organic pool Co(q) contains the top-K items by s(q,d) that also pass u^τ.

How sponsored items compete

Inside Cs, rank by a quality-adjusted score

score(d)=bd·u^(dq)γ,0<γ1,   ...(2)

and merge the top of Cs into G under the cap ρ. No item, paid or organic, enters G if it fails the floor u^τ.

Plain-English read of the math

What triggers a charge

For retrieval, the right unit is CPC-ground: charge only when d is in G and the composer actually uses it (detected by citation or span attribution). You pay for impact, not exposure.

How the price is set

Use generalized second price adapted to the quality-weighted order. If advertiser i sits above j, the per-event price is

Pricei=bju^jγu^iγ.   ...(3)

Add a reserve to protect utility when a sponsored item displaces the K-th organic candidate o:

Reserve(q,i)=α·(u^(oq)u^(iq))+·p^use(iq),   ...(4)

where p^use estimates the chance i genuinely influences the answer. Final price is max{GSP,Reserve}. If i is as useful as o, the reserve adds nothing.

One sponsored follow-up: selling a steer, not speech

The system proposes a few follow-ups F(q). We allow one sponsored option, picked by

score(f)=bf·p^engage(fq)γ,   ...(5)

and priced via the same GSP logic as (3) with p^engage in place of u^, charged on CPC-engage. If the follow-up launches a tool with a measurable outcome, support CPA-tool. The answer remains editorial; the steer is transparent and optional.


Flowcharts

Figure 1 - Query to answer with the two places we charge

flowchart LR
  Q[Query & context] --> I{Predict intent}
  I --> R[Retrieve candidates]
  R --> Co[Organic pool (top-K, û ≥ τ)]
  ADB[Targets & bids] --> ELG[Eligibility gate]
  I --> ELG
  R --> ELG
  ELG --> Cs[Sponsored pool]
  Co --> M[Merge under cap ρ]
  Cs --> M
  M --> G[Grounding set G]
  G --> LLM[Compose answer]
  LLM --> ANS[Answer + citations]
  I --> FGEN[Generate follow-ups]
  ADB --> FGEN
  FGEN --> SFF[Pick 1 sponsored follow-up]
  ANS --> OUT[User view]
  SFF --> OUT

Figure 2 — Sponsored retrieval: ranking and pricing

flowchart LR
  subgraph Ads
    A1[Doc d1, bid b1]; A2[Doc d2, bid b2]; A3[Doc d3, bid b3]
  end
  A1 --> U[Predict û(d|q)]
  A2 --> U
  A3 --> U
  U --> SCORE[score = b * û^γ]
  SCORE --> RANK[Order within Cs]
  RANK --> GSP[Price vs next score]
  RANK --> RSV[Reserve from Δû vs organic Kth]
  GSP --> PRICE[max(GSP, Reserve)]
  RSV --> PRICE
  PRICE --> PICK[Admit ≤ ρK if û ≥ τ]

Figure 3 — What fires a CPC-ground charge

flowchart LR
  G[Grounding set] --> TRACE[Attribution: spans/citations]
  TRACE --> USED{Used in answer?}
  USED -->|Yes| BILL[CPC-ground]
  USED -->|No| NOCHG[No charge]

Worked examples

A. Enterprise comparison

Query: “compare enterprise LLM gateways with per-request policy and cost controls.” Parameters: K=12, cap ρ=0.2 so at most 2 sponsored items, γ=0.5, floor τ=0.50. Three vendors bid with u^={0.62, 0.55, 0.44} and b = {$4.00, $5.50, $8.00}

Scores from (2):

Eligibility by (1): suppose θ=2.5. All three clear the gate, but vendor 3 fails the floor u^<τ (0.44 < 0.50) and drops out. Two eligible remain: vendor 2 ahead of vendor 1.

Price for the winner (2) via (3):

Price2=s10.55=3.14960.7416$4.25,

charged on CPC-ground only when the evidence is actually used. Vendor 1 pays against the next competitor or the reserve in (4).

What to notice The quality floor vetoed the biggest bid. Money helps good candidates get considered; it cannot smuggle in weak ones.


B. Local service steer

Intent: book a technician. Two follow-ups compete with bids b={$1.10, $1.60} and predicted engagement p^={0.23, 0.18}. Take γ=0.6.

Scores from (5):

Second wins. GSP price:

Price=0.45540.180.6=0.45540.3574$1.27,

charged on CPC-engage when clicked.


C. Shopping with a sponsored citation

Query: “ultralight daypacks under 20L that fit under an airline seat.” The retriever proposes K=12. Two brands pass eligibility and the floor, so with ρ=0.2 both may be in G. The answer summarizes volumes, weights, and seat-fit rules, cites three organic sources and one clearly labeled sponsored source, and offers a single sponsored follow-up that opens a size-guide tool. Charges: CPC-ground for admitted brand docs when used, CPC-cite for the sponsored citation click, and CPA-tool for running the guide.


What’s happening in the market, mapped to this model

None of this requires exotic theory. It’s the classic sponsored-search playbook, moved from “slot on a page” to “slot in the working set.”


What each side gets


Parameters worth starting with

K              = 12                       # organic candidates
ρ (rho)        = 0.20                     # cap: ≤ 20% of G may be sponsored
τ (tau)        = 0.50                     # minimum predicted utility
θ (theta)      = per-intent gate
β (beta)       = per-intent bid weight
γ (gamma)      = 0.5                      # squashing in ranking
g(bid)         = log(1 + bid)             # diminishing returns
h(intent,d)    = [0..1]                   # intent match
Units          = CPC-ground, CPC-engage, optional CPA-tool

Closing

The old game sold position on a page. The new game sells bounded presence in the model’s working set and one transparent steer. Keep the floor high, keep the cap small, keep the labels obvious. If we hold that line, answer-first search can stay useful and still fund itself.

Thanks for reading. If you made it this far, you earned the right to dunk on my choice of γ.