ChatGPT Search
Quick facts
- Operator
- OpenAI
- Founded
- 2024
- Docs
- https://help.openai.com/en/articles/9237897-chatgpt-search
- Engine class
- Retrieval-augmented chat — the model retrieves the live web only when it judges retrieval is needed, not on every turn
- Retrieval default
- Conditional / on-demand — opposite of Perplexity's always-on retrieval
- Crawlers
- OAI-SearchBot (Search visibility) · ChatGPT-User (user-triggered fetch) · GPTBot (training only — not Search)
- Citation behavior
- Inline, hoverable, clickable source links — present but sparser and less prominent than Perplexity
- GEO significance
- Largest-distribution AI search surface; the load-bearing lever is OAI-SearchBot access, not anti-training blocking
Crawler user-agents
- OAI-SearchBot
- ChatGPT-User
- GPTBot
1. What ChatGPT Search is
ChatGPT Search is the web-retrieval capability inside ChatGPT — the path where ChatGPT fetches the live web and answers with inline, clickable source links instead of answering from model memory alone (see Introducing ChatGPT search).
In the generative engine taxonomy, ChatGPT Search is the retrieval-augmented chat class — where retrieval is conditional (the model decides per query whether to search) rather than the default path. That single property makes it the deliberate counterpart to Perplexity AI, the answer-engine-native class where retrieval is always on. The two entries are paired calibration samples for the same axis.
Three different things get conflated — keep them apart:
| Name | What it is |
|---|---|
| ChatGPT Search | The web-retrieval engine capability inside ChatGPT (incl. former “SearchGPT mode”) |
| OAI-SearchBot / ChatGPT-User / GPTBot | Three distinct user-agents — see AI crawlers and (training side) GPTBot |
| OpenAI (the company) | The corporate entity, models, API business — see OpenAI |
SearchGPT was a July 2024 prototype, folded into ChatGPT Search at GA — it is not a separate product (see §6). The company, model lineage, and API commercial terms are routed out to OpenAI so the entries do not overlap.
Why it is a P0 platform: ChatGPT Search is the largest-distribution AI search surface — it rides ChatGPT’s user base, so being cited here has the widest reach of any engine in this collection.
2. How it works
ChatGPT Search is an instance of the general answer loop — query understanding → retrieve-or-not decision → live web retrieval → grounding/selection → LLM synthesis → citation backfill. This section gives only the platform-specific deltas.
| Platform-specific trait | What it changes for GEO |
|---|---|
| Retrieval is conditional, not default | The model decides per query whether to search — being eligible is necessary but the query must also trigger a fetch |
| Retrieval lands in the chat context | Sources enter a multi-turn conversation — coverage compounds across follow-ups, not one isolated answer |
| Conversational synthesis | Answers are prose-first; citations are attached, not the spine — fewer, less prominent than Perplexity |
| Query rewriting | The user’s natural-language turn is reformulated before retrieval — match the rewritten intent, not the literal phrasing |
| ”SearchGPT mode” | A retrieval-first entry point into the same capability, not a separate engine |
The selection step prefers passages that are retrievable, structurally clean, and directly quotable. Because retrieval is conditional, query coverage is itself a lever here — you must be a strong candidate for the timely, specific questions that make the model choose to search. That is why this engine pushes writing for AI citation to the front: the liftable, on-topic chunk wins the conditional fetch.
3. Crawlers and user-agents
OpenAI operates three documented user-agents with completely different jobs. Conflating them is the single most common GEO mistake on this engine, so this entry resolves the distinction here rather than routing it away.
| User-agent | Official purpose | robots.txt | Triggers |
|---|---|---|---|
OAI-SearchBot | Surfaces and links sites in ChatGPT’s search features | Respects robots.txt — disallow it and you are not surfaced in Search | Background search crawl |
ChatGPT-User | Visits a page for a specific user action in ChatGPT / Custom GPTs | User-initiated, so robots.txt rules may not apply; not used to decide Search inclusion | A live user’s question needs that page |
GPTBot | Crawls content that may be used to train foundation models | Respects robots.txt — purely a training opt-out, unrelated to Search | Background training crawl |
The load-bearing GEO fact. OpenAI documents each setting as independent of the others. Blocking
GPTBot— the default “stop AI training” move — does not remove you from ChatGPT Search. Only disallowingOAI-SearchBotdoes that. Many sites opt out of training and unintentionally believe they have left Search; they have not, and the reverse mistake (blockingOAI-SearchBotwhile meaning to block training) silently removes them from Search.
All three publish IP-range JSON endpoints for allow-list verification (searchbot.json,
gptbot.json, chatgpt-user.json; see Overview of OpenAI Crawlers
and the Publishers and Developers FAQ).
Admission, verification, audit — see AI crawlers. Training-crawler
ethics and opt-out — see GPTBot.
4. Citation preferences
This is the load-bearing GEO section. Because retrieval is conditional and citations are sparser, what gets cited when a search does fire is high-leverage and scarce.
| Frequently cited | Frequently skipped | The signal it implies |
|---|---|---|
| Structurally clean pages with clear headings | JavaScript-dependent content the fetch can’t render | Server-side render; be retrievable — see AI crawlers |
| Concrete facts, numbers, dates | Vague marketing prose with no liftable claim | Fact density — see GEO |
| Self-contained, directly quotable passages | Content that only makes sense in full-page context | Chunk independence — see writing for AI citation |
| Recent, dated material on timely questions | Stale or undated pages | Freshness — and being the source a search-triggering query needs |
| Authoritative domains for the topic | Login-walled or paywalled bodies | Source authority and open access |
The contrast with other classes is one line: ChatGPT Search ships fewer and less prominent citations per answer than answer-engine-native Perplexity AI, and behaves differently again from SERP-embedded Google AI Overviews or Bing Copilot. Lower, conditional citation density means each cited slot is scarcer here — so structural liftability plus covering the queries that trigger a search both matter.
5. API and integration
ChatGPT Search has no Sonar-style consumer search API. The programmatic approximation is the
OpenAI API web search tool (web_search / web_search_preview, in the Responses API), which
gives a model live web access and returns the sources behind the answer.
| Returned field | Contents |
|---|---|
| message content | The synthesized answer (with inline citations by default) |
url_citation annotations | Per-citation objects: url, title, start_index, end_index, type |
sources | The full list of URLs the model consulted — usually larger than the cited set |
OpenAI requires that “inline citations must be made clearly visible and clickable” when displaying
results (see Web search guide). One
caveat for GEO: the API web-search tool is not a 1:1 mirror of consumer ChatGPT Search ranking —
it is the closest measurable proxy, not the product surface itself. The point is that
url_citation / sources make “is my content being cited?” an automatable query, which is why
this anchors AI citation tracking.
6. History and timeline
Only GEO-relevant milestones — retrieval, citation, or visibility mechanics — are recorded here. Model-version history and the company’s commercial story are in OpenAI; the training-crawler debate is in GPTBot.
| Date | Milestone | Why it matters for GEO |
|---|---|---|
| Jul 2024 | SearchGPT prototype | First public OpenAI search surface — established “search ≠ training” doctrine |
| Oct 2024 | ChatGPT Search GA | SearchGPT folded into ChatGPT; inline clickable citations become standard |
| Dec 2024 | Free logged-in rollout | Distribution widens — Search reach is no longer Plus-gated |
| Feb 2025 | Logged-out, no sign-up | Maximum-distribution surface — anyone on chatgpt.com gets cited answers |
| Apr 2025 | Shopping in Search | Product/commerce queries enter the cited surface — structured product data matters |
| Oct 2025 | ChatGPT Atlas browser | A ChatGPT-native browser — being cited starts to replace the traditional click |
(Dates from OpenAI blog posts and Search Engine Land / Cybernews reporting; SearchGPT has no standalone retirement notice — the prototype page plus the GA post are the canonical record.)
7. Measured citation behavior
Be honest about scope here. The foundational GEO benchmark
(Aggarwal et al., KDD ‘24) used an internal harness and Perplexity.ai
as its live-engine baseline — not ChatGPT Search. There is no academic benchmark that uses this
engine as its primary live baseline, which is why this entry’s relatedPapers is intentionally
empty: we do not internal-link a paper that did not test this engine.
What that leaves:
- Read the cross-engine evidence at its source. The benchmarked live-engine numbers live in the Perplexity AI entry; the same content-substance rewrite behaves differently here and should not be extrapolated across engines.
- Treat citation behavior as a variable, not a constant. Conditional retrieval means the trigger rate is itself an unknown — whether a query searches at all varies — on top of which sources get cited if it does.
- Use the direction, not a number. There is no defensible “ChatGPT Search lifts visibility by X%” claim; resist inventing one. The discipline is continuous measurement, not assumption.
That measurement discipline — query a sample, extract the cited set, track your share over time —
is exactly AI citation tracking, and the url_citation
proxy in §5 is what makes it automatable on this engine.
8. Optimizing for ChatGPT Search
These are ChatGPT-Search-specific priorities — not the full GEO workflow, which lives in GEO and the playbooks.
| Tactic | Why it bites harder on ChatGPT Search | Governing entry |
|---|---|---|
Do not mis-block OAI-SearchBot (audit robots.txt / WAF) | A wrong anti-training rule silently removes you from Search | AI crawlers |
| Server-side rendering, crawlable HTML | The conditional fetch can’t cite what it can’t render | AI crawlers |
| Self-contained, quotable chunks | Fewer citation slots — only the most liftable passages win | writing for AI citation |
| High fact / number / date density | Conditional retrieval favors concrete, attributable claims | GEO |
| Cover the timely, specific queries that trigger a search | If the query never searches, citability is moot | GEO |
| Track your cited share via the API web-search proxy | Citations are extractable — measure, don’t guess | AI citation tracking |
The crawler-audit row is first because it is the highest-frequency, highest-cost mistake unique to this engine.
9. Why ChatGPT Search matters for GEO
ChatGPT Search is the engine with the widest reach but the sparsest, conditional crediting. Its GEO value is scale × difficulty: the largest audience to be cited in front of, and the easiest place to score zero exposure through a single mis-configured crawler rule. That is why it is the deliberate counterpart to Perplexity AI — same axis, opposite end.
| Engine trait | The GEO lever it amplifies (or suppresses) | Governing entry |
|---|---|---|
| Largest-distribution surface | Reach — citation here has the widest audience | GEO |
| Conditional retrieval | Query coverage — you must trigger the search, not just be eligible | GEO |
| Three-bot split (OAI-SearchBot ≠ GPTBot) | Crawl access — the highest-cost, easiest-to-get-wrong control | AI crawlers · GPTBot |
| Sparser, conditional citations | Structural liftability — scarce slots raise the bar | writing for AI citation |
ChatGPT Search is the retrieval-augmented chat instance in its highest-distribution form. Model the engine correctly — conditional retrieval, the right crawler, scarce citation slots — and you have the widest reach in GEO; model it wrong, and the largest surface returns nothing.
References
Official OpenAI documentation (as of 2026-05):
- Introducing ChatGPT search (2024-10-31) · ChatGPT search — Help Center
- SearchGPT prototype (2024-07-25)
- Overview of OpenAI Crawlers · Publishers and Developers FAQ
- Web search — OpenAI API guide
- Shopping research in ChatGPT (2025-04-28) · Introducing ChatGPT Atlas (2025-10-21)
Industry:
- Search Engine Land — OpenAI’s SearchGPT to be integrated into ChatGPT
- Cybernews — OpenAI ChatGPT search available to all logged-out users (2025-02-05)
Frequently asked questions
Is ChatGPT Search the same as ChatGPT?
If I block GPTBot, will I disappear from ChatGPT Search?
Does ChatGPT Search cite sources like Perplexity?
How do I get cited in ChatGPT Search?
Is SearchGPT still a thing?
Related
Sources
Primary
- Introducing ChatGPT search · OpenAI · 2024-10-31
- ChatGPT search (Help Center) · OpenAI
- SearchGPT prototype · OpenAI · 2024-07-25
- Overview of OpenAI Crawlers · OpenAI
- Publishers and Developers FAQ · OpenAI
- Web search (OpenAI API guide) · OpenAI
- Shopping research in ChatGPT · OpenAI · 2025-04-28
- Introducing ChatGPT Atlas · OpenAI · 2025-10-21
Secondary
- OpenAI SearchGPT to be integrated into ChatGPT · Search Engine Land
- OpenAI ChatGPT search available to all logged-out users · Cybernews