Why ChatGPT Is Not Recommending My Website

If your competitors appear in ChatGPT answers but your website does not, the gap is rarely random. ChatGPT relies on a mix of training data, live web browsing, and citation signals to decide which sources to mention. The tool above analyses your domain against the main factors that influence ChatGPT source selection and returns a prioritised diagnosis so you know exactly where to act.

Enter your domain in the field above to run the check.

How ChatGPT selects sources

ChatGPT does not crawl the web in real time for every query the way a search engine does. Its base model was trained on a large corpus of text, and when the browsing plugin or SearchGPT is active, it fetches live pages. In both cases, the signals that push a brand into the answer are similar:

Authority and trust signals. Backlinks from well-known publications, Wikipedia mentions, presence in structured knowledge bases like Wikidata, and brand search volume all indicate that a domain is a reliable source worth citing.
Topical clarity. A site that consistently covers a specific subject is more likely to surface than a generalist domain. Thin or ambiguous content confuses language models and reduces citation probability.
Answer-ready structure. Pages that use clear headings, concise factual statements, and structured data (FAQ, Article, Organization schema) are easier for a language model to parse and quote.
Crawl access. If your robots.txt blocks GPTBot or OAI-SearchBot, OpenAI's crawlers cannot read your pages. A blocked site cannot be cited from live browsing.

How to read the diagnosis and act

The tool above returns a list of signals ordered by impact. For each signal that shows a gap, here is the recommended action:

Crawl access blocked. Update your robots.txt to allow GPTBot. Verify with the robots.txt checker that no wildcard rule accidentally covers OpenAI user-agents.
Low external authority. Publish data-driven studies, secure guest posts on industry media, and aim for a Wikipedia or Wikidata entry. These are the citations ChatGPT trusts most.
Thin topical coverage. Build a content cluster around your main keyword category. Each page should answer one specific question thoroughly rather than covering many topics superficially.
Missing structured data. Add Organization schema with your official name, logo URL, and social profiles. Add FAQ schema to pages that answer common questions.
No social proof in public text. Testimonials, case studies, and press mentions published on indexable pages create trust signals the model can encounter during training refreshes.

A benchmark to calibrate urgency

Traffic from AI engines like ChatGPT and Perplexity converts at roughly 7% according to 2025 studies, compared with 2.6% for average organic traffic. The intent behind an AI-sourced visit is higher: the user already received a curated recommendation before clicking. Every month a brand is absent from ChatGPT answers is a month of high-intent traffic given to competitors. The tool above gives you the fastest path to closing that gap.

For ongoing monitoring of your brand's presence across ChatGPT, Gemini, and Perplexity, Sorank tracks AI citations automatically so you never miss a shift in your AI visibility.

Frequently asked questions

Does ChatGPT cite websites in real time or only from training data?

It depends on the mode. The base ChatGPT model replies from training data with a knowledge cutoff. When the browsing feature or SearchGPT is active, ChatGPT fetches live pages. Optimising for both requires clean crawl access and strong authority signals.

Will blocking GPTBot in robots.txt prevent my site from appearing in ChatGPT?

Yes, for live browsing queries it will. If GPTBot or OAI-SearchBot is blocked, OpenAI's systems cannot read your current pages, which greatly reduces the chance of a citation in real-time answers.

How long does it take to see results after fixing the issues?

For live browsing citations, improvements can appear within days of fixing crawl access. For training-data-based citations, changes propagate only when OpenAI refreshes its model, which can take months. Prioritise live-browsing optimisations first for the quickest impact.