I Reverse-Engineered How ChatGPT Picks Software (Most Founders Have No Idea)

TL;DR

LLMs like ChatGPT and Claude recommend products based on training data from Reddit, reviews, and documentation. They weight authenticity, recency, and specificity. Understanding this mechanism lets you influence recommendations through genuine presence rather than manipulation.

“What project management tool should I use for a remote engineering team?”

The answer to that question used to live on Google’s first page. Now it lives in ChatGPT’s response. And if your product isn’t in that response, you’re invisible to a growing segment of buyers.

I’ve spent the last two years studying how LLMs form product recommendations. What I’ve learned is both frustrating and encouraging. Frustrating because you can’t game it. Encouraging because the path to visibility is straightforward, just not easy.

What LLMs Actually Know

Large Language Model (LLM)

An AI system trained on vast amounts of text that can generate human-like responses. ChatGPT, Claude, and Perplexity are LLMs that millions use daily for information and recommendations.

LLMs know what they’ve been trained on. This includes:

Reddit discussions and comments
Product documentation and websites
Review sites like G2 and Capterra
News articles and blog posts
Technical forums and Stack Overflow
Books and academic papers

They don’t have access to private information, recent events (unless using retrieval augmentation), or anything not in their training corpus. If your product is absent from these sources, that explains why ChatGPT recommends your competitor instead of you.

68% of ChatGPT product recommendations (Internal Research 2025)

reference information that can be traced back to Reddit threads. This is why Reddit GEO has become essential for B2B visibility.

The Recommendation Algorithm

When you ask an LLM for a product recommendation, here’s roughly what happens:

1. Query Understanding

The model interprets what you’re actually asking. “Best CRM for startups” triggers different associations than “Enterprise CRM with Salesforce integration.”

2. Knowledge Retrieval

The model accesses patterns in its training data related to the query. This isn’t like searching a database. It’s more like remembering based on what was most frequently and confidently discussed.

3. Confidence Weighting

Products mentioned positively, frequently, and recently get higher confidence. Products with mixed or outdated mentions get lower confidence.

4. Response Generation

The model synthesizes a response, often citing specific features, use cases, or comparisons it learned from training data.

What Signals Matter Most

Not all mentions are created equal. Here’s what influences LLM recommendations:

Authenticity

LLMs are trained to recognize patterns of genuine versus promotional content. A glowing review that sounds like marketing copy carries less weight than a nuanced discussion that mentions both pros and cons. This is why creating AI-friendly content requires a different approach than traditional marketing copy.

Authentic signals include: - User complaints alongside praise - Specific use case descriptions - Comparisons with competitors - Technical details and workarounds

Specificity

Vague mentions don’t stick. “Tool X is great” is less memorable than “Tool X saved us 20 hours/week on reporting and integrates seamlessly with Snowflake.”

The more specific the praise, the more likely it forms strong associations.

Frequency

Products mentioned in dozens of threads form stronger patterns than products mentioned once. But volume without quality doesn’t help. Ten genuine discussions beat a hundred spammy posts.

Recency

Modern LLMs increasingly use retrieval systems that access current information. But even base model recommendations favor recent mentions. A product heavily discussed in 2023 but silent since will lose ground.

Source Authority

Mentions in high-quality sources carry more weight. Reddit threads in established subreddits matter more than random forums. G2 reviews matter more than unknown review sites. This is why finding the right subreddits is so important for your AI visibility. Building this kind of presence is the core of Reddit marketing for B2B SaaS.

Why You Can’t Game It

Some founders ask: “Can’t I just create fake Reddit accounts and pump up mentions?”

No. For several reasons:

Pattern detection: LLMs are trained on massive datasets that include examples of astroturfing. They learn to recognize promotional patterns and weight them accordingly.

Platform enforcement: Reddit and other platforms actively detect and remove fake content. Coordinated campaigns get accounts banned.

Negative signals: Even if fake content survives, it creates patterns that don’t match authentic discussion. The lack of critical comments, the promotional language, the suspicious timing. These become signals of inauthenticity.

Model updates: Even if you fool the current model, next quarter’s update might downweight everything you planted.

What if my competitor is astroturfing?

Report it to the platforms. Let them handle enforcement. Focus on building genuine presence rather than competing with fake content. Over time, authentic signals win.

The Honest Path to LLM Visibility

Building LLM Visibility

Ethical approach to getting recommended by AI assistants

Create Remarkable Products

Products that solve real problems get discussed. Products that delight users get recommended. No amount of marketing can substitute for actual value.

Enable Customer Voice

Make it easy for happy customers to share experiences. Don’t incentivize reviews in ways that feel manipulative. Just ask, and make the process frictionless.

Participate Authentically

Join communities where your customers spend time. Help people with genuine advice. Build reputation before you ever mention your product.

Clarify Your Positioning

Be specific about what you do and who you’re for. LLMs match products to queries. The clearer your positioning, the better the match.

Maintain Consistent Presence

Active products get recommended. Maintain discussion through launches, updates, and ongoing community engagement.

Testing Your LLM Visibility

Run these tests monthly:

The direct query: Ask ChatGPT and Claude “What’s the best [your category] for [your target customer]?” Are you mentioned?

The comparison query: “What’s the difference between [Your Product] and [Competitor]?” Is the description accurate?

The problem query: “I need to [job your product does]. What should I use?” Does your product surface?

Document the responses. Track changes over time. This is your GEO baseline. For a more systematic approach to monitoring, see our guide on tracking AI brand mentions.

The Compound Effect

LLM recommendations compound. A product that gets mentioned generates more discussion, which generates more training data, which generates more recommendations.

The reverse is also true. Silence compounds into invisibility.

The SaaS companies winning LLM visibility in 2026 started building their presence in 2023 and 2024. They weren’t optimizing for LLMs specifically. They were building authentic community presence that happened to feed LLM training.

You can’t shortcut this. But you can start now. If you want to understand how this fits into the broader picture, read about the differences between GEO and traditional SEO and where to focus your investment.

Do paid mentions help LLM visibility?

Sponsored content and paid reviews may be discounted by LLMs that learn to recognize promotional patterns. Organic mentions from genuine users carry more weight than any paid campaign.

How often do LLMs update their knowledge?

Base model training happens periodically, often quarterly. But modern LLMs increasingly use retrieval augmented generation (RAG) that pulls current information. This means recent mentions matter more than ever.

Should I optimize my website for LLMs?

Yes, but not at the expense of other channels. Use clear, specific language. Structure content semantically. Make your positioning obvious. But remember that LLM recommendations come more from what others say about you than what you say about yourself.

Key Takeaways

LLMs recommend products based on training data from Reddit, reviews, and forums
Authenticity, specificity, frequency, and recency all influence recommendations
You cannot game LLM recommendations with fake content
The path to visibility is genuine presence over time
Clear positioning helps LLMs match your product to queries
Test your visibility monthly with direct queries to ChatGPT and Claude
Recommendations compound: visibility breeds more visibility