Skip to content

Tracking LLM Traffic in GA4: A Quick Guide

Many of my clients have recently noticed a rise in direct traffic and an increase in leads from AI tools and large language models (LLMs) like ChatGPT, Bard, and Perplexity. While the source of this traffic often appears as “direct” in analytics, it’s increasingly likely that some of it originates from LLM-based user behavior.

To get better visibility, I set up a custom way to track LLM traffic in Google Analytics 4 (GA4).

3 Ways to Track AI / LLM Traffic

You can try any of these approaches depending on your workflow:

  1. Use regex filters in the User Acquisition report by source/medium
  2. Build a Looker Studio dashboard
  3. Create a custom GA4 report in the Explore section (this is what I recommend)

How to Set Up a GA4 Explore Report for LLM Traffic

Here’s a quick step-by-step using GA4’s Exploration tool:

  1. Go to Explore > Free Form Exploration
  2. Remove all existing segments
  3. Create a new session segment with this condition:
    Session source/medium matches regex Suggested Regex:
    ^.*(ai|\.openai|copilot|chatgpt|gemini|gpt|neeva|writesonic|chatsonic|nimble|outrider|perplexity|google.*bard|bard.*google|bard|edgeservices|astastic|copy\.ai|bnngpt|claude|anthropic|mistral|mixtral|llama[23]?|huggingface|huggingchat|cohere|you\.com|youchat|pi\.ai|inflection|poe\.com|poe|replika|jasper|forefront|quora|deepmind).*

    Feel free to tweak this based on the LLM tools your audience uses.
  4. Name your segment: “AI/LLM Traffic”
  5. Remove all existing dimensions and add:
    • Dimension: Session source/medium
    • Row: Session source/medium

You’ll now be able to see a clearer view of LLM-originating sessions within GA4.

Why This Matters

With LLMs becoming the layer between users and traditional search, attribution is evolving. Monitoring this emerging traffic source helps identify what content gets surfaced by LLMs — even when it’s not directly ranking on Google.

Got a Better Method?

If you’re tracking LLM visibility in a more reliable or scalable way, let me know. This space is changing fast — and we’re all still figuring it out.

Leave a Reply

Your email address will not be published. Required fields are marked *