How to Track Your AI Visibility Score & Monitor Competitors
You can't improve what you don't measure. This guide shows you exactly what AI visibility metrics to track, how to set up monitoring, how to benchmark against competitors, and how to use alerts to catch problems before they cost you traffic.
You wouldn't run Google Ads without tracking conversions. You wouldn't post on Instagram without checking engagement. So why are you doing GEO without monitoring your AI visibility?
Most Shopify merchants who've started optimizing for AI search are doing it blind. They make changes, cross their fingers, and hope for the best. They have no idea whether their AI visibility is going up, going down, or standing still. They don't know which competitors are gaining ground. They don't know which AI engines mention them and which ignore them completely.
That's like driving with your dashboard lights off. You might be headed in the right direction, but you won't know until you either arrive or crash.
This guide gives you the complete monitoring framework. You'll learn exactly what to track, which metrics matter most, how to set up monitoring (both manual and automated), how to benchmark your performance against competitors, and how to configure alerts that catch problems before they cost you meaningful traffic and revenue.
What to Track: The AI Visibility Metrics That Matter
There are dozens of metrics you could track for AI visibility. Most of them are noise. Here are the metrics that actually matter for Shopify stores—the ones that correlate with revenue impact and give you actionable insights.
Core Metrics Glossary
| Metric | What It Measures | Why It Matters | How to Track |
|---|---|---|---|
| AI Visibility Score | Overall score (0-100) of how visible your brand is in AI search | Your north star metric—summarizes everything into one number you can track over time | Naridon dashboard or manual calculation |
| Mention Rate | % of tracked prompts where your brand is mentioned in AI responses | Shows how often AI recommends you—the most direct measure of AI visibility | Manual prompt testing or Naridon Mentions tab |
| Position | Where your brand appears in AI recommendation lists (1st, 2nd, 3rd, etc.) | 1st position gets 3-5x more clicks than 4th position; position is as important as appearing at all | Manual testing or Naridon Position tab |
| Sentiment | Whether AI describes your brand positively, neutrally, or negatively | Negative sentiment can destroy conversions even when you're mentioned—being recommended negatively is worse than not being mentioned | Naridon Sentiment tab or manual reading |
| Citation Share | Your % of AI recommendations vs. total recommendations in your category | Your market share of AI search—the competitive metric that matters most | Naridon Citations tab |
| Source Diversity | How many different AI engines mention you (out of 8 tracked) | Diversification protects against single-engine algorithm changes or de-ranking | Naridon or manual cross-engine testing |
| AI Referral Traffic | Visitors coming from AI engines (ChatGPT, Perplexity, etc.) | Direct revenue impact—this is the bottom-line metric | Google Analytics 4 referrer analysis + Naridon visitor stats |
Supporting Metrics (Track Monthly)
- Schema validation score: Percentage of product pages with valid, complete Product and FAQ schema
- Content completeness: Percentage of products with 150+ word descriptions, FAQ sections, and complete attributes
- Query coverage: The range of query types (category, comparison, price, use-case) where your brand appears
- Trend direction: Whether each metric is improving, stable, or declining over the past 4 weeks
How to Set Up Monitoring: Manual Method
If you want to start tracking AI visibility today without any tools, here's the complete manual process. It takes about 1-2 hours per week but gives you a solid baseline.
Step 1: Build Your Prompt List
Create a spreadsheet with 15-20 prompts real customers would type into ChatGPT to find products like yours. This is your monitoring prompt list—you'll test these same prompts every week.
- List your prompts with a mix of query types:
- Category prompts (5-7): "Best [your product category]" — e.g., "Best organic skincare brands" or "Best minimalist hoodies"
- Comparison prompts (3-4): "[Your brand] vs [competitor]" — e.g., "Acme vs Essentials hoodies" or "Is Acme Skincare better than CeraVe?"
- Use-case prompts (3-4): "Best [product] for [specific use case]" — e.g., "Best hoodie for cold weather commuting" or "Best moisturizer for dry office air"
- Price prompts (2-3): "Best [product] under $[price]" — e.g., "Best hoodies under $100" or "Affordable organic skincare"
- Brand lookup prompts (2-3): "Is [your brand] good?" or "What does [your brand] sell?" — Tests whether AI knows your brand at all
Step 2: Test Across AI Engines Weekly
- Open fresh chat sessions in ChatGPT, Perplexity, and Google (for AI Overviews)
- Test each of your 15-20 prompts in all three engines (that's 45-60 individual tests)
- For each prompt, record in your spreadsheet:
- Mentioned: Yes or No
- Position: 1st, 2nd, 3rd, or N/A
- Sentiment: Positive, Neutral, or Negative
- Competitors present: Which other brands appeared
- AI engine: Which engine this result was from
- Use fresh/incognito sessions to avoid personalized results affecting your data
Step 3: Calculate Your Baseline Metrics
From your spreadsheet data, calculate:
- Mention Rate: (Prompts where you appeared / Total prompts tested) × 100. If you appeared in 8 out of 60 tests, your mention rate is 13.3%.
- Average Position: Sum of all positions where you appeared / Number of appearances. If you appeared in positions 1, 3, 2, and 4, your average position is 2.5.
- Source Diversity: Number of different AI engines where you appeared at least once. Out of 3 engines tested, how many mention you?
- Sentiment Ratio: Positive mentions / Total mentions. If you appeared 8 times and 6 were positive, your sentiment ratio is 75%.
Step 4: Repeat Weekly and Track Trends
Consistency is critical. Test the same prompts every week, preferably at the same time of day. AI results change frequently—ChatGPT can give different recommendations on Tuesday than Thursday. Weekly snapshots give you trend data that's much more useful than any single test.
After 4 weeks, you'll have enough data to see meaningful trends: is your mention rate climbing, declining, or flat? Which prompts have improved? Which are stuck? This trend data tells you what's working and what isn't.
How to Set Up Monitoring: Automated Method with Naridon
Manual monitoring works but it doesn't scale. If you have 20 prompts across 3 AI engines, that's 60 individual tests per week. Add competitor tracking (3-5 competitors at 20 prompts each), and you're at 300+ tests per week. Add multiple languages, and it becomes a full-time job. Automation isn't a luxury—it's a necessity for serious GEO.
Naridon's Monitor dashboard automates all of this across 8 AI engines continuously. Here's what you get:
7 Monitor Tabs Explained
- Visibility tab: Your overall AI visibility score over time, broken down by AI engine (ChatGPT, Perplexity, Google AI Overview, Claude, Bing Copilot, DeepSeek, Grok, Brave Search). See your score trend over days, weeks, and months. Identify which engines you're strong on and which need work.
- Position tab: Where your brand ranks in AI recommendations for each tracked prompt. See your average position over time, identify prompts where your position is improving or declining, and discover new prompts where you've started appearing.
- Sentiment tab: How AI describes your brand across all engines. See specific quotes from AI recommendations, categorized as positive, neutral, or negative. Identify negative sentiment early before it spreads across engines.
- Citations tab: Which AI engines cite your store as a source, how often they link to your pages, and which specific pages get cited most. Citations are the AI equivalent of backlinks—they signal authority.
- Mentions tab: Every instance where your brand is mentioned in AI results across all tracked prompts and engines. See the full context of each mention, including what the AI said about you.
- Brands tab: Competitor tracking. See how every competitor in your category performs across the same metrics: visibility score, mention rate, position, sentiment. Identify who's gaining ground and who's losing it.
- Share tab: Your share of AI recommendations vs. total recommendations in your category. This is the competitive market share metric—the one that tells you whether you're winning or losing the AI search battle in your niche.
Setting Up Naridon Monitor
- Install Naridon from the Shopify App Store (one-click install, no code)
- Naridon automatically discovers relevant prompts for your product category by analyzing your catalog, your niche, and common AI search patterns
- Add competitors you want to track—Naridon suggests competitors based on your category and price tier, or you can add any brand manually
- Monitoring begins immediately after setup—first data appears within 24 hours
- Configure notification preferences in Settings to receive alerts via email or in-app
How to Benchmark Against Competitors
Your AI visibility score only means something in context. A 40% mention rate might be excellent if your best competitor is at 25%, or terrible if they're at 65%. Competitive benchmarking gives you that context and shows you where the opportunities are.
Competitor Benchmarking Process
- Identify 3-5 direct competitors: Choose brands that sell similar products at similar price points to similar audiences. Don't compare yourself to brands in a completely different tier or category. If you sell premium organic skincare at $40-80, compare against other premium organic skincare brands, not drugstore brands or ultra-luxury brands.
- Track the same prompts for all competitors: When you test "Best organic skincare brands," record every brand that appears in the AI response—not just whether you appear. Build a complete picture of who dominates your category in AI search.
- Calculate relative metrics for each competitor:
- Their mention rate vs. yours (if they're at 45% and you're at 15%, you have a 30-point gap to close)
- Their average position vs. yours (are they consistently #1 while you're #4?)
- Their sentiment vs. yours (are they described more positively?)
- Their source diversity vs. yours (do they appear in more AI engines?)
- Identify their content strengths: When a competitor consistently outranks you for certain query types, visit their product pages and analyze what content they have that you don't. Do they have FAQ schema you're missing? Better descriptions? More reviews? This reverse engineering tells you exactly what to fix.
- Look for gaps they're not covering: Some query types may have no dominant brand. These are your lowest-hanging fruit—optimize for uncovered queries first.
Competitive Benchmarking Table Template
Fill this table out monthly and track changes over time:
| Metric | Your Brand | Competitor A | Competitor B | Competitor C |
|---|---|---|---|---|
| AI Visibility Score | [Your score] | [Score] | [Score] | [Score] |
| Mention Rate | [%] | [%] | [%] | [%] |
| Avg Position | [#] | [#] | [#] | [#] |
| Sentiment | [+/0/-] | [+/0/-] | [+/0/-] | [+/0/-] |
| # Engines Mentioned In | [#/8] | [#/8] | [#/8] | [#/8] |
| Citation Share | [%] | [%] | [%] | [%] |
| Trend (vs. last month) | [up/flat/down] | [up/flat/down] | [up/flat/down] | [up/flat/down] |
Setting Up Alerts
You don't want to discover a visibility drop three weeks after it happens. By then, you've lost hundreds or thousands of potential AI-referred visitors. Alerts catch problems early so you can respond quickly.
The 5 Essential Alerts
- Visibility drop alert: Trigger when your AI visibility score drops more than 10 points in a single week. This usually indicates a technical issue (broken schema, blocked crawlers) or a major content change that hurt your rankings. Response: investigate immediately, check schema validation, verify robots.txt.
- Competitor gain alert: Trigger when a competitor's mention rate exceeds yours for the first time, or when a competitor gains more than 15 points in a month. This signals they've made significant GEO optimizations. Response: analyze their recent product page changes and content updates.
- Negative sentiment alert: Trigger when AI describes your brand negatively in any response. Even one negative mention can spread across engines if the underlying data source persists. Response: identify the source (bad review, negative press, product issue) and address it directly.
- New competitor alert: Trigger when a brand you haven't seen before starts appearing in your tracked prompts. New entrants can disrupt established positions quickly. Response: add them to your competitive tracking and analyze their content strategy.
- Schema error alert: Trigger when structured data validation fails on any product page. Schema errors can happen after theme updates, app installs, or Shopify platform changes. Response: fix immediately—broken schema is an instant visibility killer.
How to Set Up Alerts
Manual method: Create a calendar event for a weekly 30-minute block dedicated to prompt testing and comparison against the previous week. Flag any drops of 10%+ or notable competitor changes. This is low-tech but works for stores just starting with GEO monitoring.
With Naridon: Naridon sends automatic notifications when significant changes occur across any monitored metric. You configure alert thresholds (e.g., "notify me when visibility drops more than 10%") and notification channels (email, in-app) in the Settings panel. The platform checks continuously, not just weekly, so you catch issues within hours rather than days.
Interpreting Your Data: What to Do When Metrics Change
Data is only useful if you know how to act on it. Here are the most common scenarios and what to do about each one.
Scenario 1: Your Visibility Score Drops
- Check for schema errors first: Did a recent theme update, app install, or Shopify platform change break your Product or FAQ schema? Validate 5-10 product pages with Google Rich Results Test.
- Verify robots.txt: Make sure AI crawlers (GPTBot, PerplexityBot, ClaudeBot) are still allowed. Theme updates occasionally reset robots.txt to defaults that block crawlers.
- Check for competitor moves: Did a competitor launch significant new content or optimizations? If they improved, your relative position drops even if your absolute content didn't change.
- Review negative reviews or press: New negative content about your brand can affect AI sentiment and reduce recommendation frequency.
- Check recent product page changes: Did someone on your team edit product descriptions and accidentally remove AI-relevant content? This happens more often than you'd think.
Scenario 2: A Competitor Surges Past You
- Analyze their product pages: View source on their top product pages. Did they add schema, FAQ sections, or richer descriptions recently?
- Check for new content: Did they launch blog posts, guides, comparison pages, or educational content that AI is citing?
- Look for new backlinks or PR: Third-party mentions (press coverage, blog features, roundup articles) are citation sources that AI trusts.
- Identify specific prompts where they gained ground: Don't try to match them on everything—focus on the highest-value prompts they took from you and optimize specifically for those queries.
Scenario 3: Sentiment Turns Negative
- Identify the source: What is AI citing when it describes you negatively? Is it a specific bad review, a negative blog post, a product complaint, or outdated information?
- Address the root cause: If it's a product quality issue, fix the product. If it's a customer service complaint, resolve it publicly. If it's outdated information, update your content.
- Create counterbalancing positive content: Publish content that directly addresses the negative point. If AI says "some users report quality issues," create content showcasing your quality standards, certifications, and positive reviews.
- Monitor weekly until recovery: Sentiment usually recovers within 2-4 weeks of addressing the root cause and publishing positive content.
Frequently Asked Questions
How often should I check my AI visibility metrics?
Minimum weekly for core metrics (visibility score, mention rate, position). If you're actively implementing GEO optimizations, check 2-3 times per week to correlate specific changes with results. Monthly for supporting metrics (content completeness, schema validation) and competitive benchmarks. Naridon's dashboard updates continuously so you can check anytime without re-running manual tests.
What's a good AI visibility score for a Shopify store?
Context matters more than absolute numbers. For most product categories, 40+ is good and 60+ is excellent. But a 35 in a highly competitive category like skincare might represent stronger performance than a 55 in a niche category with only 3 competitors. Always compare against your direct competitors first, then use absolute benchmarks as secondary guidance. The most important number is your trend: are you improving week over week?
Can I track AI visibility for free?
Yes, using the manual method described above. It provides real data and real insights. The limitation is scale: it requires 1-2 hours per week and doesn't practically scale beyond 20 prompts, 3 AI engines, and maybe 2 competitors. For comprehensive monitoring, Naridon's $49/mo Starter plan includes basic monitoring across all 8 engines, and the $249/mo Growth plan includes full competitor benchmarking and multi-language tracking.
Which AI engines should I prioritize monitoring?
Start with ChatGPT and Perplexity—they have the highest traffic volume for product recommendations and are the most common sources of AI referral traffic for ecommerce stores. Add Google AI Overviews as your third priority since Google still dominates overall search. Then expand to Claude, Bing Copilot, DeepSeek, Grok, and Brave Search for comprehensive coverage. Naridon monitors all 8 simultaneously at no extra effort.
How do I know if a visibility change is significant or just noise?
AI results have natural variance—the same prompt can produce different results on different days. Look for sustained trends over 2-3 weeks rather than single-week fluctuations. A 5% mention rate change in one week could be noise. A 10%+ change sustained over two weeks is almost certainly a signal. A 15%+ change in one week is likely significant even as a single data point. Always correlate changes with known events (you pushed content updates, a competitor launched something, an AI engine updated its model).
What's citation share and why does it matter?
Citation share is your percentage of total AI recommendations in your product category. If AI engines make 100 total brand recommendations across all your tracked prompts, and your brand appears 15 times, your citation share is 15%. This is the closest equivalent to "market share" in AI search. It tells you how much of the AI recommendation pie you own vs. your competitors. Growing your citation share means you're winning a larger portion of AI-driven product discovery in your niche.
Should I track different metrics for different product categories?
If you sell across multiple product categories (e.g., skincare AND haircare), track visibility per category separately. Your skincare products might have 40% mention rate while haircare has 10%. Blending them into one number hides the fact that haircare needs urgent attention. Naridon automatically segments metrics by product category for this reason.
Monitoring is not optional—it's the feedback loop that makes every other GEO optimization work. Without tracking, you're optimizing blindly. With tracking, you know exactly what's working, what's not, where competitors are gaining, and where the biggest opportunities lie.
Install Naridon to start monitoring your AI visibility across 8 engines, 7 metric tabs, and unlimited competitors today. One-click Shopify install, first data within 24 hours, competitor benchmarks included. Plans start at $49/mo.
What gets measured gets improved. What gets ignored gets left behind. Start measuring today.
Gotowy, by rankować dla tych rozmów?
Dołącz do wczesnych użytkowników, którzy już przechwytują ruch z wyszukiwania AI.