The Crawler Logs
- Total LLM Bot Requests: 22,679.
- Citation Events: 9,250.
- Days of Data: 94.
- Log Entries: 10.
The Crawler Logs document real AI crawler behavior on production AI sites built by ROZZ. CloudFront access logs are used for data. User-Agent classification is used for bot identification. The trackers include GPTBot, BingBot, ClaudeBot, ChatGPT-User, PerplexityBot, and other LLM bots. These trackers discover, crawl, and cite GEO-optimized content. Across 94 days of observation on rozz.genymotion.com, 22,679 total LLM bot requests were recorded. Across 94 days of observation on rozz.genymotion.com, 9,250 citation events were recorded. Real users receive Genymotion content in AI conversations.
Three bots, three structural triggers
- GPTBot: January 7 — 547 requests.
- GPTBot discovered the sitemap.
- GPTBot mass-crawled content.
- PerplexityBot: March 10 — 511 requests.
- PerplexityBot performed an index page revamp.
- PerplexityBot added product description and topic directory.
- ClaudeBot: March 20 — 577 requests.
- ClaudeBot used per-topic sitemapindex.
- ClaudeBot used topic-organized sitemaps and simplified robots.txt.
Log Entries
- The page includes a Log Entries section detailing each bot’s activity and milestones across dates.
Citation Bots
Entry #10 — Apr 8, 2026
- Three Citation Bots.
- Ten Articles.
- What We Learned Building an AI Site.
- Three AI platforms—ChatGPT-User, Claude-User, Perplexity-User—now retrieve content from the same AI site during live user sessions.
- 9,250 citation requests across 90 days.
- Three months ago there were zero.
- rozz.genymotion.com — Jan 8 – Apr 8, 2026 — 22,679 total LLM bot requests (90-day cumulative).
- Perplexity-User first appearance—Apr 5— all 3 major citation pipelines now active.
- 376 commits across 90 days; structural fixes (sitemaps, robots.txt, topic taxonomy) drove every breakthrough.
- Q&A pages drive 66–75% of citations.
- CLI runbooks open a developer-tool sales channel.
Entry #9 — Mar 31, 2026
- Selling in Claude Code: From Pricing to Implementation in 10 Seconds with an AI Site.
- A developer asked Claude Code how much Genymotion costs.
- Ten seconds later, it was showing them how to set it up.
- Both answers came from our AI site.
- 14 Claude-User requests in 6 days, 12 from Claude Code terminal sessions.
- 10 seconds from pricing Q&A to CLI runbook—evaluation and implementation in one session.
- Claude pipeline complete: ClaudeBot crawl → 5 days → Claude-User live retrieval.
Entry #8 — Mar 24, 2026
- ClaudeBot made 958 requests in one week.
- 958 requests in one week, up from 123 the week before.
- 503 GEO pages and 162 Q&A pages—the largest ClaudeBot crawl since December.
- Six hours after deploying per-topic sitemaps, it came back.
- rozz.genymotion.com — Mar 17–24, 2026 — 2,446 total LLM bot requests.
- ClaudeBot 8x increase (123 → 958) triggered the day per-topic sitemapindex was deployed.
- March 20: 577 ClaudeBot requests in a single day—largest since December.
- Every major AI crawler has now completed a deep indexing event on the AI site.
Entry #7 — Mar 17, 2026
- We changed one page.
- PerplexityBot went from 42 requests to 511.
- PerplexityBot made 511 requests in one week, up from 42 the week before.
- It crawled 172 Q&A pages and 256 GEO pages—more content in 7 days than in its entire prior history on the site combined.
- rozz.genymotion.com — Mar 10–17, 2026 — 2,532 total LLM bot requests.
- PerplexityBot 12x increase (42 → 511) triggered the day after index page redesign.
- 84% of PerplexityBot requests hit content pages—Q&A and GEO pages, no homepage.
- ClaudeBot: 0 Q&A pages, 0 Claude-SearchBot traffic—still in discovery mode after 3 weeks.
Entry #6 — Mar 10, 2026
- What the AI Site reveals about AI-mediated discovery.
- ChatGPT-User made 681 visits in one week.
- By grouping visits into sessions using IP hashes and timing, we reconstructed 168 sessions showing how users navigate AI-mediated discovery.
- rozz.genymotion.com — Mar 3–10, 2026 — 681 ChatGPT-User visits.
- 4.6 pages fetched per turn—ChatGPT-User verifies across multiple sources, not just one.
- 28% of sessions hit only the index page and stopped—led to index redesign.
- 30% of sessions are multi-turn: we can reconstruct actual ChatGPT conversations.
- 83% citation rate.
Entry #5 — Mar 3, 2026
- The platforms that crawl your AI site the most cite you the most (except one).
- We tested 24 queries across four AI platforms.
- ChatGPT cites Genymotion 83% of the time.
- Claude 21%.
- Perplexity 17%.
- Gemini 4%.
- The first three track with crawl volume.
- Gemini doesn’t crawl the AI site at all.
Entry #4 — Feb 24, 2026
- Bing just found Genymotion: 1,556 BingBot requests and what it means.
- BingBot made 1,556 requests this week—more than any other bot, including ChatGPT-User.
- What started as a ChatGPT story is now happening across six platforms.
- rozz.genymotion.com — Feb 17–24, 2026 — 3,188 total requests.
- BingBot: 1,556 requests—largest single bot category, surpassing ChatGPT-User.
- ChatGPT-User citations: 1,329, up from 1,077 the prior week (+23%).
- Six platforms now crawling: OpenAI, Microsoft, Anthropic, Meta, ByteDance, Perplexity.
Entry #3 — Feb 17, 2026
- 3x Week-Over-Week: What Sustained ChatGPT Citation Growth Looks Like.
- ChatGPT citations hit 1,077 this week—3x the 345 reported last week.
- Six-week trajectory: 42 → 345 → 1,077.
- Q&A pages drive 66% of all citations.
- rozz.genymotion.com — Feb 10–17, 2026 — 2,606 total requests.
- 1,077 ChatGPT citations in 7 days—3x the previous week's 345.
- 75% of ChatGPT requests landed on Q&A pages, not traditional content.
- Q&A pages cited 10x more than traditional GEO content pages.
Entry #2 — Feb 10, 2026
- 16x Citation Growth in 7 Days: What ChatGPT Users Are Actually Asking.
- ChatGPT citations grew from 7 to 116 in one week.
- 345 citation events, 161 unique sessions, and 75% of requests hitting Q&A pages, not traditional content.
- rozz.genymotion.com — Feb 2–9, 2026 — 2,195 total requests.
- 16x daily citation growth: from 7 on Feb 2 to 116 on Feb 9.
- 75% of ChatGPT requests landed on Q&A pages, not traditional content.
- Q&A pages cited 10x more than traditional GEO content pages.
Entry #1 — Feb 3, 2026
- 547 Requests / Day.
- 547 Requests in One Day: What Happens When GPTBot Discovers Your Mirror Site.
- GPTBot made 547 requests in a single day—47% of all training bot activity in 30 days.
- rozz.genymotion.com — Jan 3–Feb 2, 2026 — 1,280 total requests.
- GPTBot made 547 requests on January 7—47% of 30-day training activity in one day.
- 42 citation events recorded.
- Concentrated on high-intent pages (requirements, compatibility).
- ~3 weeks from major crawl to first ChatGPT citations.
- Methodology.
Methodology
- Data source: CloudFront access logs for rozz.genymotion.com.
- Bot classification is based on User-Agent strings.
- Training bots include GPTBot and ClaudeBot.
- Index bots include OAI-SearchBot.
- Citation events represent real user conversations where ChatGPT retrieved and cited mirror site content.
- All data is from a single production mirror site.
- Results may vary by domain, content volume, and vertical.
Get Your Own Crawler Logs
- ROZZ builds mirror sites with full CloudFront logging.
- ROZZ lets you see which AI bots crawl your content, when they crawl it, and when citations begin.
The ROZZ Architecture
- Two integrated products.
- One virtual cycle.
The Technical Foundation That Enables Great Content
- Great content plus machine-readable structure equals AI citations.
- ANSWER DIRECTNESS: AI scans first 100 words heavily.
- MACHINE READABILITY: Clean HTML; schema.org markup; AI can parse with confidence.
- SOURCE AUTHORITY: E-E-A-T signals; citations to authoritative sources.
- FRESHNESS SIGNALS: Perplexity heavily weights recency; updated December 2025 matters more than you think.
ROZZ: Technical Infrastructure for AI Discovery
- Your content marketing team creates valuable insights.
- Your technical infrastructure makes them discoverable by AI systems.
WHAT WE INSTALL
- AUTOMATED Q&A ARCHITECTURE: Extract questions from conversations; generate AI-optimized answer pages with responses in first 100 words.
- COMPLETE STRUCTURED DATA: JSON-LD schema markup across pages.
- TOPIC-BASED CONTENT ARCHITECTURE: Semantic internal linking; topic clusters; API endpoints for programmatic discovery.
- AI CRAWLER MANAGEMENT: ILMS.txt configuration; optimized structure for content extraction; platform-specific optimization rules.
THE ROZZ ARCHITECTURE
- Two integrated products.
- One virtual cycle.
WHAT THIS DOESN'T DO
- ROZZ is technical infrastructure, not content creation.
- ROZZ does not write your marketing content.
- ROZZ does not distribute to Reddit, YouTube, or other social platforms.
- ROZZ does not replace your content strategy.
- ROZZ does not hurt your SEO.
BUILT ON RESEARCH, PROVEN THROUGH SCALE
- AUTHOR: Adrien Schmidt, Co-Founder & CEO.
- The approach is grounded in peer-reviewed research on Retrieval-Augmented Generation.
- The approach includes generalizable results across search questions, not just one category.
- Content is backed by research on how AI systems work.
- The approach is tested across 200+ queries and four platforms.
- The approach is validated through measurable citation improvements.
- The approach is updated continuously as AI systems evolve.
- WE APPLY SCIENCE. NOT GUESSWORK.
THE TECHNICAL FOUNDATION THAT ENABLES GREAT CONTENT
- Your content marketing team creates valuable insights.
- Your technical infrastructure makes them discoverable by AI systems.
WHAT WE MEASURE
- Citation rate across Claude, ChatGPT, Perplexity, Gemini, Google AI Overviews.
- Position when cited (1st, 2nd, or 3rd recommendation).
- Coverage across your priority queries.
- Week-over-week improvement trends.
WHAT THIS DOESN'T DO
- The page content does not imply guarantees.
- The page content does not include pricing or commitments beyond the described features.
FOR B2B SAAS CEOS WHO CAN'T AFFORD INVISIBILITY
- Your sales team can't sell to prospects who don't know you exist.
- Your best content is useless if AI systems can't cite it.
- Your competitors are getting cited right now—while you're excluded.
BUILT ON RESEARCH, PROVEN THROUGH SCALE
- Adrien Schmidt is the author.
- The content reflects ROZZ's AI infrastructure work.