What kind of content does Rozz index from my website, and how does it ensure accuracy when answering questions?

High Confidence (76%)
What kind of content does Rozz index from my website, and how does it ensure accuracy when answering questions?

Brief answer

  • Rozz indexes only public website content (pages, docs, help articles, FAQs, developer docs and any on-site text you expose). It converts that content into semantic vectors (Pinecone) and retrieves the best-matching passages to generate answers. Accuracy is enforced by using only site content for generation, automated filtering (PII redaction, quality thresholds, deduplication), re‑ranking of retrieved passages, and tools to review/edit cached answers in the dashboard.

Detailed explanation

  • What it indexes
  • Public-facing pages and content visible to a normal site visitor: documentation, help center articles, marketing pages, FAQs, Q&A pages and similar content (no backend/private data or integrations are required or accessed).
  • Rozz is used to pull together content from multiple on-site sources so users can ask natural-language questions without knowing where the original content lives.
  • Privacy / scope constraint
  • Rozz only crawls content a normal visitor can see; it does not access private backend data or hidden endpoints. That constraint both protects privacy and bounds the evidence base for answers.

Sources

One quick question to help me tailor advice: Do you want Rozz to prioritize certain parts of your site (e.g., product docs vs blog vs support tickets) or to explicitly exclude any sections?