AI visibility audits for crawled pages
This page explains how GEO (Generative Engine Optimization) evaluates crawled pages for AI visibility and citation readiness.
What is GEO?
GEO evaluates your crawled pages with a ranked hierarchy. Content quality matters most, then crawler access, then indexing and snippet eligibility, then structured data and entity context, then anti-patterns that suppress visibility.
Strong GEO signals help AI systems identify pages that are eligible, useful, and trustworthy enough to ground an answer. The goal is not special AI-only markup. The goal is original, crawlable, technically eligible content.
How the scorecard works
After a crawl, GEO scores five layers from 0 to 3 for a total score out of 15.
| Layer | Meaning |
|---|---|
| Content quality | Checks whether pages contain original evidence, first-hand experience, expert attribution, proprietary examples, or unique point of view. |
| Crawler access | Checks whether robots.txt allows search and AI retrieval crawlers needed for citation visibility. |
| Indexing and snippets | Checks canonical tags, noindex/nosnippet directives, titles, descriptions, headings, mobile viewport, and crawlable HTML. |
| Structured data | Checks JSON-LD entity types and Open Graph metadata where they support page understanding. |
| Anti-patterns | Flags duplicated pages, thin templated sections, keyword-variation patterns, and other low-value tactics. |
The remediation list is ranked by leverage. For example, blocked crawlers can outrank content edits because one robots.txt rule can prevent otherwise strong pages from being retrieved.
What do findings mean?
Findings are page-level evidence behind the scorecard. They are organized by severity:
| Severity | Meaning |
|---|---|
| Critical | A blocker that can disqualify pages from indexing, snippets, or crawler access. |
| Warning | A material weakness that limits AI citation readiness. |
| Info | A lower-risk improvement opportunity. |
Treat critical findings first, but do not mistake technical cleanup for strategy. If the scorecard says content quality is weak, the highest-leverage fix is usually more non-commodity content: original examples, real data, named expertise, and first-hand perspective.