Public Audit Report

www.harvard.edu

https://www.harvard.edu/
Scanned on April 12, 2026
69
Overall
C
71
SEO
72/101 passed
63
AEO
33/52 passed
69
GEO
40/58 passed
145
Passed
66
Failed
211
Total Checks
Needs Work
Rating

Quick Wins — Top 5 Issues to Fix

SEO — 71/100 (29 issues)

Failed Checks (29)

  • Title length: 18 characters (too short, aim for 30-60)meta
  • Readability: 22 (difficult — simplify language)content
  • 79% long sentences (>20 words) — break up for readabilitycontent
  • 24 images missing width/height — causes layout shift (CLS)content
  • 21 of 24 images use legacy formats (JPG/PNG/GIF) — convert to WebP/AVIF for 25-50% smaller filescontent
  • Missing twitter:imagesocial
  • No hreflang tags — add if targeting multiple languagestechnical
  • Text-to-HTML ratio: 8.1% — too code-heavy, add more contenttechnical
  • 8 render-blocking scripts — add defer or async attributeperformance
  • No font preload/preconnect — add for faster renderingperformance
  • No Strict-Transport-Security headersecurity
  • Missing X-Content-Type-Options headersecurity
  • No Content-Security-Policy header — XSS risksecurity
  • No Terms of Service link — add for legal compliance and trusttrust
  • 14 HTTP resource(s) on HTTPS page — security risktechnical
  • No Web App Manifest — add for installability and modern web signalstechnical
  • 4 stock image(s) detected — original images rank bettercontent
  • 3% sentences use transition words — aim for 20%+content
  • Missing X-Frame-Options — clickjacking risksecurity
  • No Referrer-Policy header — may leak URL datasecurity
  • No Permissions-Policy — restrict camera/mic/geo accesssecurity
  • H1 length: 18 characters — too short, aim for 20-70content
  • Title and meta description are identical — each should be uniquemeta
  • 172 links on page — exceeds 100 limit, may dilute link equitycontent
  • 13 duplicate ID(s) found: "6683-6682-1" (2x), "6683-1644-1" (2x), "0-6734-0" (2x) — must be unique per HTML specaccessibility
  • 1 iframe(s) without sandbox attribute — security risksecurity
  • No RSS/Atom feed in <head> — add for content discovery by aggregators and AItechnical
  • No IndexNow integration — add for instant Bing/Yandex indexing on content changestechnical
  • 5 of 13 H2 sections have < 50 words — add more content or merge thin sectionscontent

Passed Checks (72)

  • Title tag found: "Harvard University"meta
  • Meta description found (138 chars)meta
  • Description length: 138 chars (optimal)meta
  • Viewport meta tag found (mobile-friendly)meta
  • UTF-8 charset declaredmeta
  • Language attribute set on HTML tagmeta
  • Theme color set: #a51c30meta
  • No meta refresh redirect (good)meta
  • Robots: "index, follow, max-image-preview:large, max-snippet:-1, max-video-preview:-1"meta
  • Single H1 tag foundcontent
  • Title and H1 are identical — consider differentiating slightly for broader keyword coveragecontent
  • 13 H2 tags foundcontent
  • Heading hierarchy is correctcontent
  • 1987 words on page (good content depth)content
  • All 24 images have ALT textcontent
  • JS-based lazy loading detectedcontent
  • 52 internal links foundcontent
  • 119 external links foundcontent
  • No empty href attributes foundcontent
  • All link texts are descriptive (no "click here" or "read more")content
  • Open Graph title foundsocial
  • Open Graph description foundsocial
  • Open Graph image foundsocial
  • Open Graph URL foundsocial
  • og:type set to "website"social
  • og:site_name: "Harvard University"social
  • Twitter Card: "summary_large_image"social
  • Twitter title foundsocial
  • Twitter description foundsocial
  • HTTPS enabledtechnical
  • Canonical tag foundtechnical
  • Canonical tag is self-referencing (correct)technical
  • Favicon foundtechnical
  • Apple touch icon foundtechnical
  • Compression: brtechnical
  • URL length: 24 chars (good)technical
  • URL uses hyphens or no separators (good)technical
  • No deprecated HTML tags foundtechnical
  • Inline CSS: 13.8 KB (acceptable)performance
  • Inline JS: 6.5 KB (acceptable)performance
  • Cache-Control: max-age=300, must-revalidateperformance
  • About page link found — builds trust and E-E-A-Ttrust
  • Contact page/email link foundtrust
  • Privacy Policy link foundtrust
  • Author byline found + author in schematrust
  • 1 JSON-LD schema block(s) foundschema
  • Schema types: WebPageschema
  • <!DOCTYPE html> declaredtechnical
  • All 1 JSON-LD block(s) have valid syntaxschema
  • Viewport includes width=device-width (responsive)meta
  • robots.txt found and allows crawlingtechnical
  • robots.txt includes Sitemap referencetechnical
  • No AI crawlers blocked in robots.txt — AI search and training can access contenttechnical
  • sitemap.xml found with valid formattechnical
  • Sitemap contains 5 URL(s)technical
  • All 5 checked internal links are workingcontent
  • 0% nofollow links (0/171) — good link equity flowcontent
  • OG image uses absolute URL (correct)social
  • 2 resource hint(s) (dns-prefetch/preconnect) — faster loadingperformance
  • All schema types have required propertiesschema
  • 23 first-person pronouns — strong personal experiencecontent
  • Only 33% responsive images — add srcset for mobilecontent
  • max-image-preview:large — eligible for Google Discover large imagesmeta
  • OG URL matches canonical URL (consistent)social
  • Viewport allows zoom (good for accessibility)accessibility
  • DOM size: ~1284 elements (good)performance
  • No snippet blocking detected (good — content is fully accessible)technical
  • Schema dates consistent: published 2021-01-29T17:09:10Z, modified 2026-04-03T19:50:37Zschema
  • No render-blocking scripts in head (good)performance
  • 2 third-party script domain(s) (acceptable)performance
  • ARIA landmarks present: main, nav, skip-linkaccessibility
  • All 0 form input(s) have labels or aria-labelsaccessibility
AEO — 63/100 (19 issues)

Failed Checks (19)

  • No definition lists (<dl>) — great for term/value pairsdiscoverability
  • No code blocks — add <code>/<pre> for technical contentdiscoverability
  • No FAQ section found — add Q&A content for AI citationstructure
  • No data tables — tables help AI compare and cite datastructure
  • 21% short sentences — shorten sentences for better AI parsingstructure
  • Few key-value patterns — use "**Label:** value" format for AIstructure
  • Only 1 concise answer paragraph(s) after headings — add 2+ short summaries (40-200 chars) right after H2/H3 tagscitation
  • No Speakable schema — add for voice assistant optimizationvoice
  • No SearchAction schema — add for enhanced search presencevoice
  • Avg 33 words/sentence — simplify for AI and voicevoice
  • Pronoun density: 0.3% — add more "you/your" for engagementvoice
  • No summary section — add Key Takeaways for AI extractionvoice
  • Few Q&A blocks — add question headings followed by answer paragraphsvoice
  • No /llms.txt found — add one to guide AI crawlers (emerging standard)discoverability
  • No sameAs links in schema — add social profiles for brand recognitioncitation
  • No Table of Contents — add jump links for AI content navigationstructure
  • No video content — multimedia increases AI engagement and citation ratediscoverability
  • No <details>/<summary> elements — use for expandable Q&A contentstructure
  • Brand name used inconsistently (73 partial vs 7 full) — standardizecitation

Passed Checks (33)

  • Structured data found — AI can understand your contentdiscoverability
  • 1987 words in HTML — good content for AI parsingdiscoverability
  • Clean URL structurediscoverability
  • 5 semantic HTML5 elements founddiscoverability
  • 18 content sections — well-structured for AIdiscoverability
  • Description + OG description both present — clear topic signalsdiscoverability
  • 100 named entities found — helps AI understand contextdiscoverability
  • 52 internal links — good topic cluster signaldiscoverability
  • Question-style headings found — ideal for AI snippetsstructure
  • 23 lists found — structured content for AIstructure
  • Concise answer paragraphs after headings — great for AI citationstructure
  • Step-by-step content detected — AI loves procedural answersstructure
  • 100% concise paragraphs — scannable for AIstructure
  • Avg 103 words/section — sufficient depthstructure
  • Author information found — builds E-E-A-Tcitation
  • Publication/modification date foundcitation
  • Statistics and data found in content — highly citablecitation
  • Organization schema found — AI can identify your brandcitation
  • 4/5 E-E-A-T signals presentcitation
  • Source citations found — increases AI trustcitation
  • 119 external links — references authoritative sourcescitation
  • Content references 2026/2025 — fresh contentcitation
  • Topic focus: 100% — title keywords found in contentcitation
  • 30 conversational words — good for AI summariesvoice
  • First paragraph is snippet-ready (20-60 words)voice
  • Active voice dominant (2% passive) — AI-friendlyvoice
  • 43 H2/H3 headings — good content structure for AIvoice
  • No AI-blocking robots directives — AI crawlers can access contentdiscoverability
  • Freshness signal found (Last-Modified header or dateModified)citation
  • AI crawlers allowed in robots.txt (0/4 explicitly listed)discoverability
  • Brand name consistent across 2/3 signals (title, OG, schema)citation
  • Content updated within 90 days — fresh content gets 3.2x more AI citationscitation
  • Statistics with source attribution — highly credible for AI citationcitation
GEO — 69/100 (18 issues)

Failed Checks (18)

  • Few question headings — add Q&A headings for AI Overviewoverview
  • 54% optimal paragraphs — aim for 20-80 words per paragraphoverview
  • No author schema — add for AI trust signalsauthority
  • Only 1/5 trust signals — add reviews, certificationsauthority
  • No original data signals — "our research shows..." boosts AI citationauthority
  • No examples or case studies — add real-world examplesauthority
  • No links to authoritative sources — cite Google, Schema.org, etc.authority
  • No FAQ Schema — add for Google rich results + AI Overviewformat
  • No HowTo Schema — add for step-by-step rich resultsformat
  • No summary section — add Key Takeaways or TL;DR for AIformat
  • No BreadcrumbList schema — add for better AI contextformat
  • Only 2/6 formatting types — use bold, lists, tables, quotes, codeformat
  • Single-perspective content — add "on the other hand..." for balanceformat
  • No <time> elements — wrap dates in <time datetime="...">semantic
  • Only 1 Schema type(s) — add more for AI contexttechnical
  • Only 3 data points — add 5+ statistics for +30-40% AI visibility (Princeton study)authority
  • No <figure>/<figcaption> — wrap images with descriptive captions for AIformat
  • No authoritative sameAs links — add Wikipedia/Wikidata/LinkedIn URIs to schema for Knowledge Graph alignmentauthority

Passed Checks (40)

  • Definition-style content found — featured snippet candidateoverview
  • Content freshness signals detectedoverview
  • Comprehensive content (1987 words, 13 sections)overview
  • Title and H1 are aligned — clear topic focusoverview
  • Step-by-step content detected — AI Overview friendlyoverview
  • 2-level heading hierarchy — deep content structureoverview
  • Source citations found — increases AI trustauthority
  • Brand identity consistent (OG + Schema)authority
  • Expert quotes or testimonials found — authority signalauthority
  • 3 industry terms used — domain expertise signalauthority
  • 64 unique external domains — diverse sourcesauthority
  • Comparison content detected — AI loves comparingformat
  • 24 images all with ALT text — AI can understand visualsformat
  • Data visualization elements found (tables/charts)format
  • Both ordered and unordered lists used — content varietyformat
  • Actionable advice found — AI prefers practical contentformat
  • Conclusion/recommendation found — helps AI provide answersformat
  • 2 entity types in Schema — AI can classify contentsemantic
  • 6/14 semantic HTML5 elements — deep semantic structuresemantic
  • 3 data points with units — highly citablesemantic
  • 100% title keywords in first 200 words — good prominencesemantic
  • 41 contextual links — good cross-referencingsemantic
  • Content categorization found — clear topic classificationsemantic
  • 6 sections have subtopics (H3) — deep coveragesemantic
  • Mobile-friendly (viewport set)technical
  • HTTPS secure — trusted by AI systemstechnical
  • Page size: 163 KB — reasonabletechnical
  • Open Graph fully configured (title, description, image)technical
  • Canonical URL set — prevents AI from citing duplicatestechnical
  • Page allows AI indexing (no noindex/nosnippet)technical
  • Language + OG locale set — clear language signaltechnical
  • 4 ARIA landmarks found — good accessibility for AItechnical
  • All image ALT texts are descriptive (>5 chars)format
  • 1987 words — sufficient depth for AI analysisoverview
  • First 200 words contain topic keywords — AI extracts primarily from the startoverview
  • Definition + list combo — strong featured snippet candidateoverview
  • Author "Harvard University" has credentials — strong E-E-A-T signalauthority
  • max-image-preview:large — optimal AI rich result displaytechnical
  • 5/6 question types (5W1H) in headings — comprehensiveoverview
  • Link anchor texts are descriptive — good AI context signalsemantic

Fix These Issues Automatically

Using WordPress? Install SEO Autopilot for one-click auto-fixes on 65+ issues — with full undo support.

Get SEO Autopilot Plugin
Scan This Site Again Request Report Removal