Public Audit Report

www.nih.gov

https://www.nih.gov/health-information
Scanned on May 2, 2026
59
Overall
D
64
SEO
64/100 passed
50
AEO
26/52 passed
57
GEO
33/58 passed
123
Passed
87
Failed
210
Total Checks
Poor
Rating

Quick Wins — Top 5 Issues to Fix

SEO — 64/100 (36 issues)

Failed Checks (36)

  • 3 title tags found — should be exactly 1meta
  • No theme-color meta tag — improves mobile browser appearancemeta
  • Heading hierarchy has gaps (e.g. H1 to H3 without H2)content
  • 42% long sentences (>20 words) — break up for readabilitycontent
  • 14 images missing width/height — causes layout shift (CLS)content
  • 12 of 24 images use legacy formats (JPG/PNG/GIF) — convert to WebP/AVIF for 25-50% smaller filescontent
  • 2 non-descriptive link text(s) found ("learn more"...) — use descriptive anchor text for SEOcontent
  • Missing og:descriptionsocial
  • Missing og:image — social shares will lack visualssocial
  • Missing twitter:descriptionsocial
  • Missing twitter:imagesocial
  • No apple-touch-icon — iOS home screen will lack custom icontechnical
  • No hreflang tags — add if targeting multiple languagestechnical
  • Text-to-HTML ratio: 7.5% — too code-heavy, add more contenttechnical
  • 4 render-blocking scripts — add defer or async attributeperformance
  • No font preload/preconnect — add for faster renderingperformance
  • No author byline — add "By [Name]" for E-E-A-T trusttrust
  • No JSON-LD structured data foundschema
  • 2 HTTP resource(s) on HTTPS page — security risktechnical
  • robots.txt does not reference Sitemap — add Sitemap: URLtechnical
  • 2 target="_blank" links missing rel="noopener" — security risksecurity
  • No Web App Manifest — add for installability and modern web signalstechnical
  • No dns-prefetch or preconnect hints — add for faster third-party loadingperformance
  • Only 1 first-person pronouns — add personal experience (E-E-A-T)content
  • Only 13% responsive images — add srcset for mobilecontent
  • No max-image-preview:large — add to robots meta for Google Discovermeta
  • No Referrer-Policy header — may leak URL datasecurity
  • No Permissions-Policy — restrict camera/mic/geo accesssecurity
  • H1 length: 18 characters — too short, aim for 20-70content
  • 114 links on page — exceeds 100 limit, may dilute link equitycontent
  • 3 render-blocking script(s) in head — add defer or asyncperformance
  • 1 duplicate ID(s) found: "affiliate" (2x) — must be unique per HTML specaccessibility
  • 1 iframe(s) without sandbox attribute — security risksecurity
  • No RSS/Atom feed in <head> — add for content discovery by aggregators and AItechnical
  • No IndexNow integration — add for instant Bing/Yandex indexing on content changestechnical
  • 1 of 4 H2 sections have < 50 words — add more content or merge thin sectionscontent

Passed Checks (64)

  • Title tag found: "Health Information | National Institutes of Health (NIH)"meta
  • Title length: 56 characters (optimal)meta
  • Meta description found (156 chars)meta
  • Description length: 156 chars (optimal)meta
  • Viewport meta tag found (mobile-friendly)meta
  • UTF-8 charset declaredmeta
  • Language attribute set on HTML tagmeta
  • No meta refresh redirect (good)meta
  • Robots: "index, follow"meta
  • meta keywords tag found — deprecated since 2009, Google ignores it. Remove to keep HTML cleanmeta
  • Single H1 tag foundcontent
  • Title-H1 alignment: 50% word overlap (good)content
  • 4 H2 tags foundcontent
  • 771 words on page (good content depth)content
  • Readability: 40 (moderate — aim for 60+)content
  • All 24 images have ALT textcontent
  • JS-based lazy loading detectedcontent
  • 62 internal links foundcontent
  • 49 external links foundcontent
  • No empty href attributes foundcontent
  • Open Graph title foundsocial
  • Open Graph URL foundsocial
  • og:type set to "Article"social
  • og:site_name: "National Institutes of Health (NIH)"social
  • Twitter Card: "summary"social
  • Twitter title foundsocial
  • HTTPS enabledtechnical
  • Canonical tag foundtechnical
  • Canonical tag is self-referencing (correct)technical
  • Favicon foundtechnical
  • Compression: gziptechnical
  • URL length: 38 chars (good)technical
  • URL uses hyphens or no separators (good)technical
  • URL is lowercase (good)technical
  • No deprecated HTML tags foundtechnical
  • Inline CSS: 0.0 KB (acceptable)performance
  • Inline JS: 5.4 KB (acceptable)performance
  • Cache-Control: max-age=900, publicperformance
  • HSTS header present — enforces HTTPSsecurity
  • X-Content-Type-Options: nosniffsecurity
  • Content-Security-Policy header presentsecurity
  • About page link found — builds trust and E-E-A-Ttrust
  • Contact page/email link foundtrust
  • Privacy Policy link foundtrust
  • Terms of Service link foundtrust
  • <!DOCTYPE html> declaredtechnical
  • Viewport includes width=device-width (responsive)meta
  • robots.txt found and allows crawlingtechnical
  • No AI crawlers blocked in robots.txt — AI search and training can access contenttechnical
  • sitemap.xml found with valid formattechnical
  • Sitemap contains 2599 URL(s)technical
  • All 5 checked internal links are workingcontent
  • 0% nofollow links (0/112) — good link equity flowcontent
  • URL contains title keywords (50% match)technical
  • Title uses "|" separator — Google may rewrite it; consider using "—" insteadmeta
  • Title and meta description are different (good)meta
  • OG URL matches canonical URL (consistent)social
  • Viewport allows zoom (good for accessibility)accessibility
  • DOM size: ~643 elements (good)performance
  • No snippet blocking detected (good — content is fully accessible)technical
  • Trailing slash consistent between URL and canonicaltechnical
  • 2 third-party script domain(s) (acceptable)performance
  • ARIA landmarks present: main, nav, skip-linkaccessibility
  • All 2 form input(s) have labels or aria-labelsaccessibility
AEO — 50/100 (26 issues)

Failed Checks (26)

  • No structured data — AI systems struggle to categorize your pagediscoverability
  • Missing meta description or og:description — unclear topic for AIdiscoverability
  • No definition lists (<dl>) — great for term/value pairsdiscoverability
  • No code blocks — add <code>/<pre> for technical contentdiscoverability
  • No FAQ section found — add Q&A content for AI citationstructure
  • No question-style headings — try "How to...", "What is..."structure
  • No data tables — tables help AI compare and cite datastructure
  • No numbered steps or ordered lists — add for "how to" AI answersstructure
  • Few key-value patterns — use "**Label:** value" format for AIstructure
  • No author information — AI values attributed contentcitation
  • No Organization schema — AI may not recognize your brandcitation
  • Only 2/5 E-E-A-T signals — add about, contact, author infocitation
  • No Speakable schema — add for voice assistant optimizationvoice
  • First paragraph too short for AI snippetsvoice
  • No SearchAction schema — add for enhanced search presencevoice
  • Avg 26 words/sentence — simplify for AI and voicevoice
  • No summary section — add Key Takeaways for AI extractionvoice
  • Few Q&A blocks — add question headings followed by answer paragraphsvoice
  • No /llms.txt found — add one to guide AI crawlers (emerging standard)discoverability
  • No sameAs links in schema — add social profiles for brand recognitioncitation
  • Content not updated in 90+ days — stale content loses AI visibilitycitation
  • No Table of Contents — add jump links for AI content navigationstructure
  • No video content — multimedia increases AI engagement and citation ratediscoverability
  • Statistics lack source references — attribute data to boost AI trustcitation
  • No <details>/<summary> elements — use for expandable Q&A contentstructure
  • Brand "National Institutes of Health (NIH)" mentioned only 1 time(s) — repeat for entity recognitioncitation

Passed Checks (26)

  • 771 words in HTML — good content for AI parsingdiscoverability
  • Clean URL structurediscoverability
  • 6 semantic HTML5 elements founddiscoverability
  • 7 content sections — well-structured for AIdiscoverability
  • 64 named entities found — helps AI understand contextdiscoverability
  • 62 internal links — good topic cluster signaldiscoverability
  • 15 lists found — structured content for AIstructure
  • Concise answer paragraphs after headings — great for AI citationstructure
  • 58% short sentences — easy for AI to extractstructure
  • 100% concise paragraphs — scannable for AIstructure
  • Avg 128 words/section — sufficient depthstructure
  • Publication/modification date foundcitation
  • Statistics and data found in content — highly citablecitation
  • Source citations found — increases AI trustcitation
  • 49 external links — references authoritative sourcescitation
  • Content references 2026/2025 — fresh contentcitation
  • Topic focus: 100% — title keywords found in contentcitation
  • 6 answer-box-ready paragraphs found after headingscitation
  • 30 conversational words — good for AI summariesvoice
  • Active voice dominant (0% passive) — AI-friendlyvoice
  • Pronoun density: 3.8% — engaging, user-focusedvoice
  • 18 H2/H3 headings — good content structure for AIvoice
  • No AI-blocking robots directives — AI crawlers can access contentdiscoverability
  • Freshness signal found (Last-Modified header or dateModified)citation
  • AI crawlers allowed in robots.txt (0/4 explicitly listed)discoverability
  • Brand name consistent across 2/3 signals (title, OG, schema)citation
GEO — 57/100 (25 issues)

Failed Checks (25)

  • Few question headings — add Q&A headings for AI Overviewoverview
  • Content may be too thin for AI Overview (771 words, 4 sections)overview
  • No step-by-step content — add numbered guides for AIoverview
  • 40% optimal paragraphs — aim for 20-80 words per paragraphoverview
  • No author schema — add for AI trust signalsauthority
  • Incomplete brand identity — add og:site_name + Organization schemaauthority
  • No original data signals — "our research shows..." boosts AI citationauthority
  • No examples or case studies — add real-world examplesauthority
  • No links to authoritative sources — cite Google, Schema.org, etc.authority
  • No FAQ Schema — add for Google rich results + AI Overviewformat
  • No HowTo Schema — add for step-by-step rich resultsformat
  • No comparison content — add vs. sections for AI recommendationsformat
  • No summary section — add Key Takeaways or TL;DR for AIformat
  • No BreadcrumbList schema — add for better AI contextformat
  • Only one list type — use both <ul> and <ol> for content varietyformat
  • No conclusion — add recommendations or a verdict sectionformat
  • Few data points with units — add specific numbers (e.g., "300+ users")semantic
  • Incomplete Open Graph tags — AI uses these for contexttechnical
  • Only 0 Schema type(s) — add more for AI contexttechnical
  • 5 image(s) with generic/short ALT text — use descriptive textformat
  • Only 0 data points — add 5+ statistics for +30-40% AI visibility (Princeton study)authority
  • Author lacks credentials — anonymous content is a GEO penaltyauthority
  • Only 2/6 question types — add What, Why, How headingsoverview
  • No <figure>/<figcaption> — wrap images with descriptive captions for AIformat
  • No authoritative sameAs links — add Wikipedia/Wikidata/LinkedIn URIs to schema for Knowledge Graph alignmentauthority

Passed Checks (33)

  • Definition-style content found — featured snippet candidateoverview
  • Content freshness signals detectedoverview
  • Title and H1 are aligned — clear topic focusoverview
  • 2-level heading hierarchy — deep content structureoverview
  • Source citations found — increases AI trustauthority
  • 2/5 trust signals foundauthority
  • Expert quotes or testimonials found — authority signalauthority
  • 3 industry terms used — domain expertise signalauthority
  • 22 unique external domains — diverse sourcesauthority
  • 24 images all with ALT text — AI can understand visualsformat
  • Data visualization elements found (tables/charts)format
  • 3/6 formatting types used — rich contentformat
  • Multi-perspective content found — balanced analysisformat
  • Actionable advice found — AI prefers practical contentformat
  • 1 entity types in Schema — AI can classify contentsemantic
  • 7/14 semantic HTML5 elements — deep semantic structuresemantic
  • <time> element found — machine-readable datessemantic
  • 100% title keywords in first 200 words — good prominencesemantic
  • 52 contextual links — good cross-referencingsemantic
  • Content categorization found — clear topic classificationsemantic
  • 3 sections have subtopics (H3) — deep coveragesemantic
  • Mobile-friendly (viewport set)technical
  • HTTPS secure — trusted by AI systemstechnical
  • Page size: 66 KB — reasonabletechnical
  • Canonical URL set — prevents AI from citing duplicatestechnical
  • Page allows AI indexing (no noindex/nosnippet)technical
  • HTML lang="en" set (add og:locale for completeness)technical
  • 4 ARIA landmarks found — good accessibility for AItechnical
  • 771 words — sufficient depth for AI analysisoverview
  • First 200 words contain topic keywords — AI extracts primarily from the startoverview
  • Definition + list combo — strong featured snippet candidateoverview
  • No snippet restrictions — AI can freely extract content (good)technical
  • Link anchor texts are descriptive — good AI context signalsemantic

Fix These Issues Automatically

Using WordPress? Install SEO Autopilot for one-click auto-fixes on 65+ issues — with full undo support.

Get SEO Autopilot Plugin
Scan This Site Again Request Report Removal