How Typo Works

A quick guide to scanning websites, understanding reports, and managing results.

How to Run a Scan
  1. Paste a website URL into the scan bar on the dashboard.
  2. Typo discovers pages automatically via sitemap.xml. If no sitemap is found, it falls back to a discovery crawl.
  3. Select individual pages from the folder tree, or hit Scan All to check everything.
  4. Typo renders each page with a real browser (Playwright), then runs it through the spellcheck pipeline.
What Typo Checks
  • Spelling errors — powered by LanguageTool, limited to typo and spelling categories only. Grammar, style, and punctuation rules are disabled.
  • Proper nouns — city names, state names, geographic locations, and brand names are recognized using NLP and a geographic database so they aren't flagged as misspellings.
  • Image alt text — alt attributes on images are extracted and spellchecked separately.
  • Meta tags — Open Graph titles/descriptions and SEO meta descriptions are checked for spelling errors.
What Typo Skips

To reduce noise, Typo automatically filters out:

  • Code blocks and inline code
  • URLs, email addresses, and file paths
  • HTML tags and markup-heavy segments
  • CMS artifacts — WordPress shortcodes, Payload CMS markup, and plugin-generated content
Understanding Your Report
  • Results are organized page by page. Each page shows its finding count and word count.
  • Summary cards at the top show total findings, pages with errors, and clean pages.
  • Each finding has a category: spelling, proper_noun, meta_tag, or alt_text.
  • Context highlighting shows the error in its surrounding text so you can quickly judge if it's a real issue.
  • Findings that appear in repeated content (like headers and footers) are deduplicated and shown once with a “found on N pages” count.
Scan Diffs

When you re-scan the same site, Typo compares results against the previous scan and tags each finding:

  • New — appeared since the last scan.
  • Fixed — was present before but is now gone.
  • Recurring — still present from the previous scan.
Scheduled Scans

Set up recurring scans so sites are checked automatically without manual intervention.

  • Creating a schedule — go to Schedules → New Schedule, enter a URL, verify pages are discoverable, then pick a frequency, day, and time.
  • Frequency — choose Weekly (pick a day of the week) or Monthly (pick a day of the month).
  • Time — times are displayed and configured in Eastern time (ET). They are converted to UTC behind the scenes.
  • What happens each run — Typo re-discovers pages via sitemap, runs the full spellcheck pipeline, and diffs results against the previous scan so findings are tagged as new, fixed, or recurring.
  • Run Now — trigger a scheduled scan on-demand at any time without affecting the next scheduled run.
  • Pause & Resume — temporarily pause a schedule to stop automatic runs. Resume it when you're ready.
  • Auto-pause — if a schedule fails 3 times in a row (e.g., the site is down), it is automatically paused to prevent repeated broken runs. You'll see a warning on the schedule detail page and can resume it manually.
  • Editing — click into any schedule to change its frequency, day, or time. Past scan history is preserved.
Managing False Positives

Two ways to handle findings that aren't real errors:

  • Dismiss — hides the finding for this scan only. It may reappear on the next scan.
  • Add to Dictionary — permanently adds the word to your team's custom dictionary. It won't be flagged on any future scan.
Sharing & Exporting
  • Share link — generates a read-only link that expires after 30 days. No login required for viewers.
  • Export — download scan results as CSV or Excel for offline review or client handoff.
Tips
  • Re-check a single page — on the page detail view, use the Re-check button to re-scan just that page without running the full scan again.
  • Low content pages — pages with very little text are flagged with a “low content” badge. These may be image-heavy pages or redirects.
  • Cross-page deduplication — identical errors found in shared content (headers, footers, navigation) are grouped and reported once to keep your report clean.