Sitemap Validator

Validate your XML sitemap instantly

Enter your site and we'll find your sitemap, check its structure, count its URLs, and confirm a sample are live — no signup. Then audit every URL in it.

Instant, free, no account needed — we locate and check your sitemap.

What we check in your sitemap

The instant check validates the essentials. A full audit then checks every single URL in your sitemap.

Sitemap is reachable

Located via robots.txt or /sitemap.xml and successfully fetched.

Valid XML structure

A proper <urlset> or sitemap-index root that search engines can parse.

Contains URLs

How many URLs the sitemap declares — an empty sitemap helps nobody.

HTTPS consistency

All listed URLs should be https on an https site — no http leftovers.

Sampled URLs are live

We fetch a sample to confirm they return 200, not redirects or 404s.

Full-site validation

Sign up to check every URL for canonical, noindex and status issues.

How to keep your sitemap healthy

A clean sitemap helps Google crawl efficiently and index the right pages. Here's how to maintain one.

1. List only canonical, indexable, 200 URLs

Your sitemap is a list of "please index these." Don't include redirected URLs, 404s, noindex pages, or non-canonical variants — they send mixed signals and waste crawl budget. Every entry should be the canonical version, return 200, and be indexable.

2. Declare it in robots.txt

Add a Sitemap: https://yourdomain.com/sitemap.xml line to robots.txt so crawlers find it automatically, and submit it in Google Search Console to monitor coverage and errors.

3. Use a sitemap index for large sites

Each sitemap caps at 50,000 URLs / 50MB uncompressed. Past that, use a sitemap index that points to multiple child sitemaps. Keeping them logically split (by section or type) also makes coverage problems easier to diagnose.

4. Keep it in sync with your site

Generate the sitemap dynamically so new pages appear and deleted ones disappear automatically. A stale sitemap full of removed URLs erodes trust in the whole file. Re-validate after any major content or URL-structure change.

5. Watch for "indexable but not in sitemap"

The reverse problem matters too: indexable pages missing from the sitemap may be under-crawled. A full crawl compares your live indexable pages against the sitemap and flags the gaps in both directions.

Sitemap validator FAQ

What is an XML sitemap?

A file (usually /sitemap.xml) listing the URLs you want search engines to crawl and index. It helps Google discover pages, especially on large sites, and signals your canonical URLs.

What makes a sitemap unhealthy?

Malformed XML, non-canonical or redirected or 404 URLs, mixed http/https, noindex pages, or not being declared in robots.txt. A sitemap should contain only canonical, indexable, 200-status URLs.

Where should my sitemap live?

At the site root (e.g. /sitemap.xml), declared in robots.txt, and submitted in Search Console. For big sites, use a sitemap index pointing to child sitemaps (max 50,000 URLs each).

Should every page be in the sitemap?

Only the indexable, canonical, 200-status pages you want ranked. Leave out redirects, noindex pages and duplicates. Run a full audit to check every URL in both directions.

More free SEO tools

Validate every URL in your sitemap

Free to start. Crawl your site and catch every bad sitemap entry.

Validate my sitemap