Sitemap
Finder Tool
Free
The fastest free sitemap finder online. Enter any domain to instantly locate all XML sitemaps via robots.txt parsing, 19 common path probes, and HTML link tag discovery — with type detection, URL counts, and child sitemap listing. No login required.
Enter any domain or URL — the tool automatically extracts the root and runs all three discovery methods.
How the Sitemap Finder Works
Three discovery methods run in sequence to ensure no sitemap is missed — from the authoritative robots.txt declaration to non-standard HTML link tags.
Enter Domain
Type or paste any domain or page URL. The tool extracts the root domain automatically — no https:// needed.
Check robots.txt
Fetches domain.com/robots.txt and extracts every Sitemap: directive — the most reliable source of sitemap URLs.
Probe 19 Paths
Probes /sitemap.xml, /sitemap_index.xml, /wp-sitemap.xml, /yoast-sitemap.xml, and 15 more common locations.
Inspect & Export
Each sitemap shows its type, URL count, source, and child sitemaps. Export CSV or copy all URLs in one click.
Everything the Sitemap Finder Identifies
Beyond just finding the URL — each result is validated and typed so you know exactly what you're dealing with.
Standard XML Sitemap
The classic /sitemap.xml format with <urlset> containing a list of <url> and <loc> entries. Used by most CMS platforms including Shopify, Squarespace, and basic WordPress setups.
Sitemap Index
A master <sitemapindex> file that references multiple child sitemaps. Large sites split by content type — posts, pages, products, categories. This tool detects indexes and lists every child sitemap URL.
News Sitemap
The Google News sitemap format listing articles published within 48 hours with publication date metadata. Required to appear in Google News. Identified by <news:news> namespace elements.
Video Sitemap
Extended sitemap format using <video:video> elements to help Google index video content. Includes thumbnail, title, description, and duration metadata for each video page.
Image Sitemap
Extended sitemap using <image:image> elements to help Google discover images embedded in pages that may not be crawlable through regular HTML parsing.
HTML Discovery
Scans the homepage for <link rel="sitemap"> and application/xml link elements in the HTML head — the third-most-common way CMSs declare their sitemap location.
Everything You Need to Know About XML Sitemaps
Sitemaps are foundational technical SEO. Here is what every site owner and SEO practitioner needs to understand.
What Is a Sitemap & What Is a Sitemap URL?
An XML sitemap is a file that lists every URL on your website you want search engines to discover and index. Think of it as a directory you hand directly to Googlebot. A sitemap URL is the web address where this file lives — most commonly https://yourdomain.com/sitemap.xml or /sitemap_index.xml.
Google uses sitemaps to prioritise crawl budget, understand site structure, and find pages that might be hard to discover through internal links alone. Submitting your sitemap URL to Google Search Console is one of the first tasks for any new website.
When does a sitemap matter most?
Sitemaps matter most for new sites with few backlinks (Googlebot won't discover pages through links), large sites with thousands of pages, sites with thin internal linking where important pages are buried, and sites with rich media like video, images, and news articles that need extended sitemap formats to get indexed correctly.
How to Find the Sitemap of a Website — Manual Methods
If you prefer to find a sitemap manually, try these four approaches in order:
1. Check robots.txt (most reliable)
Navigate to domain.com/robots.txt. Look for a line starting with Sitemap: — site owners explicitly declare sitemaps here, making it the most authoritative source. Some sites list multiple sitemaps this way.
2. Try common paths
Try /sitemap.xml and /sitemap_index.xml first. WordPress with Yoast uses /sitemap_index.xml; WordPress core sitemaps since 5.5 live at /wp-sitemap.xml. Shopify stores serve from /sitemap.xml.
3. Google search
Search site:domain.com filetype:xml in Google. This surfaces any XML files Google has indexed, often including sitemaps. Useful as a quick verification step.
4. View page source
View source on the homepage and search for sitemap. CMSs sometimes include a <link rel="sitemap" href="..."> tag in the HTML head.
Sitemap Best Practices for 2025
A sitemap does not directly boost rankings, but it significantly influences how fast and completely Google indexes your site. Follow these rules to maximise its SEO value:
- Only include indexable URLs — never list pages with noindex tags, canonical redirects, or 4xx/5xx status codes. Use our noindex checker to audit sitemap pages.
- Keep lastmod accurate — update it when content genuinely changes. Google ignores inflated or permanently static lastmod values over time.
- Split large sitemaps — keep each sitemap under 50,000 URLs and 50 MB. Use a sitemap index to tie them together.
- Declare in robots.txt — the fastest way to ensure all crawlers (not just Google) discover your sitemap automatically.
- Submit to Google Search Console and monitor the coverage report for the ratio of submitted vs indexed URLs — a gap here indicates serious crawl issues.
- Check sitemap URLs return 200 — any URL in your sitemap returning a redirect or error wastes crawl budget. Run a redirect check on your key sitemap pages.
Complete Your Technical SEO Audit
After finding your sitemap, use these tools to audit the URLs inside it and fix indexing issues across your site.
Frequently Asked Questions
Common questions about finding, reading, and submitting XML sitemaps for SEO.
How do I find the sitemap of a website?
The fastest way is to use this free sitemap finder — enter any domain and it automatically checks robots.txt for Sitemap: directives, probes 19 common paths like /sitemap.xml and /sitemap_index.xml, and scans the homepage HTML. Manually, navigate to domain.com/robots.txt and look for a Sitemap: line, then try /sitemap.xml directly in the browser if none is declared.
What is a sitemap URL?
A sitemap URL is the web address where a website's XML sitemap file is hosted — for example, https://example.com/sitemap.xml. It is an XML file that lists URLs the site owner wants search engines to crawl and index. You submit this URL to Google Search Console under Indexing → Sitemaps so Google can discover and prioritise your content.
How to see the sitemap of any website?
Find the sitemap URL using this tool, then open it in your browser. XML sitemaps display as structured markup — <urlset> wraps a list of <url> entries each with a <loc> (page URL) and optional <lastmod>. For sitemap indexes (<sitemapindex>), this tool automatically lists all child sitemap URLs so you can inspect each one. You can also use browser extensions like XML Viewer to format the raw XML nicely.
Why can't I find a sitemap for a website?
Not all websites have a sitemap. WordPress without Yoast SEO, Rank Math, or the core sitemap feature enabled generates no sitemap by default. Small or hand-coded HTML sites rarely have one. Sitemaps may also be at non-standard paths not covered by common probes, behind authentication, or blocked from crawlers in robots.txt. If no sitemap is found, generating one is strongly recommended — WordPress users should install an SEO plugin, Shopify has a built-in sitemap at /sitemap.xml, and static site generators like Jekyll and Hugo have plugins for it.
What is a sitemap index?
A sitemap index is a master XML file (typically at /sitemap_index.xml) that references multiple individual sitemap files. Large websites split their URLs across several files — posts-sitemap.xml, pages-sitemap.xml, products-sitemap.xml — and list them all in the index. Google's limit is 50,000 URLs per sitemap file and 50 MB per file, so sites with more URLs must use a sitemap index. This tool detects sitemap indexes and expands all child sitemap URLs automatically.
How does this sitemap checker work technically?
The tool runs three discovery methods in sequence. First it fetches domain.com/robots.txt and extracts any Sitemap: declarations. Then it probes 19 common sitemap paths (including CMS-specific ones like /wp-sitemap.xml and /yoast-sitemap.xml) and validates each response as a genuine XML sitemap by checking for <urlset> or <sitemapindex> elements. Finally it scans the homepage HTML for <link rel="sitemap"> tags. Every found sitemap is validated, typed, and its URL count or child sitemaps extracted.
Is this sitemap finder free?
Yes — completely free, no account required, no usage limits, no watermarks on exports. All tools on Behind the Search are free for SEO professionals. Run it as often as you need, on any domain.
Built by SEOs, for SEOs
Behind the Search is a free suite of technical SEO tools built to give you the same data as expensive enterprise platforms — without the subscription. Every tool is engineered from the ground up for accuracy, speed, and real-world utility.
From noindex checkers and robots.txt validators to this sitemap finder — each tool solves a specific, recurring problem that SEO professionals and site owners face every week.