How to Find a Website's Sitemap: 5 Reliable Methods

Learn 5 methods to find any website's sitemap, including checking common URLs, robots.txt, Google cache, and crawling tools.

Last updated: 2026-02-17

Why Would You Need to Find a Sitemap?

There are several good reasons to look for a website's sitemap:

  • Competitive analysis. A competitor's sitemap reveals their full site structure, content strategy, and how many pages they have indexed. You can see which product categories they prioritize, how their blog is organized, and whether they are scaling content.
  • Site auditing. When auditing a new client's site or your own property, the sitemap is the fastest way to get a complete inventory of pages.
  • Migration planning. Before migrating a site, you need a full list of URLs. The sitemap is often the most complete source.
  • Link building. Finding sitemaps of resource-heavy sites can reveal linkable pages and content gaps.

Not every website publishes a sitemap in an obvious location. Here are five methods to find one, from simplest to most thorough.

Method 1: Check the Default URL

The most common sitemap location is the site's root directory at /sitemap.xml. Start here.

1

Try the standard path

Open your browser and navigate to https://example.com/sitemap.xml. If the site has a sitemap at the default location, it will load as an XML document.

2

Try common variations

If /sitemap.xml returns a 404, try these alternate paths. Different CMS platforms and SEO tools use different conventions.

Common sitemap URL patterns:

/sitemap.xml
/sitemap_index.xml
/sitemap/sitemap.xml
/wp-sitemap.xml
/sitemap.xml.gz
/sitemap1.xml
/post-sitemap.xml
/page-sitemap.xml
/sitemapindex.xml

WordPress sites using Yoast SEO typically serve their sitemap index at /sitemap_index.xml. The built-in WordPress sitemap is at /wp-sitemap.xml. Shopify sites always use /sitemap.xml.

If you find a sitemap index file, click through to the individual sitemaps it references. These contain the actual URLs. The index file itself just points to other sitemap files.

Method 2: Check robots.txt

The robots.txt file often declares the sitemap location with a Sitemap: directive. This is one of the most reliable methods because the sitemap protocol recommends this practice.

1

Open the robots.txt file

Navigate to https://example.com/robots.txt in your browser. Every well-configured site has a robots.txt file at this exact location.

2

Look for the Sitemap directive

Search the file for lines starting with Sitemap:. The value after the colon is the full URL to the sitemap.

A typical robots.txt with sitemap declarations:

User-agent: *
Disallow: /admin/
Disallow: /cart/

Sitemap: https://example.com/sitemap.xml
Sitemap: https://example.com/sitemap-news.xml

Some sites list multiple sitemaps. This is common for large sites that split their sitemap by content type (products, blog posts, pages).

The robots.txt method works even when the sitemap is not at a standard URL. Some sites store sitemaps at unusual paths like /data/sitemap.xml or /seo/sitemap.xml. The robots.txt declaration reveals these non-standard locations.

Method 3: Use Google Search Operators

If the first two methods fail, Google itself may have cached or indexed the sitemap.

Try these search queries:

site:example.com filetype:xml
site:example.com inurl:sitemap
site:example.com "sitemap" filetype:xml

The filetype:xml operator filters results to XML files. Combined with site: to limit results to the target domain, this can surface sitemaps that are not at standard locations.

This method has limitations. Google does not always index sitemap files, and the results may be outdated. But it is worth trying when other methods come up empty.

Method 4: Check Google Search Console (If You Have Access)

If you have Google Search Console access for the site, the Sitemaps report shows all submitted sitemaps.

Navigate to Indexing > Sitemaps in the GSC sidebar. This page lists every sitemap that has been submitted, along with its status, URL count, and last read date.

This method is only available if you are a verified owner or user of the property. For competitor analysis, this is not an option -- use the other methods instead.

Monitor Any Sitemap Automatically

Site Watcher tracks sitemap availability, URL counts, and errors for every site you manage -- all from one dashboard.

Method 5: Use a Crawling Tool

When no sitemap is publicly accessible, a crawling tool can discover URLs by following links from the homepage, effectively building a sitemap for you.

Screaming Frog SEO Spider

Screaming Frog crawls a site by following links and can export the discovered URLs as a sitemap. It handles JavaScript rendering, follows redirects, and reports on status codes.

  1. Enter the site's URL and start a crawl.
  2. Once complete, go to Sitemaps > XML Sitemap to export the discovered URLs in sitemap format.

Free for up to 500 URLs. The paid version has no URL limit.

Browser-Based Crawlers

Several online tools will crawl a site and generate a sitemap without installing software:

  • XML-Sitemaps.com crawls up to 500 URLs for free and generates a downloadable XML sitemap.
  • Sitemap generators built into SEO platforms like Ahrefs, Semrush, and Moz can discover URLs through their own web crawlers.

Command-Line Tools

For technical users, tools like wget can spider a site and produce a URL list:

wget --spider --recursive --no-parent https://example.com 2>&1 \
  | grep '^--' | awk '{ print $3 }' | sort -u > urls.txt

This produces a raw URL list that you can convert to sitemap format.

What if No Sitemap Exists?

Not every website has a sitemap. Smaller sites with good internal linking may not need one. If you cannot find a sitemap after trying all five methods, here are your options:

For your own site: Create one. Every modern CMS and framework supports sitemap generation. Follow our guide on how to create a sitemap.

For a competitor's site: Use a crawling tool to build your own URL inventory. The crawl results will give you a similar picture of the site's structure and content scope.

For a client's site: Flag the missing sitemap as a technical SEO issue. A sitemap is recommended for any site with more than a few hundred pages, and there is no good reason for an established site not to have one.

Using Competitor Sitemaps for Research

A competitor's sitemap is one of the most underused research tools in SEO. It gives you a complete picture of their content strategy without any guesswork.

What You Can Learn

Content scope

How many pages do they have? How many blog posts? How many product pages? The raw URL count tells you the scale of their content investment.

Site architecture

The URL structure reveals how they organize content. Are products grouped by category? Do they have location-specific pages? Is their blog organized by topic?

Content gaps

Compare their sitemap URLs to yours. Pages they have that you do not represent potential content opportunities. Topics they cover that you skip are gaps worth evaluating.

Update frequency

The lastmod dates (if present and accurate) show which sections get updated most frequently. This reveals their content priorities.

Scale indicators

A sitemap index with multiple sitemaps indicates a large, actively managed site. The number and organization of sub-sitemaps reveals their operational sophistication.

How to Analyze a Competitor's Sitemap

  1. Download the sitemap (or sitemap index and all referenced sitemaps).
  2. Extract the URLs into a spreadsheet or analysis tool.
  3. Categorize by URL pattern -- group URLs by path structure to understand their content types.
  4. Count pages per category -- identify where they invest the most content.
  5. Check lastmod distribution -- see which areas are actively maintained.
  6. Compare with your own sitemap -- identify overlap and gaps.

Competitor sitemaps show what they want indexed, not necessarily what performs well. A page existing in a sitemap does not mean it ranks or drives traffic. Use the sitemap as a structural guide, not a ranking indicator.

Quick Reference: Finding a Sitemap

MethodWhen to UseReliability
Check /sitemap.xmlAlways -- start hereHigh (most sites use default path)
Check robots.txtIf default URL failsHigh (standard practice to declare here)
Google search operatorsIf robots.txt has no Sitemap directiveMedium (depends on Google indexing the file)
Google Search ConsoleOnly if you have accessVery high (shows all submitted sitemaps)
Crawling toolsIf no sitemap is publishedHigh (but only finds linked pages)

Finding a sitemap takes less than a minute when you know where to look -- start with the obvious places before reaching for advanced tools.

Track Sitemaps Across All Your Sites

Site Watcher monitors sitemaps, SSL certificates, DNS records, uptime, and vendor dependencies from one dashboard. Free for 3 targets. $39/mo for unlimited monitoring.