Phase 1: Exact Matches (Do First — Quickest Fix)
1. Check www vs. non-www versions. Visit both www.yoursite.com and yoursite.com on your main pages. If both load with identical content, you have duplication. Fix: Set a canonical redirect in .htaccess or your server config, or use domain settings in Google Search Console.
2. Audit HTTPS vs. HTTP versions. Similar to www/non-www — both http:// and https:// URLs may coexist. Redirect all traffic to HTTPS. Verify in GSC that you're claiming the HTTPS version as your primary domain.
3. Check trailing slashes. yoursite.com/services and yoursite.com/services/ may both load. Set a consistent redirect policy (usually remove trailing slash on root paths, keep on directories) and test with Screaming Frog.
4. Look for session IDs and tracking parameters in URLs. Search Console and server logs will show URLs like /page?sessionid=xyz or /product?utm_source=email. These generate duplicate pages. Use URL parameter handling in GSC to tell Google to ignore these, or implement canonical tags.
5. Audit login/checkout duplicate URLs. Pages behind authentication (cart, account dashboard) sometimes get crawled as separate URLs. Check your site's URL structure — if logged-in and logged-out versions coexist, block the logged-out version from crawling with robots.txt or noindex tags.
Phase 2: Parameter and Pagination (Medium Priority)
6. Check e-commerce filters and sorting options. On sites with product filters (color, size, price), each combination creates a new URL with identical content. Use GSC's URL parameter tool or canonical tags to consolidate. For pagination, use rel=next/prev tags or canonical tags pointing to the view-all page.
7. Audit faceted navigation. If your site generates pages like /shoes?color=red, /shoes?color=blue, etc., and the main category page (/shoes) shows the same products, you have duplication. Implement canonical tags pointing to the base category page, or use noindex on filtered variations.
8. Check cross-domain duplication (if you own multiple domains). If you operate site-a.com and site-b.com (or subdomains), verify you're not serving identical content on both. Google will see this as duplication even across domains. Decide which domain is canonical and redirect the other, or use canonical tags.
Phase 3: Template and Syndication Duplicates (Harder to Spot)
9. Review syndicated content attribution. If you republish articles from wire services, blogs, or partner sites, Google may see both versions as duplicate. Add noindex tags to syndicated versions, or use canonical tags pointing to the original source. Check your CMS for autopublished content.
10. Check boilerplate section duplication. Many pages share large blocks of identical copy (e.g., footer disclaimers, service descriptions, testimonials). This is usually not a major ranking issue, but excessive boilerplate (>30% of page content) can make pages look thin. Audit pages with highest traffic and highest boilerplate ratios first.
11. Audit auto-generated category pages. CMS systems sometimes auto-generate category/archive pages with identical meta descriptions or preview text. Use Screaming Frog to find pages with identical title tags or descriptions, then add unique copy to each.
12. Look for mirror or backup versions of pages. Old staging environments, cached versions, or mirrored subdomains (/backup, /staging, /v2) sometimes stay live. Search GSC for subdomains or file structures that shouldn't be indexed, then block them with robots.txt or add noindex tags.
Phase 4: Technical Detection (Requires Tools)
13. Run Screaming Frog to find duplicate title tags and meta descriptions. Download Screaming Frog's free version, crawl your site, and filter by Duplicate Page Titles and Duplicate Meta Descriptions. Prioritize high-traffic pages (check against GSC) and add unique titles/descriptions to each.
14. Check Google Search Console for URL variations reporting. GSC will flag some parameter-driven duplicates in its coverage reports or linked pages. Review and use the URL parameter tool to consolidate or exclude duplicates.
15. Audit structured data duplication. If your site uses schema markup, ensure each page's markup is unique (product names, prices, review ratings). Identical schema across multiple pages signals duplication to Google. Verify with Google's Rich Results Test.