Home » SEO for Ecommerce » Duplicate Content

Fixing Duplicate Content Issues in Online Stores

Duplicate content is the single most common SEO problem for ecommerce stores because online stores generate it at massive scale through URL parameters, faceted navigation, product variations, pagination, and copied manufacturer descriptions. When Google finds multiple pages with identical or nearly identical content, it has to choose which version to rank, and it often chooses wrong or ranks none of them well. Fixing duplicate content consolidates your ranking signals onto the correct pages and can produce immediate traffic improvements.

How Duplicate Content Hurts Your Rankings

Google does not penalize duplicate content with a manual action (the way it might penalize spam), but it does filter and consolidate duplicates in a way that weakens your rankings. When Google finds two or more URLs with the same content, it picks one as the canonical version and drops the others from search results. If Google picks the wrong URL, the URL with your optimized title tag and meta description might be excluded in favor of a parameterized version with no optimization.

More importantly, duplicate content dilutes your ranking signals. External links, internal links, and authority signals that should concentrate on one strong page instead get split across multiple duplicate URLs. If your product page exists at five different URLs and earns backlinks to three of them, none of the five gets the full authority benefit. Consolidating those duplicates through canonical tags channels all those signals to one URL, making it significantly more competitive.

Common Ecommerce Duplicate Content Sources

URL Parameters

Sorting, filtering, tracking, and session parameters create additional URLs that serve the same content. The same product page might be accessible at /products/blue-jacket, /products/blue-jacket?ref=homepage, /products/blue-jacket?utm_source=email, and /products/blue-jacket?variant=large. Each URL serves identical or nearly identical content, but Google may treat each as a separate page.

Faceted Navigation

Category page filters for size, color, brand, price range, and other attributes create thousands of URL combinations. A category with 5 sizes, 8 colors, and 4 brands generates 160+ filtered URLs, each showing a subset of the same product grid. Google sees these as separate pages competing for the same keywords.

Product Variations

Some platforms create separate URLs for product variations: /products/t-shirt-blue, /products/t-shirt-red, /products/t-shirt-green. If these pages share the same description, images (except the color swatch), and specifications, they are near-duplicates that compete with each other.

Pagination

Category pages with multiple pages (/category?page=1, /category?page=2, etc.) each show different products but share the same category title, description, and navigation. Google can struggle to determine which paginated page is most relevant.

HTTP vs HTTPS and WWW vs Non-WWW

If your site is accessible at both http:// and https://, or both www and non-www, every page has two (or four) duplicate URLs. This should be fixed at the server level with 301 redirects forcing all traffic to one version.

Manufacturer Descriptions

Using product descriptions provided by manufacturers means your product pages have identical content to every other retailer selling the same products. Google filters these duplicates and typically ranks the site with the most authority, leaving smaller stores invisible.

Fix 1: Implement Canonical Tags

Add a self-referencing canonical tag to every page.
Every page on your store should include a canonical tag in the HTML head that points to its own URL: <link rel="canonical" href="https://yourdomain.com/products/blue-jacket" />. This tells Google that this URL is the primary version and any other URLs with similar content should be treated as duplicates of this one. For pages with URL parameters, the canonical should point to the clean URL without parameters. All filtered, sorted, and tracked versions of a page should have their canonical pointing to the base, unfiltered URL.

Canonical tag rules for ecommerce:

  • Product pages: canonical points to the clean product URL without any parameters
  • Category pages: canonical points to the unfiltered, unsorted category URL
  • Paginated pages: each paginated page canonicalizes to itself (page 2 points to page 2, not page 1)
  • Product variations on separate URLs: canonical all color and size variants to the main product page, unless each variation has unique content and targets a different keyword
  • HTTPS and WWW: handled by 301 redirects, but adding canonical tags provides a secondary signal

Fix 2: Handle Faceted Navigation

Choose one approach for managing filter URLs.
The right solution depends on your platform and whether any filtered pages have standalone keyword value:

Option A: Canonical tags on all filtered pages. Every filtered URL includes a canonical tag pointing to the base category URL. Google consolidates ranking signals on the main category page. This is the simplest approach and works for most stores. The downside is that Google still crawls the filtered pages, using up crawl budget.

Option B: Noindex on filtered pages. Add a meta robots noindex tag to all filtered URLs. Google may still crawl them but will not index them, preventing duplicate content in search results. Combine with canonical tags for a belt-and-suspenders approach.

Option C: Block filter parameters in robots.txt. Add Disallow rules for filter parameter patterns. This prevents Google from crawling filtered pages at all, saving crawl budget. The downside is that any link equity pointing to filtered URLs is lost because Google cannot access them to follow the canonical.

Option D: JavaScript-based filtering. Implement filters using JavaScript that changes the displayed products without changing the URL. The URL stays the same as users apply and remove filters, so no duplicate URLs are created. This is the cleanest solution but requires more development effort and testing to ensure a smooth user experience.

Option E: Selective indexing. Some filtered combinations target valuable keywords. "Nike running shoes" or "waterproof hiking boots" might deserve their own indexed pages. Allow these high-value filtered pages to be indexed with unique titles and descriptions, while blocking all other filter combinations. This requires careful management but captures the most traffic.

Fix 3: Resolve Pagination Issues

Configure paginated pages correctly.
Each paginated page should have a self-referencing canonical tag (page 2 canonicalizes to page 2). Google no longer uses rel="next" and rel="prev" hints, so do not rely on those. Ensure Google can discover all paginated pages through navigation links (previous/next page links at the bottom of the product grid). Consider showing more products per page (48 or 60 instead of 24) to reduce total paginated pages. If you use a "Load More" button with JavaScript, also provide traditional pagination links that Google can follow.

Fix 4: Write Unique Product Content

Replace copied descriptions with original content.
Start with your highest-traffic and highest-margin products. Write unique descriptions of at least 300 words that cover features, benefits, use cases, and answers to common questions. See the on-page SEO guide for detailed product description guidelines. For stores with thousands of products, prioritize writing unique content for your top 100 revenue-generating products first, then expand coverage over time.

Fix 5: Consolidate HTTP, HTTPS, WWW, and Non-WWW

Set up 301 redirects at the server level to force all traffic to one version of your site: HTTPS with or without WWW (pick one and stick with it). In Apache, this is handled in .htaccess. In Nginx, it is handled in the server configuration. Most ecommerce platforms handle this automatically, but verify by trying all four combinations (http://yourdomain.com, https://yourdomain.com, http://www.yourdomain.com, https://www.yourdomain.com) and confirming they all redirect to your chosen canonical version.

Finding Duplicate Content on Your Store

Run Screaming Frog on your site and check the Duplicate tab to find pages with identical titles, descriptions, and H1 tags. Check Google Search Console's Pages report for pages excluded as duplicates. Search Google for site:yourdomain.com "exact phrase from a product description" to see if multiple pages on your site contain the same text. Use Ahrefs Site Audit or Semrush Site Audit for automated duplicate content detection with specific fix recommendations.

Fix the highest-impact duplicates first: pages with the most search traffic potential, pages that already rank but could rank higher with consolidated signals, and duplicates affecting your most important product categories.