✨ Get 25% OFFon any plan. Use the coupon:

Canonical URL: what it is and how to solve duplicate content

By Tiago CostaUpdated on July 2, 2026

Illustration of three nearly identical browser pages, with arrows pointing to the preferred version marked as canonical.
Definition

A canonical URL is the preferred version of a page that the search engine should index when duplicate addresses exist. In practice, the canonical URL:

  • is signaled by the rel=canonical tag in the page's HTML;
  • tells Google which address to treat as the original;
  • concentrates the authority of the links on that version;
  • solves duplicate content without removing the other pages.

What a canonical URL is

A canonical URL is the address you choose as the official version of a page when the same content, or very similar content, can be reached through more than one URL. Instead of letting the search engine decide on its own which one to show, you point to the preferred one, and the rest are treated as copies that refer back to the original.

The signal is given by a tag called rel="canonical", placed in the HTML head of the duplicate pages. It works as a note to the search engine: the version that counts for indexing and ranking is this one, not the others. The term comes from canonical, in the sense of official, the one that serves as reference.

A word of caution about vocabulary: you will find canonical, canonical tag and canonical URL all talking about the same thing. They all refer to the same mechanism of choosing and signaling the main version of a piece of content.

Why duplicate content is a problem

Duplicate content happens when the same text appears at different addresses, whether through versions with and without www, tracking parameters, product pages that repeat or print versions. For the search engine, this creates a doubt: which of these URLs should appear in the results?

The problem is more common than it seems. A study by Semrush, which analyzed 100,000 sites and 450 million pages, pointed to duplicate content as the most frequent SEO issue, present on about half of the sites reviewed. In other words, it is a trap that catches most projects at some point.

When Google finds several copies with no guidance, it may split the strength of the links between them, choose the wrong version to display or simply spend crawl budget visiting redundant pages. The canonical URL solves this by concentrating the signals on a single address.

Infographic of the canonical tag steps: duplicate pages, rel canonical tag, preferred version and concentrated authority.
How the canonical tag consolidates duplicate pages into a single preferred version.

How the canonical tag works

The canonical tag is a simple line in the HTML, but its effect is large. The format is this: <link rel="canonical" href="https://yoursite.com/official-page">, inserted inside the <head> section of each duplicate version, pointing to the preferred URL.

When it reads this markup, the search engine consolidates the duplicate pages onto the canonical one. In practice, this means the strength passed by the links (the so-called link juice) and the relevance measured by algorithms like PageRank concentrate on the official address, instead of splitting between copies.

Two points deserve attention. First, the canonical tag is a strong suggestion, not an absolute order: Google may choose another version if the signals point there. Second, a page can point to itself as canonical, a recommended practice to make clear which is the main version.

Canonical URL, redirect and noindex: when to use each

Canonical is not the only way to deal with similar pages, and picking the wrong tool costs traffic. The general rule separates three situations:

SituationIdeal tool
Duplicate content that should stay accessibleCanonical tag pointing to the preferred version
Page that changed address permanently301 redirect
Page that should not appear in search at allNoindex directive

The difference matters: canonical keeps both pages live and simply points to the preferred one; the 301 redirect takes the user and the search engine to the new address; and noindex removes the page from results without consolidating authority. Mixing canonical with noindex on the same page, by the way, sends contradictory signals and should be avoided.

Illustration of three paths comparing canonical, 301 redirect and noindex for handling similar pages.

Common mistakes with the canonical tag

The canonical tag is powerful precisely because it is easy to get wrong. The most frequent slips are:

  • Pointing to the wrong URL: canonicals that refer to missing, broken or irrelevant pages confuse the search engine.
  • Canonical chains: page A points to B, which points to C, creating a loop that weakens the signal.
  • Blocking the canonical in robots.txt: if the search engine cannot crawl the preferred URL, it does not read the instruction.
  • Ignoring URL parameters: filters and sorting generate many addresses that need a canonical, as the entry on URL parameters explains.
  • Confusing it with hreflang: versions in different languages are not duplicates and should use hreflang, not a canonical from one to the other.

A famous Google Search Console message, the one saying the search engine chose a canonical different from the one declared, usually comes from exactly these mistakes: the page's signals contradict the tag.

How to check the canonical URL of a page

Checking whether the canonical is correct is quick and prevents ranking surprises. A simple routine:

  • View the source code: look in the HTML for the rel="canonical" line and confirm which URL it points to.
  • Use URL inspection: the URL inspection tool shows the canonical declared by the page and the one Google actually chose.
  • Follow the indexing report: in Google Search Console, you can see which pages were consolidated onto another canonical.
  • Check consistency: make sure the canonical, the sitemap and the internal links all point to the same preferred version.

When the declared canonical and the one Google chose match, it is a sign the page is sending clear signals. When they diverge, it is worth investigating which signal is pulling the search engine toward another URL.

FAQ

Frequently asked questions

What is a canonical URL?

A canonical URL is the preferred version of a page, signaled to the search engine when the same content can be reached through several addresses. It is marked by the rel=canonical tag and tells Google which URL to treat as the original, concentrating indexing and link authority on it.

What is a canonical address?

A canonical address is another name for a canonical URL: the official address that represents a piece of content against possible duplicates. It is the version you want to see in the search results, the one the other copies point to through the canonical tag.

What is the canonical tag for?

The canonical tag is used to solve duplicate content without deleting pages. It tells the search engine which version to consolidate, preventing link strength from splitting between copies and stopping Google from choosing or showing the wrong URL in the results.

What is the difference between a canonical URL and a 301 redirect?

Canonical keeps the pages live and only points the search engine to the preferred one, useful when both versions need to exist. The 301 redirect is permanent and takes the user and search engine to the new address, right when the old page should stop being accessed.

Why did Google choose a canonical page different from mine?

This usually happens when the page's signals contradict the tag: a canonical pointing to a weak URL, internal links indicating another version or nearly identical content on different addresses. Google treats the canonical as a suggestion and may prefer the version it considers more relevant.

Your blog's technical SEO handled automatically

Automarticles writes, optimizes and publishes your blog articles on its own, handling canonicals, indexing and structure so your content ranks.

Start free trial
Keep learning

Related concepts

301 redirectThe 301 redirect is the type of permanent redirect that sends both the visitor and the search engine from an old URL to a new address, signaling that the change is definitive. Besides keeping people from landing on a nonexistent page, it passes most of the authority accumulated by the original URL to the new one, which makes it the correct way to change address, domain or site structure without losing positions on Google.NoindexNoindex is a directive that tells search engines not to include a page in the search results. It is applied through a robots meta tag in the HTML or through an HTTP header, and it makes Google drop the page from the index even when other sites link to it. Unlike robots.txt, which blocks crawling, noindex requires the page to stay crawlable so the search engine can read the instruction.URL parametersURL parameters are snippets added to the end of an address, after a question mark, that send extra information to the server, such as product filters, internal search terms, session identifiers or campaign tags (UTMs). They are useful for tracking traffic and organizing dynamic pages, but when poorly controlled they can create duplicate content, waste the search engine's crawling and dilute the ranking signals of a page.HreflangHreflang is an HTML attribute that tells search engines the language and, optionally, the region of a page, so they show the right version to each user. On sites with content in several languages or aimed at different countries, hreflang tags connect the equivalent versions of the same page and prevent them from competing with each other or appearing to the wrong audience. It is a central piece of international SEO.