What is Canonical URL Tag?

- Tuesday, December 22, 2009

Canonicalization is the process of picking the best URL when there are several choices, and it usually refers to home pages. Canonical URL tag is part of the HTML header on any webpage, inside <head>...</head> section.

<link rel="canonical" href="http://www.example.com/" />

What is Canonical URL Tag?

This tag tells all the major search engine bots (Google, Bing & Yahoo!) that the page in question should be treated as though it were a copy of the URL http://www.example.com/ and that all of the link & content metrics the engines apply should technically flow back to that URL.

Canonicalization helps to solve issues like www version vs. non-www version, re-directs, duplicate URLs, hijacking, etc... Avoiding duplicates in the search engine index has consistently been a key concern to all SEOs, Web Masters and Site Owners. When you use the <link> tag, you can indicate the canonical URL form for crawlers to use for each page of content, no matter how it was retrieved.

This puts the preferred URL form with the content so that it is always available to the crawler, no matter which session id, link parameter, sort parameter, parameter order, or other source of variance is present in the URL form used to access the page. When you have completely identical content, but with different URLs due to things such as a tracking parameters or a session ID:

Example:

  • http://example.com/
  • http://www.example.com/
  • http://www.example.com/#joydeep
  • http://www.example.com/?id=12345
  • http://www.example.com/?group=abcd
  • http://www.example.com/?src=banner&act=seo

This tag allows you to publicly specify your preferred version of a URL. This format provides you with more control over the URL returned in search results. It also helps to make sure that properties such as link popularity are consolidated to your preferred version.

So you can simply add this <link> tag to specify your preferred version (inside the <head> section):

<link rel="canonical" href="http://www.example.com/" />
And all the major search engine bots (Google, Bing & Yahoo!) will understand that the duplicates all refer to the canonical URL: http://www.example.com/.

Additional URL properties, like PageRank and related signals, are transferred as well (this is the best part I liked about it).

Best Practices for Canonical URL Tag:

  • The URL paths in the <link> tag should not returns a 404 Error
  • Avoid that the Canonical URL is not a re-direct URL
  • The URL paths in the <link> tag can be absolute or relative, I recommend using absolute paths to avoid any chance of errors.
  • A <link> tag should only point to a canonical URL form within the same domain or sub-domain and not across domains. For example, a tag on http://test.example.com/privacy.html can point to a URL on http://www.example.com/ but not on http://www.different-domain.com/ or any other domain.
  • Don’t make half of your links go to http://example.com/ and the other half go to http://www.example.com/. Instead, pick the URL you prefer and always use that format for your internal links.

Duplicate content on a site is not grounds for action on that site unless it appears that the intent of the duplicate content is to be deceptive and manipulate search engine results. If search engines can't crawl pages with duplicate content, they can't automatically detect that these URLs point to the same content and will therefore effectively have to treat them as separate, unique pages.

A better solution is to allow search engines to crawl these URLs, but mark them as duplicates by using the <link rel="canonical" href="http://www.example.com/" /> link element, the URL parameter handling tool, or 301 redirects.





About the Author

Author Joydeep Deb

has 9+ years of strong experience in Online Marketing, Web Technologies, Search Engine Optimization, Search Engine Marketing, Social Media Marketing, eMail Marketing, Web Master, Web Analytics, MIS Reporting, Analysis, Web Development and Design.

Get social with Joydeep at Twitter | Facebook | Google+ | LinkedIn

Comments


Blog List

Tools

MISC


Newsletter

Loading...

Thank you!!!

Keep in touch other than eMail!