SHARE
Facebook X Pinterest WhatsApp

Search vendors get canonical on results

Feb 13, 2009
google.logo.jpg
yahoo.search.small.gif

From the

‘to WWW or not to WWW

files:

It’s always a bit of a mystery to figure out if it matters whether or not you need to use ‘www’ in front of a domain name or not. That is www.example.com or just example.com.

Sometimes one will refer to the other and in some cases both will exist which can end up confusing search engines with duplication. Google, Yahoo and Microsoft have now teamed up for a new Search Engine standard that will provide a solution for the problem, properly referred to as a canonical domain (that is what section of the URL before the example.com). It’s the new link rel=”canonical” tag that can help to specify what should be indexed and how.

“When you use the tag, you can indicate the canonical URL form for crawlers to use for each page of content, no matter how it was retrieved,” Priyank Garg Director Product Management
Yahoo! Search blogged. “This puts the preferred URL form with the content so that it is always available to the crawler, no matter which session id, link parameter, sort parameter, parameter order, or other source of variance is present in the URL form used to access the page.”

Canonical links can also be extremely useful for sessionID tagged pages that are dynamically generated. Those types of pages tend to be difficult to index and often get a mod_rewrite (that is the webserver rewrites the address to something human readable) but it still leaves two (or more) potential addresses for the same content that a search engine could find.

Google in its discussion of the new tag gives an example that is yet another potential implementation of the link rel=canonical tag. Google’s exampls uses the wikia page http://starwars.wikia.com/wiki/Nelvana_Limited which specifies its rel=”canonical” as: http://starwars.wikia.com/wiki/Nelvana.
According to Google’s blog post on this issue:

The two URLs are nearly identical to each other, except that
Nelvana_Limited, the first URL, contains a brief message near its
heading. It’s a good example of using this feature. With
rel=”canonical”, properties of the two URLs are consolidated in our
index and search results display wikia.com’s intended version.

This is a really interesting development from my point of view that will both add complexity and simplicity to web developers’ lives.

On the one hand, we’ve now got greater control than ever for search engine optimization of pages. On the other hand, this is yet another way to re-write URLs which makes overall site management even more complex than before. Instead of just having URLs and then maybe a few rewritten ones, now you’ve got to worry about natural URLs, rewritten URLs and then canonical ones. Then again a good Sitemap could really help out there too, keeping it all straight.

Recommended for you...

Facebook Becomes Meta, But Did It Move Too Soon?
Rob Enderle
Oct 29, 2021
Microsoft Gets Rid Of Passwords: I Can Almost Hear Angels Singing
Rob Enderle
Sep 17, 2021
Why AMD Has Been So Successful: Mark Papermaster
Rob Enderle
Sep 9, 2021
Another Crazy Week in Cybersecurity
Paul Shread
Jul 2, 2021
Internet News Logo

InternetNews is a source of industry news and intelligence for IT professionals from all branches of the technology world. InternetNews focuses on helping professionals grow their knowledge base and authority in their field with the top news and trends in Software, IT Management, Networking & Communications, and Small Business.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.