Also:"Search engines are not trying to penalize content," Mukherjee said. "We're trying to find the right content to promote. Independent of how large our indexes get, there will always be capacity constraints."
"Honest site owners often worry about duplicate content when they don't really have to," Google's Cutts said. "There are also people that are a little less conscientious." He also noted that different top level domains, like x.com, x.ca, are not a concern.
Notice mention of archive is not listed anywhere.Jake Baillie, TrueLocal's president, described the top six duplicate content mistakes:
1. Circular navigation - having different paths through a site should be avoided. Publishers should define a consistent way of addressing page content no matter what navigation path a user takes through a site.
2. Printer friendly pages - if these are html pages, robots.txt should be used to block search engines from indexing them.
3. Inconsistent linking - calling directory pages in an inconsistent manner, like /directory and /directory/, should be avoided.
4. Product-only pages - it is not good for a site to have product pages and SKU pages; they should be consolidated if possible.
5. Transparent serving domains - use 301 redirection instead of DNS aliasing to get users to a canonical site from multiple domains.
6. Bad cloaking - Don't use cloaking scripts you didn't write. Make sure your cloaking script is returning separate content for each URL being cloaked.
Whole article: http://www.webpronews.com/topnews/to...sYourSite.html



LinkBack URL
About LinkBacks







Reply With Quote
