Articles About | Links
I see that Matt Cutts of Google is calling for link spam reports.
I’m still very troubled by this paid links issue after all these years!
I agree it’s Google’s right to penalise or promote any page/site in its natural listings, which represent Google’s subjective opinion of relevancy.
However, the idea that all paid links are bad/”evil” is wrong in so many ways:
- Paid links pre-date Google.
- There is no machine-readable standard for labelling a paid link. I’ll repeat that – there is no machine-readable standard for labelling a paid link.
- Labelling paid links fails the “Does this makes sense in the absence of search engines?” ethical test. The answer may well be “Yes”. (Where the answer is “No”, I agree paid links are spam).
- Labelling paid links fails the “Would I do this if search engines did not exist?” test. In fact, you have to know that Google exists, and that they mind about paid links, in order to label those paid links in the non-standard way that Google asks you to label them. This is perhaps my biggest beef with Google’s approach to paid links – they actually violate one of Google’s published Webmaster principles.
- What does “paid” mean anyway? An actual exchange of cash? If you look at the top results for any hugely commercial field, say “car insurance”, it’s hard to believe that there is no commercial influence in the results! When all that a company does is commercial, then every link (positive or negative) to that company’s site is commercial in nature.
I understand that a market in paid links arose because of Google’s algorithm.
However, the irony is that in responding to that market by asking all publishers to label paid links in a non-standard way, Google violated its own principles. It started to ask publishers to adapt what they published to suit Google (because Google existed), and called them spammers if they didn’t. That’s the wrong way around. It’s the spammers that do stuff purely because Google exists!
14
How much PageRank does a page that is not in the index have?
0 Comments | Posted by Alan Perkins in Crawling and Indexing, Google, Links, Search Engines
If a page has a NOINDEX tag on it, how much PageRank does it have? The intuitive answer would be “None”. How can a page that is not indexed have PageRank? Wouldn’t it be treated like a dangling link and disregarded during PageRank calculations?
Apparently not. Matt Cutts states on seomoz:
Does a link from a page with meta robots=”noindex, follow” carry less weight? no weight?
For Google, I believe such links would carry the same weight as normal links on regular pages.
Hmmm. Does he mean that the unindexed page actually has a PageRank? Or does he mean that the zero Pagerank that the unindexed page has would be divided out among the links on the page, giving nothing to each? I wonder …
One thing’s for sure … if “NOINDEX, FOLLOW” works as implied, it’s a great way to inject spammy content and links.
Thanks to Dan Thies for drawing my attention to the latest “mayhem” surrounding Google, rel=nofollow and the FTC. This is an area close to my heart, as my article from 2005, Search Marketing & The Law, made clear:
It would be foolish to expect to be operating in a multi-billion dollar global marketing industry and not expect to comply with marketing laws and regulations in the countries in which you are marketing.
The current confusion stems from Matt Cutts’ blog post on paid links back in April, which called for both human readable and machine readable disclosure of paid links – machine readable first:
If you want to sell a link, you should at least provide machine-readable disclosure for paid links by making your link in a way that doesn’t affect search engines. There’s a ton of ways to do that. For example, you could make a paid link go through a redirect where the redirect url is robot’ed out using robots.txt. You could also use the rel=nofollow attribute.
The problem here is that there is no machine-readable disclosure for paid links. Matt suggests that there a “ton” of ways, but none of these ways mean “this link is paid”, let alone the means, method and motive for payment. This is where the confusion starts.
Matt then goes on to discuss human-readable disclosure:
The other best practice I’d advise is to provide human readable disclosure that a link/review/article is paid.
Here I fully agree with Matt – it’s important not to mislead your visitors. No confusion here.
The real confusion seems to come from the next thing Matt says:
Google’s quality guidelines are more concerned with the machine-readable aspect of disclosing paid links/posts, but the Federal Trade Commission has said that human-readable disclosure is important too:
The petition to us did raise a question about compliance with the FTC act,” said Mary K. Engle, FTC associate director for advertising practices. “We wanted to make clear . . . if you’re being paid, you should disclose that.”
To make sure that you’re in good shape, go with both human-readable disclosure and machine-readable disclosure, using any of the methods I mentioned above.
Some people have inferred that Matt is saying that paid links that aren’t labelled in a machine-readable way are contravening the FTC guidelines. He isn’t saying this at all. Read carefully. The FTC is concerned with human-readable disclosure, not machine-readable disclosure. There is no machine-readable disclosure for paid links.
It is possible to place deceptive advertising in search results using various means. But failing to label a link as paid in a machine-readable way is not one of them. There is no machine-readable disclosure for paid links.
I see Matt Cutts has updated his “How to report paid links” post. I missed this one the first time around as he posted it on my 40th birthday.
Matt is head of Google’s Webspam team. We talked together quite a lot in 2001 when Matt was working on the Google Webmaster Guidelines. Matt had seen my talk on spam and cloaking at the 2001 Search Engine Strategies Conference in San Francisco, and had read my White Paper on The Classification of Search Engine Spam. He was particularly keen on my ideas of “Doing it for humans”. These ideas eventually found their way into Google’s Quality Guidelines as
Make pages for users, not for search engines. Don’t deceive your users or present different content to search engines than you display to users, which is commonly referred to as “cloaking.”
Since 2001, Matt and I have communicated regularly and have always seen eye-to-eye on issues of spam. That is, until now. I can’t help but feel that Matt’s stance on paid links and the application of the rel=nofollow attribute is the start of a slippery slope for Google.
Back in 2001, when we agreed on “Doing it for humans”, the principle behind this was that as a Web publisher you shouldn’t need to do anything specifically for search engines in order for them to crawl, index or rank your content. If you found yourself doing something specifically for search engines, that was the time to ask yourself whether you were spamming.
Fast forward to 2005 and Google introduced the rel=nofollow attribute; a contributor to that post was Matt Cutts himself. The best places to use this tag, according to the post, are
the actual links that other people can create [...] for instance, only the links within comments and the link immediately after “Posted by:” would get the rel=”nofollow” attribute.
This seemed like a great idea at the time. With hindsight, it marked the introduction of a tag that was specifically designed to affect search engine ranking algorithms:
At the heart of our [Google's] technology is PageRank™, a system for ranking web pages … [that] relies on the uniquely democratic nature of the web by using its vast link structure as an indicator of an individual page’s value.
By asking publishers to label links with rel=nofollow, Google was giving those publishers the power to affect PageRank. Nothing more and nothing less.
Search engines offer tags and other techniques to prevent crawling or indexing of Web pages – notably robots.txt files and the robots meta tag. But not since the ill-conceived and now much deprecated meta keywords tag has a search engine offered the ability to control rankings, in the way that the rel=nofollow attribute does.
So, the problem with rel=nofollow is that it’s “Doing it for search engines”, not the “Doing it for humans” that Matt Cutts and I agreed on back in 2001. Matt covered this in his post:
That same philosophy would mean that you wouldn’t create a robots.txt file (users don’t check those), never make any meta tags (users don’t see meta tags), never create an XML sitemap file (users wouldn’t know about them), and wouldn’t create web pages that validate (users wouldn’t notice). Yet these are all great practices to do.
However, there are two big differences between rel=nofollow and these examples quoted by Matt:
- rel=nofollow is not designed to affect crawling, or indexing, which are naturally of interest to Webmasters (as it’s their site and their content); but ranking, which is the preserve of the search engine (as it’s their algorithm).
- Failure to know about or deploy robots.txt, meta tags or sitemaps is not search engine spam. However, Matt is saying that failing to use rel=nofollow could be treated as spam; so a Webmaster who is buying links needs to know of the existence of rel=nofollow in order to avoid spamming.
I can see why rel=nofollow was felt necessary, and I agree it’s a good idea in certain circumstances (particularly when built into software such as Wordpress, rather than expecting individual publishers to know about it and apply it). What I can’t understand is why Google is now asking publishers to label paid links with rel=nofollow. The labelling of paid links was never mentioned as an application when rel=nofollow was introduced. There are some big problems with this approach:
- Even when rel=nofollow was introduced the Web was over 10 years old. What about all the old links, that were made before rel=nofollow existed? Are publishers supposed to go back and change them? All of them?
- Since rel=nofollow was introduced, it has become relatively well known in the dedicated search marketing and blogging community. But what efforts have been made to make it known to everyday Web designers and publishers?
- What exactly constitutes a paid link? A link to a parent company? A link to a partner company who has supplied you work in the past? What about a link to somebody who bought you a beer at a conference once? Or is it only a link for which money specifically was exchanged? In some commercial areas of the Web, depending on your point of view, all links could be considered paid. Although I don’t work in the pills/porn/casino industries, even categories such as financial services are so highly commercialised that almost all links could be considered as paid. Within such a category, if all links were labelled with rel=nofollow (as Google appears to want), what actual use would rel=nofollow be?
Given the above problems, I can’t quite work out whether Google is being humble or arrogant in asking publishers to label their links…
- Humble: It’s like Google is admitting that it can no longer detect and properly compensate for paid links algorithmically. The heart of their ranking technology is links, and link spam is hurting so badly that they are asking for help.
- Arrogant: It’s like Google is starting to lay down how the Web will be built. Rather than reacting to how people build Web sites, they are telling people how to build Web sites.
I am not enamoured with either of these possibilities.
