Using BlackBerry Bold as a Tethered Cellular Modem on Orange-Israel Network

June 21, 2009 – 12:32 am

My new company got me a snazzy BlackBerry Bold to help me keep up to date with all the emails and tasks i need to do, since I am working the majority of the time from home. Since I live in Jerusalem and the new office is in Tel Aviv, on those days that i do commute, I have about 3 daily hours of train time to kill. Needless to say, the optimal way to utilize that time is to have an internet connection through the whole ride, but without the wireless modem, there is nothing to do but watch How I Met Your Mother :)

BlackBerry Bold has changed all that and one of the first things i did when i got it is to set out and find instructions how to tether its internet connection to my laptop. After a long search and a lot of dot connecting, I managed to find a solution:

  1. Connect your Blackberry Bold through the USB and start the Desktop Manager app (came with your phone). Without the Desktop Manager running, you will not be able to connect to the internet.
  2. Go to Control Panel -> Phone and Modem Options -> "Modems" tab.

  3. You should be able to see a list of all the modems you ever installed on your computer. Blackberry modem should be marked as a Standard Modem on COM14 or COM16. Notice how all the other standard modems are Bluetooth modems. I am guessing that Blackberry can serve as a modem even over Bluetooth, but have not checked this.

  4. Click Properties (if on Vista, now click on Change Settings in the new window) and go to Advanced tab. Note: If you are on Vista and did not click on the "Change Settings" button, you will not be able to perform the next step

  5. Under the Extra Settings in the Advanced Tab, enter the following line:

    at+cgdcont=1,"IP","internet"

    Make sure that there are no spaces before and after. Click on OK to close the window

  6. Now go to your Network Connections and create a new connection (Setup a New Connection on Vista). Choose Setup a Dialup Connection and pick the  Standard Modem (not the bluetooth one).

  7. Next step you are going to be asked for a dialup phone number. Enter the following number:

    *99***1#

    No username or password are needed.

  8. Go to the list of your network connections, right click the new connection you created and click on Properties. Go to Networking tab, pick the TCP/IP protocol (on Vista this is the TCP/IPv4 protocol) and click on Properties. Click on Advanced. Uncheck the "Use IP Header Compression" checkbox (marked in red in the below picture).

 

That is about it. Before connecting, make sure that you have 3G network enabled on your Blackberry. I personally disable 3G since it devours my battery within like 6 hours. The connection speed that I measured was something under 2Mbps download, which is not bad at all. I also managed to use my VPN client and have yet to find a significant difference between this connection and the regular WiFi connection.

 

I’m sure that not everyone will manage to connect smoothly with just following the above instructions, but I am hoping that at least some will get some benefit from them. in case you hit some bumps, let me know in the comments.

Back to SEO Basics – from Site Review Session at Google I/O 2009

June 20, 2009 – 12:30 pm

In between the whole nofollow-gate affair, SMX Advanced, Google indexing links in Java and a velvet revolution in Iran, there was a Google I/O conference. The conference sounded like a pretty cool place to be, especially with the Google Wave announcement and the Android phones being given out to all attendees, there was also a Site Review Session done by Matt and two guys from his team, Brian White and Greg Grothaus (who mostly  sat and looked at their laptops while Matt was stealing all the limelight :) ).

So while Matt was talking mostly about pretty basic stuff (which was understandable since this was not the usual SEO crowd he caters to), I managed to fish out a few interesting tidbits from his talk. While these are not necessarily new for everybody, it seemed to me that some of the stuff he said was not always clear enough or was just guessed, so it is nice to have them out in the open and confirmed. I would recommend for anyone interested to go and watch the video and for those that do not have time/mental strength to do so, here are a few gems that I  picked out of the talk. Again, it is possible that none of these are new for you, but I am sure that there will be people that will learn some things from this:

  1. Google doesn’t read the text in the images.

    While this is clearly stated in Google Webmaster Help section, every now and then there are speculations popping up about how Google definitely reads text inside images. So Matt clearly said that Google will not OCR the whole web and will not read the text inside images any time soon, even if they do have OCR capabilities, as can be seen from the Google Books example.

  2. Don’t worry about keyword density, make sure Google knows there is a word on your page, after 2-3 repeats, it doesn’t matter any more.

    While it was clear for a long time that keyword density does not play a role in the ranking algorithm, I have never seen a clear statement of “2-3 repeats” and then no impact. Matt did review a site that was stuffing keywords and made it clear that Google knows how to discover these cases and punish them.

  3. If you want some links to get more PR, put them higher up in the page.

    We all knew that the links showing higher up on the page seem to be more important, but this is the first mention of the fact (that I know of) that there is different PR flow related to the position of the link inside the page. This may explain several phenomena that people were seeing affecting the importance of the link (such as supposed decreased importance of links in footer)

  4. Google doesn’t care if you link to www.site.com/dir/dir/dir/page.html as opposed to www.site.com/page.html. All that matters is the linking distance from the homepage. Other search engines, however, may deem that important

    So while it is not exactly news that Google does not care how deep your URL structure is, as long as the pages are linked from homepage, it is interesting that he bothered to mention the situation with other search engines. So when building your website hierarchy, try to make it as shallow as possible

  5. Pagerank can go to images, not only pages

    Again while this shouldn’t come as a surprise to anyone, it is interesting to hear it directly from Matt himself. So when thinking about those linking images, make sure that you are using an optimal image that you would want to appear in the Image SERPs

  6. Typically, Google revisits the whole index on a weekly basis. In the worst case, it will be monthly.

    This seems like a good benchmark to start checking whether your pages are in the main index or the supplemental  or whether you are being punished or neglected for some other reason.

None of these are news to people that have done optimization for a while, however i have a feeling that a few newcomers may find this information valuable, especially when one sees the questions that are being asked on forums, blogs and Google’s Webmaster Help groups. So there it is.

Affilicon 2009 Presentation – Behavioral Metrics in Search Engine Algorithms

June 2, 2009 – 10:14 pm

Just came back from the Affilicon conference where I participated on the Future of SEO panel and gave a talk about behavioral parameters that (maybe are and) will possibly be used by search engines in their ranking algos. The response to the presentation was great (especially from the SEO crowd at the conference). All in all, had a great time, met a few new people, started rolling the ball on some SEO meetups and cooperations we may organize here in the near future and enjoyed the panels. Food was also great. Kudos to Itay and the crew for putting everything together and big Thank You for providing the opportunity to participate on the panel. Here is the presentation I gave:

Google News Listing Troubleshooting

May 24, 2009 – 4:33 pm

We usually focus on optimization for the organic SERPs, but there is a lot of traffic to be found in other Google Services as well. For example, if we look at the levels of traffic other Google subdomains are receiving, we will find that even though the numbers drop sharply after google.com, there are still substantial numbers in other services:

(taken from Wolfram Alpha, originally from Alexa)

Specifically, Google News can be a great source of traffic not only due to the volume of searches performed, but also due to perceived authority that listing in the News section can carry. I don’t have any hard data on this, but it would be interesting to compare the conversion rate of News visitors vs. regular search visitors.

In any case, I wanted to share a problem one of our clients had with his Google News listings recently and the way how we managed to solve this problem, hoping that this solution will help other people with similar problems.

We’ve noticed that the number of visitors coming from Google News has dropped sharply in mid-April. Performing a site: search on Google News, showed that the most recent article indexed was from April 18th. Since this specific client publishes several news articles a day, all of which usually get featured on Google News immediately, it was obvious there was a problem.

A quick check of the Google Webmaster Tools of the site discovered a list of news-related errors, the most frequent one being “Title not Found”

Searching for that specific error on Google, led me to this thread on Google News Help forum, which in turn pointed to this page on Google News Help for Publishers section. There are two suggested ways to fix this particular problem:

  1. Make sure that the title is repeated both in the <title> tag and again in the body of the article in <h1> tags. The suggestion is that these two are as similar as possible, although we found that they do not have to be identical
  2. Ensure that the <h1> title in the body of the article is not hyperlinked

Prior to the first delisting of this client’s articles from Google News, they performed a number of changes on the website, including hyperlinking the <h1>Title to point back to the original article page. This practice is commonly used to both ensure the link back to the original content from the websites that pick up the articles feed from the website and to tell Google which of the articles is the original one and thus prevent duplicate content issues.

It turns out that this hyperlink was preventing Google News bot to correctly identify the Title of each article and therefore to stop lisitng them in Google News. Removal of this hyperlink got the client’s articles back into the Google News in a matter of minutes.

Quick glance at other Google News listed sources shows that the majority of them do not have hyperlinked Titles. There are those that do (like LA Times for example), but they may be doing something else to ensure the correct recognition of the Title. In any case, if you are planning on getting your site listed in Google News, linking from the <h1> title of each article is not something I would recommend.

Tags: , , ,

False Alarm

April 9, 2009 – 9:40 pm

I am pulling down the last post about the IE8 compatibility tag. It looks like the quotation marks got screwed somewhere in the process of copying the tag and that was what was screwing the title tags.

My bad, will validate in the future, lesson learned.

Thanks to Thomas Scholz and Christian Schmidt for spotting the (rather stupid) mistake.

After 8 years

March 23, 2009 – 10:57 pm

Yesterday I gave notice at Tens Web Marketing. It feels weird to even write this, let alone do it.

I have joined Tens at 2001. Fresh out of the army, I moved to Jerusalem, searched for [internet companies in Jerusalem] on Walla and sent out my CV to all of them. Emanuel was one of the few that responded and it all  went downhill from there. There was lot of discussing, plenty of learning, lot of good times and some bad times too. In the course of those 8 years. Internet has completely transformed (think no social media), SEO has evolved from quick fix of the metatags to what it is today (don’t get me started there) and Tens has evolved with it all. From being a two man (and a cat) operation working from a private apartment in the beautiful Rehavia neighborhood of Jerusalem to the 10 people company in the offices in the hi-tech area of Har Hotzvim, it has certainly grown since then. Together we went through the aftermath of the bubble bursting (not this bubble, that bubble), demasking of GoogleGuy from WMW, Florida update, participation on (late) Jimworld forums, iHelpYou forum and now Sphinn and many other landmarks in the SEO timeline (bit weird to call it history).  In those 8 years we have (usually successfully)  tackled some of the most bizarre search engine niches (hip replacement anyone?) as well as some of the most competitive ones. With time, the company exposure has grown as well and from giving a talk to 20 marketing executives with question marks above their heads (SE-what?) at the Israel Export Institute, cooperating with other SEOs at SEO Circle, talking to entrepreneurs at the MIT Forum, helping organize the first Sphinncon here in Jerusalem (oh man, that was fun) and talking on the SEO panel at the coming Affilicon and Tens has definitely become an integral part of the Israeli Internet marketing landscape.

So where am I headed next ?

I have joined a very interesting marketing company from Tel Aviv named WhiteWeb. Beside the impressive customer portfolio and all the benefits, the things that was a deal breaker for me was the job description – I am to be responsible for acquiring and maintaining the know-how of the company. What that means is that I will be doing a lot of experimenting, problem solving, A/B/C/D/E… testing and blogging – one of the things I asked for was to be able to blog about all the interesting findings and discoveries and you can definitely expect more of that coming. In addition I will be attending more conferences both here and abroad so will finally get to meet a whole lot of people I was interacting with through different (fill the blank)spheres. It tells a lot about the company that is willing to invest in a full time research position even in this economy and I am very excited to be joining the WhiteWeb team.

I will not be neglecting my MSc. either and until I finish the lab work (beginning of October), I will be working partly at the office and partly from home.

So since I am staying in the field, I will definitely be bumping into all of the Israeli SEOs in general and the Tens crowd in particular. I am really thankful for everything I’ve learned and am wishing all the best to my friends at Tens.

My Thoughts on Google’s Webmaster Tools 404 Report

October 15, 2008 – 9:31 am

Right. It has been busy few months and it seems like there is some blogging due. Since the results of the latest batch of experiments I have been busy with are inconclusive, I thought to do a non-experimental post in the meantime, just to keep my ~500 (whoha-just-went-down-to-380-overnight-was-it-something-i-said. The numbers went back, it was some kind of a glitch) RSS subscribers (whoohooo) happy.

So as many of you have probably heard, Google Webmaster Tools have started reporting the referrals of the 404 errors they have encountered as they are crawling the site. Matt was very surprised no one from the SEOsphere was writing about it (although since then, there was a number of posts about this issue), and rightly so – this report can be very useful for several purposes:

  1. As a webmaster you want to know which pages on your site are being requested and from where and act upon that information. One way of acting is by fixing the possible URL mistakes you may have on your site. Another way is creating a customized 404 pages that will channel your incoming traffic to more useful places on your site. Just imagine that you are getting tons of requests from a bad link from a forum on a certain topic and being able to offer those visitors a customized 404 pages which casually offers links to the same-topic page on your site. Or 301ing the visitors to a landing page offering related products. The possibilities are numerous.
  2. As an SEO you do not want those links to go to waste by sending their link juice to your default/customized 404 page. You can redirect those links through .htaccess (or any other form of redirection) to your landing page

Sounds great. The question that started bugging me when I read about this service offered by Google was whether the information I am getting from my Webmaster Tools (WMT) is correct / up to date / comprehensive?  So I dug into my log files covering the same time period as the WMT for this blog and lo and behold, Google was showing me only a fraction of the referrals to the 404 page.
WMTsnap-small

In Google WMT I could see only two bad referrals, while digging through log files discovered 22 different bad referrals. That is a 9% fraction, which is pretty bad.

To be fair, not all of the referrals found in WMT report were included in the log file report, for the simple reason that for a referral to appear in your log files, it needs to be clicked on. Google’s spider, however, does not pass referrals nor does it click on the link immediately upon discovery, therefore you will see in WMT report some links that will not be found in log files.

There are a number of reasons for the discrepancy between WMT and the log files and not all of them require putting a tinfoil hat on:

  1. Google is showing only those bad referrals that their spider found. It is possible that the page with the bad link was removed so as far as Google is concerned, the linking page does not exist so there is no need to report it.
  2. The page that contains the bad link has not been crawled yet.
  3. The page that contains the bad link is not being crawled due to password requirement, robots.txt or metatag blocking (although we all know how well that works), duplicate content issues found on the linking page or any other possible issue.
  4. As with the incoming link data, Google is withholding some of the information from the webmasters (yes, this is the tinfoil one, which doesn’t make it less of a possibility, however it sounds farfetched to me).

In any case, for all the reasons I have mentioned earlier, you want to know about all the links causing a 404 on your site, even if they are not counted by Google presently. One possible scenario that comes to mind is a page that is not being crawled by Google for whatever reason, and thus a bad link from that page not reported in WMT. If that page gets scraped and the scraped page is counted by Google, you have a 404 problem which you could use to your benefit.

Now, I know that the number of bad referrals I am showing here is pretty low, however it must not be forgotten that the audience that reads this blog and links to it is very web savvy and there are not many mistakes done by them (even though Search Engine Roundtable is one of the bad referrals both in log files and the WMT report <looks in Barry’s direction>). However, when it comes to websites from other niches, I would assume that the percentages of reported vs. unreported bad referrals could mount up to significant numbers and due diligence should be applied by exploring your log files.

And for end, here are several tips that can assist your digging through the log files:

  1. First and foremost, use a good log file analysis tool. I prefer Nihuo, which gives great visualization of your log file data, while giving you a great deal of flexibility in defining the analysis parameters, such as tracking single files, tracking advertising campaigns, setting up filters for a plethora of parameters, etc. Of course, a custom made log analysis tool is better, if you have the skills/resources to get one.
  2. Filter out No Referral 404s. They are good for fixing your missing pages on the site, not so useful for redirecting your link juice.
  3. A majority of requests resulting in 404 on my site were requests for favicon.ico from the time I did not have a favicon. Another very popular file whose request result in 404 is robots.txt. Filter those out since they are of no
    interest to you for this purpose.
  4. Filter out all of the referrals coming from your URL.

This should leave you predominantly with the 404 referrals coming form broken links from outside of your site.

As I said earlier, WMT 404 report is a step in great direction, however, do not rely completely on it and complement your 404 research with your log file info.

Tags: , , , ,

Google counting only the first link to a domain – rebunked

July 3, 2008 – 10:06 pm

OK, so not so long ago Rand put out a post saying that Google will count only one link to a URL from any given page. This is a phenomenon that was originally noticed by Michael VanDeMar but many an eyebrow was risen at both posts. That particular facial hair elevation trend has seemingly culminated by David Eaves’ post which supposedly debunks the theory. However, some problems were found in the way David has conducted his research (starting with the fact that it was done in google.co.uk which seems to behave like Yahoo did 10 years ago). Since we love to solve dilemmas around here and since argument from personal incredulity is not terribly popular in my neighborhood, I decided to conduct a separate test which will cover most of the questions raised in and around the above mentioned posts. (BTW the initial discussion is covered at SERoundtable, as usual)

So, in order to test the theory, i picked two sites that sometimes double as my furry lab animals and set them up so that site A links to site B with two links using different anchor texts. The phrases appeared only on a site A, they were not to be found anywhere on site B and if that site was to rank for any of those two phrases, it would be only due to the anchor text of the links pointing from the site A.

The testing process went like this:

  1. Setup the sites like described above and wait for Google indexing. After the links are indexed, check the rankings of the site B for the two phrases.Result: Site B ranked for the first phrase and not for the second phrase. The below image describes the situation:
  2. Switch between the position of links to site B. Now the previously second phrase appears above the previously first phrase on site A. Wait for Google to index the change and check the locations of B for the new phrases.Result: Site B dissapeared from the SERPs for the new second phrase (previously first) and appears for the new first phrase (previously second), like in the image below:
  3. Revert the situation to the initial state: Switch back between the phrases, wait for indexing and check the rankingsResult: Site B reappeared for the initially first phrase and dissapeared again for the initially second phrase, like in the image below:

OK, so to me this is a pretty solid data supporting Michael’s and Rand’s claims. The fact that the SERPs reacted to my changes in the order of links back and forth is kind of hard to debunk.

However, one question that kept popping up is whether nofollowing the first link will cause Google to index the second link even though it is placed lower in the code. So i performed an identical test to the one described above, just with the first link nofollowed throughout the test. The results are pretty interesting – Google did not change its behavior due to the nofollow attribute applied on the top link.

Even when i switched between the first and the second link (preserving the state where the top link is nofollowed), Google still counted only the top link and not the bottom one.

Besides the fact that nofollowing the top link will not get you around the Google ignoring the second link, it is interesting to see that Google will still count the top link even though it is nofollowed. Maybe it was due to the fact that the nofollowed link was the only link on the web using that phrase as an anchor text ? Who knows… sounds like a theory worth testing.

As for the general settings of the experiment, the keywords I have used belong to the “semi-promiscuous” kind, meaning that they are made up of words that mean something but as a combination are not commonly used. The phrase #1 had about 40 results and the phrase #2 had about 270 results in Google SERPS. It took about two to three weeks for Google to index each of the changes, meaning that each set of results persisted for that time. It also means that the testing for this post took about a month and a half. Hence the posting frequency :(

As i write in every post describing an experiment, the conclusions should be taken with a grain of salt, Google is a black box that cannot be 100% scientifically tested (I am not talking about you Matt), i was not wearing my lucky underwear when performing the tests, etc.

PS. In the course of the described experiments, i did come up with a method that can help circumvent the described situation and force Google count all the links from a single page to a target domain. It was even independently confirmed by an SEO who knows what he is talking about (unlike me) and is being successfully used on some of our sites. However, I was asked not to blog about it, so if you want to know more and I know who you are, either personally or from Sphinn/Twitter/Plurk/etc., contact me through the form and I’ll drop you a hint ;)

Tags: , , , , , ,

Anatomy of a Google Filter/Penalty (or how not to do nofollow PR sculpting)

May 15, 2008 – 10:42 pm

OK, so after a long break, I’m trying to get back on track with posting. Had a great time in South Africa (owe an apology to Viperchill for not giving him a call, family business was crazy mate, but I get there about once a year so maybe next time?), had a great time in Eilat on my Marine Microbiology course and now it is time to get back to business.

It seems like Google are applying their penalties all over the place. Instead of manually punishing sites, they came up with all these red flags that can send your site into the depths of -60 / -300 / -950 penalty/filter. The penalties are site specific and it is very interesting to find out what triggers them and when they are lifted.

So, I know it sounds weird, but when one of our customers recently got hit with a severe loss of positions in Google SERPs, a part of me was obviously upset with the disturbance, but another part of me was excited due to the fact that I can analyze the change in locations and try and figure out what would cause this particular filter to be applied. At the beginning I was really flabbergasted by the weird nature of the filter, so I turned to several of my SEO friends for advice and opinion, but nothing could really replace the bricks and mortar approach of doing one change at the time while monitoring the locations. So to cut the long story short, I’m going to give a very concise list of the actions that led to the penalty and the reaction we got from SERPS:

  1. Customer decides to change the focus of his site to a different, but similar product. That move involves change in on-page optimization, so I am thinking, since there are changes about to be done on the site, why not throw in nofollowing of the majority of navigation links, according to the recipe – link love flows from homepage to all the inner pages and from inner pages only to homepage and pages I want ranked.
  2. Locations for the new keyphrase begin to change – on www.google.com, as seen from Israel, the site climbs from #53 to #26 and then to #8. On www.google.com as seen from the US (by using the gl=US argument) the site plunges to the neighborhood of #300, fluctuating from #283 to #345. The same is true for every other keyword that was introduced to the on-page optimization and is not represented in the anchor text of the existing links. The old keyword holds first page position on Google seen from any country.
  3. I remove the nofollow from the site navigation. After about a week, the site climbs to the area of #100 for all the new keywords on US Google. The Israel Google location is moving between #6 and #8. The old keywords are slowly slipping down but are still holding first page locations on both Googles.
  4. I tweak a few links from our sites + get a link from a relevant PR4 site using the new keyword as the anchor text. After about two weeks, the site is at first page locations for all the new keywords, while slipping to the second page for the old keyword.

The whole incident is summarized in a graph representing change in locations over time:

It is important to notice that when i refer to Israel Google, I do not mean www.google.co.il. It is the good old www.google.com but seen from the Israeli IP address. Similarly, US Google is www.google.com with the ?gl=US argument in the URL or www.google.com seen through a US proxy.

So what happened ? There are several things here that require attention:

  • This was definitely a filter and not an overall site punishment. Furthermore, it appears to be automatic and not manual.
  • The filter seems to be triggered by a combination of the implementation of nofollow and a significant difference between the on-page and off-page optimization.
  • The filter is keyword sensitive
  • The filter is country specific! This observations is in accordance to what quite a few other webmasters/SEOs were seeing in other niches. The big question is whether these filters are applied only on Google US as a rule or there is some kind of other input which will define the country on whose results it is implemented. I can think of Webmaster Tools localization, Analytics data or even Google toolbar data as examples of such possible input.

This is not the first time that nofollow has been nominated as a red flag for Google. Eric Lander has written about it on SEJ and there is a discussion on the Sphinn thread. I know that there is a big question about whether nofollowing your navigational links could serve as a signal for Google to punish you, however I don’t think that the nofollowing by itself was what has caused the penalty in this case. I think it was a combination of signals that caused the infamous red flag to be risen above the customer’s site

This is all very interesting as an observation, however without actual implications to my everyday SEO work, it would not be worth much:

1. If you are changing on-page optimization, do it gradually and hand-in-hand with addition of the incoming links targeting the new keyphrases.

2. Do not implement nofollow PageRank sculpting abruptly or at the same time when you are doing other big changes on your website (for example, I wouldn’t try doing it close to a hosting move)

3. Before you optimize your site, try and define where is your target audience going to search from. Then monitor locations on www.google.com as it is seen from each of your target countries. This task can be somewhat automatized by using Google Global Firefox extension.

Anyone else seen similar thing happen ?

PS. on unrelated note, does anyone know how to remove the Sphinn button code from Feedburner RSS ? It is driving me crazy!

Tags: , , ,

Choosing Your SEO Testing Grounds

March 24, 2008 – 1:19 am

OK, so after a long time and a lot of testing and data crunching, it was about time I sat and wrote about my latest SEO adventures.

It seems like the SEO testing is all over the place lately. Firstly there is the excellent post by our old favorite XMCP about all the things to keep in mind when setting up an SEO test (and my follow up post). Then there is an interesting test about Google preference for different TLDs at GoogleCache blog. Top it up with the “Google indexes only the first link” test from SEOmoz (or the same test done earlier by Michael VanDeMar) and you have a rage. That is without mentioning Michael Martinez’s SEO Theory which consistently provides thought provoking material. Hell, there is so much writing about SEO testing that maybe there should be a session with similar topic at the next search conference ? SMX Advanced maybe ? (hint, hint)

So, you read all the excellent articles that describe different SEO tests and you want to test an idea that has been running circles in your head for months now. The question is how do you start ?

Image courtesy of cyranthu

You want your test to have a clear-cut, reliable results that can be translated into actions applicable on your money/client’s websites. So obviously you will want to put a website of your own out there, tweak a thing or two and see how that affects your rankings. Now this is where the problems start. What keyphrase to optimize for ? How much to optimize ? …. The main question here is what am I looking for in a testing ground. Well there are two things that come to mind:

  1. Low noise level – we want to perform our experiments in a surroundings that will not drown out our signal. This means that if I see my site drop 5 positions in SERPS, this is due to the action I performed and not due to the fact that the 5 sites below me have increased their rankings.
  2. Low level of competition – this is needed for two reasons:
    1. Competitive phrases have a high level of noise due to the constant promotion work being done on competing websites. Links added, text changed, metatags improved, etc.
    2. It will be harder for you to actually bring your site to a position where changes effecting it can be analyzed.

Looking at the above mentioned articles, SEO testing ground choices that people have made can be summarized into three models:

  1. A virgin testing ground – this is a nonsense keyword that has no results in Google SERPs prior to the test. This is basically what was done in SEOCache Google TLD preference test. They created new websites and promoted them as much as they wanted for the non-existing keyword, thus controlling all the 10 results. This approach provides a great level of control over all the parameters of the competing sites, thus enabling the tester to accurately attach every change in rankings to the action done on the websites. That said, this is as far from a real life situation as you can get. Forget about semantic relevance, forget about the rate of incoming links, forget about the search history. The problems with this model are further stressed by the actual SEOCache experiment – they show that Google prefers .org TLDs since they ranked above all the other TLDs consistently. However, when I checked the results from Israel, .net TLD’s came up above the .org’s.
  2. Semi-virgin testing ground – optimizing for nonsense keyphrases that have websites in the SERPs but are not used. Good examples I can think of are old SEO contests like [nigritude ultramarine] or [seraphim proudleduck]. These are actually quite good testing grounds, but again, no semantic relevance and other algo parameters that characterize a real SERP.
  3. Semi-promiscuous testing ground – these are SERPs for keyphrases that are made up of words that actually mean something separately but together are not in common use, like [small red cantaloupe chair] or [arrogant tennis epiphany]. This will provide you with a number of real websites to compete against which will provide the search history and link addition rate parameters. The real keywords used in the query will cover the semantic relevancy issue. There are, of course, problems with this ground, which I will elaborate on in a moment.

So, my somewhat biased description of different testing grounds should tell you which one I personally prefer. Yep, the majority of my tests are done on the Semi-Promiscuous Testing Grounds (SPTG), due to the closest possible resemblance to the real life SERPs. There are some problems with SPTGs as well as is shown on the actual examples below.

Throwing theories up in the air is all well and good, however some examples need to be shown. So yours truly took 10 queries that define SPTGs and monitored the rankings for 20 days. Well, actually I did not monitor the rankings, SERP Archive did and I just gathered and analyzed the results.

So, I will provide just the chosen few phrases that, IMHO, represent the typical SPTGs and discuss the potential problems with these niches. I already apologize for not giving the complete details about the queries, I am running additional tests on some of them and do not want them spoiled just yet :) .

So let’s look at the graph over time for the phrase #1:

As can be seen, the top 10 for this phrase is pretty stable. Notice how things go a bit haywire between the 7th and the 11th of February ? Keep that in mind. Also see how the Site 8 dropped out of top 20 for a few days and then returned ? Remember this too, it will come useful when we get to the numerical analysis. Let’s take a look at the phrase #5:

This one looks even more stable. The turmoil begins only at the position 7 and lower and even then it is not significant for the sites #7, #8 and #9. Let’s take a shot at another niche defined by Phrase 3:

See the upheaval between the 7th and the 11th of February ? Just like in Phrase 1. Since these are two completely unrelated phrases, it makes sense that this is some kind of Google link recalculation/PR update/algo change. This is further emphasized by a similar pattern seen with other phrases not shown here.

Now, as a comparison, let’s take a look at a competitive phrase [personal loans]:

All hell breaks loose. No one is safe here, since there are constantly links being added so the position is never constant.

Looking at colorful charts is nice and important, however it is not enough. If you want to automate the process of choosing the testing grounds, you need to have some numbers for your scripts to crunch, so some calculations need to be made. Here is the point where I am warning any casual reader that what I did when doing the calculations is based only on my common sense. I am aware of the fact that much more robust and logical statistical tests exist that should be applied to the data, but I am just not swimming well in that field. I am actually trying to set a meeting with a statistics whiz that will guide me in these kinds of analysis, but that has not happened yet and I did not want to delay this post any longer. So take anything from this point onwards with more grains of salt than you are usually recommended when reading this blog (which is a lot).

So what I did is calculate an absolute value of change between locations on every two adjacent days and then averaged these changes over the testing time period. This gave me an average change of locations for each site. Then I averaged these values in order to get a value I called a Niche Stability Value (NSV). I put those in a bar chart and here is what I got:

So even though I am not sure (to say the least) about the reliability of my calculations, the above chart matches what I saw in the phrases charts. Phrases 6-10 were considered as non-competitive, however they did include competitive words like “investment” or “outfit” albeit in non-conventional context. Since the queries were not done in quotes, it makes sense that some of the sites in the top 10 were being promoted which would add to the level of noise.

One of the weaknesses of my calculations is the fact that they should minimize the effects of the temporary reversible drops in SERPs (like we saw with site 8 for phrase #1). These drops do not represent the real devaluation in the site’s score. Actually, if i took those few days out of calculations, the NSC for phrase #1 would drop to 0.2, which would make more sense. So any statisticians out there, I would love to get some input and further improve my calculations.

So, what is the take home message ? How do we choose a niche to perform experiments in ?

  • From what I saw in the results of the experiment, the phrases that defined the most stabile niches were scientific phrases. Any query that brings up a lot of PDF files from scientific magazines should fit this category. Furthermore, non-exact sciences are better suited niches for SEO testing than exact science related. So go for comparative religion, literature, sociology etc.
  • It is important to do both a visual inspection of the location charts and the statistical analysis. The advantage of visual check is that you can spot the algo changes/PR updates/reversible drops that should be taken out of equation. The advantage of the statistical analysis is that it can provide you with the quick estimate of low-competitive niches. It may have some false negatives, however the chances of a false positive are rather small. If the statistical analysis singles out a niche as a non-competitive one, the chances are that it really is a good testing ground, while the niches marked as competitive could still be good testing grounds with non-significant quirks that skewed the calculations.
  • Do not rely on your estimation of what is a non-competitive phrase without doing the above analysis. As you can see, I thought that phrase #8 was non-competitive and it came out to be a terrible possible testing ground. As they say, assumption is the mother of all f*ckups. Yes, yes I know there is a nicer way of saying that. It sounds dorky though.

To summarize this g-i-g-a-n-t-i-c post: chosing pristine niches for your testing is a good tactics, but it takes out a lot of real-life parameters out of equation. On the other hand, performing tests in real competitive SERPs will probably tell you nothing and will waste a lot of your time. Therefore, I do my tests in SERPs made out of illogical phrases consisting of real words. This however demands location monitoring for all the sites around my testing pages, which gives it an additional level of reliability ruling out temporary hickups in Google’s algo and other unrelated changes.

I don’t have a disclaimer to this blog (although I should put it up sometimes), but this post maybe brings it out most significantly: the setup of the tests, the results and the interpretations (mostly the interpretations) are all a product of my experience and current knowledge. They can be spot on and they can be complete crap. For me, the most important thing is to put the material and the ideas out there for the public to judge, add and shoot down. Only time and additional tests will tell whether there is some value to my ramblings here. So, if you have a different idea, find a significant logical failure or just strongly disagree with everything written here, please leave a comment, I don’t consider this a popularity contest.

Positive responses are also welcome. :) )

PS. After re-reading the post and before publishing it, I noticed a possible mix-up that can happen: the stability of a niche should not be confused with the competitiveness, ie. the difficulty of promoting a page to the top 10 for that phrase. It only shows the levels of change in the locations of the top 10 websites.

Tags: , , , , , ,