Home Organic
Growth
Master
Planning
SEOMike
Portfolio
Professional
Background
Personal
Background
SEOMike
Blog
SEO
Services
 

Google Trends

May 14th, 2010
Google Trends offers a good look at year-over-year data for search volume.

Google Trends offers a good look at year-over-year data for search volume.

Google Trends can be a very effective tool for planning your SEO campaign if you know how to use it.  The tool is a great way to look for seasonality and “burstiness” of topics online, which is especially important to review if your client has season-related products.  Using Google Trends you can establish a target date where your campaign needs to come to fruition, which allows you to allocate your resources appropriately in order to reach your target.

As an example, I’ve used the terms “lawn mower” and “snow blower” because they obviously have very season-related sales.  You can see in the image below the year-over-year number of searches for the two terms.  You can see that “lawn mower” peaks around May of every year and then drops off as the season wears on.  If a client sells several seasonal items, you can see the importance of using the time leading up to May to prepare for the peak lawn mower season.  You can also see in the image below, that searches for “snow blower” peak in December – January of a given year and is subject to bursts of traffic related to major snowfall.

Google Trends can be a very effective tool for planning your SEO campaign if you know how to use it.  The tool is a great way to look for seasonality and “burstiness” of topics online, which is especially important to review if your client has season-related products.  Using Google Trends you can establish a target date where your campaign needs to come to fruition, which allows you to allocate your resources appropriately in order to reach your target.

As an example, I’ve used the terms “lawn mower” and “snow blower” because they obviously have very season-related sales.  You can see in the image below the year-over-year number of searches for the two terms.  You can see that “lawn mower” peaks around May of every year and then drops off as the season wears on.  If a client sells several seasonal items, you can see the importance of using the time leading up to May to prepare for the peak lawn mower season.  You can also see in the image below, that searches for “snow blower” peak in December – January of a given year and is subject to bursts of traffic related to major snowfall.  Another good use of this tool, beyond seasonality is illustrated below; the ability to measure one phrase’s popularity against another.  This is a good way to evaluate some of your long-tail phrases.

Google Trends offers a good look at year-over-year data for search volume.

Google Trends offers a good look at year-over-year data for search volume.

In the image above you will also notice a couple of other really important things; rankings of states and cities.  This information can help you specifically target your PPC campaigns to cities and states where the most searches for your items occur.  Another important thing you can see is the ability to export the results to a CSV file which allows you to present your own graphics and charts outside of the Google tool.  This is great if you are doing research for a client and want to brand your results as your own.  Don’t worry too much about the alphabetized article results displayed to the left when you are looking at a huge period of time as in the image above.  These links point to news stories that will give you a very small glimpse into what items and on this scale, much of the content is dead.

In the image below we’ll look at 2009’s trends related to these categories.  You can get a little deeper look at a single year’s trends like this as well as looking at which cities and states performed the best for a given year.  You can get a solid peak month for your campaign by using the single year view.

The Google Trends yearly view of results show seasonality in detail.

The Google Trends yearly view of results show seasonality in detail.

The single-year results can also be exported to an CSV file to allow you to play with them in Excel.  Earlier I said you should ignore the alphabetized article results on the left, but in this case these articles can be of use.  If you click the “more news results” link you’ll be taken to a page where you can see the articles that trended throughout the year.  This will help you determine market-factors that drove the number of searches and find out what was going on in the industry to help explain the search volume.  You’ll be able to find seasonal trends like the effect of huge snow storms on snow blower searches, and the effect of drought in regions on lawn mower sales.  When the grass doesn’t grow, sales go down.

One last thing I’ll mention about the “more results” is that there is more data here than meets the eye.  Since Google has been indexing offline content such as newspapers and books, you have a deep look into information going all the way back to the 1800s in some cases.  Though data from that far back is probably not relevant to today’s market, it is very interesting to see.

Google Trends shows mentions of items in offline material.

Google Trends shows mentions of items in offline material.

To sum up, there is a TON of data available to the SEO in Google Trends if you know what to look for.  You can offer a wealth of information to your clients to help them see why you want to focus on certain campaigns leading up to the busy season.  They will better understand why you want to focus so much effort on organic SEO for “snow blowers” in the summer instead of when they are selling.  The more the client understands the data behind your online strategy, the more likely they will value it as a part of their company’s strategy, instead of just a “bolt on” curiosity.

Diaspora – Harbinger of Doom for Facebook?

May 12th, 2010
I recently learned of a project being developed by four NYU students with money raised through Kickestarter.org.  The whole point of the project is to replace the giant personal-information-hoarding networks like Facebook and MySpace with a social network that puts you in control of your personal data.
Diaspora is currently under development and the team hopes to have it up and running for a small user base this summer with the main launch towards September.  Diaspora is an open-source, decentralized social network allowing individual members to connect directly instead of through a main hub.  That means that you would host your own copy of the social network software on your computer and your computer would communicate directly with each of your friend’s computer.  Decentralizing the network allows a user to keep their private information on their own computer, instead of passing it through a third-party server where their data is collected and sold to advertisers.  As users become more aware of the amount of data they are leaking from the Facebook profiles, they will likely make the move to Diaspora.
I think these guys are really on to something here, especially considering Facebook’s latest move to get their fingers into your web experience through Facebook Connect. (as discussed here)  Facebook Connect is a major intrusion into a user’s privacy and does not set well with the user base.  It could end up being Facebook’s demise.
Though the prognosis might sound bleak for online advertisers wishing to reach the millions of people who use social networks, creative advertisers will find a way to get through.  The key will likely be a model similar to the iPhone and iPad; apps and plugins supported by advertisers.  Since the software is based on an open source platform, anyone can develop plugins and apps to augment the experience.
Here are a few challenges that Diaspora needs to overcome:
It might be difficult to reach a large audience and convince them that their solution superior enough to Facebook to make the user switch.  That might be a bit challenging considering a lot of Facebook users are still trying to figure out all the functionality of their current system. Terms like “seed” and “decentralized network” and “open-source” might be enough to scare people off.
Another challenge is the potential barrier-to-entry associated with setting up the software.  A user will either have to setup their own installation on their computer, or setting it up on a third-party service much like the majority of Wordpress blogs.  (Wordpress blogs can either be setup on your own server, or you can setup a blog at whatever.wordpress.com.)
One of the biggest challenges might be updates. If a user doesn’t install the latest software updates, they could put their computer at risk.
I’m going to watch the development of Diaspora closely.  If these guys pull it off, they will reshape the web as we know it.

I recently learned of a project being developed by four NYU students with money raised through Kickestarter.org.  The whole point of the project is to replace the giant personal-information-hoarding networks like Facebook and MySpace with a social network that puts you in control of your personal data.

Diaspora is currently under development and the team hopes to have it up and running for a small user base this summer with the main launch towards September.  Diaspora is an open-source, decentralized social network allowing individual members to connect directly instead of through a main hub.  That means that you would host your own copy of the social network software on your computer and your computer would communicate directly with each of your friend’s computers.  Decentralizing the network allows a user to keep their private information on their own computer, instead of passing it through a third-party server where their data is collected and sold to advertisers.  As users become more aware of the amount of data they are leaking from their Facebook profiles, services like Diaspora will sound enticing.

I think these guys are really onto something here, especially considering Facebook’s latest move to get their fingers into your web experience through Facebook Connect. (as discussed here)  Facebook Connect is a major intrusion into a user’s privacy and does not set well with the user base.  It could end up being Facebook’s demise.

Though the prognosis might sound bleak for online advertisers wishing to reach the millions of people who use social networks, creative advertisers will find a way to get through.  The key will likely be a model similar to the iPhone and iPad; apps and plugins supported by advertisers.  Since the software is based on an open source platform, anyone can develop plugins and apps to augment the experience.

Here are a few challenges that Diaspora needs to overcome:

  • It might be difficult to reach a large audience and convince them that their solution is superior enough to Facebook to make the user switch.  That might be a bit challenging considering a lot of Facebook users are still trying to figure out all the functionality of their current system. Terms like “seed” and “decentralized network” and “open-source” might be enough to scare people off.
  • Another challenge is the potential barrier-to-entry associated with setting up the software.  A user will either have to setup their own installation on their computer, or setting it up on a third-party service much like the majority of Wordpress blogs.  (Wordpress blogs can either be setup on your own server, or you can setup a blog at whatever.wordpress.com.)
  • One of the biggest challenges might be updates. If a user doesn’t install the latest software updates, they could put their computer at risk.

I’m going to watch the development of Diaspora closely.  If these guys pull it off, they will reshape the web as we know it.

Teoma Resurrected

May 11th, 2010
Meaning “expert” in Gaelic, Teoma was launched in 2000 by a team of scientists at Rutgers.  You may have never heard of Teoma before because it was not a very popular search engine, even at it’s prime.  Though it wasn’t popular, Teoma had a big impact on the web today by utilizing an algorithm that ranked websites based on communities similar to Google’s, but Teoma took it a step further by using subject-specific popularity.  Subject-specific popularity is the number of related sites that link to a specific site, making that site an authority.  This is now a major part of modern search engine algorithms.  The site was acquired by Ask Jeeves in 2001 as a technology grab.  Ask Jeeves needed the technology to analyze links better and Teoma had the solution.  Ask pretty much let the brand languish and eventually the site died in 2006.
Teoma was re-launched in April of this year as with no fanfare at all, similar to it’s demise in 2006.  Teoma just doesn’t make much noise about it’s technology or existence.  Are we going to see that change?
According to a couple responses from Ask about Teoma, they re-launched Teoma to “provide a simplified interface for everyday keyword web search.”  I find this response interesting considering the recent addition to the Google search results – the everything bar.  Is Teoma hoping to capitalize on a small percentage of Google users that are a bit peeved about the new Google features?  Perhaps.
Though I enjoy the Teoma experience, much like the older Google, the results just don’t seem to be as good as Google’s.  Teoma seems more susceptible to web spam.  For example, a search for “home loan” in Ask, Google and Bing list LendingTree as the number one result – a pretty expected result considering the brand’s power.  Yahoo offers LendingTree as the number two result, second to a government resource page, which is somewhat understandable.  Where is LendingTree in Teoma’s results?  Number seven.  All six results that precede LendingTree in this search are spam lead-gen sites, one of which has several broken images on the first page.  Would these results make you happy?  I expect not.
If the results are so crummy, why would Ask re-launch Teoma?  I think they are planning on using the site as a test-bed for future search features and algorithm changes before they push those changes to Ask.  This is the most likely course of action for resurrecting a site such as Teoma, especially with such a soft re-launch.  Ask has barely made a peep about it.  Maybe Ask has some really cool new feature they are looking to test, or a significant change to their algorithm and they don’t want a lot of attention on Ask.com about it.
Though I’m very underwhelmed by the results, I’m going to keep my eye on Teoma.  Teoma had a major impact on today’s search environment many years ago, maybe they are teeing up another paradigm-shift.  Maybe not.

Meaning “expert” in Gaelic, Teoma was launched in 2000 by a team of scientists at Rutgers.  You may have never heard of Teoma before because it was not a very popular search engine, even at it’s prime.  Though it wasn’t popular, Teoma had a big impact on the web today by utilizing an algorithm similar to Google’s that ranked websites based on communities, but Teoma took it a step further by using subject-specific popularity.  Subject-specific popularity is the number of related sites that link to a specific site, making that site an authority.  This is now a major part of modern search engine algorithms.  The site was acquired by Ask Jeeves in 2001 as a technology grab.  Ask Jeeves needed the technology to analyze links better and Teoma had the solution.  Ask pretty much let the brand languish and eventually the site was taken down in 2006.

Teoma was re-launched in April of this year as with no fanfare at all, similar to it’s demise in 2006.  Teoma just doesn’t make much noise about it’s technology or existence.  Are we going to see that change?

According to a couple responses from Ask about Teoma, they re-launched Teoma to “provide a simplified interface for everyday keyword web search.”  I find this response interesting considering the recent addition to the Google search results – the everything bar.  Is Teoma hoping to capitalize on a small percentage of Google users that are a bit peeved about the new Google features?  Perhaps.

Though I enjoy the Teoma experience, much like the older Google, the results just don’t seem to be as good as Google’s.  Teoma seems more susceptible to web spam.  For example, a search for “home loan” in Ask, Google and Bing list LendingTree as the number one result – a pretty expected result considering the brand’s power.  Yahoo offers LendingTree as the number two result, second to a government resource page, which is somewhat understandable.  Where is LendingTree in Teoma’s results?  Number seven.  All six results that precede LendingTree in this search are spam lead-gen sites, one of which has several broken images on the first page.  Would these results make you happy?  I expect not.

If the results are so bad, why would Ask re-launch Teoma?  I think they are planning on using the site as a test-bed for future search features and algorithm changes before they push those changes to Ask.  This is the most likely course of action for resurrecting a site such as Teoma, especially with such a soft re-launch.  Ask has barely made a peep about it.  Maybe Ask has some really cool new feature they are looking to test, or a significant change to their algorithm and they don’t want a lot of attention on Ask.com about it.

Though I’m very underwhelmed by the results, I’m going to keep my eye on Teoma.  Teoma had a major impact on today’s search environment many years ago, maybe they are teeing up another paradigm-shift.  Maybe not.

Using Subdomains to Speed Your Site

April 27th, 2010

Effectively using subodomains on your website will decrease your page load time.  If your site is image-heavy your pages probably load slowly.  Web browsers load all elements of a page from one TCP connection stream and each TCP stream only gets so much bandwidth.  For example, if a page has thirty images to load, HTML content, java scripts, and flash elements, a web browser could have 40+ simultaneous TCP connections to a single URL.  This method might stall the loading of a page waiting for a large flash element. One major benefit of using multiple subdomains it that it allows content to load out of sequence.  You might have seen sites that stall until a flash piece is loaded before displaying the rest of the page content.  If the site would have called the flash piece from a subdomain, it would be loaded separately from the HTML and it wouldn’t stall the page load.

A good way to set this up is to write your homepage to call items from subdomains.  Upon the initial load of the website, a DNS query is made to get the IP address of the URL and then it’s cached.  If your homepage loads elements from multiple subdomains, all of those are cached and no further DNS queries need to be made.  If page load time is an issue for your site you could setup a subdomain for all javascript, images, flash, etc.  In this example queries for HTML are loaded from the main URL, javascript from js.example.com, images from i.example.com, flash from f.example.com.  If your site requires 30 TCP connections from one domain, the server handles those pretty much sequentially.  Splitting up the calls between subdomains divides the number of connections in the queue.  In this case you might have 2 connections to js.example.com, 1 connection to example.com (for content), 15 connections to i.example.com, and 2 connections to f.example.com.  Each set of connections is loaded simultaneously thus letting each connection take as long as needed without delaying loading of the elements farther down the list.  This is a bit complex so I’ve created an image to illustrate the point.

single-urlmulti-sd

I have setup sites in this manner before to help overcome slow page-load times and saw a great increase.  The site was very image-heavy and it was nice to be able to call them from a separate subdomain so loading the images didn’t slow down the site.  If you have a page with tons of images, like thumbnails of products, you can split it up further by defining section-related subdomains on a page.  This would allow you to load all the thumbnails from separate subdomains and speed the page further.

Too Many Webmasters Are Webamateurs!

April 22nd, 2010
It’s happened to all of us.  We’ve been to a website that we expected to work correctly, but of course, it didn’t.  Who should we blame for these functionality problems?  The webmaster.  A webmaster is defined as “A technician who designs or maintains a website.”  Notice in that definition the mention of “designs.”  Almost all web designers forget about an extremely important part of their job: browser ubiquity testing.  I discussed this topic before, but I see so many silly mistakes on the web that the topic is always at the front of my mind.
A great example of a major web faux paux can be seen below.  Can you believe that Merrill Lynch calls Google Chrome an “incompatible browser?”  They say their compatible browsers are IE 5+, Netscape 7+ and Firefox 0.8+!  How long has it been since their webmaster actually thought about their site’s compatibility?  The Netscape browser isn’t available anymore, IE 5 is history, and Firefox has been through ELEVEN version updates since this message was implemented.  Where is the webmaster?!

It’s happened to all of us.  We’ve been to a website that we expected to work correctly, but of course, it didn’t.  Who should we blame for these functionality problems?  The webmaster.  A webmaster is defined as “A technician who designs or maintains a website.”  Notice in that definition the mention of “designs.”  Almost all web designers forget about an extremely important part of their job: browser ubiquity testing.  I discussed this topic before, but I see so many silly mistakes on the web that the topic is always at the front of my mind.

A great example of a major web faux pas can be seen below.  Can you believe that Merrill Lynch calls Google Chrome an “incompatible browser?”  They say their compatible browsers are IE 5+, Netscape 7+ and Firefox 0.8+!  How long has it been since their webmaster actually thought about their site’s compatibility?  The Netscape browser isn’t available anymore, IE 5 is history, and Firefox has been through ELEVEN version updates since this message was implemented.  Where is the webmaster?!

Merrill Lynch has a webamateur!

Merrill Lynch has a webamateur!

This is just one example of a lazy or inept webmaster.  I’d estimate that 80% or more of the websites on the internet have some browser compatibility issues.  Website troubles are not limited to the small-budget businesses that throw together sites, as you can see in the example.  I wonder how much money Merrill Lynch spent on their website.  I wonder if they know their site still does browser detection for browsers that were released in early 2004 – SIX years ago.

Google’s April Fool’s Fun

April 1st, 2010
April Fools!
Google’s April Fool’s day joke this year was pretty amusing to this Kansan.
Last month the city of Topeka, the capitol of Kansas, officially changed it’s name to Google for a month in an effort to attract attention and win the bid to be the first city where Google tests their ultra-high speed internet service.  In an April Fool’s Day response, Google changed their name to Topeka.
x; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;”>Google Employees lined the street in front of the Mountain View CA headquarters.  Check out the screenshots I took with streetview:
Another amusing aspect of Google’s April Fool’s Day shanagans, is the fact that all of their search response times have been changed to other units of measure like (2.00 shakes of a lamb’s tail).  Here’s a list of the response times I recorded:
60.35 jiffies (interesting because this appears to be calibrated correctly.  A jiffy about 1 millisecond, and Google’s usual response times are 6-8 milliseconds.  So, the response time of 60.35 jiffies is equal  to about  6 milliseconds.  Nice Google)
0.74 times the velocity of a unladen swallow
0.23 centons
0.07 nanocenturies
1.22e-15 epochs
23.00 skidoo
0.22 microfortnights
0.19 centibeats
0.26 microweeks
2.00 shakes of a lamb’s tail
0.32e+43 Planck times (interestingly a jiffy used to mean Planck time)
at 5.21 hertz
1.21 gigawatts
11.90 parsecs
0.02 femtogalactic years
Another funny thing Google did on April Fool’s 2010 is post a promo saying that you can store anything on Google Documents, even your car keys.  ”Ever wish you could CTRL+F your keys?”  (Link)
Other Google April Fool’s pranks that get an honorable mention:
Google Maps in 3D
Google Books in 3D
Google standard voicemail: http://www.google.com/googlevoice/standard_voicemail.html
Google Translate for Animals (offered on google.co.uk)

Last month the city of Topeka, the capitol of Kansas, officially changed it’s name to Google for a month in an effort to attract attention and win the bid to be the first city where Google tests their ultra-high speed internet service.  In an April Fool’s Day response, Google changed their name to Topeka.

The April Fool’s idea that made me smile most was that the Google Employees lined the street in front of the Mountain View CA headquarters.  Check out the screenshots I took with streetview:

I also spotted a guy in the crowd with a marriage proposal sign:

Another amusing aspect of Google’s April Fool’s Day shanagans, is the fact that all of their search response times have been changed to other units of measure like (2.00 shakes of a lamb’s tail).  Here’s a list of the response times I recorded:

  • 60.35 jiffies (interesting because this appears to be calibrated correctly.  A jiffy about 1 millisecond, and Google’s usual response times are 6-8 milliseconds.  So, the response time of 60.35 jiffies is equal  to about  6 milliseconds.  Nice Google)
  • 0.74 times the velocity of a unladen swallow
  • 0.23 centons
  • 0.07 nanocenturies
  • 1.22e-15 epochs
  • 23.00 skidoo
  • 0.22 microfortnights
  • 0.19 centibeats
  • 0.26 microweeks
  • 2.00 shakes of a lamb’s tail
  • 0.32e+43 Planck times (interestingly a jiffy used to mean Planck time)
  • at 5.21 hertz
  • 1.21 gigawatts
  • 11.90 parsecs
  • 0.02 femtogalactic years

Another funny thing Google did on April Fool’s 2010 is post a promo saying that you can store anything on Google Documents, even your car keys.  “Ever wish you could CTRL+F your keys?”

Other Google April Fool’s pranks that get an honorable mention:

The Importance of Valid HTML

March 30th, 2010
Valid HTML is the best practice when it comes to websites.  Though it can be difficult, it is worth the time.   If your code is valid, it won’t have to be auto-corrected by browsers and it won’t cause problems with search engine robots.
I’ve talked about browser ubiquity link testing before and the importance of working out problems with code to make sure a site renders correctly in browsers.  To sum up, when code is written correctly a web browser doesn’t have to interpret and correct it.
If web browsers can do their own code correction, why is validating the code important?  Browsers can correct code, search engine robots cannot.  They are left to fend for themselves without that advanced feature.  This can cause problems with them parsing your site in full and finding all the content on a page.  The spiders may get stuck on a piece of messed up code and stall which will eat up the time they have to spend on your site.
There are several tools available to help validate your code and the best one comes from the W3C (World Wide Web Consortium) site link .   The W3C is in charge of writing standards for web code and has played a key role in the development of the internet as we know it.  They know what they are talking about.
Correcting the code can be a real headache if you’re not really good at coding so I recommend contracting a web developer to help.  A good web developer is an expert in this kind of thing and the process will take a lot less time than a novice trying to work it out.

Valid HTML is the best practice when it comes to websites.  Though it can be difficult, it is worth the time.   If your code is valid, it won’t have to be auto-corrected by browsers and it won’t cause problems with search engine robots.

I’ve talked about browser ubiquity testing before and the importance of working out problems with code to make sure a site renders correctly in browsers.  To sum up, when code is written correctly a web browser doesn’t have to interpret and correct it.

If web browsers can do their own code correction, why is validating the code important?  Browsers can correct code, search engine robots cannot.  They are left to fend for themselves without that advanced feature.  This can cause problems with them parsing your site in full and finding all the content on a page.  The spiders may get stuck on a piece of messed up code and stall which will eat up the time they have to spend on your site.

There are several tools available to help validate your code and the best one comes from the W3C (World Wide Web Consortium).   The W3C is in charge of writing standards for web code and has played a key role in the development of the internet as we know it.  They know what they are talking about.

Correcting the code can be a real headache if you’re not really good at coding so I recommend contracting a web developer to help.  A good web developer is an expert in this kind of thing and the process will take a lot less time than a novice trying to work it out.

Broken Links = Broken Site

March 26th, 2010
Another new client comes on and reminds me of the importance of the basics.  The site I’m reviewing for the first time has almost 500 broken links.  That’s right, five HUNDRED.  The client came on reporting virtually no representation in the search engines.  I’m starting to see why.  Code that doesn’t validate, content in iframes, and a ton of broken links.  Though there are a lot of factors that I think are keeping them down, I’m just going to focus on broken links in this post though.
So, what are broken links and how do you find them?  Broken links are any kind of link that doesn’t deliver the expected page.  The link could fail because of a typo in the or because the target page has moved.  There are two major problems with having a lot of broken links on a website;  1. The users can’t get to content you want them to find and,  2.  The search engines get stuck in these “spider traps” and burn all their time waiting for a 404, 403, or 500a response.  Both problems affect a site’s positioning in the index because the search engines want to present their users with sites that work.  Also, the search engine bots will never discover all of your site’s content if they are stuck waiting for an error from a broken link.
In the case of my new client, there’s no way to go through it by hand to find every broken link.  It would take weeks.  The best way to do find them is to run a spider against the site, like Xenu, and let it do all the checking for you.  (I’ll make another post about Xenu at some point with tips on it’s operation, but for now I’ll say that you shouldn’t run it full-throttle against a site because your IP might get blocked for a DOS attack. link to wiki on dos)  Xenu will check every link to see if it works and will report the response.  After the report runs, you’ll have a nice list of broken links and the pages where they are listed.  Keep in mind though that Xenu doesn’t click on every link.  Some older versions won’t work with java onclick attributes and Xenu certainly won’t click in flash elements.
Now that you’ve got your broken link report, what should you do?  Fix them!  You have to go through each link and try to figure out what went wrong with that link.  Was the HTML fat-fingered?  Was the link copied incorrectly from a browser address bar?  Was the page moved?  What error does the server deliver?  Once you’ve figured out what is wrong with the broken link, you can try to fix it.  If the HTML was fat-fingered, just correct the problem and you should be good.  If something else happened to the link, try to find the correct target and replace it.  If you were deep-linking and the target no longer exists, try to find it on the site in an archive or something.  If you were citing some other site’s article and that article doesn’t exist, it’s still good practice to give a link to their homepage as credit for the citation.
If you’re an SEO correcting links for a client, it’s important to have a good report for them including recommendations for what they should change in their site editing / page creation policies to prevent it from happening again.  Oh, and don’t forget to run Xenu against the site again when the corrections have been made.  If they got messed up one time, they could get messed up again.

Another new client comes on and reminds me of the importance of the basics.  The site I’m reviewing for the first time has almost 500 broken links.  That’s right, five HUNDRED.  The client came on reporting virtually no representation in the search engines.  I’m starting to see why.  Code that doesn’t validate, content in iframes, and a ton of broken links.  Though there are a lot of factors that I think are keeping them down, I’m just going to focus on broken links in this post though.

What are broken links and how do you find them?  Broken links are any kind of link that doesn’t deliver the expected page.  The link could fail because of a typo or because the target page has moved.  There are two major problems with having a lot of broken links on a website;  1. The users can’t get to content you want them to find and,  2.  The search engines get stuck in these “spider traps” and burn all their time waiting for a 404, 403, or 500a response.  Both problems affect a site’s positioning in the index because the search engines want to present their users with sites that work.  Also, the search engine bots will never discover all of your site’s content if they are stuck waiting for an error from a broken link.

In the case of my new client, there’s no way to go through it by hand to find every broken link.  It would take weeks.  The best way to do find them is to run a spider against the site, like Xenu, and let it do all the checking for you.  (I’ll write another post about Xenu at some point with tips on it’s operation, but for now I’ll say that you shouldn’t run it full-throttle against a site because your IP might get blocked for a DOS attack.)  Xenu will check every link to see if it works and will report the response.  After the report runs, you’ll have a nice list of broken links and the pages where they are listed.  Keep in mind though that Xenu doesn’t click on every link.  Some older versions won’t work with java onclick attributes and Xenu certainly won’t click in flash elements.

Now that you’ve got your broken link report, what should you do?  Fix them!  You have to go through each link and try to figure out what went wrong with that link.  Was the HTML fat-fingered?  Was the link copied incorrectly from a browser address bar?  Was the page moved?  What error does the server deliver?  Once you’ve figured out what is wrong with the broken link, you can try to fix it.  If the HTML was fat-fingered, just correct the problem and you should be good.  If something else happened to the link, try to find the correct target and replace it.  If you were deep-linking and the target no longer exists, try to find it on the site in an archive.  If you were citing another site’s article and that article no longer exists, it’s still good practice to give a link to their homepage as credit for the citation.

If you’re an SEO correcting links for a client, it’s important to have a good report for them including recommendations for what they should change in their site editing / page creation policies to prevent it from happening again.  Oh, and don’t forget to run Xenu against the site after the corrections have been made.  If they got messed up one time, they could get messed up again.

Link Campaign Sustainability

March 18th, 2010

When planning a link campaign it’s important to set some goals for the campaign so you can accurately track progress.  You can’t just wing it.  You need to be precise in how you target your campaign or else it will look artificial, or worse yet, spammy.

The best method for tracking your link campaign, as far as I’m concerned, is a good excel spreadsheet.  You should track where you found the link partner, what date you requested a link, when the link went live, the URL of the actual linking page, link type, the advertiser network (if relevant), and the last date you checked the link.  I found this to be quite useful when I had a text link campaign from a third party.  We would periodically review the campaign to make sure the links were live and found that we were paying a lot of money every month for links that were no on sites even though their system told them they were.

As the link campaign builds steam it’s important that you maintain the same pace throughout the campaign.  The number of link acquisitions in a period is tracked by Google and it’s best to keep the linking campaign on a steady pace with no spikes or lags in activity.  This is why I keep a separate spreadsheet containing the number of links for a site on a given day and update that sheet daily.  Using the data I’ve collected I have excel create a line chart of the number of links over time, then apply a trend-line to it to set a goal for the campaign.  The trend-line will show if you are making the progress needed or not.

Another important thing to look at is the number and types of links that your competitors have.  Once you’ve established what they have you can try to build a similar campaign.  You should evaluate the number of links from each different PR, ie., 340 links from PR 2 pages, 400 from PR 0 pages, 65 from PR 5 pages, etc.  The goal is to make your link base look as natural as possible, yet still competitive, to avoid unwanted attention from Google.  Remember:  All sites will have a number of “nofollow” links pointing to them.  Best to keep that number on par also.

The last factor I’ll mention here is link decay.  A Google patent application I read mentioned the fact that Google evaluates the decay rate of a site’s link base.  You need to make sure that your decay rate isn’t too high.  Link decay is when Google can’t find a link on a page anymore.  This can happen when a blog entry that links to you moves off of the homepage and Google doesn’t see it anymore.

A sustainable link campaign is critical to long-term success of your SEO efforts.

Content Management Systems are supposed to make life easy. . .

March 4th, 2010

Content Management Systems (CMS) can make life easier for businesses that have no webmaster, or have thousands of pages that they need to organize.  However, a CMS comes at a steep price, both monetarily, and in restrictions.  There are so many inherent problems that choosing the right one is critical for the success of your website. I could go on and on and on about the problems I’ve had with CMSs over the years, but suffice it to say they are a total pain in the neck.

In the interest of keeping this somewhat short and not a soapbox rant, I will stick to a bullet-pointed list of things to think about when selecting a CMS.

  • Look for hidden fees for:
    • Page Editing
    • New Page Creation
    • File Uploads
    • Image Creation
    • Phone Tech Support
    • Email Tech Support
    • Yearly Licensing Fees
    • Software Update Fees
    • Bandwidth Overages
  • Do not work with a company that refuses to give you full admin rights.
  • Demand FTP access.
  • You must be able to edit the HTML of the pages and the page components. Period.  The only reason they wouldn’t want you to is so they can charge you to do it for you.
  • Do they own the artwork they create?
  • Ask for examples of current clients and have an SEO inspect their HTML.
  • Test the page-load times of the example websites.  If you don’t know how, your SEO will.
  • Don’t sign anything unless you get everything you want in writing.
  • Remember, if you design a site in a CMS, they’ve got you. There’s no leaving without a complete recoding of the site and potential loss of database info.
  • Will the CMS cost more than a good webmaster?
  • If your CMS is a hosted solution, make them give you a Service Level Agreement (SLA) in writing to guarentee uptime.
  • Beware of built-in SEO packages. They are usually designed by a programmer that has “some” SEO knowledge and not a true expert.  Bolt-on SEO packages never, ever, work well.
  • Ask for a demo of their product as you would see it and in a working environment. A CMS company will often let you test a feature-rich version of the software on their fastest server. The version and server you get may be significantly different.
  • Do you really need a CMS or are you being sold a CMS? Ask a professional SEO.
  • Have an IT guy check out their system. If you don’t have an IT guy, get one. A real one.
  • You should have contracted a professional SEO by now, get them involved before you buy.
  • Ask about guarenteed response times to support issues. If they don’t guarentee a resolution time, your issue may languish in tech support for months.
  • Ask who your main point of contact will be. It sure won’t be the salesman you’re talking to. Ask to speak with that rep and see if you can understand them. You know what I mean.
  • Ask about system backups.
  • Ask about system redundancy.
  • Ask about connection redundancy.
  • Ask about the project workflow for the site design.
  • Get a guaranteed completion date.
  • Ask to see their designer’s other work.
  • Make sure you aren’t limited to the number of design concepts you can go through. You may only get two, and if two isn’t enough, you will have to pay more.
  • Ask how their testing environment works. You need a production area to test pages before pushing them live.

There are good CMSs out there, but as a layperson they are hard to identify.  A CMS may be a very good fit for your company, but too often the decision to buy a CMS is made without consulting anyone “technical.”    If you are relying on the CMS company to be your technical person, remember, they are selling you a product.  You need an independent opinion.  When you’re talking to the CMS company you’re not talking to a technical person looking out for your company’s best interests, you’re talking to a salesman working for a commission.