Google Analytics Cross Segmentation: Something You Should Know

Google Analytics has a great feature called Cross Segmentation. Using this feature you can ‘drill down’ into your data to gain more insight. However, there is one thing that all GA users should know. Sometimes cross segmenting data does not produce the desired result.

Example

I’m a big fan of bounce rate. I think it’s a vital metric that explains a lot about the online sales channel. One things I like to do is measure the bounce rate for marketing campaigns. Using bounce rate I can tell if the marketing message that drove a visitor to the site matches the message shown to the visitor when they land on the site. Nothing revolutionary here…

I start with the Content Optimization > Navigational Analysis > Entrance Bounce Rate report. Here it is in all it’s glory:

20070307-entrance-bounce-rate.jpg

To get the data I want, the bounce rate for a specific page coming from a specific source, I need to cross segment the above report. Usually there is a special landing page (or multiple landing pages) for the campaign, but in this case we’ll look at /blog/index.php. Cross segmenting row 1 in the above report yields:

Boune Rate: Cross Segmented

See how the report columns have changed? We can no longer see the bounce rate. We only see the visits, pageviews, conversion rate and revenue per visit. Honestly, I don’t need that data, I really need to know the bounce rate for each source. Unfortunately I can’t get that data using the cross segmentation feature.

Another Example

Here’s the Marketing Optimization > Visitor Segment Performance > Referring Source report.

20070307-ref-source-cross-segmented.jpg
** Please Note: I initially posed the wrong image above. The image should contain referrals from Web Analytics Demystified. If it shows data from StumbleUpon then you may be viewing a cached image. Sorry. Now, back to our story. **
Thanks Eric for all the traffic :) Let’s cross segment row #1 by ‘Content’ and see what happens:

Referring Source: Cross Segmented

You may think that we’re segmenting by the content on my site, but we’re not. This isn’t the same content from the ‘Top Content’ report. What we see here are the pages on Eric’s site where people clicked on links to my site. How can I be sure? All the pages on my site start with ‘/blog/’.

So why is this happening? It’s just the way that GA is storing data. It’s not a bug, it’s just the way that GA works. Don’t worry, there is a work around :)

The Solution

The solution comes down to two things: planning and knowledge. Know the exact metrics you need for your analysis and make sure Google Analytics can deliver them. If you can not cross segment a report to produce the desired data, then try creating an additional profile (using filters).

Here’s how I get around the bounce rate issue above. I use a filtered profile to generate the bounce rate. I create a new profile and apply an include filter based on the campaign, medium or source, that I want to analyze. When the filter is applied to the profile then all the reports in that profile will be specific to the campaign, medium or source, specified in the filter. Obviously this is practically impossible if you are doing an analysis on the fly, or if you need to filter on a piece of data that is unknown when you set up GA.

As a rule, I always create specific profiles for major marketing campaigns. Here’s an example of the filter I might use:

Campaign Name Filter

The above filter only includes data coming from a single campaign named ‘Important-Campaign’. That means that the the data in the Entrance Bounce Rates report is only for the ‘Important-Campaign’. I’m essentially cross segmenting when Google Analytics processes the profile data.

Conclusion

I truly believe that GA can provide most of the metrics you will need for a thorough analysis. However, you must plan ahead. As the above example shows there are some anomolies, but they can be mitigated with a logical plan for analysis.

An Analysis of My Data

I’ll be honest, it has been a long time since I looked at the analytics for this blog, I just haven’t had any time. Work has been flat-out busy since January 1. That’s also the reason that my posting has been very light.

Anyway, I logged into GA today to check some stats and was surprised by some of the data. I thought you all might be interested in the basic process I used to figure out what happened.

The Big Picture: What’s happened Since January 1

Since I haven’t reviewed the traffic data for a couple of months I started by adjusting the date filter to cover January 1 to March 4. Here’s the Executive Overview report:

Analytics Talk Traffic

Notice anything? What the heck happened around February 1st? Obviously some website drove a ton of traffic to the blog. Was it Digg?

Drilling Down: When did it Happen?

Before I get ahead of myself and try to figure out where the traffic came from, let’s determine exactly when it happened. I’m going to use the date filter to drill down and isolate when the date happened. If I hover my pointer over the data above GA shows that the spike occurred on February 1.

If I adjust the date filter one more time to cover February 1 we can see there was a big jump in traffic at 8 AM:

picture-3.jpg

Now we’re getting somewhere. Now that I know when the spike happened I can start to figure out where the traffic came form?

Getting Closer: Where did they come from?

I’m trying to figure out where the traffic came from, so I’m going to look at referral information using the Marketing Optimization > Visitor Segment Performance > Referring Source report. This report segments the site traffic based on where it originated. When we look at this report we should immediately know where the traffic came from:

picture-4.jpg

A ha! I was Stumble-Uponed! For those of you unfamiliar with StumbleUpon, here’s a description from their website:

People-Driven Technology:
Using a combination of human opinions and machine learning to immediately deliver relevant content, StumbleUpon presents only web sites which have been suggested by other like-minded Stumblers. Each time the ‘Stumble’ button is clicked, the user is presented with a high quality web site based on the collective opinions of other like-minded web surfers.

Based on this description I’m going to say that StumbleUpon drives qualified traffic to my site. We like qualified traffic, it usually converts better :)

Is there anything else to know?

What else can I learn about these people? First, they didn’t convert on any of my GA goals! I can see this in the above report. There is a 0% conversion rate for Goal 1 and Goal 2. This means that they didn’t do what I wanted them to do. (The goals for my website are to sign up for the RSS feed and to use the contact form). What’s disturbing is that this was qualified traffic! Is this indicative of a problem with my website? Why didn’t these people convert?

I’m going to dig deeper by cross segmenting the data in the Referring Source report. I’m trying to find out more about these people that came to my site from StumbleUpon. Did they come to the site before (i.e. how many return visits?):

picture-5.jpg

Not too many return visits, almost all new visits. So I now know that StumbleUpon drove 136 new visits to my blog in about one hour and none of them converted. Wow, that’s a complete bummer.

Summing Up

I could dig a bit deeper, and look at how the visitors from StumbleUpon navigated the site. But, with an average number of pageviews below 2 I’m not going to discover too much. There are some lessons to be learned here:

First, I should have been paying closer attention to my reports. I probably don’t need to review them daily, but I should review them once a week. Also, I’m going to dig a bit deeper into my new visitor segment. I really think I may have a problem converting new visitors. But that’s another post for a later time :)

Google Analytics Campaign Tracking Pt. 3: Reports and Analysis

In Part 1 of this series I explained link tagging, the technology that Google Analytics uses to track on-line marketing campaigns. In Part 2 I discussed how to tag your links and posted a tool that I use to quickly tag large numbers of marketing URLs. Today, in Part 3, I’ll start to pull this whole thing together by walking through a very basic analysis.

How I Start

I like to start my marketing analysis using the Marketing Campaign Results reports. Using these reports I can immediately identify any campaigns that are under or over performing. They’re a great launching pad for further analysis. You can find them in the Marketing Optimization > Marketing Campaign Results section.

The reports segment the data based on where the visitor came from using the values from the campaign tracking variables. So, for each of the major campaign variables we discussed in Part 1 (utm_campaign, utm_source and utm_medium) we have a corresponding Google Analytics report. Here’s the mapping of campaign variable to GA report:

Campaign Variable Report Name
utm_campaign Campaign Conversion
utm_source Source Conversion
utm_medium Medium Conversion

This means that the values you used in the campaign variables will be pulled directly into the reports. Exciting stuff, huh? :)

Campaign Conversion Report

Let’s start with the Campaign Conversion report.

Campaign Conversion Report
This report segments the traffic based on campaign name. It contains information from tagged URLs (using the utm_campaign variable) and un-tagged URLs. How does it get data for the un-tagged URLs? If you’re using the auto-tagging feature in AdWords then Google Analytics will automatically pull in the Campaign names you create in AdWords. All other un-tagged URLs get put into the following buckets:

  1. (direct): visitors that entered your website address directly into the browser
  2. (organic): visitors from an un-paid search engine listing
  3. (referral) : visitors that clicked on an un-tagged link
  4. (not set) visitors from links that were tagged but were missing some information. For example, if you are looking at the Campaign Conversion report, and see that there were 10 visits from ‘(not set)’ this means that the utm_campaign variable was missing from the tagged link.

Ok, so what does this report tell us? It helps us quickly understand how well our campaign is performing using some basic metrics:

  • Visits: How much interest did the campaign generate?
  • Goal Conversion Rate (G1/Visits): Did the visitors from this campaign do what we wanted them to do?
  • Transaction Average (T/Visits): How many transactions were generated by this campaign?
  • Revenue per Visit ($/Visits): How much money did we make from each visit in the campaign?

It’s important to realize that each metric gives you a bit more insight into what is going on. For example, let’s say a campaign has a very low conversion rate. Why? Look at the number of visits. Is the campaign generating a lot of traffic? If there are a high number of visits but a low conversion rate there may be a disconnect between the marketing message you’re sending and the content the visitor sees when they land on the site. Dig deeper, Try checking the bounce rate for the landing page.

Again, this is a good starting point for a deeper analysis. And analysis means segmenting the data to gain more insight.

Segmenting Campaign Data

Notice that the first column in the report is named ‘Campaign/Source’ and not just ‘Campaign’? The reason is that this report let’s us drill down into our campaign and view the sources associated with the campaign. If we click on the plus sign for the ‘Ongoing’ campaign we can drill into the data and see the associated sources.

Campaign Source Report

This tells us is there were three sources of traffic in the ‘Ongoing’ campaign: Squidoo, Wikipedia and MySpace. This is real data from a company using social networking and viral sites to drive traffic. The value in the brackets (Social_Networking and viral) is the medium (which we’ll get to later).

Remember from Part 1 of this series, the source is the ‘who’ part of the campaign. ‘Who’ did we partner with to distribute our message? By drilling down into the data we can find out. Drilling from the campaign level to the source level revealed a lot about our campaign. It looks like this business should dump MySpace and focus more on Squidoo! Not only do they get more traffic, they get far better conversion.

But let’s take a different look. It could be that these sources are used in multiple campaigns. Maybe MySpace did really well in a different campaign. Let’s use the Source Conversion Report to get a different view.

Source Conversion Report

Source Conversion
The source report shows us how all of our various sources are doing. It does not matter which campaign the source belongs to, they are all listed in this report. This report is very helpful because it shows, historically, how well a source performs. It may be that a source just under performed for a specific campaign. We can see above that Squidoo, which performed very well in the Ongoing campaign, does not crack the top 10 sources. MySpace is no where to be found. This probably means that MySpace performs poorly across the board, not just in the Ongoing campaign.

Next to each source we can see the medium associated with that source. Again, I like to think of the medium as the mechanism that we use to push our marketing message out. Was it email, CPC, banner, print, etc. Google Analytics has pulled the medium value from the utm_medium variable and placed it in the report.

Medium Conversion Report

Looking at the medium we can evaluate how well the mechanism is working for us. Let’s see how well the Social Networking ‘mechanism’ is working.

Medium Conversion Report
Interesting. We can see that the ‘Social Networking’ medium doesn’t get a lot of traffic, but it gets attentive traffic (high number of pageviews per visit) and what appears to be an average conversion rate (for this site) for Goal 1.

The medium report is good at identifying dependencies. Are you too dependent on a particular way of getting traffic? If all your conversions come from organic, and the search engines drastically change their ranking algorithms, then you could loose a lot of traffic and a lot of money!

Other Reports

In addition to the Conversion reports, there are also three ROI reports. These reports are very similar to the Conversion reports. They segment the data in the same way (based on campaign, medium and source). The difference is the metrics reported. Rather than conversion rates, these reports show cost, revenue and ROI. If you have an e-commerce site and are collecting revenue, or have monetized the values of your goals, then the revenue generated by each campaign will be displayed.

Campaign ROI Report

A warning about this report. GA will only pull in cost data from your AdWords campaigns. Do not be alarmed if you see no other cost data in this report. GA is a closed system, you can not import cost data from other sources. This means that the ROI calculations will be incottrct for non-AdWords campaigns.

The referral conversion report is another fantastic report :) This report lists all of the un-tagged, non-organic and non-direct links that drove traffic to the site.
Referral Conversion Report

Drilling down into this report will show you where on each referral domain, the visitor originated. I find the referral report enlightening. The web is a wacky place. And people reference content in so many different ways. This report will help you hunt down all the sources of your traffic.

referral-drilldown.jpg

Some Final Thoughts

You may notice that some of the reports above have multiple lines for the same items. For example, the Medium Conversion report has two line items for social networking:

  • Social_Networking
  • social_networking

The reason the item is listed twice is that the person tagging the links specified two different values for the utm_medium variable. That’s why it’s important to use a standard naming convention when tagging your links.

Well, that wraps up this Part 3 of our Campaign Tracking series. What did you think? My ultimate goal is to make you all marketing measurement wizards. Am I doing a good job?