Splunk: Real-time (web) analytics, powerful data mining and cost effective single customer view

Splunk is a fantastic monitoring and operational intelligence tool and now we are all trained up here at Datalicious with certificates to prove it (see end of post). The most frequent use case is for systems administrators, but we set out to play around with it and see how we could use it for web analytics. We realised that we could use its powerful, expressive search language and its intuitive charting & visualisation features to do analytics work that's more difficult, more expensive, or simply not possible, in other web analytics suites.

The big philosophy of Splunk is that you just throw all your data into it and worry about how to report on it and what to do with it later. This is great for us: it means we can focus on gathering as much data as possible in the implementation stage of a project, and there's no risk of getting to the reporting & insights staging only to realise we've overlooked something.

We have a setup where all our Google Analytics data is cloned and sent into Splunk. We hacked together a simple, scaleable pixel server in node which acts as an intermediary between Google Analytics and our Splunk installation. Our server can handle any pixel request, so we can supplement the data that Google Analytics gathers with anything we want to do in our tracking code - without having to set up Custom Variables in advance, and without being limited to 5 of them.

Once the data is in Splunk, its search language lets us get right at the data and do whatever we want with it. For example, maybe we want to see how many page views our website gets on average per session, to see how our latest site design is performing. We can run this search:
eventtype=datalicious_GA earliest=-7d | stats avg(utms) AS avg | eval avg=round(avg, 2)
Broken down, it's pretty simple: we're looking at the event type called "datalicious_GA", which has been defined elsewhere. The earliest results we want are 7 days ago. We "pipe" the output of that search to the "stats" command, and we get an average of "utms", which is Google Analytics' session counter. We then round it to two so that it looks a bit nicer, and we get this:

average page views

Fairly simple. But what happens if we realise we want to break those results down by some kind of segmentation which we didn't plan for in the past? It's no problem. If at any time in the future we get some additional metadata about our visitors, we can apply that retrospectively to generate segmentations across their full history. For example, lets say some visitors eventually "convert", which for our website is simply clicking one of the links to contact us. We could run this more complex search query:
eventtype="dataliciousGA" | eval type="Non-Converter" | join type=outer datalicious [search eventtype="dataliciousGA" | join datalicious [search eventtype="datalicious_conversion"] | eval type="Converter"] | stats avg(utms) AS avg by type | eval avg=round(avg,2)
This just means we want to do a search for converters, join it to the search result for all visitors, and show the average per-session page views of each of those segments.
 segmented average page views

It's trivial to look at something like conversions by channel:

Screen_5

Of course, no one wants to look at ugly search strings all day. That's why we build visualisations:

individually segmented page views

It's important to emphasise that we can retrospectively apply a segmentation across the full history of all impressions, events and custom data at any time. In the above example, we built a little form and got people from around the office to fill in their name. We associated that with the unique cookie ID they have on our website, and suddenly we can track their individual behaviour over all time. This didn't have to be the name, it could have been any meaningful segmentation: annual household income, country, favourite musical genre, etc.

And of course, we can apply all of those segmentations across data like search keywords:

segmented search keywords

Check out our fancy certification:

(download)
Check out the Datalicious Supertag: Container tag for smarter tag management

Google Analytics keeps innovating, introduces new Flow Visualizations and Real-Time analytics

Google has added an exciting new feature to its Analytics tool to help you visualize how different classes of visitor move around your site. There have been a few attempts at this kind of visualization, none of which have been particularly useful. We think this one hits the mark rather well.

Flow_visualizations
Starting on the left you see where visitors have come from, which is the default but you can start from other segments, then the landing pages and subsequent pages. The Datalicious site is pretty simple, so there's not much, but what we see in our clients' sites is much more interesting, with the full complexity of connections laid out in a really interesting way. This new feature will be gradually rolled out to all Google Analytics users over the next few weeks.

It comes hot on the heels of Google's Real Time view, which is also being gradually rolled out and gives a brilliant live dashboard of who is on your site right now. Google are clearly taking the analytics space very seriously, with lots of very cool stuff in the pipeline that we can't talk about yet!

Real_time
Email us at insights@datalicious.com or call us on 1300 209 601 if you need help with your Google Analytics set-up or would simply like someone to analyse your data and make some campaign and website optimisation suggestions.
Check out the Datalicious Supertag: Container tag for smarter tag management

SiteCore presentation on the power of raw visitor data for advanced data mining and modelling

In case you missed my presentation on The Power of Raw Visitor Data at today's SiteCore Dreamcore event in Sydney, check out the below slides.

SiteCore's build-in visitor profiling and segmentation capabilities are awesome but you can do even more advanced data mining and modelling by connecting to the platform's raw web analytics database with tools such as Tableau and SPSS

The dataset basically contains all campaign response and visitor behaviour in a readily accessible MS SQL data warehouse which makes it easy to create custom Tableau reports and interactive dashboards as well as run more advanced statistical analysis in SPSS such as regression modelling. In addition, Tableau makes it easy to combine the web analytics data with additional data sources such as your CRM or call center data.

You might also want to have a look at our earlier blog post and video on how to use Tableau to analyse raw SiteCore web analytics data and build interactive dashboards.

Click here to download:
201110 Datalicious SiteCore Analytics V1.pdf (2.69 MB)
(download)
Check out the Datalicious Supertag: Container tag for smarter tag management

How to use Tableau to analyse raw SiteCore web analytics data and build interactive dashboards

Screen_shot_2011-10-05_at_3

Ever wondered what kind of data WCM (Web Content Management) systems collect on individual visitors? Well, usually not much (or not in a readily accessible format anyway) but SiteCore is the exception! 

SiteCore stores all campaign response and visitor behaviour in a MS SQL database that can easily be accessed using standard BI tools such as Tableau. Have a look at the below video to see just how easy it is to connect to the database in order to create custom reports and interactive dashboards from your web analytics data.

The custom dashboard you can see in the video is unfortunately too big for Tableau public so we can only show you some screen shots in this post. However, please contact us if you're a SiteCore customer or thinking about becoming one and are interested in using Tableau as a reporting solution

Just in case you are wondering, SiteCore and Tableau probably can't replace your current web analytics platform but there's a lot of other funky stuff you can do with the raw SiteCore data, especially if you use Tableau to combine it with other additional data sources such as your CRM or call center data.

(download)

Check out the Datalicious Supertag: Container tag for smarter tag management

Impact of website performance on overall conversion and cross browser/device display testing

We had a pretty interesting presentation from Gomez yesterday on the impact of site performance on overall conversion which is backed up by a Gartner report earlier in the year called E-Commerce Websites: Features That Make Consumers Buy.

Key findings of the Gartner report

  • Consumers expect to have detailed and accurate product information 
  • Consumer expectations demand good performance from retailer websites
  • The Akamai "7 Second Rule", page abandonment spikes after that
  • Consumers would like to see more single-page shopping carts
  • Consumers' product reviews are becoming a must-have element

According to the report page load speed is the 3rd most important influencing factor. "Historical Akamai research showed consumers abandoned pages if they took longer than seven seconds to load. Delay the presentation of the product information that the consumer is looking for, and you run the risk of losing not only the sale but your customer."

Apart from the performance testing features, we also really liked the cross browser testing and preview functionality as well which allows you to generate JPGs showing how your website looks across various browsers and operasting systems as well as mobile devices. Have a look at the below screen shots for the Datalicious blog to see how the site renders on an iPhone, iPad and Blackberry as well as load times across the various Posterous elements.

Download the full Gartner report on eCommerce Site Features That Make Consumers Buy.

(download)

Check out the Datalicious Supertag: Container tag for smarter tag management