How to Measure Activity Frequency in Google Analytics

Histogram grouping users by number of sessions in a month (log-scale)

Histogram grouping users by number of sessions in a month (log-scale)

After reading Li Jin and Andrew Chen’s post about power user curves I wanted to try out the concept with Google Analytics data. It turned out to be harder than expected because, surprisingly, GA doesn’t have a view or query that reveals how many users visited x days in a given date range.

How many users visited every day this week sounds like a simple question but GA can’t answer it.

The closest question we can answer with GA is how many users had x sessions in a given date range. So this doesn’t exactly give us the “L30” day-based histogram mentioned in their post, but it gives us something pretty close.

GA’s Frequency > Count of Sessions Isn’t Helpful

GA has a Count of Sessions view under the Frequency & Recency tab, but this is one of those GA things that doesn’t do what you might expect it to. It ignores the date range and shows lifetime session counts for users. It also doesn’t de-dupe between the buckets, so a user with 10 sessions is also represented in the 9, 8, 7, etc buckets.

To find out how many users had 30 sessions last month we have to use a segment. To calculate how many users had 30, 29, 28, etc sessions we need to use dynamic segments, which means we have to use the API.

I have some example code below but first here are a few examples of the sort of data this can produce.

7-day Timeframe

This is based on the L7 histogram mentioned in Jin and Chen’s post. Again, instead of showing the number of days that users were active it shows the user’s number of sessions in a 7-day period.

Percentage of users with different session counts in a given week

Percentage of users with different session counts in a given week

Comparing Segments and Cohorts

We can plot different segments, cohorts, or products to see how the user activity frequency compares.

Sessions-in-a-week usage curves for a set of related products

Sessions-in-a-week usage curves for a set of related products

Another example, using a 30-day timeframe:

Sessions-in-a-month usage curves for a set of related products

Sessions-in-a-month usage curves for a set of related products

API Code Example

The only thing of note going on here in the query is the dynamic segment. For each request we need a users::condition::ga:sessions==x segment. To prevent users from appearing in multiple groups we use the == operator instead of >.

We’ll often want to know the number of users whose session count exceeded the upper bound so the loop switches the operator on the last step.

This prints out an array that you can drop into a charting tool or Google Sheets to graph.

This was just a short post explaining how to work out the GA mechanics. To understand the purpose and why these frequency curves are interesting, make sure to read Li Jin and Andrew Chen’s post.