Freakalytics®

Practical Analytics for Better Decisions

Freakalytics™ header image
 
 

Estimating future success rates from initial experience, surveys and observation (tutorial)

A wide range of common business questions are often decided incorrectly because decision-makers overlook, forget or neglect the application of a simple concept from statistics.  In this tutorial we will walk you through several examples to avoid this potentially costly mistake.  Examples where this technique can help include:

Is my ad worth the price?
Conversion (CTR): how many customers converted to a paying customer after clicking on an Google ad and visiting a special offer web page?  Based on the revenue generated is the ad price too high?

How many of my customers have children?
Estimating customer demographics: based on a one day survey in every store, what percent of our entire customer base have children?

Who will win the election?
Survey results: what percent of likely voters will vote for Obama based on the responses from a 1,000 people in a poll?

Bringing down the house?
Winning a bet: if my friend flips a coin 10 times and it landed on heads 9 times, is this a “fair” coin?

All of these questions and many others can be answered with the technique explained and demonstrated in this article.

 

Which states have the most Miss America winners?

Here is a fun example about the Miss America pageant, it appeared on the Ask.com home page.

Notice that 27% of Ask.com users picked the correct state for the most Miss America winners, is that good? Well, we should ask how you would perform if you had no information and simply guessed at the answer. With four choices and only one correct answer, you have a 1 in 4 chance (that’s 1/4 = 25%) of guessing the answer even if you have no clue.

So, is 27% actually better than all of these people just guessing? The answer is “it depends” on a missing piece of information- how many people answered this question. If 100 people answered it and 27 answered correctly, there is a good chance that they are all simply guessing. However, if 10,000 answered this question and 2,700 answered it correctly, there is a good chance that some of them answered better than just guessing.

 

The classic illustration of success- flip a coin

You may be puzzled at this point. Don’t fear. Let me move to a simpler example, flipping a coin. Believe it or not, it is very similar to the multiple choice question above, with the main difference being the chance of “success”- guessing heads or tails correctly, which is 1 in 2 or 50%. So, if I flip it once and you are right, then 100% of flips were guessed correctly. However, this one flip being guessed correctly wouldn’t lead me to believe that you had the ability to see the future (or that the coin is an unfair coin that is always heads). How many flips guessed correctly would it take? Like I have seen followed in many business situations, what does your intuition or gut say?

Five out of five correct?
Twelve out of fourteen?
80 out of 100?

Here’s the good news, there is a simple

Read the rest of this entry »

Add Your Comment Here

How to show growth over time and conceal the baseline time period in Tableau

A client asked,

What I want to show is a YoY (year over year) or WoW (week over week) comparison – however the first data point is always missing – even when the underlying data are available.

For example:
The full set of data shows a null value for 2006 – just because there is no data available to compute a comparison.
Now I want to move the date filter to start @ 2007 – and now 2007 has a null value even though we have data available to compute that metric.
” What’s the issue and how can I display only years

Read the rest of this entry »

Add Your Comment Here

Data Driven Conference 2012 and our special discount for attendees

We are having a great time at the Data Driven Conference in Columbus! Our first session was standing room only and we are presenting the same talk a second time at 1:30 in E161.

Interesting questions include “how do you become better at asking the right questions that lead to better analysis” and “how do you communicate with IT to get better data”?

To buy The Accidental Analyst directly from us at the special attendee discount, please visit www.AccidentalAnalyst.com/ddc and place your order before this Thursday.

Here is our infographic that we created

Read the rest of this entry »

Add Your Comment Here

Important presentation? Have paper notes of your key points at the podium!

Here’s a copy of my key points from Part 3 of our recent Big Data analytics webcast, “Big Data” on your laptop, fast, informative and at your command.

If you are making an important presentation about an analysis and you haven’t written down your key points on paper, why aren’t you doing this??? Make your presentation much less stressful and ensure

Read the rest of this entry »

Add Your Comment Here

Webcast: “Big Data” on your laptop, fast, informative and at your command

NOTE: This fun review of “big data” was inspired by a recent presentation I gave on behalf of Tableau Software at the Big Data Conference in Chicago. You can find the 1st part of this 3 part webcast here, “Performance to Cost Index & my personal history with “Big Data” and Part 2 here, “Big Data” in US History, Exploring the 1790 US Census. This part of the big data series is free, just subscribe or sign in below.

In this presentation, I share an example of working with big data stored on my laptop and the entire analysis happens without any type of connection to remote servers or databases. My analysis uses two tables of interest, the first has 216 million records, over ten years of airline ticket pricing in the US while the second table has 72 million records of US airlines performance data extracted from Hadoop. In the demonstration, which uses currently available technologies, we will quickly explore and analyze this data for interesting trends and patterns.

Read the rest of this entry »

Add Your Comment Here

Free Webcast: “Big Data” in US History, Exploring the 1790 US Census

NOTE: This fun review of “big data” was inspired by a recent presentation I gave on behalf of Tableau Software at the Big Data Conference in Chicago. You can find the 1st part of this 3 part webcast here, “Performance to Cost Index & my personal history with “Big Data”. Part 3 is here, “Big Data” on your laptop, fast, informative and at your command.

In this presentation, I share a review of the original big data in US history, the 1790 US Census. Some surprises are found along the way, including data quality issues in the Census reports and a surprising

Read the rest of this entry »

Add Your Comment Here

Free Webcast: Performance to Cost Index & my personal history with “Big Data”

NOTE: This fun review of “big data” was inspired by a recent presentation I gave on behalf of Tableau Software at the Big Data Conference in Chicago. You can find the 2nd part of this 3 part webcast here, ““Big Data” in US History, Exploring the 1790 US Census”. Part 3 is here, “Big Data” on your laptop, fast, informative and at your command.

Many people ask me, what is “big data”?  For most of them, the right answer is that big data is any data that is difficult to use or understand (yes, I know the official, “correct” answers, which often vary and typically include topics like Hadoop and Cloudera.)

In this presentation, I share my experience with the Commodore 64, the PS/2, DEC Stations, VAX servers, Solaris Servers, PC’s and a MacBook Pro.  Products and languages covered include BASIC, FORTRAN, SAS, Oracle, Teradata and Tableau.

It is truly astonishing

Read the rest of this entry »

Add Your Comment Here

History of US House representation from 1910 through 2010

A few observations from this example
 
Examining this dashboard with the initial decades of 1960 and 2010, you can see that the control of the House has shifted toward the West and the South. Exceptions include Louisiana, Mississippi and Oklahoma in the South and Montana in the West.
 
If you adjust the first decade slider to 1910, an even more dramatic pattern appears! People love the sunshine and the West coast with California, Florida and Nevada growth at 300%+ and Washington, Oregon, Utah and Colorado at 67% or more growth.
 
 
Dashboard topics in this example
 
Download the workbook to peek at a few cool features of this dashboard, including:
1) Using table calculations

Read the rest of this entry »

Add Your Comment Here