As we cover Six Sigma Statistics, I want to make sure that I go over the illustrative part of the statistics. We know Six Sigma is technical but the key to making it stick, is to make it simple and understood by the non-technical people using it. So let’s talk about the Box Plot or the Whisker Plot. A key thing to remember in Six Sigma is that everyone using different terminology, so ask questions and make sure you are speaking the same language.
What is a Box Plot?
Simply put a box plot helps to put a picture to the data showing you where most of the data falls, how the data is distributed and where the outliers are. So it basically shows you what you’ve got, how it looks and what is unusual about it.
What does it measure?
Say you have a process that has multiple variables affecting it and you want to know what is what. If you have a delivery truck with 4 alternative routes a box plot can show you which ones, according to the data, are the most problematic. Additionally a box plot will tell you how symmetrical your data is. Knowing if your data is skewed or not can affect how you interpret your data. In a box plot, if the data is mostly symmetrical the median will appear in the middle of the box and the whiskers will appear to be mostly the same length. IF the data is skewed to one direction, the median will not be in the middle and the whiskers will be different sizes.
How does it work?
Box plot measurements are based on quartiles and the distributions are shown within the graphic. Think back to your SAT’s or ACT’s. Remember how they told you that you scored in the 25th percentile? Well that’s a box plot. You will have an upper limit and a lower limit and those limits will be determined by your organization’s goals. The outliers will be the extreme values, values that are so far outside of the normal distribution that it is unlikely they will be reproduced.
Interpreting your data is just as important as gathering it, so choose carefully and with purpose. Talk to your belt and use that advice to help you find the best method for your organization.
This is a micro blog this week, because next week we get into measures of variation which is a dry subject and will challenge my creative ability. As we continue our trek into statistics and how to interpret them, there is a very specific area that I want you to pay attention to, variation. The reason variation is so important is that it tells you why something is different and how that matters to the data set as a whole. It also provides you the knowledge of what the data won’t be able to tell you because of the interference the variation causes. This is important because when you interpreting data understanding the limitations is almost more important than understanding what is being told to you.
The first thing to consider is range. Range will tell you the difference between the most obvious observation and the smallest one. This is important because this is where you identify your outliers (variables that are outside the norm, think of road work on a delivery path or a maternity event as an obvious observation). A large range would be the maternity event; it’s so big there is no way to avoid noticing it. A small range would be a traffic event, it may have impact but the impact will not be evenly distributed and it may or may not impact the final result.
There is a measurement range that is good for a sample size of 2, it’s called the inter-quartile range. For a bigger sample stick to standard deviation; Standard deviation tells you the average number of times a variation occurs from the mean.
By all means as usual this is not a step by step approach to understanding variation, but it is enough of a foundation to have a conversation with your belt about the metrics and what they mean to your organization and its strategic goals. If you need help starting this conversation, give us a call and we will be happy to get you started.