Box plots are not very intuitive at first glance.The box plot is good at representing bell shaped or Gaussian distributions, but can hide certain important insights in case of bimodal or other non-Gaussian distributions.There are a few points to note about box plots: Here are the formulas that we used for our sample dataset: Step 1: Compute the 5 Summary Descriptors from the Dataįirst, we need to compute the 5 summary descriptors.įor this, create a second table based on the main data, using the formulas shown below (Make sure they are in the same order as shown below too): Descriptor
CREATE BOX AND WHISKER PLOT EXCEL HOW TO
Let us see how to perform each of these steps in more detail.
The line that runs across the middle of the box is the second quartile or the median value of the data.The bottom edge of the box marks the first quartile of the data.The top edge of the box marks the third quartile of the data.This value can help us identify if the dataset is skewed or if there are any potential outliers. The length of the top whisker represents how far the highest value is from the median of the dataset.From the above chart we can see that the highest salary paid by the company is $40,000. The tip of the top whisker (line) marks the maximum value in the data.Let us interpret this chart by looking at its 5 indicators:
The image below shows employee salaries of a given company: How to Interpret a Box Plotīefore we see how to create a box plot (also called a box and whisker chart), let us first learn to understand and interpret a box plot.
In this way, box plots also help us find outliers and can tell if the data is symmetric or skewed.īox plots are especially helpful when you want to compare different sample distributions, and since it provides a 5-point summary of the entire dataset, it is really useful when the datasets you are studying are very large. They can be quite helpful when you want to see how the data is distributed and the range to which the data extends. Why Use a Box Plot?Ī box plot is a great way to visually summarize your data using 5 descriptive indicators. The width of the box basically marks the most concentrated area of the data distribution.Ī box plot can also contain ‘whiskers’ which are simply lines that depict the minimum and maximum values that are outside the first and third quartiles. The main ‘box’ of the box plot is drawn between the first and third quartiles, with an additional line drawn to represent the second quartile, or the ‘median’. The box plot divides numerical data into ‘quartiles’ or four parts. An example of a Box Plot chart created in Excel