Statistical Comparison of Two Data Sets in Excel

Get FREE Advanced Excel Exercises with Solutions!

In this article, I will discuss the statistical comparison of two data sets in Excel. At times, while working with spreadsheets, we have to compare data statistically. Luckily, Excel has some built-in functions to do the comparison between data sets.


Statistical Comparison of Two Data Sets in Excel: Key Method

Excel Statistical Comparison of Two Data Sets Introduction

In our example, we will use two monthly sales data sets of Steel-cut oats and Rolled oats. By comparing statistically through Excel, we will find out how sales of these two types of oats change over time. Besides that, we will show the sales graphically too. Furthermore, for ease of our statistical comparison, we will find Mean, Standard Deviation, Coefficient of Variation, and Range for Steel-cut oats i.e. range (C5:C11) first.

Excel Statistical Comparison of Two Data Sets

Steps:

  • Initially, to get the Mean of Steel cut oats, type the following formula in Cell C12.
=AVERAGE(C5:C11)

Excel Statistical Comparison of Two Data Sets

Here, the AVERAGE function returns the arithmetic mean of dataset C5:C11.

  • Next, we will find out the standard deviation of dataset C5:C11. So, type the following formula in Cell C13.
=STDEV.S(C5:C11)

Excel Statistical Comparison of Two Data Sets

Here, the STDEV.S function estimates Standard Deviation based on the sample (ignores logical values and text in the sample)

  • Then, we will calculate the Coefficient of Variation of the dataset (C5:C11). The formula to calculate the CV is:

(Standard Deviation/Mean)*100

  • So, considering the above equation, type the below formula to get the Steel-cut oats’ sales:
=C13/C12

Excel Statistical Comparison of Two Data Sets

  • However, make sure you calculate the CV in percentage. To do that, select the corresponding cell (C14), go to Home > Number.

Excel Statistical Comparison of Two Data Sets

  • Now try to keep the value within 1 decimal place, and hit OK.

Excel Statistical Comparison of Two Data Sets

  • After that, we will calculate the range of the data set (C5:C11). To calculate the range of the above-mentioned data set, here is our formula:
=MAX(C5:C11)-MIN(C5:C11)

The MAX function returns the largest value of the dataset C5:C13. And, the MIN function returns the smallest value of that range. Lastly, by subtracting these minimum values from the maximum one, we will get the Range of the Steel-Cut Oats.

  • Finally, drag down the Fill Handle (+) tool to copy all the formulas to calculate the Mean, STD Deviation, CV, and Range of the Rolled oats data set.


Statistical Comparison Between Data Sets in Excel

Let’s compare the data sets depending on the result we got from the above calculation.

Mean: Mean is the arithmetic average of a dataset. And, from the above calculation, we can see that Rolled oat’s sales Mean is greater than that of the Steel cut one’s. That means, over time, the sales of Rolled oats are greater than the other one.

Standard Deviation: The standard deviation is a measure of the amount of variation of data points or values relative to their average or mean. For example, a low standard deviation tells us that the values tend to be close to the mean of the dataset. On the other hand, a high standard deviation means that the values are spread out over a wider range. Here, from our result standard deviation is greater for Rolled oats. Thus, this indicates the sales values of Rolled oats are spread out over a wider range than those of the Steel-cut oats.

CV: The coefficient of variation (CV) is a relative measure of variability that indicates the size of a standard deviation to its mean. From our above calculation, we can see that the CV of Steel cut oats is slightly higher than that of Rolled oats. Consequently, we can summarize that sales values of Rolled oats are more consistent compared to Steel-cut ones.

Range: In statistics, the range of a set of data is the difference between the largest and smallest values. It is evident from the datasets that Rolled oats have a higher range. This result indicates that, for some months, fluctuation of the sales of Rolled oats is higher than those of the Steel cut ones.


Download the Practice Workbook

You can download the practice workbook that we have used to prepare this article.


Conclusion

In the above article, I have tried to discuss the statistical comparison method elaborately. Hopefully, this method and explanation will be enough to solve your problems. Please let me know if you have any queries.


<< Go Back to Columns | Compare | Learn Excel

What is ExcelDemy?

ExcelDemy - Learn Excel & Get Excel Solutions Center provides online Excel training , Excel consultancy services , free Excel tutorials, free support , and free Excel Templates for Excel professionals and businesses. Feel free to contact us with your Excel problems.
Hosne Ara
Hosne Ara

Hosne Ara is a materials and metallurgical engineer who loves exploring Excel and VBA programming. To her, programming is like a time-saving superhero for dealing with data, files, and the internet. She's skilled in Rhino3D, Maxsurf C++, MS Office, AutoCAD, and Excel & VBA, going beyond the basics. With a B. Sc. in Materials and Metallurgical Engineering from Bangladesh University of Engineering and Technology, she's shifted gears and now works as a content developer. In this role, she... Read Full Bio

We will be happy to hear your thoughts

Leave a reply

Advanced Excel Exercises with Solutions PDF

 

 

ExcelDemy
Logo