It is easy to calculate: add up all the numbers, then divide by how many numbers there are.
The mean, commonly known as the average, is a frequently used measure of central tendency in statistics. But what exactly is the mean and how do you calculate it? This comprehensive guide will walk you through everything you need to know to find the mean of any dataset.
What is the Mean?
The mean, also called the arithmetic mean or average, is the sum of all values in a dataset divided by the total number of values. It gives you a single number that is representative of the middle of the dataset. The mean is the most commonly used measure of central tendency because it takes into account every value in your data.
Here is the formula for calculating the mean of a sample
![Sample Mean Formula][]
Where:
- x̄ = mean of sample
- Σx = sum of all values in sample
- n = total number of values in sample
And here is the formula for the population mean ![Population Mean Formula][]
Where:
- μ = mean of population
- ΣX = sum of all values in population
- N = total number of values in population
The only difference between the formulas is that we use lowercase (x) for sample attributes and uppercase (X) for population attributes,
When to Use the Mean
The mean is the best measure of central tendency to use when:
- You have continuous, quantitative data. The mean requires numerical values that can be added together.
- Your data has a normal distribution. In a normal distribution, the mean, median, and mode are all equal.
- You want to use all values in your dataset. The mean takes into account every data point.
- You have no significant outliers. Extreme outliers can skew the mean by pulling it towards them.
Steps to Calculate the Mean
Follow these two simple steps to find the mean of any dataset:
1. Add Up All the Values
First, add up all of the values in your dataset. Be sure to keep track of negative signs if you have negative numbers.
For example, if your dataset is:
{2, 7, -3, 10, 7}
You would calculate:
2 + 7 + (-3) + 10 + 7 = 23
2. Divide the Sum by Number of Values
Next, divide the sum you just calculated by the total number of values in the dataset.
Using our example, there are 5 values in the dataset.
So we would divide the sum (23) by the number of values (5):
23/5 = 4.6
The mean of this dataset is 4.6.
And that’s it! By following these two steps you can easily calculate the mean of any set of numbers.
Calculating the Mean by Hand
Let’s go through an example of calculating the mean of a dataset by hand:
The number of hours spent studying last week by 8 college students are:
{3, 5, 2, 4, 5, 8, 10, 6}
Step 1) Add up all the values:
3 + 5 + 2 + 4 + 5 + 8 + 10 + 6 = 43
Step 2) Divide the sum by the total number of values (n=8):
43/8 = 5.375
The mean number of hours spent studying is 5.375
Using a Calculator to Find the Mean
For larger datasets, it’s easier to use a calculator rather than adding up all the values by hand. Here are the steps:
-
Enter all the values from your dataset into the calculator.
-
Press the “sum” or “total” button on your calculator to add them up.
-
Divide the sum by the number of values in your dataset.
-
The result is the mean!
Many statistics calculators and spreadsheet programs like Excel have built-in functions to calculate the mean automatically. But it’s good to know how to do it manually as well.
The Effect of Outliers on the Mean
One important thing to understand about the mean is that it is sensitive to outliers. Outliers are extreme values that are much higher or lower than the other data points. Just one or two outliers can skew the mean by pulling it towards them.
Let’s see an example. Here is a dataset of daily temperatures:
{68, 65, 67, 71, 78, 62, 60}
The mean temperature is 67.4 ̊F
Now let’s add an outlier value of 95 ̊F to the dataset:
{68, 65, 67, 71, 78, 62, 60, 95}
The new mean temperature is now 71.1 ̊F
See how the outlier raised the mean substantially? The median or mode would likely better represent the central tendency of this skewed dataset.
Mean, Median, and Mode
The mean, median, and mode are the three main measures of central tendency in statistics. Here’s a quick overview:
- Mean – The average, calculated by summing all values and dividing by the number of values.
- Median – The middle value, found by ordering the dataset and selecting the value in the center.
- Mode – The most frequently occurring value in the dataset.
Choosing which one to use depends on the shape of your data distribution and whether you have outliers. The mean is best for symmetric, normal distributions without outliers. The median or mode are better for skewed distributions.
Mean For Grouped Frequency Tables
When you have data organized into groups with frequencies rather than raw scores, you need to use a modified formula to calculate the mean.
Here is the formula for mean of grouped frequencies:![Mean of Grouped Frequencies Formula][]
Where:
- x̄ = mean
- xi = midpoint of each class
- fi = frequency of each class
Let’s look at an example:![Grouped Frequency Table Example][]
To find the mean:
- Calculate the midpoint for each class by averaging the lower and upper limits
- Multiply each midpoint by the class frequency
- Add up the products
- Divide the sum by the total frequency (n)
Using this process gives us a mean of 67.7 for this data.
Weighted Mean
In some cases, you may want to give certain values in your dataset more influence over the mean. This is called a weighted mean.
The formula for weighted mean is:![Weighted Mean Formula][]
Where:
- x̄w = weighted mean
- wi = weight of each value
- xi = each individual value
Weights are applied by multiplying each value by its assigned weight. Higher weighted values have more impact on shifting the mean.
Let’s say you survey both students and faculty about how many events they attend on campus each semester. You want to give more weight to the faculty responses. You assign faculty a weight of 2 and students a weight of 1.
Here is the data:
| Group | Value | Weight |
|
The mean of the above numbers is 22
How do you handle negative numbers? Adding a negative number is the same as subtracting the number (without the negative). For example 3 + (−2) = 3−2 = 1.
Knowing this, let us try an example:
Example 2: Look at these numbers:
3, 7, 5, 13, 20, 23, 39, 23, 40, 23, 14, 12, 56, 23, 29
The sum of these numbers is 330
There are fifteen numbers.
The mean is equal to 330 / 15 = 22
How to Find the Mean | Math with Mr. J
How do you find the mean of a number?
Mean: The “average” number; found by adding all data points and dividing by the number of data points. Example: The mean of 4 , 1 , and 7 is ( 4 + 1 + 7) / 3 = 12 / 3 = 4 . Median: The middle number; found by ordering all data points and picking out the one in the middle (or if there are two middle numbers, taking the mean of those two numbers).
What is mean in statistics?
In statistics, the mean is the value used to summarize the given data set. It measures the center of the data, hence called a measure of central tendency. Mean is calculated to find the average of different scenarios in real life, such as an average number of people having a TV in a city, average marks obtained by students in a class, etc.
How do I solve for the mean of a data set?
To solve for the mean of this data set, we will need to use all the information we’ve been given and will also need to know what the mode and median are. As a reminder, the mode is the value that appears most frequently in a data set, while the median is the middle value in a data set (when all values have been arranged from lowest to highest).
How do you calculate the mean of a set?
Divide the sum of the set by the number of values. The result is the mean (a type of average) of your set. This implies that if each number in your set was the mean, they would add up to the same total. Example: . Therefore, 4 is the mean of the numbers. You can check your calculations by multiplying the mean by the number of values in the set.