Statistics Definitions > Summation Notation

## What is Summation Notation?

Summation (Σ) just means to “add up.” For example, let’s say you had 5 items in a data set: 1,2,5,7,9; you can think of these as x-values. If you were asked to add all of the items up in summation notation, you would see:

Σ(x) which equals 1 + 2 + 5 + 7 + 9 = 24.

When using summation notation, X_{1} means “the first x-value”, X_{2} means “the second x-value” and so on. For example, let’s say you had a list of weights: 100lb, 150lb, 153lb and 202lb. The weights and their corresponding x-values are:

X_{1}: 100lb

X_{2}: 150lb

X_{3}: 153lb

X_{4}: 202lb

The “i=1” at the base of Σ means “start at your first x-value”. This would be X_{1} (100lb in this example). The “n” at the top of Σ means “end at n”. In statistics, n is the number of items in the data set. So what this summation is asking you to do is “**add up all of your x-values from the first to the last**.” For this set of data, that would be:

100 lb + 150 lb + 153 lb + 202 lb = 605 lb.

**Note**: If you see a number above Σ, instead of n, it means to add up to a certain point. For example, a “3” above the Σ means to sum up the the third item (X_{3}) in the set.

**Why the difficult notation?** Why not just say “add up”? There *are* cases when you might want to start at a different point in the data set. Although you probably won’t come across these in an elementary statistics class, if you go onto more advanced stats (or calculus), you’ll come across many different variations. So introducing the Σ notation is getting you used to the format, much like x and y is introduced very early on in basic math.

Summation notation is also a shorthand that helps to avoid long equations. For example, take this lengthy expression, where a, b, and c are constants, and X And Y are random variables.

(aX_{1}+bY_{1}+c)+(aX_{2}+bY_{2}+c)+(aX_{3}+bY_{3}+c)+(aX_{4}+bY_{4}+c)+(aX_{5}+bY_{5}+c)+(aX_{5}+bY_{5}+c)

This can be written more succinctly in summation notation as:

## A More Complicated Example

One of the most challenging formulas you’ll come across in elementary statistics that involves summation notation is Pearson’s correlation coefficient:

There are multiple summations in the formula and although it’s time consuming to solve, it is fairly straightforward if you break it down into steps. Note that there are two summations of X in the formula:

ΣX

^{2}, which means to square the x-values and add them all up

and

(ΣX)

^{2}, which means to add up all of the x-values and then square. ------------------------------------------------------------------------------

**Confused and have questions?** Head over to Chegg and use code “CS5OFFBTS18” (exp. 11/30/2018) to get $5 off your first month of Chegg Study, so you can understand any concept by asking a subject expert and getting an in-depth explanation online 24/7.

**Comments? Need to post a correction?** Please post a comment on our *Facebook page*.