How to average collection of different standard deviations

I have a column with separately calculated values of standard deviation for groups with different sample sizes. How do I calculate the average of these standard deviation values?


Statistics websites (eg statology) give a formula like that below, but I cannot find an equivalent Function in Numbers - AVEDEV seems not to be the same calculation, even for the simpler average standard deviation of a group with identical sample sizes.


How can I do this please?

Mac Studio (2023)

Posted on Nov 10, 2023 12:09 PM

Reply
Question marked as Top-ranking reply

Posted on Nov 10, 2023 12:39 PM

Barring the existence of a function I couldn't find, this construct will give you that formula if your table looks like this one.


=SQRT(SUMPRODUCT('Sample size -1',Std dev squared)÷(SUM(Sample size)−COUNT(Sample size)))




11 replies

Nov 11, 2023 4:32 PM in response to Basilisk2

Two other ways to implement the equation you posted:



Formula 1 requires column D. Formula 2 requires column E.

Rows 5 and 6 are footer rows so are not included in the sums and counts.


Formula 1 =SQRT(SUM(D)÷(SUM(B)−COUNT(B)))

Formula 2 =SQRT(SUMPRODUCT(E,C,C)÷(SUM(B)−COUNT(B)))


Numbers (and Excel) have some statistical functions but are a far cry from actual statistical apps like R.

Nov 10, 2023 1:02 PM in response to Recycleur

That's very helpful, thanks. I was hoping there would be a Function rather than having to write the whole thing out longhand but I guess the dumbing down of Numbers when it merged into iPad territory meant complex mathematical and statistical functions were removed or not added.


PS the Std Dev of a sample size of one is zero... but I know you just wanted to put some numbers there for the purposes of the example and I do appreciate your help doing that.

Nov 10, 2023 1:59 PM in response to Wayne Contello

Thanks for adding your comment, I appreciate your help.


I want to calculate the average of the standard deviations in Column C. Statistically this is not a simple average such as a Mean or Median value. Anytime you calculate the average Standard Deviation (SD) you use one of two formulae. One (given above) is if the sample size is different for each SD value, the other is used when the sample sizes are identical for each SD value.


Standard Deviation is a measure of volatility that can be used in many situations where values are monitored from environmental monitoring, to engineering production, to stocks and bond returns, and many others. The formulae do not change, but sometimes you need to know the average of all the standard deviations to measure the effectiveness of a whole set of different values eg from different factories or machines measured the same day, or the volatility of a given portfolio given investment in different stocks and bonds. I am surprised it is not a built in function.


Nov 11, 2023 5:15 PM in response to Badunit

Thanks - although your graphic was a bit too small to read. I know these days there are useful stats packages such as R as well as one other popular one I can't remember the name of.


Back in 1978 my first year University Statistics course made us do all the work manually, F-Tests, Student's T test, Standard Deviation calculations, Chi-Squared, even Multiple Regression Analysis, and I don't think there were any dedicated stats programs available - at least, not for us Biologists. Heck, for computing we had to write code out one line at a time and send it in to be put onto punchcards to be fed into the mainframe and we'd get a result (or not) the next day.


So with Numbers I like the Functions, but am learning which ones to use all the time.

This thread has been closed by the system or the community team. You may vote for any posts you find helpful, or search the Community for additional answers.

How to average collection of different standard deviations

Welcome to Apple Support Community
A forum where Apple customers help each other with their products. Get started with your Apple Account.