I'm trying to do the following
I have a series
0 0 0 0 0 0 0 0 0 0 0 96 24 120 120 1 120 240 258 240 278 423 77 3
I want to eliminate outliers and calculate a new mean and standard deviation.
Using the following I was able to calculate the new mean without the outlier (in this case there is only one outlier => 423)
=SUMPRODUCT((V3:AS3<CP3+1.5*CN3)*(V3:AS3>CO3-1.5*CN3)*(V3:AS3))/(24-CQ3)
Where V3:AS3 contains the range above, CN3 is the Inter-Quartile range, CP3 is the third quartile, and CO3 is the first quartile.
For some reason if I try the following array formula, it fails for distributions that contain outliers and works for distributions that don't have outliers...
{=STDEV(IF(OR(V2:AS2>CP2+1.5*CN2,V2:AS2<CO2-1.5*CN2),"",V2:AS2)) }
What I'm trying to do here is return V2:AS2 only if they fit within the ranges defined above, else return ""... I have used similar tricks for others applications without any problem. What am I doing wrong?
For this same distribution, if I just want to count the number of outliers I use:
{=+SUM(IF(OR(V2:AS2<CO2-1.5*CN2,V2:AS2>CP2+1.5*CN2),1,0))}
In this case, the outlier count is 1 (which is obviously the 423 observation)
Any ideas?
I have a series
0 0 0 0 0 0 0 0 0 0 0 96 24 120 120 1 120 240 258 240 278 423 77 3
I want to eliminate outliers and calculate a new mean and standard deviation.
Using the following I was able to calculate the new mean without the outlier (in this case there is only one outlier => 423)
=SUMPRODUCT((V3:AS3<CP3+1.5*CN3)*(V3:AS3>CO3-1.5*CN3)*(V3:AS3))/(24-CQ3)
Where V3:AS3 contains the range above, CN3 is the Inter-Quartile range, CP3 is the third quartile, and CO3 is the first quartile.
For some reason if I try the following array formula, it fails for distributions that contain outliers and works for distributions that don't have outliers...
{=STDEV(IF(OR(V2:AS2>CP2+1.5*CN2,V2:AS2<CO2-1.5*CN2),"",V2:AS2)) }
What I'm trying to do here is return V2:AS2 only if they fit within the ranges defined above, else return ""... I have used similar tricks for others applications without any problem. What am I doing wrong?
For this same distribution, if I just want to count the number of outliers I use:
{=+SUM(IF(OR(V2:AS2<CO2-1.5*CN2,V2:AS2>CP2+1.5*CN2),1,0))}
In this case, the outlier count is 1 (which is obviously the 423 observation)
Any ideas?