statistics, standard deviation

This is the process of eliminating outliers. In fact this is essential before using statistical calculation because all inferential (deduction) is false if your datas don't eliminate exceptional cases: statistics are about common cases not exception. But statistic can be used to detect outliers. There are more or less complex procedure. For most simplest procedure datas above 3 standard deviations should be considered as outliers. Also 2 consecutive datas above two standard deviations should also be suspicious. This presuppose that the mother population is normal. If it is not the case transformation like log-normal should be applied before. If it is difficult to normalise at least it is possible to make groups and whatever the law th mean of a group even as small as 4 should follow rather well the normal law.

Quote from Gordon Gekko:

i am not a statistics expert at all, but statistics interest me...

there is something i'm thinking about....

say i have 10 numbers, but 2 are way different than the rest. for example, say 8 of the numbers are less than 4, but 2 are above 8:

2
3
11
2
3
3
2
1
9
1

what if i don't want to account for the 2 extreme numbers? or what if in another set of 10 numbers, the same thing happens, but 3 numbers are above 8 instead of 2?

this is what i'm getting at....

i suppose i could just eliminate the biggest 2 or 3 numbers every time. but then i was thinking if i could use standard deviation to help me here....

anyone know what i'm saying?!?
 
Back
Top