Share/Comment | |||
Tweet | |||
Feedback |
The choice of statistical analysis techniques for a data set depends on it's scale of measurement. Therefore it is critical to develop a clear understanding of the same.
Scale is defined as a graduated range of values forming a standard system for measuring or grading something. Scale of measurement refers to the properties of value assigned to an observation.
Sounds too abstract? Let us look closely at some statements around measurements - “my laptop bag weighs 2.5 Kgs”, “today's temperature is 12 degrees Fahrenheit”, “customer is highly satisfied”, and “she is married”. Notice the values here, viz. “2.5 Kgs”, “12 degree Fahrenheit”, “highly satisfied”, and “married”. You can easily visualize that each of these value have different properties.
The value “married” is a mere classification of marital status of a person. Such a value can at best be compared for “equality”; it will be absurd to say that “married” is greater (or possibly smaller) than “separated”! This type of scale is known as the nominal scale. This is the most primitive scale of measurement. Some of us may not even feel comfortable terming it as a scale of “measurement”!
The value “highly satisfied” clearly has an additional property of rank ordering. Therefore, you can compare such values for “greater/less than” also. In other words, we know that a “highly satisfied” customer is happier (i.e. greater) than a “satisfied” customer! This type of scale is known as the ordinal scale. While you can test these values for equality or greater/less than but there is no way to exactly quantify the difference between two ordinal scale values. For example, how do you quantify the difference between satisfied and highly satisfied customer?
Values in nominal and ordinal scale can be viewed as simple classification labels. Therefore, these values can be either “text” or “numeral”.
In case of the temperature scale, you can quantify the difference (or the interval) between two values. For example, the difference between 12 degrees and 22 degrees Fahrenheit is the same as the difference between 81 degrees and 91 degrees, which is equal to 10 degrees. Another such example is time scale. In fact we normally talk of time intervals only. Here again, the distance or interval between 1800 and 1810 is 10 years, which is the same as the distance between 2000 and 2010. But in both of these scales, there is no absolute or true “0”; the choice of “0” value is rather arbitrary - more of a matter of convenience or convention. For instance, there is no beginning of time! The lack of true or absolute “0”, makes it difficult to express ratios. It is rather impossible or funny. Can time in the year 2000 be visualized as 1.11(= 2000/1800) times of what it was there in the year 1800?! This type of scale is known as the interval scale.
The weight scale has all of the above properties and also has an absolute or true “0”! That is why it is natural to comprehend ratios here - 5.0Kgs is clearly twice as heavy as 2.5Kgs. This type of scale is known as the ratio scale.
Here is a summary of some of the important statistical techniques that you can apply on datasets belonging to the different scales of measurement:
Scale | Graphs & Procedures | Central Tendency | Dispersion |
---|---|---|---|
Nominal | Pie and Bar; Frequencies | Mode | Information Entropy(?) |
Ordinal | Pie and Bar; Frequencies | Mode, Median | Range, IQR, Percentile |
Interval | Pie, Bar, Histogram and Boxplot; Frequencies, Descriptive and Exploratory | Mode, Median, Arithmetic Mean | Range, IQR, Percentile, Standard Deviation |
Ratio | Pie, Bar, Histogram and Boxplot; Frequencies, Descriptive and Exploratory | Mode, Median, Arithmetic & Geometric Mean | Range, IQR, Percentile, Standard Deviation, Coefficient of Variation |
comments powered by Disqus
Commenting Guidelines
We hope the conversations that take place on “discover6sigma.org” will be constructive in context of the topic. To ensure the quality of the discussion stays in check, our moderators will review all the comments and may edit them for clarity and relevance.
The comments that are posted using fowl language, promotional phrases and are not relevant in the said context, may be deleted as per moderators discretion.
By posting a comment here, you agree to give “discover6sigma.org” the rights to use the contents of your comments anywhere.