Labeled scales (e.g., category, Likert, VAS) and invalid across-group comparisons: what we have learned from genetic variation in taste

https://doi.org/10.1016/S0950-3293(02)00077-0Get rights and content

Abstract

Direct comparisons of sensory or hedonic perceived intensities across individuals are impossible since we cannot share experiences. However, experiences can be compared indirectly if we can identify a standard assumed to be equal, on average, to groups compared. Sensory standards (i.e., magnitude matching) have proved very useful. Intensity adjectives/adverbs have also been used as standards (e.g., This tastes very strong to me; is it very strong to you?). We argue these labels often refer to experiences of different absolute intensity to the groups of interest making the comparisons invalid. An early solution rested on the assumption that the maximum intensity perceivable was equal for all sensory domains and individuals. Comparisons were then possible because all judgments could be made on a zero to maximum scale. We present data showing this assumption is not true.

Introduction

Variation in sensory or hedonic experience is relatively easy to measure within individuals. But it is extremely hard to measure across individuals or groups because we cannot share one another's experiences. Two psychophysical traditions have led to attempts to provide comparisons of perceived sensory or hedonic intensities across groups. The older tradition depends on scales of various types labeled with adjective/adverb intensity descriptors (e.g., weak, medium, very strong). The newer tradition derives from the work of S.S. Stevens on direct scaling methods. Both depend on the same logic. In order to make comparisons across groups, we must identify some kind of standard that can be assumed to be, on average, equally intense to each group. When comparisons are made through intensity descriptors, the standards are the descriptors themselves. Unfortunately, as we will discuss below, these often fail to reflect equivalent perceived intensities; this makes these scales invalid for across-group comparisons (see also Bartoshuk, 2002). On the other hand, the logic underlying comparisons using direct scaling methods has permitted what we believe to be valid across-group comparisons in some circumstances. Although we know of no stimulus likely to produce the same perceived intensity to all individuals, we can identify stimuli likely to have the same perceived intensity, on average, to particular groups (we need only assume that the modalities to be investigated and the standard modality are not related to one another for those groups).

Given the wide reliance on labeled scales, there would be considerable value in attempting to find ways to alter these scales to make them produce valid across-group comparisons. We believe that increasing insight into our use of intensity descriptors may suggest such ways. These can then be tested against scaling that uses sensory standards. A combination of these approaches may provide the most powerful method of all.

Section snippets

Adjective/adverb-labeled scales used to compare perceived intensities across subjects/groups

In everyday language we compare sensations using intensity adjectives (e.g., This tastes strong to me. Does it taste strong to you?) along with adverbs to modify the adjectives (e.g., This tastes very strong to me. Does it taste very strong to you?). The validity of such comparisons depends on the assumption that these intensity descriptors refer to the same absoloute intensities to all.

One of the most ancient scales dates to the astronomer Hipparchus (190–120 BC) who classified stars into six

Labeled scales and erroneous across-group comparisons

Adjective/adverb labeled scales can obviously be used for within-subject comparisons. They can also be used for across-group comparisons when the members of the groups are randomly assigned; this insures that there will be no difference, on average, in the way the groups use the labels. However, these scales have come to be used to compare groups that differ in systematic ways (e.g., sex, age, clinical status). This usage rests on the implicit assumption that the intensity labels refer to the

Stimuli as standards: magnitude matching

S.S. Stevens, 1956, Stevens, 1957 revolutionized psychophysics in the 1950s by introducing direct scaling methods. The most popular of these, magnitude estimation, directed subjects to assign numbers to sensations such that one sensation twice as intense as another is assigned a number twice as large, etc. Stevens devised this method to produce scales with ratio properties, a highly desirable attribute. However, it is important to note that magnitude estimate data can only show the rate at

Genetic variation in taste

Fox (1931) discovered taste blindness to PTC (phenylthiocarbamide). About 25% of the US population cannot taste PTC (nontasters); the other 75% taste it as bitter. Family studies (Blakeslee, 1932, Snyder, 1931) suggested that nontasting was a simple Mendelian recessive. Very early in the history of PTC studies, threshold measures became the gold standard (Harris & Kalmus, 1949) [See Fernberger (1932) for a rare suprathreshold study from that era]. The PTC threshold distribution was essentially

Magnitude estimation and across-group comparisons: Borg/Teghtsoonian range theory

Stevens was interested in comparing sensory modalities; he was not interested in making comparisons across subjects or groups. Thus he was not concerned that magnitude estimates cannot be compared across subjects. Borg (1961), who was concerned and sought solutions to this problem, suggested that the range from no sensation to maximal sensation within sensory modalities was equal for all subjects. He initially cautioned that this assumption might not be “suitable for all sense modalities.”

General Labeled Magnitude Scale (gLMS)

It is important to note that the addition of adjective/adverb descriptors to ratio scales does not solve the problem of the relativity of the descriptors across context and experience. When Green and his colleagues constructed the LMS, they anchored the top of the scale to oral sensations including pain assuming that oral pain would be equivalent, on average, across groups of subjects. However, the intensities of sensations produced by irritants placed on the tongue are linked to tongue

Experiment

Data were collected from attendees (20–79 years old, inclusive) at lectures given by LMB (data collected prior to the lecture). Attendees filled out a questionnaire, tasted a piece of candy rating it for sweetness and tasted PROP paper rating it for bitterness. All ratings were made on the gLMS.

Results

Fig. 2 shows the most intense remembered sensations. ANOVA for the subjects rating all nine maximal sensations showed a highly significant effect of quality of sensation. Tukey HSD post-hoc tests (shown in Fig. 2) showed that the strongest flower smell was the least intense and sound the most intense of those sensory qualities tested for this group of subjects. Some subjects (N=62) rated two or more modalities at the same maximum. Of attendees who had only one sensation at the maximum (N=221)

Maximal perceived intensities are not the same across modalities

The results of the present study demonstrate that, contrary to the assumptions of range theory, maximum perceived intensities vary across modalities. Previous studies have shown that nontasters, medium tasters and supertasters live in different oral sensory worlds. The correlations of remembered sensations with PROP paper bitterness show that these differences can even be demonstrated with remembered sensations. Given the stability of spacing among intensity adjectives, this supports the

Acknowledgements

Portions of these data were presented at the 2001 annual meetings of the Association for Chemoreception Sciences (Fast et al., 2001) and the Society for the Study of Ingestive behavior (Bartoshuk et al., 2001). Supported by NIH (DC00283 and AG18619) and NRICGP/USDA (9603745).

References (123)

  • J.I. Nagy et al.

    The nature of the substance P-containing nerve fibers in taste papillae of the rat tongue

    Neuroscience

    (1982)
  • D.D. Price et al.

    The validation of visual analogue scales as ratio scale measures for chronic and experimental pain

    Pain

    (1983)
  • J. Prutkin et al.

    Genetic variation and inferences about perceived taste intensity in mice and men

    Physiology and Behavior

    (2000)
  • R.C.B. Aitken

    Measurement of feelings using visual analogue scales

    Proceedings of the Royal Society of Medicine

    (1969)
  • R.C.B. Aitken et al.

    Distraction from flashing lights

    Aerospace Medicine

    (1963)
  • L.M. Bartoshuk

    Bitter taste of saccharinRelated to the genetic ability to taste the bitter substance 6-n-propylthiouracil (PROP)

    Science

    (1979)
  • L.M. Bartoshuk

    Sweetnesshistory, preference, and genetic variability

    Food Technology

    (1991)
  • L.M. Bartoshuk

    Comparing sensory experiences across individualsrecent psychophysical advances illuminate genetic variation in taste perception

    Chemical Senses

    (2000)
  • L.M. Bartoshuk

    Self reports and across-group comparisonsa way out of the box

    APS Observer

    (2002)
  • L.M. Bartoshuk

    Genetic and pathological taste variation

  • L.M. Bartoshuk et al.

    Do taste-trigeminal interactions play a role in oral pain?

    Chemical Senses

    (1996)
  • L.M. Bartoshuk et al.

    PROP supertasters and the perception of ethyl alcohol

    Chemical Senses

    (1993)
  • L.M. Bartoshuk et al.

    Sensory variability, food preferences and BMI in non-, medium and supertasters

    Appetite

    (1999)
  • L.M. Bartoshuk et al.

    The General Labeled Magnitude Scale provides valid measures of genetic variation in taste and may be a universal psychophysical ruler

    Appetite

    (2001)
  • Bartoshuk, L. M., Duffy, V. B., Fast, K., Green, B. G., & Snyder, D. J. (in press). Hormones, age, genes and pathology:...
  • L.M. Bartoshuk et al.

    PROP (6-n-propylthiouracil) supertasters and the saltiness of NaCl

  • L.M. Bartoshuk et al.

    Magnitude matching and a modified LMS produce valid sensory comparisons for PROP studies

    Appetite

    (2000)
  • L.M. Bartoshuk et al.

    PROP supertasters and the perception of sweetness and bitterness

    Chemical Senses

    (1992)
  • L.M. Bartoshuk et al.

    Bitterness of KCl and benzoaterelated to genetic status for sensitivity to PTC/PROP

    Chemical Senses

    (1988)
  • H. Berry et al.

    Treatment of rheumatoid arthritis

    Clinical Trials Journal

    (1972)
  • M. Biernat et al.

    Shifting standards and stereotype-based judgements

    Journal of Personality and Social Psychology

    (1994)
  • M.H. Birnbaum

    How to show tht 9>221collect judgements in a between-subjects design

    Psychological Methods

    (1999)
  • A.F. Blakeslee

    Genetics of sensory thresholdstaste for phenyl thio carbamide

    Proceedings of the National Academy of Sciences

    (1932)
  • G. Borg

    Interindividual scaling and perception of muscular force

    Kungl. Fysiografiska Sällskapets I Lund Förhandlingar

    (1961)
  • G. Borg

    A category scale with ratio properties for intermodal and interindividual comparisons

  • G. Borg

    A “fixed star” for interprocess comparisons

  • G. Borg

    Psychophysical scaling: an overview

  • H. Champney

    The measurement of parent behavior

    Child Development

    (1941)
  • A.K. Chapo et al.

    Salt intensity and behaviors: associations with bitterness of 6-n-propylthiouracil (PROP)

    Chemical Senses

    (2001)
  • A.K. Chapo et al.

    Sodium chloride (NaCl) saltinessare older females more responsive

    The Gerontologist

    (2001)
  • P.R.F. Clarke et al.

    Reliability and sensitivity in te self-assessment of well-being

    Bulletin of the British Psychological Society

    (1964)
  • N. Cliff

    Adverbs as multipliers

    Psychological Review

    (1959)
  • V.B. Duffy et al.

    Sensory factors in feeding

  • V.B. Duffy et al.

    Supertasters of PROP (6-n-propylthiouracil) rate the highest creaminess to high-fat milk products

    Chemical Senses

    (1996)
  • V.B. Duffy et al.

    Bitterness of 6-n-propylthiouracil (PROP) associates with bitter sensations and intake of vegetables

    Appetite

    (2001)
  • V.B. Duffy et al.

    Genetic taste status associates with fat food acceptance and body mass index in adults

    Chemical Senses

    (1999)
  • V.B. Duffy et al.

    Genetic variation in tasteassociations with alcohol sensation and intake

    Chemical Senses

    (2000)
  • Fast, K., Duffy, V. B., & Bartoshuk, L. M. (in press). New psychophysical insights in evaluating genetic variation in...
  • K. Fast et al.

    Remembered intensities of taste and oral burn correlate with PROP bitterness

    Chemical Senses

    (2001)
  • S.W. Fernberger

    A preliminary study of taste deficiency

    American Journal of Psychology

    (1932)
  • Cited by (234)

    View all citing articles on Scopus
    View full text