Limitations of Norms in Psychological Testing
Tests that are norm-referenced provide a number of benefits over non-norm-referenced tests. Psychological tests enable the gathering of valuable information about individual functioning for many different areas. Most norm-referenced tests are relatively quick to administer, such that a psychologist can obtain a sampling of behavior with a small investment of time and resources. A primary advantage of psychological testing is that rich and detailed information is revealed through the testing that would otherwise be unavailable to the psychologist. However, norm-referenced tests are far from perfect and the quality, reliability, and validity of norm-referenced tests varies substantially in some very important ways.
A number of assumptions are important to the construction of norms. The characteristic being measured must accommodate the ordering of individuals from low to high along an asymmetrical continuum that should at least be ordinal (Angoff, 1984). In addition, the relation of the scores must be transitive. That is to say that the mathematical definition of transitive states that: If a condition "applies between two successive members of a sequence, it must also apply between any two members taken in order," such that, for example, "if A is larger than B, and B. is larger than C, then A is larger than C" ("Transitive," n.d.) (Angoff, 1984). The operational definition of the characteristic being measured must be reasonably clear and valid to a degree that it yields similar orderings of the characteristic in the individuals (Angoff, 1984). The range of scores for a characteristic must all evaluate that same characteristic (Angoff, 1984). There must be a good match between the group(s), the target characteristics, and the test design and purpose (Angoff, 1984). Norms are meaningful and useful only to the extent that they have carefully defined. The norms population must be appropriate to the subject being tested and to the test; the challenge is to define the concept of appropriateness without conflating it with the concept of difficulty (Angoff, 1984). This means that a test or a subject can be difficult for many of the test takers, yet the test or the subject can still be considered appropriate for that population of test takers (Angoff, 1984).
Normative data should be developed for each distinct norms population for which it is meaningful to make comparisons with individuals or the group (Angoff, 1984). The test items themselves must be subject to pilot testing in which the data about the test items is drawn from samples of the population for which the test is being developed; that is to say, for the groups for which the norms will be provided (Angoff, 1984). Populations that serve as the basis for a set of norms should evidence homogeneity (Angoff, 1984). This means that all the individual are clearly members of the group and are logical and/or actual "competitors" in the same arena (Angoff, 1984).
Overview of Norms in Psychological Testing
A variety of norms exist, including the following: National norms, local norms, age and grade equivalents, item norms, school mean norms, user-selected norms, special study norms, and norms that yield direct meaning. This discussion centers on norms that are used for psychological tests, for which the following section provides an overview of how norms are developed.
Standardization samples are generated for psychological tests so that tests can be referenced to a normal distribution that is used to compare scores on specific future tests. Standardization relies on the creation of a large sample of test takers who are representative of the larger population for which the test is being developed. This standardization sample is referred to as the norm group or norming group. The raw scores of a sample group are converted into percentiles, which can be associated with a constructed normal distribution that will be used to rank the relative standing of individuals who take the test some time in the future.
Norms function as frames of reference for the interpretation of test scores, but norms are not performance standards or clinical ideals. The size of norm groups varies widely, ranging from just a few hundred up to a hundred thousand people. As with other types of samples, the more individuals that are included in the norm group, the closer the sample is to an approximation of a normal population distribution. Moreover, normative data illustrates how the dimensions of major population subgroups differ and the extent to which test variables are associated with the population classifications.
Limitations of Norms on Psychological Test Interpretation
Norms are developed according to the assumptions that underlie psychological testing and in accordance with criteria from a variety of sources. That several different authoritative sources are involved in the determination of criteria...
Our semester plans gives you unlimited, unrestricted access to our entire library of resources —writing tools, guides, example essays, tutorials, class notes, and more.
Get Started Now