Abstract
Since the early 1900s, near the time of Binet’s first offering, intelligence tests have been scrutinized for cultural biases in a variety of forms. Indeed, the issues of bias (or its potential) in psychological testing have been a source of recurring, characteristically intense, social controversy throughout the history of mental measurement (e.g., see Reynolds & Brown, 1984, for a review of early to mid-1900s controversies). Discussions of cultural bias in tests, especially aptitude and ability measures such as are common to neuropsychological examinations, frequently are accompanied by strongly emotion-laden polemics decrying the use of mental tests with members of ethnic minorities. Courts, legislatures, and the media have all become involved in the questions surrounding potential cultural bias in testing (e.g., see Brown, Reynolds, & Whitaker, in press; Elliott, 1987; Spitz, 1986).
This chapter is based substantively on a number of prior works of the author, most notably Reynolds (l982a), Reynolds and Kaiser (1992), Reynolds (1999), and Reynolds, Lowe, and Saenz (1999).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adebimpe, V. R., Gigandet, J., & Harris, E. (1979). MMPI diagnosis of black psychiatric patients. American Journal of Psychiatry, 136, 85–87.
Ackerman, T. A. (1991). A didactic explanation of item bias, item impact, and item validity from a multidimensional perspective. Journal of Educational Measurement, 29(1), 67–91.
Alley, G., & Foster, C. (1978). Nondiscriminatory testing of minority and exceptional children. Focus on Exceptional Children, 9, 1–14.
American Psychiatric Association. (1994). Diagnostic and statistical manual of mental disorders, 4th ed: DSM-IV. Washington, DC: TAuthor.
Anastasi, A. (1982). Psychological testing (5th ed.). New York: Macmillan.
Angoff, W. H., & Ford, S. R. (1973). Item-race interaction on a test of scholastic aptitude. Journal of Educational Measurement, 10, 95–106.
Ardila, A., Rosselli, M., & Puente, A. E. (1994). Neuropsychological evaluation of the Spanish speaker. New York: Plenum.
Bond, L. (1987). The golden rule settlement: A minority perspective. Educational Measurement: Issues and Practice, 6(2), 23–25.
Brown, R. T., Reynolds, C. R., & Whitaker, J. S. (in press). Bias in mental testing since “Bias in Mental Testing.” School Psychology Quarterly.
Bruning, J., & Kintz, B. (1968). Computational handbook of statistics. Glenview, IL: Scott, Foresman.
Burrill, L. E. (1975). Statistical evidence of potential bias in items and tests assessing current educational status. Paper presented at the annual meeting of the Southeastern Conference on Measurement in Education, New Orleans, LA.
Butler-Omololu, C., Doster, J., & Lahey, B. (1984). Some implications for intelligence test construction and administration with children of different racial groups. Journal of Black Psychology, 10(2), 63–75.
Camilli, G., & Shepard, L. A. (1987). The inadequacy of ANOVA for detecting test bias. Journal of Educational Statistics, 12, 87–99.
Camilli, G., & Shepard, L. A. (1994). Methods for identifying biased test items. Thousand Oaks, CA: Sage.
Campbell, D. F., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56, 85–105.
Cardall, C., & Coffman, W. E. (1964). A method of comparing the performance of different groups on the items in a test (RB-64-61). Princeton, NJ: Educational Testing Service.
Cattell, R. B. (1978). The scientific use of factor analysis in behavioral and life sciences. New York: Plenum.
Cattell, R. B., & Baggaley, A. R. (1960). The salient variable similarity index for factor matching. British Journal of Statistical Psychology, 13, 33–46.
Cattell, R. B., Coulter, M. A., & Tsujioka, B. (1966). The taxonomic recognition of types and functional emergents. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology (pp. 288–329). Chicago: Rand McNally.
Chambers, J. S., Barron, F., & Sprecher, J. W. (1980). Identifying gifted Mexican-American students. Gifted Child Quarterly, 24, 123–128.
Chinn, P. C. (1979). The exceptional minority child: Issues and some answers. Exceptional Children, 46, 532–536.
Chipman, S., Marshall, S., & Scott, P. (1991). Content effect on word-problem performance: A possible source of test bias? American Educational Research Journal, 28, 897–915.
Clarizio, H. (1982). Intellectual assessment of Hispanic children. Psychology in the Schools, 79(1), 61–71.
Cleary, T. A., Humphreys, L. G., Kendrick, S. A., & Wesman, A. (1975). Educational uses of tests with disadvantaged students. American Psychologist, 30, 15–41.
Cronbach, L. J. (1990). Essentials of psychological testing (5th ed.). New York: Harper & Row.
Dana, R. H. (1996). Culturally competent assessment practices in the United States. Journal of Personality Assessment, 66, 472–487.
Elliott, R. (1987). Litigating intelligence. Dover, MA: Auburn House.
Emerling, F. (1990). An investigation of test bias in nonverbal cognitive measures for two ethnic groups. Journal of Psychoeducational Assessment, 8(1), 34–41.
Feldt, L. S. (1969). A test of the hypothesis that Cronbach’s alpha or Kuder-Richardson coefficient twenty is the same for two tests. Psychometrika, 34, 363–373.
Flaugher, R. L. (1978). The many definitions of test bias. American Psychologist, 33, 671–679.
Frisby, C., & Braden, J. (Eds.). (in press). Bias in mental testing. A special, topical issue of School Psychology Quarterly.
Gray-Little, B., & Kaplan, D. A. (1998). Interpretation of psychological tests in clinical and forensic evaluations. In J. Sandoval et al. (Eds.), Test interpretation and diversity (pp. 141–178). Washington, DC: American Psychological Association.
Greenlaw, R., & Jensen, S. (1996). Race norming and the Civil Rights Act of 1991. Public Personnel Management, 25(1), 13–24.
Grove, W., & Meehl, P. A. (1998). Comparative efficiency of informal (subjective, impressionistic) and formal (mechanical, algorithmic) prediction procedures: The clinical-statistical controversy. Psychology, Public Policy, and Law, 2, 293–323.
Guilford Press. (1997). Culturally sensitive assessment: Paying attention to cultural orientation. Child Assessment News, 6, 8–12.
Gulliksen, H., & Wilks, S. S. (1950). Regression tests for several samples. Psychometrika, 15, 91–114.
Gutkin, T. B., & Reynolds, C. R. (1981). Factorial similarity of the WISC-R for white and black children from the standardization sample. Journal of Educational Psychology, 73, 227–231.
Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage.
Hammill, D. (1991). Detroit tests of learning aptitude (3rd ed.). Austin, TX: PRO-ED.
Harman, H. (1976). Modern factor analysis (2nd ed.). Chicago: University of Chicago Press.
Harrington, G. M. (1975). Intelligence tests may favor the majority groups in a population. Nature, 25(8), 708–709.
Harrington, G. M. (1976, September). Minority test bias as a psychometric artifact: The experimental evidence. Paper presented at the annual meeting of the American Psychological Association, Washington, DC.
Helms, J. E. (1992). Why is there no study of cultural equivalence in standardized cognitive ability testing? American Psychologist, 47, 1083–1101.
Hilliard, A. G. (1979). Standardization and cultural bias as impediments to the scientific study and validation of “intelligence.” Journal of Research and Development in Education, 12, 47–58.
Humphreys, L. G. (1973). Statistical definitions of test validity for minority groups. Journal of Applied Psychology, 58, 1–4.
Isern, M. (1986). An investigation of bias in tests of writing ability for bilingual Hispanic college students. Doctoral dissertation, University of Miami. Dissertation Abstracts International, 47, 2135A.
Jackson, G. D. (1975). Another view from the Association of Black Psychologists. American Psychologist, 30, 88–93.
Jensen, A. R. (1974). How biased are cultural loaded tests? Genetic Psychology Monographs, 90, 185–224.
Jensen, A. R. (1976). Test bias and construct validity. Phi Delta Kappan, 58, 340–346.
Jensen, A. R. (1977). An examination of culture bias in the Wonderlic Personnel test. Intelligence, 1, 51–64.
Jensen, A. R. (1980). Bias in mental testing. New York: Free Press.
Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34, 183–202.
Jöreskog, K. G. (1971). Simultaneous factor analysis in several populations. Psychometrika, 36, 409–426.
Jörsekog, K. G., & Sorbom, D. (1989). LISREL 7: A guide to the program and applications. Mooresville, IN: Scientific Software.
Judd, C. M., & McClelland, G. H. (1989). Data analysis: A model comparison approach. San Diego, CA: Harcourt Brace Jovanovich.
Kaiser, H., Hunka, S., & Bianchini, J. (1971). Relating factors between studies based upon different individuals. Multivariate Behavioral Research, 6, 409–422.
Katzenmeyer, W. G., & Stenner, A. J. (1977). Estimation of the invariance of factor structures across race and sex with implications for hypothesis testing. Educational and Psychological Measurement, 37, 111–119.
Kaufman, A. S. (1979). Intelligent testing with the WISC-R. New York: Wiley-Interscience.
Keith, T. Z., & Reynolds, C. R. (1990). Measurement and design issues in child assessment research. In C. R. Reynolds & R. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children (pp. 29–61). New York: Guilford.
Linn, R. L., & Werts, C. E. (1971). Considerations for studies of test bias. Journal of Educational Measurement, 8, 1–4.
Lonner, W. J. (1985). Issues in testing and assessment in cross-cultural counseling. The Counseling Psychologist, 13, 599–614.
Mayfield, J. W., & Reynolds, C. R. (1997). Black-white differences in memory test performance among children and adolescents. Archives of Clinical Neuropsychology, 12, 111–122.
McGaw, B., & Jöreskog, K. G. (1971). Factorial invariance of ability measures in groups differing in intelligence and socioeconomic status. British Journal of Mathematical and Statistical Psychology, 24, 154–168.
McGurk, F. V. J. (1951). Comparison of the performance of Negro and white high school seniors on cultural and noncultural psychological test questions. Washington, DC: Catholic University of America Press.
Mercer, J. R. (1976, August). Cultural diversity, mental retardation, and assessment: The case for nonlabeling. Paper presented to the Fourth International Congress of the International Association for the Scientific Study of Mental Retardation, Washington, DC.
Miele, F. (1979). Cultural bias in the WISC. Intelligence, 3, 149–164.
Mitrushina, M. N., Boone, K. B., & D’Elia, L. F. (1999). Handbook of normative data for neuropsychological assessment. Oxford: Oxford University Press.
Mulaik, S. A. (1972). The foundation of factor analysis. New York: McGraw-Hill.
Nandakumar, R., Glutting, J. J., & Oakland, T. (1993). Mantel-Haenszel methodology for detecting item bias: An introduction and example using the guide to the assessment of test session behavior. Journal of Psychoeducational Assessment, 11(2), 108–119.
Padilla, A. M. (1988). Early psychological assessment of Mexican American children. Journal of the History of the Behavioral Sciences, 24, 113–115.
Payne, B., & Payne, D. (1991). The ability of teachers to identify academically at-risk elementary students. Journal of Research in Childhood Education, 5(2), 116–126.
Pedhazur, E. J., & Schmelkin, L. P. (1991). Measurement, design, and analysis. Hillsdale, NJ: Erlbaum.
Potthoff, R. F. (1966). Statistical aspects of the problem of biases in psychological tests (Institute of Statistics Mimeo Series No. 479). Chapel Hill: Department of Statistics, University of North Carolina.
Reschley, D. (2000). PASE v. Hannon. In C. R. Reynolds & E. Fletcher-Janzen (Eds.), Encyclopedia of special education (2nd ed., pp. 1325–1326). New York: Wiley.
Reynolds, C. R. (1980a). In support of “Bias in Mental Testing” and scientific inquiry. Behavioral and Brain Sciences, 3, 352.
Reynolds, C. R. (1980b). Differential construct validity of intelligence as popularly measured: Correlations of age with raw scores on the WISC-R for blacks, whites, males, and females. Intelligence, 4, 371–379.
Reynolds, C. R. (1980c). An examination for bias in a preschool battery across race and sex. Journal of Educational Measurement, 17, 137–146.
Reynolds, C. R. (1982a). Construct and predictive bias. In R. A. Berk (Ed.), Handbook of methods for detecting test bias (pp. 199–227). Baltimore, MD: Johns Hopkins University Press.
Reynolds, C. R. (1982b). The problem of bias in psychological assessment. In C. R. Reynolds & T. B. Gutkin (Eds.), The handbook of school psychology (pp. 178–208). New York: Wiley.
Reynolds, C. R. (1997). Measurement and statistical problems in neuropsychological assessment of children. In C. R. Reynolds & E. Fletcher-Janzen (Eds.), Handbook of child clinical neuropsychology (pp. 180–203). New York: Plenum.
Reynolds, C. R. (1998). Need we measure anxiety differently for males and females. Journal of Personality Assessment, 70, 212–221.
Reynolds, C. R. (1999a). Cultural bias in testing of intelligence and personality. In A. Beilack, M. Hersen (Series Eds.), & C. Celar (Vol. Ed.), Comprehensive clinical psychology: Vol. 10: Sociocultural and individual differences (pp. 53–92). New York: Pergamon.
Reynolds, C. R. (1999b). Fundamentals of measurement and assessment in psychology. In A. Beilack, M. Hersen (Series Eds.), & C. R. Reynolds (Vol. Ed.), Comprehensive clinical psychology: Vol. 4: Assessment (pp. 33–56). New York: Pergamon.
Reynolds, C. R. (in press). Why do we ignore research on bias in mental testing? Psychology, Public Policy, and Law.
Reynolds, C. R., & Bigler, E. D. (1994). Test of memory and learning. Austin, TX: PRO-ED.
Reynolds, C. R., & Brown, R. T. (1984). Bias in mental testing: An introduction to the issues. In C. R. Reynolds & R. T. Brown (Eds.), Perspectives on bias in mental testing (pp. 1–39). New York: Plenum.
Reynolds, C. R., Chastain, R., Kaufman, A. S., & McLean, J. (1987). Demographic influences on adult intelligence at ages 16 to 74 years. Journal of School Psychology, 25, 323–342.
Reynolds, C. R., & Harding, R. E. (1983). Outcome in two large sample studies of factorial similarity under six methods of comparison. Educational and Psychological Measurement, 43, 723–728.
Reynolds, C. R., & Kaiser, S. (1992). Test bias in psychological assessment. In T. B. Gutkin & C. R. Reynolds (Eds.), The handbook of school psychology (2nd ed., pp. 487–525). New York: Wiley.
Reynolds, C. R., Lowe, P. A., & Saenz, A. (1999). The problem of bias in psychological assessment. In T. B. Gutkin & C. R. Reynolds (Eds.), The handbook of school psychology (3rd ed., pp. 549–595). New York: Wiley.
Reynolds, C. R., & Paget, K. D. (1981). Factor structure of the revised Children’s Manifest Anxiety Scale for blacks, whites, males, and females with a national normative sample. Journal of Consulting and Clinical Psychology, 49, 352–359.
Reynolds, C. R., & Streur, J. (1982). Comparative structure of the WISC-R for emotionally disturbed and normal children. The Southern Psychologist, 1, 27–35.
Reynolds, C. R., Willson, V. L., & Chatman, S. P. (1984). Item bias on the 1981 revision of the Peabody Picture Vocabulary Test using a new method of detecting bias. Journal of Psychoeducational Assessment, 2, 219–221.
Sandoval, J., & Miller, M. (1979). Accuracy judgements of WISC-R item difficulty for minority groups. Paper presented to the annual meeting of the American Psychological Association.
Schmidt, W. H. (1983). Content biases in achievement tests. Journal of Educational Measurement, 20, 165–178.
Spitz, H. (1986). The raising of intelligence. Hillsdale, NJ: Erlbaum.
Stricker, L. J. (1982). Identifying test items that perform differentially in population subgroups: A partial correlation index. Applied Psychological Measurement, 6, 261–273.
Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland & H. Wainer (Eds.), Differential item functioning: Theory and practice (pp. 67–113). Hillsdale, NJ: Erlbaum.
Thorndike, R. L. (1971). Concepts of culture-fairness. Journal of Educational Measurement, 8, 63–70.
Thorndike, R. M. (1978). Correlational procedures for research. New York: Gardner.
Timm, N. H. (1975). Multivariate analysis with applications in education and psychology. Monterey, CA: Brooks/Cole.
Veale, J. R., & Foreman, D. F. (1983). Assessing cultural bias using foil response data: Cultural variation. Journal of Educational Measurement, 20, 249–258.
Williams, R. L. (1974). From dehumanization to black intellectual genocide: A rejoinder. In G. J. Williams & S. Gordon (Eds.), Clinical child psychology: Current practices and future perspectives. New York: Behavioral.
Willson, V. L., Nolan, R. F., Reynolds, C. R., & Kamphaus, R. W. (1989). Race and gender effects on item functioning on the Kaufman Assessment Battery for Children. Journal of School Psychology, 27, 289–296.
Wright, B. J., & Isenstein, V. R. (1977, reprinted 1978). Psychological tests and minorities [DHEW Pub. No. (ADM) 78-482]. Rockville, MD: National Institute of Mental Health, Department of Health, Education and Welfare.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media New York
About this chapter
Cite this chapter
Reynolds, C.R. (2000). Methods for Detecting and Evaluating Cultural Bias in Neuropsychological Tests. In: Fletcher-Janzen, E., Strickland, T.L., Reynolds, C.R. (eds) Handbook of Cross-Cultural Neuropsychology. Critical Issues in Neuropsychology. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-4219-3_15
Download citation
DOI: https://doi.org/10.1007/978-1-4615-4219-3_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-6894-6
Online ISBN: 978-1-4615-4219-3
eBook Packages: Springer Book Archive