Methods for Detecting and Evaluating Cultural Bias in Neuropsychological Tests

Reynolds, Cecil R.

doi:10.1007/978-1-4615-4219-3_15

Cecil R. Reynolds^9,10

Part of the book series: Critical Issues in Neuropsychology ((CINP))

399 Accesses
40 Citations

Abstract

Since the early 1900s, near the time of Binet’s first offering, intelligence tests have been scrutinized for cultural biases in a variety of forms. Indeed, the issues of bias (or its potential) in psychological testing have been a source of recurring, characteristically intense, social controversy throughout the history of mental measurement (e.g., see Reynolds & Brown, 1984, for a review of early to mid-1900s controversies). Discussions of cultural bias in tests, especially aptitude and ability measures such as are common to neuropsychological examinations, frequently are accompanied by strongly emotion-laden polemics decrying the use of mental tests with members of ethnic minorities. Courts, legislatures, and the media have all become involved in the questions surrounding potential cultural bias in testing (e.g., see Brown, Reynolds, & Whitaker, in press; Elliott, 1987; Spitz, 1986).

This chapter is based substantively on a number of prior works of the author, most notably Reynolds (l982a), Reynolds and Kaiser (1992), Reynolds (1999), and Reynolds, Lowe, and Saenz (1999).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adebimpe, V. R., Gigandet, J., & Harris, E. (1979). MMPI diagnosis of black psychiatric patients. American Journal of Psychiatry, 136, 85–87.
PubMed Google Scholar
Ackerman, T. A. (1991). A didactic explanation of item bias, item impact, and item validity from a multidimensional perspective. Journal of Educational Measurement, 29(1), 67–91.
Article Google Scholar
Alley, G., & Foster, C. (1978). Nondiscriminatory testing of minority and exceptional children. Focus on Exceptional Children, 9, 1–14.
Google Scholar
American Psychiatric Association. (1994). Diagnostic and statistical manual of mental disorders, 4th ed: DSM-IV. Washington, DC: TAuthor.
Google Scholar
Anastasi, A. (1982). Psychological testing (5th ed.). New York: Macmillan.
Google Scholar
Angoff, W. H., & Ford, S. R. (1973). Item-race interaction on a test of scholastic aptitude. Journal of Educational Measurement, 10, 95–106.
Article Google Scholar
Ardila, A., Rosselli, M., & Puente, A. E. (1994). Neuropsychological evaluation of the Spanish speaker. New York: Plenum.
Google Scholar
Bond, L. (1987). The golden rule settlement: A minority perspective. Educational Measurement: Issues and Practice, 6(2), 23–25.
Article Google Scholar
Brown, R. T., Reynolds, C. R., & Whitaker, J. S. (in press). Bias in mental testing since “Bias in Mental Testing.” School Psychology Quarterly.
Google Scholar
Bruning, J., & Kintz, B. (1968). Computational handbook of statistics. Glenview, IL: Scott, Foresman.
Google Scholar
Burrill, L. E. (1975). Statistical evidence of potential bias in items and tests assessing current educational status. Paper presented at the annual meeting of the Southeastern Conference on Measurement in Education, New Orleans, LA.
Google Scholar
Butler-Omololu, C., Doster, J., & Lahey, B. (1984). Some implications for intelligence test construction and administration with children of different racial groups. Journal of Black Psychology, 10(2), 63–75.
Article Google Scholar
Camilli, G., & Shepard, L. A. (1987). The inadequacy of ANOVA for detecting test bias. Journal of Educational Statistics, 12, 87–99.
Article Google Scholar
Camilli, G., & Shepard, L. A. (1994). Methods for identifying biased test items. Thousand Oaks, CA: Sage.
Google Scholar
Campbell, D. F., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56, 85–105.
Article Google Scholar
Cardall, C., & Coffman, W. E. (1964). A method of comparing the performance of different groups on the items in a test (RB-64-61). Princeton, NJ: Educational Testing Service.
Google Scholar
Cattell, R. B. (1978). The scientific use of factor analysis in behavioral and life sciences. New York: Plenum.
Book Google Scholar
Cattell, R. B., & Baggaley, A. R. (1960). The salient variable similarity index for factor matching. British Journal of Statistical Psychology, 13, 33–46.
Article Google Scholar
Cattell, R. B., Coulter, M. A., & Tsujioka, B. (1966). The taxonomic recognition of types and functional emergents. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology (pp. 288–329). Chicago: Rand McNally.
Google Scholar
Chambers, J. S., Barron, F., & Sprecher, J. W. (1980). Identifying gifted Mexican-American students. Gifted Child Quarterly, 24, 123–128.
Article Google Scholar
Chinn, P. C. (1979). The exceptional minority child: Issues and some answers. Exceptional Children, 46, 532–536.
Google Scholar
Chipman, S., Marshall, S., & Scott, P. (1991). Content effect on word-problem performance: A possible source of test bias? American Educational Research Journal, 28, 897–915.
Google Scholar
Clarizio, H. (1982). Intellectual assessment of Hispanic children. Psychology in the Schools, 79(1), 61–71.
Article Google Scholar
Cleary, T. A., Humphreys, L. G., Kendrick, S. A., & Wesman, A. (1975). Educational uses of tests with disadvantaged students. American Psychologist, 30, 15–41.
Article Google Scholar
Cronbach, L. J. (1990). Essentials of psychological testing (5th ed.). New York: Harper & Row.
Google Scholar
Dana, R. H. (1996). Culturally competent assessment practices in the United States. Journal of Personality Assessment, 66, 472–487.
Article PubMed Google Scholar
Elliott, R. (1987). Litigating intelligence. Dover, MA: Auburn House.
Google Scholar
Emerling, F. (1990). An investigation of test bias in nonverbal cognitive measures for two ethnic groups. Journal of Psychoeducational Assessment, 8(1), 34–41.
Article Google Scholar
Feldt, L. S. (1969). A test of the hypothesis that Cronbach’s alpha or Kuder-Richardson coefficient twenty is the same for two tests. Psychometrika, 34, 363–373.
Article Google Scholar
Flaugher, R. L. (1978). The many definitions of test bias. American Psychologist, 33, 671–679.
Article Google Scholar
Frisby, C., & Braden, J. (Eds.). (in press). Bias in mental testing. A special, topical issue of School Psychology Quarterly.
Google Scholar
Gray-Little, B., & Kaplan, D. A. (1998). Interpretation of psychological tests in clinical and forensic evaluations. In J. Sandoval et al. (Eds.), Test interpretation and diversity (pp. 141–178). Washington, DC: American Psychological Association.
Chapter Google Scholar
Greenlaw, R., & Jensen, S. (1996). Race norming and the Civil Rights Act of 1991. Public Personnel Management, 25(1), 13–24.
Google Scholar
Grove, W., & Meehl, P. A. (1998). Comparative efficiency of informal (subjective, impressionistic) and formal (mechanical, algorithmic) prediction procedures: The clinical-statistical controversy. Psychology, Public Policy, and Law, 2, 293–323.
Article Google Scholar
Guilford Press. (1997). Culturally sensitive assessment: Paying attention to cultural orientation. Child Assessment News, 6, 8–12.
Google Scholar
Gulliksen, H., & Wilks, S. S. (1950). Regression tests for several samples. Psychometrika, 15, 91–114.
Article PubMed Google Scholar
Gutkin, T. B., & Reynolds, C. R. (1981). Factorial similarity of the WISC-R for white and black children from the standardization sample. Journal of Educational Psychology, 73, 227–231.
Article Google Scholar
Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage.
Google Scholar
Hammill, D. (1991). Detroit tests of learning aptitude (3rd ed.). Austin, TX: PRO-ED.
Google Scholar
Harman, H. (1976). Modern factor analysis (2nd ed.). Chicago: University of Chicago Press.
Google Scholar
Harrington, G. M. (1975). Intelligence tests may favor the majority groups in a population. Nature, 25(8), 708–709.
Article Google Scholar
Harrington, G. M. (1976, September). Minority test bias as a psychometric artifact: The experimental evidence. Paper presented at the annual meeting of the American Psychological Association, Washington, DC.
Google Scholar
Helms, J. E. (1992). Why is there no study of cultural equivalence in standardized cognitive ability testing? American Psychologist, 47, 1083–1101.
Article Google Scholar
Hilliard, A. G. (1979). Standardization and cultural bias as impediments to the scientific study and validation of “intelligence.” Journal of Research and Development in Education, 12, 47–58.
Google Scholar
Humphreys, L. G. (1973). Statistical definitions of test validity for minority groups. Journal of Applied Psychology, 58, 1–4.
Article Google Scholar
Isern, M. (1986). An investigation of bias in tests of writing ability for bilingual Hispanic college students. Doctoral dissertation, University of Miami. Dissertation Abstracts International, 47, 2135A.
Google Scholar
Jackson, G. D. (1975). Another view from the Association of Black Psychologists. American Psychologist, 30, 88–93.
Article Google Scholar
Jensen, A. R. (1974). How biased are cultural loaded tests? Genetic Psychology Monographs, 90, 185–224.
Google Scholar
Jensen, A. R. (1976). Test bias and construct validity. Phi Delta Kappan, 58, 340–346.
Google Scholar
Jensen, A. R. (1977). An examination of culture bias in the Wonderlic Personnel test. Intelligence, 1, 51–64.
Article Google Scholar
Jensen, A. R. (1980). Bias in mental testing. New York: Free Press.
Google Scholar
Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34, 183–202.
Article Google Scholar
Jöreskog, K. G. (1971). Simultaneous factor analysis in several populations. Psychometrika, 36, 409–426.
Article Google Scholar
Jörsekog, K. G., & Sorbom, D. (1989). LISREL 7: A guide to the program and applications. Mooresville, IN: Scientific Software.
Google Scholar
Judd, C. M., & McClelland, G. H. (1989). Data analysis: A model comparison approach. San Diego, CA: Harcourt Brace Jovanovich.
Google Scholar
Kaiser, H., Hunka, S., & Bianchini, J. (1971). Relating factors between studies based upon different individuals. Multivariate Behavioral Research, 6, 409–422.
Article Google Scholar
Katzenmeyer, W. G., & Stenner, A. J. (1977). Estimation of the invariance of factor structures across race and sex with implications for hypothesis testing. Educational and Psychological Measurement, 37, 111–119.
Article Google Scholar
Kaufman, A. S. (1979). Intelligent testing with the WISC-R. New York: Wiley-Interscience.
Google Scholar
Keith, T. Z., & Reynolds, C. R. (1990). Measurement and design issues in child assessment research. In C. R. Reynolds & R. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children (pp. 29–61). New York: Guilford.
Google Scholar
Linn, R. L., & Werts, C. E. (1971). Considerations for studies of test bias. Journal of Educational Measurement, 8, 1–4.
Article Google Scholar
Lonner, W. J. (1985). Issues in testing and assessment in cross-cultural counseling. The Counseling Psychologist, 13, 599–614.
Article Google Scholar
Mayfield, J. W., & Reynolds, C. R. (1997). Black-white differences in memory test performance among children and adolescents. Archives of Clinical Neuropsychology, 12, 111–122.
PubMed Google Scholar
McGaw, B., & Jöreskog, K. G. (1971). Factorial invariance of ability measures in groups differing in intelligence and socioeconomic status. British Journal of Mathematical and Statistical Psychology, 24, 154–168.
Article Google Scholar
McGurk, F. V. J. (1951). Comparison of the performance of Negro and white high school seniors on cultural and noncultural psychological test questions. Washington, DC: Catholic University of America Press.
Google Scholar
Mercer, J. R. (1976, August). Cultural diversity, mental retardation, and assessment: The case for nonlabeling. Paper presented to the Fourth International Congress of the International Association for the Scientific Study of Mental Retardation, Washington, DC.
Google Scholar
Miele, F. (1979). Cultural bias in the WISC. Intelligence, 3, 149–164.
Article Google Scholar
Mitrushina, M. N., Boone, K. B., & D’Elia, L. F. (1999). Handbook of normative data for neuropsychological assessment. Oxford: Oxford University Press.
Google Scholar
Mulaik, S. A. (1972). The foundation of factor analysis. New York: McGraw-Hill.
Google Scholar
Nandakumar, R., Glutting, J. J., & Oakland, T. (1993). Mantel-Haenszel methodology for detecting item bias: An introduction and example using the guide to the assessment of test session behavior. Journal of Psychoeducational Assessment, 11(2), 108–119.
Article Google Scholar
Padilla, A. M. (1988). Early psychological assessment of Mexican American children. Journal of the History of the Behavioral Sciences, 24, 113–115.
Article Google Scholar
Payne, B., & Payne, D. (1991). The ability of teachers to identify academically at-risk elementary students. Journal of Research in Childhood Education, 5(2), 116–126.
Article Google Scholar
Pedhazur, E. J., & Schmelkin, L. P. (1991). Measurement, design, and analysis. Hillsdale, NJ: Erlbaum.
Google Scholar
Potthoff, R. F. (1966). Statistical aspects of the problem of biases in psychological tests (Institute of Statistics Mimeo Series No. 479). Chapel Hill: Department of Statistics, University of North Carolina.
Google Scholar
Reschley, D. (2000). PASE v. Hannon. In C. R. Reynolds & E. Fletcher-Janzen (Eds.), Encyclopedia of special education (2nd ed., pp. 1325–1326). New York: Wiley.
Google Scholar
Reynolds, C. R. (1980a). In support of “Bias in Mental Testing” and scientific inquiry. Behavioral and Brain Sciences, 3, 352.
Article Google Scholar
Reynolds, C. R. (1980b). Differential construct validity of intelligence as popularly measured: Correlations of age with raw scores on the WISC-R for blacks, whites, males, and females. Intelligence, 4, 371–379.
Article Google Scholar
Reynolds, C. R. (1980c). An examination for bias in a preschool battery across race and sex. Journal of Educational Measurement, 17, 137–146.
Article Google Scholar
Reynolds, C. R. (1982a). Construct and predictive bias. In R. A. Berk (Ed.), Handbook of methods for detecting test bias (pp. 199–227). Baltimore, MD: Johns Hopkins University Press.
Google Scholar
Reynolds, C. R. (1982b). The problem of bias in psychological assessment. In C. R. Reynolds & T. B. Gutkin (Eds.), The handbook of school psychology (pp. 178–208). New York: Wiley.
Google Scholar
Reynolds, C. R. (1997). Measurement and statistical problems in neuropsychological assessment of children. In C. R. Reynolds & E. Fletcher-Janzen (Eds.), Handbook of child clinical neuropsychology (pp. 180–203). New York: Plenum.
Google Scholar
Reynolds, C. R. (1998). Need we measure anxiety differently for males and females. Journal of Personality Assessment, 70, 212–221.
Article PubMed Google Scholar
Reynolds, C. R. (1999a). Cultural bias in testing of intelligence and personality. In A. Beilack, M. Hersen (Series Eds.), & C. Celar (Vol. Ed.), Comprehensive clinical psychology: Vol. 10: Sociocultural and individual differences (pp. 53–92). New York: Pergamon.
Google Scholar
Reynolds, C. R. (1999b). Fundamentals of measurement and assessment in psychology. In A. Beilack, M. Hersen (Series Eds.), & C. R. Reynolds (Vol. Ed.), Comprehensive clinical psychology: Vol. 4: Assessment (pp. 33–56). New York: Pergamon.
Google Scholar
Reynolds, C. R. (in press). Why do we ignore research on bias in mental testing? Psychology, Public Policy, and Law.
Google Scholar
Reynolds, C. R., & Bigler, E. D. (1994). Test of memory and learning. Austin, TX: PRO-ED.
Google Scholar
Reynolds, C. R., & Brown, R. T. (1984). Bias in mental testing: An introduction to the issues. In C. R. Reynolds & R. T. Brown (Eds.), Perspectives on bias in mental testing (pp. 1–39). New York: Plenum.
Chapter Google Scholar
Reynolds, C. R., Chastain, R., Kaufman, A. S., & McLean, J. (1987). Demographic influences on adult intelligence at ages 16 to 74 years. Journal of School Psychology, 25, 323–342.
Article Google Scholar
Reynolds, C. R., & Harding, R. E. (1983). Outcome in two large sample studies of factorial similarity under six methods of comparison. Educational and Psychological Measurement, 43, 723–728.
Article Google Scholar
Reynolds, C. R., & Kaiser, S. (1992). Test bias in psychological assessment. In T. B. Gutkin & C. R. Reynolds (Eds.), The handbook of school psychology (2nd ed., pp. 487–525). New York: Wiley.
Google Scholar
Reynolds, C. R., Lowe, P. A., & Saenz, A. (1999). The problem of bias in psychological assessment. In T. B. Gutkin & C. R. Reynolds (Eds.), The handbook of school psychology (3rd ed., pp. 549–595). New York: Wiley.
Google Scholar
Reynolds, C. R., & Paget, K. D. (1981). Factor structure of the revised Children’s Manifest Anxiety Scale for blacks, whites, males, and females with a national normative sample. Journal of Consulting and Clinical Psychology, 49, 352–359.
Article PubMed Google Scholar
Reynolds, C. R., & Streur, J. (1982). Comparative structure of the WISC-R for emotionally disturbed and normal children. The Southern Psychologist, 1, 27–35.
Google Scholar
Reynolds, C. R., Willson, V. L., & Chatman, S. P. (1984). Item bias on the 1981 revision of the Peabody Picture Vocabulary Test using a new method of detecting bias. Journal of Psychoeducational Assessment, 2, 219–221.
Article Google Scholar
Sandoval, J., & Miller, M. (1979). Accuracy judgements of WISC-R item difficulty for minority groups. Paper presented to the annual meeting of the American Psychological Association.
Google Scholar
Schmidt, W. H. (1983). Content biases in achievement tests. Journal of Educational Measurement, 20, 165–178.
Article Google Scholar
Spitz, H. (1986). The raising of intelligence. Hillsdale, NJ: Erlbaum.
Google Scholar
Stricker, L. J. (1982). Identifying test items that perform differentially in population subgroups: A partial correlation index. Applied Psychological Measurement, 6, 261–273.
Article Google Scholar
Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland & H. Wainer (Eds.), Differential item functioning: Theory and practice (pp. 67–113). Hillsdale, NJ: Erlbaum.
Google Scholar
Thorndike, R. L. (1971). Concepts of culture-fairness. Journal of Educational Measurement, 8, 63–70.
Article Google Scholar
Thorndike, R. M. (1978). Correlational procedures for research. New York: Gardner.
Google Scholar
Timm, N. H. (1975). Multivariate analysis with applications in education and psychology. Monterey, CA: Brooks/Cole.
Google Scholar
Veale, J. R., & Foreman, D. F. (1983). Assessing cultural bias using foil response data: Cultural variation. Journal of Educational Measurement, 20, 249–258.
Article Google Scholar
Williams, R. L. (1974). From dehumanization to black intellectual genocide: A rejoinder. In G. J. Williams & S. Gordon (Eds.), Clinical child psychology: Current practices and future perspectives. New York: Behavioral.
Google Scholar
Willson, V. L., Nolan, R. F., Reynolds, C. R., & Kamphaus, R. W. (1989). Race and gender effects on item functioning on the Kaufman Assessment Battery for Children. Journal of School Psychology, 27, 289–296.
Article Google Scholar
Wright, B. J., & Isenstein, V. R. (1977, reprinted 1978). Psychological tests and minorities [DHEW Pub. No. (ADM) 78-482]. Rockville, MD: National Institute of Mental Health, Department of Health, Education and Welfare.
Google Scholar

Download references

Author information

Authors and Affiliations

Texas A&M University, College Station, Texas, 77843-4225, USA
Cecil R. Reynolds
Bastrop Mental Health Associates, Bastrop, Texas, 78602, USA
Cecil R. Reynolds

Authors

Cecil R. Reynolds
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Northern Colorado, Colorado Springs, Colorado, USA
Elaine Fletcher-Janzen
Drew University of Medicine and Science, Los Angeles, California, USA
Tony L. Strickland
UCLA School of Medicine, Los Angeles, California, USA
Tony L. Strickland
Texas A&M University, College Station, Texas, USA
Cecil R. Reynolds
Bastrop Mental Health Associates, Bastrop, Texas, USA
Cecil R. Reynolds

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Reynolds, C.R. (2000). Methods for Detecting and Evaluating Cultural Bias in Neuropsychological Tests. In: Fletcher-Janzen, E., Strickland, T.L., Reynolds, C.R. (eds) Handbook of Cross-Cultural Neuropsychology. Critical Issues in Neuropsychology. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-4219-3_15

Download citation

DOI: https://doi.org/10.1007/978-1-4615-4219-3_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-6894-6
Online ISBN: 978-1-4615-4219-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics