Abstract
In this paper we describe WITT, a computational model of categorization and conceptual clustering that has been motivated and guided by research on human categorization. Properties of categories to which humans are sensitive include best or prototypical members, relative contrasts between categories, and polymorphy (neither necessary nor sufficient feature rules). The system uses pairwise feature correlations to determine the “similarity” between objects and clusters of objects, allowing the system a flexible representation scheme that can model common-feature categories and polymorphous categories. This intercorrelation measure is cast in terms of an information-theoretic evaluation function that directs WITT'S search through the space of clusterings. This information-theoretic similarity metric also can be used to explain basic-level and typicality effects that occur in humans. WITT has been tested on both artificial domains and on data from the 1985 World Almanac, and we have examined the effect of various system parameters on the quality of the model's behavior.
Article PDF
Similar content being viewed by others
References
Cheeseman, P., Kelly, J., Self, M., Stutz, J., Taylor, W., & Freeman, D. (1988). Proceedings of the Fifth International Conference on Machine Learning(pp. 54–64). Ann Arbor, MI: Morgan Kaufmann.
DeJong, G., & Mooney, R. (1986). Explanation-based learning:An alternative view. Machine Learning, 1145–176.
Dennis, I., Hampton, J. A., & Lea, S. E. G. (1973). New problem in concept formation. Nature, 243101–102.
Estes, W. K. (1986). Memory storage and retrieval processes in category learning. Journal of Experimental Psychology:General, 115155–174.
Everitt, B. (1974). Cluster analysis. London: Heinemann Educational Books.
Gluck, M. A., & Corter, J. E. (1985). Information, uncertainty, and the utility of categories. Proceedings of the Seventh Annual Conference of the Cognitive Science Society(pp. 283–287). Irvine, CA: Lawrence Erlbaum.
Fisher, D. H. (1987). Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2139–172.
Grossberg, S. (1976). Adaptive pattern classification and universal recoding. Part I:Parallel development and coding of neural feature detectors. Bio-logical Cybernetics, 23121–134.
Homa, D. (1978). Abstraction of ill-defined form. Journal of Experimental Psychology:Human Learning and Memory, 4407–416.
Lance, G. N., & Williams, W. T. (1967). Note on a new information-statistic classificatory program. Computer Journal, 9373–380.
Lebowitz, M. (1987). Experiments with incremental concept formation:UNI-MEM. Machine Learning, 2103–138.
Medin, D. L., Wattenmaker, W. D., & Hampson, S. E. (1987). Family re-semblance, conceptual cohesiveness and category construction. Cognitive Psychology, 19242–279.
Michalski, R. S. (1980). Knowledge acquisition through conceptual cluster-ing:A theoretical framework and an algorithm for partitioning data into conjunctive concepts. International Journal of Policy Analysis and Infor-mation Systems, 4, 219–244.
Michalski, R. S., & Stepp, R. E. (1983a). Learning from observation:Con-ceptual clustering. In R. S. Michalski, J. G. Carbonell, & T. M. Mitchell (Eds.), Machine learning:An artificial intelligence approach. Los Altos, CA: Morgan Kaufmann.
Michalski, R. S., & Stepp, R. E. (1983b). Automated construction of classifi-cations:Conceptual clustering verses numerical taxonomy. IEEE Trans-actions on Pattern Analysis and Machine Intelligence, 5396–410.
Miller, G. A. (1971). Empirical methods in the study of semantics. In D. D. Steinberg & L. A. Jakobovits (Eds.), Semantics. Cambridge: Cambridge University Press.
Mitchell, T. M. (1978). Version spaces:An approach to concept learning. Doctoral dissertation, Department of Electrical Engineering, Stanford University, Palo Alto, CA.
Mitchell, T. M., Keller, R. M., & Kedar-Cabelli, S. T. (1986). Explanation-based generalization:A unifying view. Machine Learning, 1, 47–80.
Murphy, G. L. (1982). Cue validity and levels of categorization. Psychological Bulletin, 91174–177.
Murphy, G. L., & Medin, D. L. (1985). The role of theories in conceptual coherence. Psychological Review, 92289–316.
Orloci, L. (1969). Information analysis of structure in biological collections. Nature, 223483–484.
Posner, M. I., & Keele, S. W. (1968). On the genesis of abstract ideas. Journal of Experimental Psychology, 77353–363.
Quinlan, J. R. (1986). Induction of decision trees, Machine Learning, 181–106.
Rosch, E., Mervis, C., Gray, W., Johnson, D., & Boyes-Braem, P. (1976). Basic objects in natural categories. Cognitive Psychology, 7382–439.
Rosch, E., & Lloyd, B. B. (Eds. ). (1978). Cognition and categorization. Hillsdale, NJ: Lawrence Erlbaum.
Rumelhart, D. E., & McClelland, J. L. (1986). Parallel distributed processing: Explorations in the micro-structure of cognition. Cambridge, MA: MIT Press.
Schank, R. C., Collins, G. C., & Hunter, L. E. (1986). Transcending inductive category formation in learning. Behavioral and Brain Sciences, 9639–686.
Smith, E. E., & Medin, D. L. (1981). Categories and concepts. Cambridge, MA: Harvard University Press.
Sneath, P. H., & Sokal, R. R. (1973). Numerical taxonomy:The principles and practice of numerical classification. San Francisco, CA: Freeman.
Wallace, C. S., & Boulton D. M. (1968). An information measure for classification. Computer Journal, 11185–194.
Winston, P. H. (1975). Learning structural descriptions from examples. In P. H. Winston (Ed.), The psychology of computer vision. New York: McGraw-Hill.
Wittgenstein, L. (1953). Philosophical investigations. Oxford: Basil Blackwell.
Zadeh, L. A. (1965). Fuzzy sets. Information and Control, 8338–353.
Rights and permissions
About this article
Cite this article
Hanson, S.J., Bauer, M. Conceptual Clustering, Categorization, and Polymorphy. Machine Learning 3, 343–372 (1989). https://doi.org/10.1023/A:1022697818275
Issue Date:
DOI: https://doi.org/10.1023/A:1022697818275