Abstract
The articles in this bundle are all associated with the notion of interaction and represent the genesis of the subject of graphical models in its modern form, the origins of these being traceable back to Gibbs [11] and Wright [30] and earlier.
You have full access to this open access chapter, Download chapter PDF
Similar content being viewed by others
Keywords
- Structural Equation Model
- Bayesian Network
- Graphical Model
- Directed Acyclic Graph
- Conditional Independence
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
The articles in this bundle are all associated with the notion of interaction and represent the genesis of the subject of graphical models in its modern form, the origins of these being traceable back to Gibbs [11] and Wright [30] and earlier.
Around 1976, Terry was fascinated by the notion of conditional independence, along the lines later published in Dawid [6, 7]. In 1976, Terry invited me to Perth and we were running a daily research seminar with the theme of studying similarities and differences between Statistics and Statistical Mechanics. In particular, we wondered what the relations were between notions of interaction as represented in linear models, in multi-dimensional contingency tables, and in stochastic models for particle systems; in addition, the purpose was also to understand what was the relation between these concepts and conditional independence.
As we discovered that these were all essentially the same concepts, the similarity being obscured by very different traditions of notation, the term graphical model was coined. Our findings, also obtained in collaboration with John Darroch, were collected in Darroch et al. [4], and later expanded and published in Speed [24], Darroch et al. [5], and Darroch and Speed [3] as well as Lauritzen et al. [19] and to some extent Speed [25], the latter giving an overview of a number of different variants and proofs of what has become known as the Hammersley–Clifford theorem [14, 2].
Of these articles, Darroch et al. [5] rather quickly had a seminal impact and a small community of researchers in the area of graphical models gradually emerged. In a certain sense, the article does not contain much formally new material (if any at all), but for the first time a simple, visual description and interpretation of the class of log-linear models [12, 13], which otherwise could seem obscure, was available. The interpretation of a subclass of the models in terms of conditional independence had an immediate intuitive appeal. In addition, the article identified and emphasized models represented by chordal or triangulated graphs as those where estimation and other issues had a particularly simple solution, the combinatorial theory of these graphs being further studied in Lauritzen et al. [19].
Darroch and Speed [3] studied the notion of interaction from an algebraic point of view in terms of fundamental decompositions of the linear space of functions on a product of finite sets; indeed it essentially but implicitly uses the fundamental decomposition of this space into irreducible components which are stable under a product of symmetric groups [9] and thus gives an elegant algebraic perspective on the Hammersley–Clifford theorem.
Towards the end of 1976, Terry serendipitously came across Wermuth [29], which identified that a completely analogous theory could be developed for the Gaussian case, with chordal graphs playing essentially the same role as in the case of log-linear models; indeed, Dempster [8] had developed the basic computational and statistical theory for these under the name of models for covariance selection. This fact and the corresponding interpretation was emphasized and discussed in Darroch et al. [4] as well as in Speed [24, 25], but received otherwise relatively little attention at the time. Gaussian graphical models have had a remarkable renaissance in connection with the modern analysis of high-dimensional data, for example concerning gene expression [10, 23]. Out of this early work with Gaussian graphical models grew also the article by Speed and Kiiveri [26], which describes and unifies a class of iterative algorithms for fitting Gaussian graphical models of which special cases previously had been considered by e.g. Dempster [8]. Essentially, there are two fundamental types, of which one initially uses the estimate under no restrictions and iteratively ensures that restrictions of the model are satisfied; the other type initially uses a trivial estimator and iteratively ensures that the likelihood equations are satisfied. The article elegantly shows that an abundance of hybrids of these algorithms can be constructed and gives a unified proof of their convergence.
The last two articles [16, 17], represent the genesis of what today is probably the most prolific and well-known type of graphical models; these are based on directed acyclic graphs and admitting interpretation in causal terms similar to that of structural equation models [1]. At the time when these articles appeared they were (undeservedly) largely ignored both by the statistical and structural equation communities. Graphical models based on directed acyclic graphs—now mostly known as Bayesian networks [21]—have an unquestionable prominence in current scientific literature, but the surge of interest in these models was in particular generated by the prolific research activities in computer science, where work such as, for example, Lauritzen and Spiegelhalter [18], Pearl [22], Spirtes et al. [27], Heckerman et al. [15], and Pearl [20] established these models as objects worthy of intense study. In retrospect, it is clear that the global Markov property defined in Kiiveri et al. [17] was not the optimal one as there are independence relations true in any Bayesian network that cannot be derived from it, but fundamentally this article establishes the correct class of directed Markov models for the first time and thus yields a conditional independence perspective on structural equation models, as later elaborated, for example by Spirtes et al. [28].
References
K. A. Bollen. Structural Equations with Latent Variables. John Wiley and Sons, New York, 1989.
P. Clifford. Markov random fields in statistics. In G. R. Grimmett and D. J. A. Welsh, editors, Disorder in Physical Systems: A Volume in Honour of John M. Hammersley, pages 19–32. Oxford University Press, 1990.
J. N. Darroch and T. P. Speed. Additive and multiplicative models and interactions. Ann. Stat., 11:724–738, 1983.
J. N. Darroch, S. L. Lauritzen, and T. P. Speed. Log-linear models for contingency tables and Markov fields over graphs. Unpublished manuscript, 1976.
J. N. Darroch, S. L. Lauritzen, and T. P. Speed. Markov fields and log-linear interaction models for contingency tables. Ann. Stat., 8: 522–539, 1980.
A. P. Dawid. Conditional independence in statistical theory (with discussion). J. Roy. Stat. Soc. B, 41:1–31, 1979.
A. P. Dawid. Conditional independence for statistical operations. Ann. Stat., 8:598–617, 1980.
A. P. Dempster. Covariance selection. Biometrics, 28:157–175, 1972.
P. Diaconis. Group Representations in Probability and Statistics, volume 11 of Lecture Notes–Monograph Series. Institute of Mathematical Statistics, Hayward, CA, 1988.
A. Dobra, C. Hans, B. Jones, J. R. Nevins, and M. West. Sparse graphical models for exploring gene expression data. J. Multivariate Anal., 90:196–212, 2004.
W. Gibbs. Elementary Principles of Statistical Mechanics. Yale University Press, New Haven, Connecticut, 1902.
L. A. Goodman. The multivariate analysis of qualitative data: Interaction among multiple classifications. J. Am. Stat. Assoc., 65: 226–256, 1970.
S. J. Haberman. The Analysis of Frequency Data. University of Chicago Press, Chicago, 1974.
J. M. Hammersley and P. E. Clifford. Markov fields on finite graphs and lattices. Unpublished manuscript, 1971.
D. Heckerman, D. Geiger, and D. M. Chickering. Learning Bayesian networks: The combination of knowledge and statistical data. Mach. Learn., 20:197–243, 1995.
H. Kiiveri and T. P. Speed. Structural analysis of multivariate data: A review. In S. Leinhardt, editor, Sociological Methodology. Jossey-Bass, San Francisco, 1982.
H. Kiiveri, T. P. Speed, and J. B. Carlin. Recursive causal models. J. Aust. Math. Soc. A, 36:30–52, 1984.
S. L. Lauritzen and D. J. Spiegelhalter. Local computations with probabilities on graphical structures and their application to expert systems (with discussion). J. Roy. Stat. Soc. B, 50:157–224, 1988.
S. L. Lauritzen, T. P. Speed, and K. Vijayan. Decomposable graphs and hypergraphs. J. Aust. Math. Soc. A, 36:12–29, 1984.
J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge, UK, 2000.
J. Pearl. Fusion, propagation and structuring in belief networks. Artif. Intell., 29:241–288, 1986.
J. Pearl. Probabilistic Inference in Intelligent Systems. Morgan Kaufmann Publishers, San Mateo, CA, 1988.
J. Schäfer and K. Strimmer. An empirical-Bayes approach to inferring large-scale gene association networks. Bioinformatics, 21:754–764, 2005.
T. P. Speed. Relations between models for spatial data, contingency tables and Markov fields on graphs. Adv. Appl. Prob.: Supplement, 10: 111–122, 1978.
T. P. Speed. A note on nearest-neighbour Gibbs and Markov probabilities. Sankhyā Ser. A, 41:184–197, 1979.
T. P. Speed and H. Kiiveri. Gaussian Markov distributions over finite graphs. Ann. Stat., 14:138–150, 1986.
P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction and Search. Springer-Verlag, New York, 1993. Reprinted by MIT Press.
P. Spirtes, T. S. Richardson, C. Meek, R. Scheines, and C. Glymour. Using path diagrams as a structural equation modeling tool. Sociol. Method. Res., 27:182–225, 1998.
N. Wermuth. Analogies between multiplicative models in contingency tables and covariance selection. Biometrics, 32:95–108, 1976.
S. Wright. The method of path coefficients. Ann. Math. Statist., 5: 161–215, 1934.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Lauritzen, S.L. (2012). Interaction Models. In: Dudoit, S. (eds) Selected Works of Terry Speed. Selected Works in Probability and Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1347-9_4
Download citation
DOI: https://doi.org/10.1007/978-1-4614-1347-9_4
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1346-2
Online ISBN: 978-1-4614-1347-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)