Interaction Models

Lauritzen, Steffen L.

doi:10.1007/978-1-4614-1347-9_4

Steffen L. Lauritzen²

Part of the book series: Selected Works in Probability and Statistics ((SWPS))

1549 Accesses
1 Citations

Abstract

The articles in this bundle are all associated with the notion of interaction and represent the genesis of the subject of graphical models in its modern form, the origins of these being traceable back to Gibbs [11] and Wright [30] and earlier.

You have full access to this open access chapter, Download chapter PDF

Dynamic Network Models

Independence in Model Theory

Linear Models (Basics)

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

The articles in this bundle are all associated with the notion of interaction and represent the genesis of the subject of graphical models in its modern form, the origins of these being traceable back to Gibbs [11] and Wright [30] and earlier.

Around 1976, Terry was fascinated by the notion of conditional independence, along the lines later published in Dawid [6, 7]. In 1976, Terry invited me to Perth and we were running a daily research seminar with the theme of studying similarities and differences between Statistics and Statistical Mechanics. In particular, we wondered what the relations were between notions of interaction as represented in linear models, in multi-dimensional contingency tables, and in stochastic models for particle systems; in addition, the purpose was also to understand what was the relation between these concepts and conditional independence.

As we discovered that these were all essentially the same concepts, the similarity being obscured by very different traditions of notation, the term graphical model was coined. Our findings, also obtained in collaboration with John Darroch, were collected in Darroch et al. [4], and later expanded and published in Speed [24], Darroch et al. [5], and Darroch and Speed [3] as well as Lauritzen et al. [19] and to some extent Speed [25], the latter giving an overview of a number of different variants and proofs of what has become known as the Hammersley–Clifford theorem [14, 2].

Of these articles, Darroch et al. [5] rather quickly had a seminal impact and a small community of researchers in the area of graphical models gradually emerged. In a certain sense, the article does not contain much formally new material (if any at all), but for the first time a simple, visual description and interpretation of the class of log-linear models [12, 13], which otherwise could seem obscure, was available. The interpretation of a subclass of the models in terms of conditional independence had an immediate intuitive appeal. In addition, the article identified and emphasized models represented by chordal or triangulated graphs as those where estimation and other issues had a particularly simple solution, the combinatorial theory of these graphs being further studied in Lauritzen et al. [19].

Darroch and Speed [3] studied the notion of interaction from an algebraic point of view in terms of fundamental decompositions of the linear space of functions on a product of finite sets; indeed it essentially but implicitly uses the fundamental decomposition of this space into irreducible components which are stable under a product of symmetric groups [9] and thus gives an elegant algebraic perspective on the Hammersley–Clifford theorem.

Towards the end of 1976, Terry serendipitously came across Wermuth [29], which identified that a completely analogous theory could be developed for the Gaussian case, with chordal graphs playing essentially the same role as in the case of log-linear models; indeed, Dempster [8] had developed the basic computational and statistical theory for these under the name of models for covariance selection. This fact and the corresponding interpretation was emphasized and discussed in Darroch et al. [4] as well as in Speed [24, 25], but received otherwise relatively little attention at the time. Gaussian graphical models have had a remarkable renaissance in connection with the modern analysis of high-dimensional data, for example concerning gene expression [10, 23]. Out of this early work with Gaussian graphical models grew also the article by Speed and Kiiveri [26], which describes and unifies a class of iterative algorithms for fitting Gaussian graphical models of which special cases previously had been considered by e.g. Dempster [8]. Essentially, there are two fundamental types, of which one initially uses the estimate under no restrictions and iteratively ensures that restrictions of the model are satisfied; the other type initially uses a trivial estimator and iteratively ensures that the likelihood equations are satisfied. The article elegantly shows that an abundance of hybrids of these algorithms can be constructed and gives a unified proof of their convergence.

The last two articles [16, 17], represent the genesis of what today is probably the most prolific and well-known type of graphical models; these are based on directed acyclic graphs and admitting interpretation in causal terms similar to that of structural equation models [1]. At the time when these articles appeared they were (undeservedly) largely ignored both by the statistical and structural equation communities. Graphical models based on directed acyclic graphs—now mostly known as Bayesian networks [21]—have an unquestionable prominence in current scientific literature, but the surge of interest in these models was in particular generated by the prolific research activities in computer science, where work such as, for example, Lauritzen and Spiegelhalter [18], Pearl [22], Spirtes et al. [27], Heckerman et al. [15], and Pearl [20] established these models as objects worthy of intense study. In retrospect, it is clear that the global Markov property defined in Kiiveri et al. [17] was not the optimal one as there are independence relations true in any Bayesian network that cannot be derived from it, but fundamentally this article establishes the correct class of directed Markov models for the first time and thus yields a conditional independence perspective on structural equation models, as later elaborated, for example by Spirtes et al. [28].

References

K. A. Bollen. Structural Equations with Latent Variables. John Wiley and Sons, New York, 1989.
MATH Google Scholar
P. Clifford. Markov random fields in statistics. In G. R. Grimmett and D. J. A. Welsh, editors, Disorder in Physical Systems: A Volume in Honour of John M. Hammersley, pages 19–32. Oxford University Press, 1990.
Google Scholar
J. N. Darroch and T. P. Speed. Additive and multiplicative models and interactions. Ann. Stat., 11:724–738, 1983.
Article MathSciNet MATH Google Scholar
J. N. Darroch, S. L. Lauritzen, and T. P. Speed. Log-linear models for contingency tables and Markov fields over graphs. Unpublished manuscript, 1976.
Google Scholar
J. N. Darroch, S. L. Lauritzen, and T. P. Speed. Markov fields and log-linear interaction models for contingency tables. Ann. Stat., 8: 522–539, 1980.
Article MathSciNet MATH Google Scholar
A. P. Dawid. Conditional independence in statistical theory (with discussion). J. Roy. Stat. Soc. B, 41:1–31, 1979.
MathSciNet MATH Google Scholar
A. P. Dawid. Conditional independence for statistical operations. Ann. Stat., 8:598–617, 1980.
Article MathSciNet MATH Google Scholar
A. P. Dempster. Covariance selection. Biometrics, 28:157–175, 1972.
Article Google Scholar
P. Diaconis. Group Representations in Probability and Statistics, volume 11 of Lecture Notes–Monograph Series. Institute of Mathematical Statistics, Hayward, CA, 1988.
Google Scholar
A. Dobra, C. Hans, B. Jones, J. R. Nevins, and M. West. Sparse graphical models for exploring gene expression data. J. Multivariate Anal., 90:196–212, 2004.
Article MathSciNet MATH Google Scholar
W. Gibbs. Elementary Principles of Statistical Mechanics. Yale University Press, New Haven, Connecticut, 1902.
MATH Google Scholar
L. A. Goodman. The multivariate analysis of qualitative data: Interaction among multiple classifications. J. Am. Stat. Assoc., 65: 226–256, 1970.
Article Google Scholar
S. J. Haberman. The Analysis of Frequency Data. University of Chicago Press, Chicago, 1974.
MATH Google Scholar
J. M. Hammersley and P. E. Clifford. Markov fields on finite graphs and lattices. Unpublished manuscript, 1971.
Google Scholar
D. Heckerman, D. Geiger, and D. M. Chickering. Learning Bayesian networks: The combination of knowledge and statistical data. Mach. Learn., 20:197–243, 1995.
MATH Google Scholar
H. Kiiveri and T. P. Speed. Structural analysis of multivariate data: A review. In S. Leinhardt, editor, Sociological Methodology. Jossey-Bass, San Francisco, 1982.
Google Scholar
H. Kiiveri, T. P. Speed, and J. B. Carlin. Recursive causal models. J. Aust. Math. Soc. A, 36:30–52, 1984.
Article MathSciNet MATH Google Scholar
S. L. Lauritzen and D. J. Spiegelhalter. Local computations with probabilities on graphical structures and their application to expert systems (with discussion). J. Roy. Stat. Soc. B, 50:157–224, 1988.
MathSciNet MATH Google Scholar
S. L. Lauritzen, T. P. Speed, and K. Vijayan. Decomposable graphs and hypergraphs. J. Aust. Math. Soc. A, 36:12–29, 1984.
Article MathSciNet MATH Google Scholar
J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge, UK, 2000.
MATH Google Scholar
J. Pearl. Fusion, propagation and structuring in belief networks. Artif. Intell., 29:241–288, 1986.
Article MathSciNet MATH Google Scholar
J. Pearl. Probabilistic Inference in Intelligent Systems. Morgan Kaufmann Publishers, San Mateo, CA, 1988.
Google Scholar
J. Schäfer and K. Strimmer. An empirical-Bayes approach to inferring large-scale gene association networks. Bioinformatics, 21:754–764, 2005.
Article Google Scholar
T. P. Speed. Relations between models for spatial data, contingency tables and Markov fields on graphs. Adv. Appl. Prob.: Supplement, 10: 111–122, 1978.
Google Scholar
T. P. Speed. A note on nearest-neighbour Gibbs and Markov probabilities. Sankhyā Ser. A, 41:184–197, 1979.
MathSciNet MATH Google Scholar
T. P. Speed and H. Kiiveri. Gaussian Markov distributions over finite graphs. Ann. Stat., 14:138–150, 1986.
Article MathSciNet MATH Google Scholar
P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction and Search. Springer-Verlag, New York, 1993. Reprinted by MIT Press.
Google Scholar
P. Spirtes, T. S. Richardson, C. Meek, R. Scheines, and C. Glymour. Using path diagrams as a structural equation modeling tool. Sociol. Method. Res., 27:182–225, 1998.
Article Google Scholar
N. Wermuth. Analogies between multiplicative models in contingency tables and covariance selection. Biometrics, 32:95–108, 1976.
Article MathSciNet MATH Google Scholar
S. Wright. The method of path coefficients. Ann. Math. Statist., 5: 161–215, 1934.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of Oxford, Oxford, UK
Steffen L. Lauritzen

Authors

Steffen L. Lauritzen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Steffen L. Lauritzen .

Editor information

Editors and Affiliations

School of Public Health, Div. Biostatistics, University of California, Earl Warren Hall 140, Berkeley, 94720, California, USA
Sandrine Dudoit

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lauritzen, S.L. (2012). Interaction Models. In: Dudoit, S. (eds) Selected Works of Terry Speed. Selected Works in Probability and Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1347-9_4

Download citation

DOI: https://doi.org/10.1007/978-1-4614-1347-9_4
Published: 09 January 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1346-2
Online ISBN: 978-1-4614-1347-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Interaction Models

Abstract