skip to main content
10.1145/565196.565202acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

A new approach to analyzing gene expression time series data

Published:18 April 2002Publication History

ABSTRACT

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time-points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time-points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to non-uniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knockout data that produces biologically meaningful results.

References

  1. J. Aach and G. M. Church. Aligning gene expression time series with time warping algorithms. Bioinformatics, 17:495--508, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  2. B. Brumback and J. Rice. Smoothing spline models for the analysis of nested and crossed samples of curves. Am. Statist. Assoc., 93:961--976, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  3. S. Chu, J. DeRisi, and et al. The transcriptional program of sporulation in budding yeast. Science, 282:699--705, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. de Boor. A practical guide to splines. Springer, 1978.Google ScholarGoogle ScholarCross RefCross Ref
  5. L. Deng, M. Aksmanovic, D. X. Sun, and C. F. J. X. Wu. Recognition using hidden markov models with polynomial regression functions as nonstationary states. IEEE Transactions on Speech and Audio Processing, 2:507--520, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  6. P. D'haeseleer, X. Wen, S. Fuhrman, and R. Somogyi. Linear modeling of mrna expression levels during cns development and injury. In PSB99, 1999.Google ScholarGoogle Scholar
  7. M.B. Eisen, P.T. Spellman, P.O. Brown, and D. Botstein. Cluster analysis and display of genome-wide expression patterns. PNAS, 95:14863--14868, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  8. N. Friedman, M. Linial, I. Nachman, and D. Pe'er. Using bayesian network to analyze expression data. In RECOMB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. N. S. Holter, A. Maritan, and et al. Dynamic modeling of gene expression data. PNAS, 98:1693--1698, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  10. G. James and T. Hastie. Functional linear discriminant analysis for irregularly sampled curves. Journal of the Royal Statistical Society, to appear, 2001.Google ScholarGoogle Scholar
  11. S. H Neal, M. Madhusmita, and et al. Fundamental patterns underlying gene expression profiles: Simplicity from complexity. PNAS, 97:8409--8414, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  12. Sharan R. and Shamir R. Algorithmic approaches to clustering gene expression data. Current Topics in Computational Biology, To appear.Google ScholarGoogle Scholar
  13. I. Simon, J. Barnett, and et al. Serial regulation of transcriptional regulators in the yeast cell cycle. Cell, 106:697--708, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  14. T. S. Spellman, G Sherlock, and et al. Comprehensive identification of cell cycle-regulated genes of the yeast saccharomyces cerevisia by microarray hybridization. Mol. Biol. of the Cell, 9:3273--3297, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  15. O. Troyanskaya, M. Cantor, and et al. Missing value estimation methods for dna microarrays. Bioinformatics, 17:520--525, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  16. P. Viola. Alignment by Maximization of Mutual Information. PhD thesis, MIT AI Lab, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. L. P. Zhao, R. Prentice, and L. Breeden. Statistical modeling of large microarray data sets to identify stimulus-response profiles. PNAS, 98:5631--5636, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  18. G. Zhu, Spellman T. S., and et al. Two yeast forkhead genes regulate cell cycle and pseudohyphal growth. Nature, 406:90--94, 2000.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A new approach to analyzing gene expression time series data

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                RECOMB '02: Proceedings of the sixth annual international conference on Computational biology
                April 2002
                341 pages
                ISBN:1581134983
                DOI:10.1145/565196

                Copyright © 2002 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 18 April 2002

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                RECOMB '02 Paper Acceptance Rate35of118submissions,30%Overall Acceptance Rate148of538submissions,28%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader