Abstract
We give an overview of VisTrails, a system that provides an infrastructure for systematically capturing detailed provenance and streamlining the data exploration process. A key feature that sets VisTrails apart from previous visualization and scientific workflow systems is a novel action-based mechanism that uniformly captures provenance for data products and workflows used to generate these products. This mechanism not only ensures reproducibility of results, but it also simplifies data exploration by allowing scientists to easily navigate through the space of workflows and parameter settings for an exploration task.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Alonso, G., Mohan, C.: Workflow management: The next generation of distributed processing tools. In: Jajodia, S., Kerschberg, L. (eds.) Advanced Transaction Models and Architectures, ch. 2. Kluwer, Dordrecht (1997)
Anderson, E., Callahan, S., Chen, G., Freire, J., Santos, E., Scheidegger, C., Silva, C., Vo, H.: Visualization in radiation oncology: Towards replacing the laboratory notebook. Technical Report UUSCI-2006-017, SCI Institute–Univ. of Utah (2006)
Bavoil, L., Callahan, S., Crossno, P., Freire, J., Scheidegger, C., Silva, C., Vo, H.: VisTrails: Enabling Interactive Multiple-View Visualizations. In: IEEE Visualization 2005, pp. 135–142 (2005)
Brodlie, K., Poon, A., Wright, H., Brankin, L., Banecki, G., Gay, A.: GRASPARC: a problem solving environment integrating computation and visualization. In: IEEE Visualization 1993, pp. 102–109 (1993)
Callahan, S., Freire, J., Santos, E., Scheidegger, C., Silva, C., Vo, H.: Using provenance to streamline data exploration through visualization. Technical Report UUSCI-2006-016, SCI Institute–Univ. of Utah (2006)
Foster, I., Voeckler, J., Wilde, M., Zhao, Y.: Chimera: A virtual data system for representing, querying and automating data derivation. In: Statistical and Scientific Database Management (SSDBM), pp. 37–46 (2002)
Groth, P., Miles, S., Fang, W., Wong, S.C., Zauner, K.-P., Moreau, L.: Recording and using provenance in a protein compressibility experiment. In: Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing (HPDC 2005) (July 2005)
Kreuseler, M., Nocke, T., Schumann, H.: A history mechanism for visual data mining. In: IEEE Symposium on Information Visualization, pp. 49–56 (2004)
Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger-Frank, E., Jones, M., Lee, E., Tao, J., Zhao, Y.: Scientific Workflow Management and the Kepler System. Concurrency and Computation: Practice & Experience (2005)
Parker, S.G., Johnson, C.R.: SCIRun: a scientific programming environment for computational steering. Supercomputing (1995)
Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance in e-science. SIGMOD Record 34(3), 31–36 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Freire, J., Silva, C.T., Callahan, S.P., Santos, E., Scheidegger, C.E., Vo, H.T. (2006). Managing Rapidly-Evolving Scientific Workflows. In: Moreau, L., Foster, I. (eds) Provenance and Annotation of Data. IPAW 2006. Lecture Notes in Computer Science, vol 4145. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11890850_2
Download citation
DOI: https://doi.org/10.1007/11890850_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46302-3
Online ISBN: 978-3-540-46303-0
eBook Packages: Computer ScienceComputer Science (R0)