Abstract
The Basic Local Alignment Search Tool (BLAST) is one of the best known sequence comparison programs available in bioinformatics. It is used to compare query sequences to a set of target sequences, with the intention of finding similar sequences in the target set. Here, we present a distributed BLAST service which operates over a set of heterogeneous Grid resources and is made available through a Globus toolkit v.3 Grid service. This work has been carried out in the context of the BRIDGES project, a UK e-Science project aimed at providing a Grid based environment for biomedical research. Input consisting of multiple query sequences is partitioned into sub-jobs on the basis of the number of idle compute nodes available and then processed on these in batches. To achieve this, we have implemented our own Java-based scheduler which distributes sub-jobs across an array of resources utilizing a variety of local job scheduling systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BRIDGES project, http://www.brc.dcs.gla.ac.uk/projects/bridges/
Sinnott, R., Atkinson, M., Bayer, M., Berry, D., Dominiczak, A., Ferrier, M., Gilbert, D., Hanlon, N., Houghton, D., Hunt, E., White, D.: Grid Services Supporting the Usage of Secure Federated, Distributed Biomedical Data. In: Proceedings of the UK e-Science All Hands Meeting, Nottingham, UK (2004)
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic Local Alignment Search Tool. J. Mol. Biol. 215, 403–410 (1990)
EBI BLAST, http://www.ebi.ac.uk/blastall/index.html
NCBI BLAST, website: http://www.ncbi.nlm.nih.gov/BLAST/
Pedretti, K.T., Casavant, T.L., Braun, R.C., Scheetz, T.E., Birkett, C.L., Roberts, C.A.: Three Complementary Approaches to Parallelization of Local BLAST Service on Workstation Clusters (invited paper). In: Proceedings of the 5th International Conference on Parallel Computing Technologies, pp. 271–282 (1999)
Braun, R.C., Pedretti, K.T., Casavant, T.L., Scheetz, T.E., Birkett, C.L., Roberts, C.A.: Parallelization of local BLAST service on workstation clusters. Future Generation Computer Systems 17, 745–754 (2001)
Clifford, R., Mackey, A.J.: Disperse: a simple and efficient approach to parallel database searching. Bioinformatics 16, 564–565 (2000)
Mathog, D.R.: Parallel BLAST on split databases. Bioinformatics 19, 1865–1866 (2003)
Darling, A.E., Carey, L., Feng, W.: The design, implementation and evaluation of mpiBLAST. In: Proceedings of ClusterWorld Conference, Expo and the 4th International Conference on Linux Clusters: The HPC Revolution (2003)
Hokamp, K., Shields, D.C., Wolfe, K.H., Caffrey, D.R.: Wrapping up BLAST and other applications for use on Unix clusters. Bioinformatics 19, 441–442 (2003)
GridBlast at Keck BioCenter, University of Wisconsin, http://bioinf.ncsa.uiuc.edu/
GridBlast at A-Star Bioinformatics Institute Singapore: http://www.bii.astar.edu.sg/infoscience/dcg/gridGridblast/index.asp
North Carolina BioGrid, http://www.ncbiogrid.org/tech/apps.html
RIKEN GridBlast, http://big.gsc.riken.jp/big/Members/fumikazu/Activity_Item.2004-02-02.0425
mpiBLAST, http://mpiblast.lanl.gov/
ScotGRID, http://www.scotgrid.ac.uk/
OpenPBS, http://www.openpbs.org/
Condor, http://www.cs.wisc.edu/condor/
Globus WSRF, http://www.globus.org/wsrf/default.asp
GenBank statistics, http://www.ncbi.nih.gov/Genbank/genbankstats.html
OGSA Open Grid Services Architecture, http://www.gridforum.org/documents/GWD-I-E/GFD-I.030.pdf
Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations. International J. Supercomputer Applications 15(3) (2001)
Globus toolkit version 2.4.3, http://www-fp.globus.org/gt2.4/
Java Cog kit version 1.1, http://www-unix.globus.org/cog/java/1.1/
UK National Grid Service, http://www.ngs.ac.uk/
PERMIS Grid authorisation software, http://www.permis.org
The U.K.: e-Science Programme, http://www.rcuk.ac.uk/escience/
Globus toolkit, http://www.globus.org/toolkit/
Cardiovascular Functional Genomics project, http://www.brc.dcs.gla.ac.uk/projects/cfg
Scottish Bioinformatics Research Network project, http://www.nesc.ac.uk/hub/projects/sbrn
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bayer, M.M., Sinnott, R. (2005). Distributed BLAST in a Grid Computing Context. In: R. Berthold, M., Glen, R.C., Diederichs, K., Kohlbacher, O., Fischer, I. (eds) Computational Life Sciences. CompLife 2005. Lecture Notes in Computer Science(), vol 3695. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11560500_22
Download citation
DOI: https://doi.org/10.1007/11560500_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29104-6
Online ISBN: 978-3-540-31726-5
eBook Packages: Computer ScienceComputer Science (R0)