BLAST distributed execution on partitioned databases with primary fragments. (English)
Palma, José M. Laginha M. (ed.) et al., High performance computing for computational science ‒ VECPAR 2008. 8th international conference, Toulouse, France, June 24‒27, 2008. Revised selected papers. Berlin: Springer (ISBN 978-3-540-92858-4/pbk). Lecture Notes in Computer Science 5336, 544-554 (2008).
Summary: BLAST is one of the most popular computational biology tools. The execution cost of BLAST is highly dependent on database sizes, which have considerably increased following all recent advances in sequencing methods. The evaluation of BLAST in distributed and parallel environments like PC clusters and Grids has been largely investigated in order to obtain better performances. This work evaluates a replicated allocation of the (sequences) database, where each copy is also physically fragmented. We investigate two dynamic workload balancing methods that focus on our database allocation strategy. Preliminary practical results show that we achieve both a balanced workload and very good performances. We briefly discuss ideas that would make our approach feasible for Grid computational environments.