CloudBurst is a new parallel read-mapping algorithm optimized for mapping next-generation sequence data to the human genome and other reference genomes, for use in a variety of biological analyses including SNP discovery, genotyping, and personal genomics. It is modeled after the short read mapping program RMAP, and reports either all alignments or the unambiguous best alignment for each read with any number of mismatches or differences. This level of sensitivity could be prohibitively time consuming, but CloudBurst uses the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes.
the Schatz Lab
:: MORE INFORMATION
CloudBurst: Highly Sensitive Short Read Mapping with MapReduce
Center for Bioinformatics and Computational Biology, University of Maryland
Bioinformatics 2009 25(11):1363-1369